In the last post of this series we dealt with axis systems. In this post we are also dealing with axes but this time we are taking a look at the position scales of dates, time and datetimes. Since we at STATWORX are often forecasting – and thus plotting – time series, this is an important issue for us. The …
Coordinate systems in ggplot2: easily overlooked and rather underrated
All plots have coordinate systems. Perhaps because they are such an integral element of plots, they are easily overlooked. However, in ggplot2, there are several very useful options to customize the coordinate systems of plots, which we will not overlook but explore in this blog post. Since it is spring, we will use a random subset of the famous iris …
About Risks and Side-Effects… Consult your Purrr-Macist
Capture errors, warnings and messages, but keep your list operations going In a recent post about text mining, I discussed some solutions to webscraping the contents of our STATWORX blog using the purrr-package. However, while preparing the next the episode of my series on text mining, I remembered a little gimmick that I found quite helpful along the way. Thus, …
Make RStudio Look the Way You Want — Because Beauty Matters
Introduction RStudio is a powerful IDE that helped me so many times with conveniently debugging large and complex R programs as well as shiny apps, but there is one thing that bugs me about it: there is no easy option or interface in RStudio that lets you customize your theme, just as you can do in more developed text editors …
strsplit – but keeping the delimiter
One of the functions I use the most is strsplit. It is quite useful if you want to separate a string by a specific character. Even if you have some complex rule for the split, most of the time you can solve this with a regular expression. However, recently I came across a problem I could not get my head …
Regularized Greedy Forest – The Scottish Play (Act I)
Macbeth shall never vanquish'd be until Great Birnam Wood to high Dunsinane Hill Shall come against him. (Act 4, Scene 1) In Shakespeare's The Tragedy of Macbeth, the prophecy of Birnam Wood is one of three misleading prophecies foreshadowing the defeat of the protagonist of the same name. While highly unlikely, the event of a nearby forest moving towards his …
Diamonds and Faceting are a Data Scientist’s best Friends
In the last post of this series, we took a first look at strategies for the effective visualization and exploration of data patterns within large data sets. Namely, we examined ways to overcome overplotting, with a focus on a two-dimensional feature space defined by two continuous features. However, oftentimes we want to visualize the distribution of data across several subgroups. …
Empirische Bestimmung von Elastizität – Teil 1
Wie kann man Preiselastizität bestimmen? Die Antwort auf diese Frage ist nicht eindeutig, sondern fallspezifisch. Es gibt viele Verfahren, um Preiselastizität empirisch zu bestimmen. Direkte Expertenbefragungen, Kundenbefragungen, indirekte Kundenbefragungen durch Conjoint Analysen und vielfältige experimentelle Testmethoden. Wenn die Datenlage es erlaubt, wenden wir Methoden an, die auf historischen Marktdaten basieren. Unterfüttert mit Daten zu Faktoren, wie Wettbewerbspreise, Werbeinformationen etc., lassen …
Als Data Science Praktikant bei STATWORX
Neben dem Einstieg als Trainee oder Data Science Consultant bei STATWORX gibt es ebenso die Möglichkeit, ein Praktikum im Bereich Data Science zu absolvieren. Unsere aktuellen Stellenausschreibungen findet ihr übrigens hier. Bewerbung bei STATWORX Das Berufsbild des Data Scientists ist durch seine vielfältigen Aufgaben und die bunte Durchmischung der Kompetenzen vor allem in den letzten Jahren sehr attraktiv geworden. Dies …
Show me your pipe!
At STATWORX, we all love R – even so much, that we have decided to visit eRum 2018, an R conference hosted in Budapest! And just as much as we love R, we love the piping operator %>% , as it makes our R codes much neater. I guess, many of you have already seen it in action, but you …