In this blog post, Stephan explains how to translate a simple R script, which transforms tables from wide to long format into a REST API with the R package Plumber and how to run it locally or with Docker.
STATWORX – R for Data Science Workshop
Beim Erarbeiten neuer Skills und Tools ist der Einstieg nicht immer einfach. In der STATWORX Academy bieten wir daher eine Fülle an Schulungsformaten an, in denen wir Dir den Einsteig in den Bereich Data Science erleichtern wollen. Unser Kollege Jan erzählt in diesem Blogartikel vom „R for Data Science“ Workshop, den er in unserem Frankfurter Office geleitet hat. Erfahre in diesem Artikel, an wen sich der Kurs richtet und welche Inhalte geboten werden.
A Collection of Benchmarks in R
Benchmarking your code is one thing – another thing is to keep and use the gained knowledge for future projects. In this blog, Jakob presents his collection of benchmarks and creates an easy to use a setup for new ones.
Tuning Random Forest on Time Series Data
Training random forests on time series is one thing, but tuning them? It’s not like you can just apply cross validation and be done with it. Or can you? This post forms part two our mini-series “Time Series Forecasting with Random Forest”. Find out how you can tune the hyperparameters of the random forest algorithm when dealing with time series data. The answers might surprise you!
rBokeh – Don’t be stopped by missing arguments!
rBokeh is an interactive plotting library. Since it functions lack some arguments compared to its Python counterpart, plots are sometimes difficult to customize. I will show how to overcome those issues and drill out the plot objects.
rBokeh – Tipps and Tricks with JavaScript and beyond!
rBokeh is an interactive plotting library. It has the capability to extend its features with customized JavaScript code. I show the most important aspects, guide through an example, and provide some useful tipps how to handle likely issues.
Evaluating Model Performance by Building Cross-Validation from Scratch
Cross-validation is a widely used technique to assess the generalization performance of a machine learning model. In this blog post I will introduce the basics of cross-validation, provide guidelines to tweak its parameters, and illustrate how to build it from scratch in an efficient way.
Using emojis and .png as icons in your ggplot
Time flies by and we are more people now here at STATWROX. But, did we change our behavior in using emojis or are we still the same? I am revisiting my analysis and will have a look!
Dynamic UI Elements in Shiny
Shiny enables its users to quickly create a fixed UI with code. Although simple, this can prove to be quite limiting. Applying the principles of reactivity to the UI part of a ShinyApp is a natural progression from reactive coding as we know it already from Shiny’s server side. This blog entry reviews and discusses two of the most convenient tools for doing so.
R or Python
Data Science beginners often encounter the same question: which programming language should one learn first? The choice usually falls on one of the two major providers, R or Python. With this blog article, we want to help you with the search for the right programming language for you.