It’s easy to create a function in R, but what if you want to call that function from a different application, with the scale to support a large number of simultaneous requests? This article shows how you can deploy an R fitted model as a Plumber web service in Kubernetes, using Azure Container Registry (ACR) and Azure Kubernetes Service (AKS). We use the AzureContainers package to create the necessary resources and deploy the service.
Teaching and Learning Materials for Data Visualization
Intuit: Staff Data Scientist – Business Analytics [Mountain View, CA]
At: Intuit Location: Mountain View, CAWeb: intuit.comPosition: Staff Data Scientist - Business Analytics
Code for case study – Customer Churn with Keras/TensorFlow and H2O
This is code that accompanies a book chapter on customer churn that I have written for the German dpunkt Verlag. The book is in German and will probably appear in February: https://www.dpunkt.de/buecher/13208/9783864906107-data-science.html.
Day 12 – little helper dive
We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier to share and improve those functions, if they are within a package. Up till the 24th December I will present one function each day from helfRlein
. So, on the 12th day of Christmas my true love gave to me…
If you did not already know
Sequential Adaptive Nonlinear Modeling of Vector Time Series (SLANTS) We propose a method for adaptive nonlinear sequential modeling of vector-time series data. Data is modeled as a nonlinear function of past values corrupted by noise, and the underlying non-linear function is assumed to be approximately expandable in a spline basis. We cast the modeling of data as finding a good fit representation in the linear span of multi-dimensional spline basis, and use a variant of l1-penalty regularization in order to reduce the dimensionality of representation. Using adaptive filtering techniques, we design our online algorithm to automatically tune the underlying parameters based on the minimization of the regularized sequential prediction error. We demonstrate the generality and flexibility of the proposed approach on both synthetic and real-world datasets. Moreover, we analytically investigate the performance of our algorithm by obtaining both bounds of the prediction errors, and consistency results for variable selection. …
Cummins: Advanced Analytics Platform Principle Engineer [Columbus, IN]
At: Cummins Location: Columbus, INWeb: www.cummins.comPosition: Advanced Analytics Platform Principle Engineer
10 Data Science Skills to Land your Dream Job in 2019
In a 2017 business research article IBM predicted that the need for Data Scientists will increase by 28% by 2020, with nearly 3 million job openings for Data Science professionals. According to a Forbes report, Data Science is the best job in America for three consecutive years, with a median base salary of $110,000 and over 4,524 job openings.
Network Centrality in R: New ways of measuring Centrality
This is the third post of a series on the concept of “network centrality” withapplications in R and the package netrankr
. The last part introduced the concept ofneighborhood-inclusion and its implications for centrality. In this post, weextend the concept to a broader class of dominance relations by deconstructing indicesinto a series of building blocks and introduce new ways of evaluating centrality.
Using ggplot2 for functional time series
This week I’ve been attending the Functional Data and Beyond workshop at the Matrix centre in Creswick.