Federated learning: distributed machine learning with data locality and privacy
We’re excited to release Federated Learning, the latest report and prototype from Cloudera Fast Forward Labs.
R Packages worth a look
Visualization of Subgroups for Decision Trees (visTree)Provides a visualization for characterizing subgroups defined by a decision tree structure. The visualization simplifies the ability to interpret indiv …
Whats new on arXiv
Forecasting Transportation Network Speed Using Deep Capsule Networks with Nested LSTM Models
KDnuggets™ News 18:n43, Nov 14: To get hired as a data scientist, don’t follow the herd; LinkedIn Top Voices in Data Science & Analytics
Features
Distilled News
29 Statistical Concepts Explained in Simple English – Part 3
Windows Clipboard Access with R
The windows clipboard is a quick way to get data in and out of R. How can we exploit this feature to accomplish our basic data exploration needs and when might its use be inappropriate? Read on.
Notes on the Frank-Wolfe Algorithm, Part II: A Primal-dual Analysis
This blog post extends the convergence theory from the first part of my notes on the Frank-Wolfe (FW) algorithm with convergence guarantees on the primal-dual gap which generalize and strengthen the convergence guarantees obtained in the first part.
Chocolate milk! Another stunning discovery from an experiment on 24 people!
I was reading over this JAMA Brief Report and could not figure out what they were doing with the composite score. Here are the cliff notes:
NLP for Log Analysis – Tokenization
This is part 1 of a series of posts based on a presentation I gave at the Silicon Valley Cyber Security Meetup on behalf of my company, Insight Engines. Some of the ideas are speculative and I do not know if they are used in practice. If you have any experience applying these techniques on logs, please share in the comments below.