Donny Williams writes:
What I’ve learned from competing in machine learning contests on Kaggle
A Real World Reinforcement Learning Research Program
We are hiring for reinforcement learning related research at all levels and all MSR labs. If you are interested, apply, talk to me at COLT or ICML, or email me.
On this 4th of July, let’s declare independence from “95%”
Plan your experiment, gather your data, do your inference for all effects and interactions of interest. When all is said and done, accept some level of uncertainty in your conclusions: you might not be 97.5% sure that the treatment effect is positive, but that’s fine. For one thing, decisions need to be made. You were already going to make some decision with much less information—that is, with much more uncertainty. Now that you have more information, you can make a more informed decision. The other thing is, even if you did have a super-clean experiment with excellent measurements and a large and stable effect, so that you had that 95% interval excluding zero for your quantity of interest . . . so what? Whatever you care about is in the future, so even if your treatment was so great compared to the alternative in your sample being studied, there’s no saying what it will be in future populations under different conditions.
Data Science at Scale: Six Major Trends
Last week, Dataiku hosted an EGG-citing day of presentations and panel discussions from industry leaders and experts. The insights they provided tell us a lot about the current state of data science and where it seems to be heading.
Tutorial: The practical application of complicated statistical methods to fill up the scientific literature with confusing and irrelevant analyses
Build this media monitoring Slack bot in 20 minutes without writing code
Media monitoring has become a vital tool for marketers in every industry – being aware of the stories about you or your competitors that are being published in the news or shared on social media is simply a central part of how we work today. This is why, by 2022, the spend on media monitoring solutions is set to hit is set to hit $4.2 billion.
How to update your scikit-learn code for 2018
In 2015, I created a 4-hour video series called Introduction to machine learning in Python with scikit-learn. In the years since, hundreds of thousands of students have watched these videos, and thousands continue to do so every month.
PNAS forgets basic principles of game theory, thus dooming thousands of Bothans to the fate of Alderaan
Under the subject line, “I needed this information to make a go/no-go decision on my new Death Star,” Kevin Lewis points to this press release from a prestigious journal:
Using WSL Linux on Windows 10 for Deep Learning Development.
To explain briefly, WSL enables you to run Linux on Win10 and you can use your favorite Linux tools (bash, zsh, vim) for your development cycle and you can enjoy Win10 for the rest. It obviates the need for dual-boot configuration which might be a nightmare sometimes.