In my post Skill vs Strategy I made the following proposition:
Step by step Kaggle competition tutorial
Kaggle is a Data Science community where thousands of Data Scientists compete to solve complex data problems.
Becoming More Efficient
Busy day yesterday, was thinking about things that make me more efficient. Here are three.
Learning in Brains and Machines (2): The Dogma of Sparsity
· Read in 10 minutes · 1700 words · collected posts ·
Deep Learning for Chatbots, Part 1 – Introduction
Chatbots, also called Conversational Agents or Dialog Systems, are a hot topic. Microsoft is making big bets on chatbots, and so are companies like Facebook (M), Apple (Siri), Google, WeChat, and Slack. There is a new wave of startups trying to change how consumers interact with services by building consumer apps like Operator or x.ai, bot platforms like Chatfuel, and bot libraries like Howdy’s Botkit. Microsoft recently released their own bot developer framework.
The Frog of CIFAR 10
Image produced by a CPPN network trained on CIFAR-10’s frog class.**Same network generates the image at both 30x30
and 1080x1080
pixel resolution.
Genome Analysis Toolkit: Now Using Apache Spark for Data Processing
Users of the latest release of the Genome Analysis Toolkit, an open source framework for analyzing high-throughput DNA sequencing data, can now choose Apache Spark for data processing.
On Software Demos and Potemkin Villages
** Wed 06 April 2016
Travis CI: "You Have Too Many Tests LOLZ!"
As part of getting RSiteCatalyst 1.4.8 ready for CRAN, I’ve managed to accumulate hundreds of testthat tests across 63 test files. Each of these tests runs on Travis CI against an authenticated API, and the API frequently queues long-running reports. Long-story-short, my builds started failing, creating the error log message quoted below:
Inverting a Neural Net
In this experiment, I “invert” a simple two-layer MNIST model to visualize what the final hidden layer representations look like when projected back into the original sample space.