SunJackson Blog

Principal Component Analysis Tutorial

转载自：https://algobeans.com/2016/06/15/principal-component-analysis-tutorial/

Annalyn Ng

发表于 2016-06-14

Imagine that you are a nutritionist trying to explore the nutritional content of food. What is the best way to differentiate food items? By vitamin content? Protein levels? Or perhaps a combination of both?

阅读全文 »

Document Similarity With Word Movers Distance

转载自：http://jxieeducation.com/2016-06-13/Document-Similarity-With-Word-Movers-Distance/

未知

发表于 2016-06-13

Document Similarity with Word Mover’s Distance

阅读全文 »

The Power of IPython Notebook + Pandas + and Scikit-learn

转载自：http://www.andreykurenkov.com/writing/project/power-of-ipython-pandas-scikilearn/

contact@andreykurenkov.com

发表于 2016-06-11

IPython Notebook, Numpy, Pandas, MongoDB, R — for the better part of a year now, I have been trying out these technologies as part of Udacity’s Data Analyst Nanodegree. My undergrad education barely touched on data visualization or more broadly data science, and so I figured being exposed to the aforementioned technologies would be fun. And fun it has been, with R’s powerful IDE-powered data mundging and visualization techniques having been particularly revelatory. I learned enough of R to create some complex visualizations, and was impressed by how easy is to import data into its Dataframe representations and then transform and visualize that data. I also thought RStudio’s paradigm of continuously intermixed code editing and execution was superior to my habitual workflow of just endlessly cycling between tweaking and executing of Python scripts.

阅读全文 »

Animate NBA shot events with Paper.js

转载自：http://opiateforthemass.es/articles/animate-nba-shot-events/

Yuki Katoh (yukiegosapporo@gmail.com)

发表于 2016-06-08

All the shots and FT attempts in one animation made with NBA spatio-temporal data (maintained by neilmj) and paper.js.The data is from Golden State Warriors vs Denver Nuggets on January 13th 2016.

阅读全文 »

Model-Free Prediction and Control

转载自：http://kvfrans.com/model-free-prediction-and-control/

Kevin Frans

发表于 2016-06-07

The problem with the methods covered earlier is that it requires a model. Oftentimes, the agent does not know how the environment works and must figure it out by themselves.

阅读全文 »

Translating W2v Embedding From One Space To Another

转载自：http://jxieeducation.com/2016-06-06/Translating-W2V-Embedding-From-One-Space-To-Another/

未知

发表于 2016-06-06

The Problem With Word Embeddings

阅读全文 »

A Guide to Gradient Boosted Trees with XGBoost in Python

转载自：http://jmsteinw.github.io/XG-Boost/

未知

发表于 2016-06-05

阅读全文 »

A Gentle Introduction to Bloom Filter

转载自：http://bugra.github.io/work/notes/2016-06-05/a-gentle-introduction-to-bloom-filter/

Bugra Akyildiz

发表于 2016-06-05

Bloom filters are probabilistic space-efficient data structures. They are very similar to hashtables; they are used exclusively membership existence in a set. However, they have a very powerful property which allows to make trade-off between space and false-positive rate when it comes to membership existence. Since it can make a tradeoff between space and false positive rate, it is called probabilistic data structure.

阅读全文 »

LSTMs

转载自：https://shapeofdata.wordpress.com/2016/06/04/lstms/

Jesse Johnson

发表于 2016-06-04

In past posts, I’ve described how Recurrent Neural Networks (RNNs) can be used to learn patterns in sequences of inputs, and how the idea of unrolling can be used to train them. It turns out that there are some significant limitations to the types of patterns that a typical RNN can learn, due to the way their weight matrices are used. As a result, there has been a lot of interest in a variant of RNNs called Long Short-Term Memory networks (LSTMs). As I’ll describe below, LSTMs have more control than typical RNNs over what they remember, which allows them to learn much more complex patterns.

阅读全文 »

Matter and Neutron Stars

转载自：http://datagenetics.com/blog/june12016/index.html

未知

发表于 2016-06-04

阅读全文 »