SunJackson Blog

Ten Ways Your Data Project is Going to Fail

转载自：https://www.martingoodson.com/ten-ways-your-data-project-is-going-to-fail/

Martin Goodson

发表于 2016-11-01

Introduction

阅读全文 »

Two papers released on arXiv, "Operator Variational Inference" and "Model Criticism for Bayesian Causal Inference"

转载自：http://dustintran.com/blog/two-papers-released-on-arxiv

未知

发表于 2016-10-30

Two papers of mine were released today on arXiv.

阅读全文 »

Once Again： Prefer Confidence Intervals to Point Estimates

转载自：http://www.johnmyleswhite.com/notebook/2016/10/30/once-again-prefer-confidence-intervals-to-point-estimates/

John Myles White

发表于 2016-10-30

Today I saw a claim being made on Twitter that 17% of Jill Stein supporters in Louisiana are also David Duke supporters. For anyone familiar with US politics, this claim is a priori implausible, although certainly not impossible.

阅读全文 »

Interacting with ML Models

转载自：https://shapeofdata.wordpress.com/2016/10/26/interacting-with-ml-models/

Jesse Johnson

发表于 2016-10-26

The main difference between data analysis today, compared with a decade or two ago, is the way that we interact with it. Previously, the role of statistics was primarily to extend our mental models by discovering new correlations and causal rules. Today, we increasingly delegate parts of our reasoning processes to algorithmic models that live outside our mental models. In my next few posts, I plan to explore some of the issues that arise from this delegation and how ideas such as model interpretability can potentially address them. Throughout this series of posts, I will argue that while current research has barely scratched the surface of understanding the interaction between algorithmic and mental models, these issues will be much more important to the future of data analysis than the technical performance of the models themselves. In this first post, I’ll use a relatively mundane case study – personalized movie recommendations – to demonstrate some of these issues, keeping in mind that the same issues impact models in more serious contexts like healthcare and finance.

阅读全文 »

Random forest interpretation – conditional feature contributions

转载自：http://blog.datadive.net/random-forest-interpretation-conditional-feature-contributions/

ando

发表于 2016-10-24

In two of my previous blog posts, I explained how the black box of a random forest can be opened up by tracking decision paths along the trees and computing feature contributions. This way, any prediction can be decomposed into contributions from features, such that (prediction = bias + feature_1contribution+..+feature_ncontribution).

阅读全文 »

AI ‘judge’ doesn’t explain why it reaches certain decisions

转载自：https://aimatters.wordpress.com/2016/10/24/ai-judge-doesnt-explain-why-it-reaches-certain-decisions/

Stephen Oman

发表于 2016-10-24

The Guardian reports on a recent paper by University College London researchers that are using artificial intelligence to predict the outcome of trials at the European Court of Human Rights.

阅读全文 »

DynamoDB Learnings

转载自：http://bugra.github.io/work/notes/2016-10-23/dynamodb-learnings/

Bugra Akyildiz

发表于 2016-10-23

At Hinge, we have been using Dynamodb in production for more than 8 months and we just relaunched two weeks ago with full capacity. I want to share couple of learnings and why it made sense for us to store ratings in DynamoDB since I own the rating processing in the application. We are processing millions of ratings per day, upto so far, DynamoDB is holding pretty good so far. They are also crucial for our recommender to get smarter, so care is very much needed for ratings.

阅读全文 »

Clustering Zeppelin on Zeppelin

转载自：http://myinspirationinformation.com/api/spotify-api/clustering-zeppelin-on-zeppelin/

James Thomson

发表于 2016-10-23

阅读全文 »

Intro to Implicit Matrix Factorization： Classic ALS with Sketchfab Models

转载自：http://blog.ethanrosenthal.com/2016/10/19/implicit-mf-part-1/

Ethan Rosenthal

发表于 2016-10-19

Last post I described how I collected implicit feedback data from the website Sketchfab. I then claimed I would write about how to actually build a recommendation system with this data. Well, here we are! Let’s build.

阅读全文 »

Recurrent Neural Network Gradients, and Lessons Learned Therein

转载自：https://cavaunpeu.github.io/2016/10/18/recurrent-neural-network-gradients-and-lessons-learned-therein/

Will Wolf

发表于 2016-10-18

I’ve spent the last week hand-rolling recurrent neural networks. I’m currently taking Udacity’s Deep Learning course, and arriving at the section on RNN’s and LSTM’s, I decided to build a few for myself.

阅读全文 »