SunJackson Blog

Preparing for the Data Science Job Hunt

转载自：https://www.dataquest.io/blog/preparing-for-the-data-science-job-hunt/

Erica Freedman

发表于 2018-07-11

Editor’s note: This piece was written in collaboration with SwitchUp, an online platform for researching and reviewing technology learning programs. Erica Freedman is a Content Specialist at SwitchUp.

阅读全文 »

John Mount speaking on rquery and rqdatatable

转载自：http://www.win-vector.com/blog/2018/07/john-mount-speaking-on-rquery-and-rqdatatable/

John Mount

发表于 2018-07-11

rquery and rqdatatable are new R packages for data wrangling; either at scale (in databases, or big data systems such as Apache Spark), or in-memory. The packages speed up both execution (through optimizations) and development (though a good mental model and up-front error checking) for data wrangling tasks.

阅读全文 »

Do Bayesians Overfit?

转载自：http://www.nowozin.net/sebastian/blog/do-bayesians-overfit.html

Sebastian Nowozin

发表于 2018-07-11

TLDR: Yes, and there are precise results, although they are not as well known as they perhaps should be.

阅读全文 »

BD reviews

转载自：http://andrewgelman.com/2018/07/11/bd-reviews/

Andrew

发表于 2018-07-11

I read BD’s (bandes dessinées or, as we say in English, graphic literature or picture storybooks) to keep up with my French. Regular books are too difficult for me. When it comes to BDs, some of the classic kids strips and albums are charming, but the ones for adults, which are more like Hollywood movies, are easier for me to read because I find the stories more compelling: I want to find out what happens next.

阅读全文 »

Exercise and weight loss： long-term follow-up

转载自：http://andrewgelman.com/2018/07/10/exercise-weight-loss-long-term-follow/

Phil

发表于 2018-07-10

This post is by Phil Price, not Andrew.

阅读全文 »

He wants to model a proportion given some predictors that sum to 1

转载自：http://andrewgelman.com/2018/07/10/wants-model-proportion-given-predictors-sum-1/

Andrew

发表于 2018-07-10

Joël Gombin writes:

阅读全文 »

Top-Down vs. Bottom-Up Approaches to Data Science

转载自：https://blog.dataiku.com/top-down-vs.-bottom-up-approaches-to-data-science

alex.reutter@dataiku.com (Alex Reutter)

发表于 2018-07-10

Data projects are generally organized in one of two ways: top-down (that is, starting with the business question) or bottom-up (starting with the data and working up to insights). But is there a “right way,” and is one approach better or more effective than the other?

阅读全文 »

Using Siamese Networks and Pre-Trained Convolutional Neural Networks (CNNs) for Fashion Similarity Matching

转载自：https://blogs.technet.microsoft.com/machinelearning/2018/07/10/how-to-use-siamese-network-and-pre-trained-cnns-for-fashion-similarity-matching/

ML Blog Team

发表于 2018-07-10

This post is co-authored by Erika Menezes, Software Engineer at Microsoft, and Chaitanya Kanitkar, Software Engineer at Twitter. This project was completed as part of the coursework for Stanford’s CS231n in Spring 2018.

阅读全文 »