SunJackson Blog

The One reason you should learn Python

转载自：https://www.codementor.io/dyako/the-one-reason-you-should-learn-python-o83lmn2mi

David Yakobovitch

发表于 2018-10-11

When you speak with researchers, data scientists, and practitioners who are involved in any capacity with data, you are bound to here one word multiple times in a conversation: Python.

阅读全文 »

Evaluating the Business Value of Predictive Models in Python and R

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/b1w9dE5kfDQ/evaluating-business-value-predictive-models-modelplotpy.html

Matt Mayo Editor

发表于 2018-10-11

By Jurriaan Nagelkerke, Data Science Consultant, and Pieter Marcus, Data Scientist

阅读全文 »

Decolonising Artificial Intelligence

转载自：http://blog.shakirm.com/2018/10/decolonising-artificial-intelligence/

shakirm

发表于 2018-10-11

· Read in 6mins · 1297 words ·

阅读全文 »

Machine Reading Comprehension： Learning to Ask & Answer

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/cjbVQ-1dXi0/machine-reading-comprehension-learning-ask-answer.html

Dan Clark

发表于 2018-10-11

By Han Xiao, Tencent AI.

阅读全文 »

Using Confusion Matrices to Quantify the Cost of Being Wrong

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/gfZzbwXgHZ4/confusion-matrices-quantify-cost-being-wrong.html

William Schmarzo

发表于 2018-10-11

There are so many confusing and sometimes even counter-intuitive concepts in statistics. I mean, come on…even explaining the differences between Null Hypothesis and Alternative Hypothesis can be an ordeal. All I want to do is to understand and quantify the cost of my analytical models being wrong.

阅读全文 »

Guest Post： Galin Jones on criteria for promotion and tenture in (bio)statistics departments

转载自：https://simplystatistics.org/2018/10/11/guest-post-galin-jones-on-criteria-for-promotion-and-tenture-in-bio-statistics-departments/

未知

发表于 2018-10-11

Editor’s Note: I attended an ASA Chair’s meeting and spoke about ways we could support junior faculty in data science. After giving my talk Galin Jones, Professor and Director of Statistics at University of Minnesota, and I had an interesting conversation about how they had changed their promotion criteria in response to a faculty candidate being unique. I asked him to write about his experience and he kindly contributed the following post.

阅读全文 »

Document worth reading： “The Risk of Machine Learning”

转载自：https://advanceddataanalytics.net/2018/10/11/document-worth-reading-the-risk-of-machine-learning/

Michael Laux

发表于 2018-10-11

Many applied settings in empirical economics involve simultaneous estimation of a large number of parameters. In particular, applied economists are often interested in estimating the effects of many-valued treatments (like teacher effects or location effects), treatment effects for many groups, and prediction models with many regressors. In these settings, machine learning methods that combine regularized estimation and data-driven choices of regularization parameters are useful to avoid over-fitting. In this article, we analyze the performance of a class of machine learning estimators that includes ridge, lasso and pretest in contexts that require simultaneous estimation of many parameters. Our analysis aims to provide guidance to applied researchers on (i) the choice between regularized estimators in practice and (ii) data-driven selection of regularization parameters. To address (i), we characterize the risk (mean squared error) of regularized estimators and derive their relative performance as a function of simple features of the data generating process. To address (ii), we show that data-driven choices of regularization parameters, based on Stein’s unbiased risk estimate or on cross-validation, yield estimators with risk uniformly close to the risk attained under the optimal (unfeasible) choice of regularization parameters. We use data from recent examples in the empirical economics literature to illustrate the practical applicability of our results. The Risk of Machine Learning

阅读全文 »

Distilled News

转载自：https://advanceddataanalytics.net/2018/10/11/distilled-news-882/

Michael Laux

发表于 2018-10-11

Practicing ‘No Code’ Data Science

阅读全文 »

Top KDnuggets tweets, Oct 3–9： 5 Reasons Logistic Regression should be the first thing you learn when becoming a Data Scientist

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/lrxItDsf0uY/top-tweets-oct03-09.html

Matt Mayo Editor

发表于 2018-10-10

Most Retweeted, Favorited, Viewed & Clicked:5 Reasons Logistic Regression should be the first thing you learn when becoming a Data Scientist https://t.co/lobXcyIzpj https://t.co/3xuHbVQvR3

阅读全文 »

Top 10 Mistakes to Avoid to Master Data Science

转载自：https://www.codementor.io/divyacyclitics15/top-10-mistakes-to-avoid-to-master-data-science-o707edexv

Kartik Singh

发表于 2018-10-10

阅读全文 »