SunJackson Blog

“35. What differentiates solitary confinement, county jail and house arrest” and 70 others

转载自：https://andrewgelman.com/2018/11/07/35-differentiates-solitary-confinement-county-jail-house-arrest-70-others/

Andrew

发表于 2018-11-07

Thomas Perneger points us to this amusing quiz on statistics terminology:

7 Best Practices for Machine Learning on a Data Lake

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/MbUXOqWW03g/tdwi-best-practices-machine-learning-data-lake.html

Gregory PS Editor

发表于 2018-11-07

Why a data lake? Machine learning often requires an iterative process that can drain performance on a traditional warehouse. Data lakes are made for scale and experimentation. They also provide ample, diverse training data for the most comprehensive learning experience, which makes algorithmic assessments more accurate and successful when put into production.

阅读全文 »

Direct access to Amazon SageMaker notebooks from Amazon VPC by using an AWS PrivateLink endpoint

转载自：https://aws.amazon.com/blogs/machine-learning/direct-access-to-amazon-sagemaker-notebooks-from-amazon-vpc-by-using-an-aws-privatelink-endpoint/

Erkan Tas

发表于 2018-11-06

Amazon SageMaker now supports AWS PrivateLink for notebook instances. In this post, I will show you how to set up AWS PrivateLink to secure your connection to Amazon SageMaker notebooks.

阅读全文 »

Causal mediation estimation measures the unobservable

转载自：http://feedproxy.google.com/~r/RBloggers/~3/g0rPbW6a6ng/

Keith Goldfeld

发表于 2018-11-06

I put together a series of demos for a group of epidemiology students who are studying causal mediation analysis. Since mediation analysis is not always so clear or intuitive, I thought, of course, that going through some examples of simulating data for this process could clarify things a bit.

阅读全文 »

New： Maintained Datasets

转载自：http://blog.kaggle.com/2018/11/06/new-maintained-datasets/

Noah Daniels

发表于 2018-11-06

Can you trust the data you use on Kaggle? Is it licensed? Has it been updated recently?

阅读全文 »

Turbocharge Tech Transformation： Integrate AI Across Insurance

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/lSS8Qm7i8FQ/insurancenexus-turbocharge-tech-ai-insurance.html

Dan Clark

发表于 2018-11-06

By Insurance Nexus Sponsored Post.

阅读全文 »

Customize your notebook volume size, up to 16 TB, with Amazon SageMaker

转载自：https://aws.amazon.com/blogs/machine-learning/customize-your-notebook-volume-size-up-to-16-tb-with-amazon-sagemaker/

Erkan Tas

发表于 2018-11-06

Amazon SageMaker now allows you to customize the notebook storage volume when you need to store larger amounts of data.

阅读全文 »

R plus Magento 2 REST API revisited： part 1- authentication and universal search

转载自：http://feedproxy.google.com/~r/RBloggers/~3/vtPMUciHkoI/

Alex

发表于 2018-11-06

I wrote a post about getting Magento 2 data to R using REST API last year. Now I provide more examples of use and a wrapper over API that you can re-use to get data from Magento 2 to R in a bit more convenient way.

阅读全文 »