SunJackson Blog


  • 首页

  • 分类

  • 关于

  • 归档

  • 标签

  • 站点地图

  • 公益404

标签

How good are your beliefs? Part 2: The Quiz

09-18

Simple Feed Ranking Algorithm

10-28

Probability and Tennis

08-13

How to update your scikit-learn code for 2018

07-04

Cribbage Scores

02-25

How good are your beliefs? Part 2: The Quiz

09-18

Six Sigma DMAIC Series in R – Part4

12-15

Recreating the NBA lead tracker graphic

12-13

Probability and Tennis

08-13

Cribbage Scores

02-25

Deep Learning for Chatbots, Part 2 – Implementing a Retrieval-Based Model in Tensorflow

07-04

Mini AI app using TensorFlow and Shiny

01-15

How good are your beliefs? Part 2: The Quiz

09-18

How good are your beliefs? Part 2: The Quiz

09-18

How good are your beliefs? Part 2: The Quiz

09-18

Self Avoiding Walks

12-08

Simulating Chutes & Ladders in Python

12-18

Twenty Peg Puzzle

09-19

Twenty Peg Puzzle

09-19

Twenty Peg Puzzle

09-19

R Objects

08-24

Exploring Line Lengths in Python Packages

11-09

Cake cutting part 3

04-10

Recurrent Neural Networks in Tensorflow III - Variable Length Sequences

11-15

Koch Snowflake

01-05

Twenty Peg Puzzle

09-19

Sleeping Giant Rural Postman Problem

12-01

Intro to graph optimization: solving the Chinese Postman Problem

10-07

Where Will Your Country Stand in World War III?

04-12

Twenty Peg Puzzle

09-19

How-to: Train Models in R and Python using Apache Spark MLlib and H2O

01-29

Six lines to install and start SparkR on Mac OS X Yosemite

09-21

Six lines to install and start SparkR on Mac OS X Yosemite

09-21

Top KDnuggets tweets, Oct 17-23: Machine Learning Cheat Sheets

10-24

Apache Spark Introduction for Beginners

10-18

Distilled News

08-29

Announcing hs2client, A Fast New C++ / Python Thrift Client for Impala and Hive

06-16

Why pandas users should be excited about Apache Arrow

02-22

Introducing Apache Arrow: A Fast, Interoperable In-Memory Columnar Data Structure Standard

02-18

Six lines to install and start SparkR on Mac OS X Yosemite

09-21

Package Paths in R

03-31

Six lines to install and start SparkR on Mac OS X Yosemite

09-21

Six lines to install and start SparkR on Mac OS X Yosemite

09-21

MIMIC Data

09-22

MIMIC Data

09-22

MIMIC Data

09-22

Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data

01-10

binb 0.0.3: Now with Monash

10-12

The Microsoft AI Idea Challenge – Breakthrough Ideas Wanted!

08-14

MIMIC Data

09-22

MIMIC Data

09-22

10 Companies to Work with After a Data Science Course

01-10

Document worth reading: “Marketing Analytics: Methods, Practice, Implementation, and Links to Other Fields”

12-07

Why Primary Research?

12-04

Distilled News

10-28

Applications of R presented at EARL London 2018

09-21

Aella Credit empowers underbanked individuals by using Amazon Rekognition for identity verification

08-15

TINT uses Amazon Comprehend to find and aggregate the best social media content for customers

08-15

Machine Learning Making Big Moves in Marketing

07-30

Crossing Your Data Science Chasm

03-22

Why Indian companies should take on different projects than competing Valley companies - an application of Cobb-Douglas

11-07

Business Execution

09-22

Business Execution

09-22

Top 5 Data Visualization Tools for 2019

01-03

In case you missed it: November 2018 roundup

12-14

In case you missed it: October 2018 roundup

11-15

Review: Excel TV’s Data Science with Power BI and R

10-12

Speed Up With Microsoft

10-04

How to Implement AI-First Business Models at Scale

09-21

Cosmos DB for Data Science

09-07

In case you missed it: August 2018 roundup

09-06

AI, Machine Learning and Data Science Roundup: August 2018

08-17

AI, Machine Learning and Data Science Roundup: July 2018

07-23

AI Lab: Learn to Code with the Cutting-Edge Microsoft AI Platform

06-19

ML/NLP Publications in 2017

01-02

Algorithms, Machine Learning, and Optimization: we are hiring!

11-12

AI and ML Futures 2: The Quiet Revolution

01-17

Business Execution

09-22

Business Execution

09-22

Why Vegetarians Miss Fewer Flights – Five Bizarre Insights from Data

01-12

Apps gather your location and then sell the data

12-13

Exploring the Gender Pay Gap with Publicly Available Data

12-12

6 Step Plan to Starting Your Data Science Career

12-05

Kick Start Your Data Career! Tips From the Frontline

12-05

“Statistical insights into public opinion and politics” (my talk for the Columbia Data Science Society this Wed 9pm)

12-04

Why Machine Learning Interpretability Matters

12-04

Leaving NYC for Nashville

12-03

Deep Learning for the Masses (… and The Semantic Layer)

11-30

Cathy O’Neil discusses the current lack of fairness in artificial intelligence and much more.

11-26

EARL Houston: Interview with Hadley Wickham

11-05

Peter Bull discusses the importance of human-centered design in data science.

11-05

Sharing the Recipe for rOpenSci’s Unconf Ice Breaker

11-01

Our Favorite Spooky AI & Data Articles

10-30

Distilled News

10-22

I’m an Analyst and the software engineers made fun of my code!

10-19

The Golden Rule of Nudge

10-10

(People are missing the point on Wansink, so) what’s the lesson we should be drawing from this story?

09-27

One Drink Per Day, Your Chances of Developing an Alcohol-Related Condition

09-25

Why, oh why, do so many people embrace the Pacific Garbage Cleanup nonsense? (I have a theory).

09-18

How to set up a voting system for a Hall of Fame?

09-02

Why you can't have privacy on the internet

08-22

What is Data Science?

08-20

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Some thoughts after reading “Bad Blood: Secrets and Lies in a Silicon Valley Startup”

08-09

Trapped in the spam folder? Here’s what to do.

08-07

The persistence of bad reporting and the reluctance of people to criticize it

07-12

Import AI

06-05

How to Overcome Imposter Syndrome For Good

05-30

Technology and Information: Data Science and UX

05-01

R Spatial Resources

04-06

Moravec's Paradox

01-31

Retrospective on leaving academia for industry data science

04-09

Reasons I left academia

02-12

Questions on Artificial Intelligence

01-16

Ten Ways Your Data Project is Going to Fail

11-01

PyData DC 2016 Talk

10-11

Quora Q&A Session Answers

03-09

CES 2016

01-11

Go easy on Volkswagen

10-26

Business Execution

09-22

Pear Therapeutics: Data Scientist [San Francisco, CA]

01-11

7 Reasons for Policy Professionals to Get Pumped About R Programming in 2019

01-07

New Year's Resolution: Help Data Scientists Help You

12-31

Hackathon Winner Interview: Penn State | Kaggle University Club

12-19

The Netflix Data War

12-19

Think Twice Before You Accept That Fancy Data Science Job

12-19

All the (NBA) box scores you ever wanted

12-18

Top Insights from 50 Chief Data Officers

12-14

Intuit: Staff Data Scientist [Mountain View, CA]

12-12

Intuit: Staff Data Scientist [Mountain View, CA]

12-11

Let Automation Carry You from BI to AI in 2019

12-11

Tribes.ai: Sr Data Scientist [Remote, India / Eastern Europe]

12-01

You Can’t Do AI Without Augmented Analytics and AutoML

11-26

Quidditch: is it all about the Snitch?

11-24

Driving Success through Business Insight, One Customer at a Time

11-21

From Project Manager to Data Champion — Conquer Your Data Projects

10-18

Here is How You Can build a data science team from scratch in 2018 - The Definitive Guide

10-05

What Does it Take to Train Deep Learning Models On-Device?

10-04

Dataiku 5.0: Enterprise AI Within Reach

09-12

See How AI is Inspiring the Next Generation of Developers

09-05

The Hidden Costs of Data Silos

08-07

Data Science at Scale: Six Major Trends

07-05

Predicting World Cup dark horses from press coverage using the AYLIEN News API – Monthly Media Roundup

06-15

Your First Job

11-15

A Few Tips To Make Distributed Teams Work Well

09-23

A Few Tips To Make Distributed Teams Work Well

09-23

The Key to Getting a Data Science Job, According to Briana Brownell

12-20

Document worth reading: “Internet of Things: An Overview”

11-25

Federated Learning: Machine Learning with Privacy on the Edge

10-29

Developing effective data scientists

02-11

A Few Tips To Make Distributed Teams Work Well

09-23

Distilled News

01-13

Top Skills Needed to Work as Data Scientist in iGaming

01-10

Distilled News

12-29

Alternative approaches to scaling Shiny with RStudio Shiny Server, ShinyProxy or custom architecture.

12-18

Four Real-Life Machine Learning Use Cases

12-13

The Machine Learning Project Checklist

12-07

ggQC | ggplot Quality Control Charts – New Release

12-05

Distilled News

11-12

Multi-object tracking with dlib

10-29

Coding is hard

10-24

Get a 2–6x Speed-up on Your Data Pre-processing with Python

10-23

Distilled News

10-05

Robust Quality – Powerful Integration of Data Science and Process Engineering

10-01

R Packages worth a look

09-20

Understanding Different Components & Roles in Data Science

09-18

If you did not already know

09-11

Understanding Different Components & Roles in Data Science

08-30

New Dynamics for Topic Models

07-31

Revisiting “Is the scientific paper a fraud?”

07-29

Parallel computation with two lines of code

05-18

Bayesian Inference via Simulated Annealing

02-07

Don't Panic: Deep Learning will be Mostly Harmless

11-29

David MacKay Symposium

03-15

Deep Learning, Pachinko, and James Watt: Efficiency is the Driver of Uncertainty

03-04

A Few Tips To Make Distributed Teams Work Well

09-23

A Few Tips To Make Distributed Teams Work Well

09-23

Why Vegetarians Miss Fewer Flights – Five Bizarre Insights from Data

01-12

If you did not already know

01-10

Ensemble Learning: 5 Main Approaches

01-03

If you did not already know

01-01

Document worth reading: “Instance-Level Explanations for Fraud Detection: A Case Study”

01-01

Text classification with tidy data principles

12-24

Magister Dixit

12-18

Dr. Data Show Video: Five Reasons Computers Predict When You’ll Die

12-08

Top 5 domains Big Data analytics helps to transform

11-23

Mega-PAW Las Vegas Registration is Live & Super Early Bird Pricing is Now Available!

11-20

The tidy caret interface in R

11-16

Dr. Data Show Video: What the Hell Does “Data Science” Really Mean?

11-10

Machine Learning Basics – Random Forest

10-30

How I Learned to Stop Worrying and Love Uncertainty

10-24

If you did not already know

10-19

Dr. Data Show Video: Why Machine Learning Is the Coolest Science

10-01

Streamlining Production with Predictive Maintenance and Essilor

09-04

Vulcan Post: This AI Startup Is Run By The World’s Top Data Scientists – Lets Anyone Build Predictive Models

09-04

Magister Dixit

08-09

When LOO and other cross-validation approaches are valid

08-03

Divisibility in statistics: Where is it needed?

07-08

Flaws in stupid horrible algorithm revealed because it made numerical predictions

07-03

Performance metrics aren't everything

02-09

Lessons learned in my first year as a data scientist

01-25

AutoML on AWS

12-04

AWS Machine Learning Big Data NYC

10-24

Is the Universe Random?

06-19

Safe Crime Detection

06-05

Applying Machine Learning To March Madness

03-12

Random forest interpretation – conditional feature contributions

10-24

Denoising Dirty Documents: Part 7

09-23

Ensemble Learning: 5 Main Approaches

01-03

The brain as a neural network: this is why we can’t get along

12-19

Machine Learning (ML) Essentials

12-11

Failure Pressure Prediction Using Machine Learning

12-10

Example of Overfitting

11-16

The tidy caret interface in R

11-16

Machine Learning Basics – Random Forest

10-30

Machine Learning Trick of the Day (8): Instrumental Thinking

10-15

Because it's Friday: Hurricane Trackers

09-14

GovernmentCIO: How AI Can Stop Doctors Likely to Overprescribe Opioids — and Stem the Crisis

09-04

Logistic Regression: Concept & Application

09-03

Is the Universe Random?

06-19

Applying Machine Learning To March Madness

03-12

Properties of Interpretability

12-06

Random forest interpretation – conditional feature contributions

10-24

Random Forest Tutorial: Predicting Crime in San Francisco

08-25

Decision Trees Tutorial

07-27

Making Python on Apache Hadoop Easier with Anaconda and CDH

02-17

My Top 10% Solution for Kaggle Rossman Store Sales Forecasting Competition

01-16

Denoising Dirty Documents: Part 7

09-23

If you did not already know

01-10

Ensemble Learning: 5 Main Approaches

01-03

If you did not already know

01-01

Document worth reading: “Instance-Level Explanations for Fraud Detection: A Case Study”

01-01

4 Reasons Santa Needs Machine Learning & AI

12-24

Failure Pressure Prediction Using Machine Learning

12-10

Dr. Data Show Video: Five Reasons Computers Predict When You’ll Die

12-08

Data Science Projects Employers Want To See: How To Show A Business Impact

12-04

The tidy caret interface in R

11-16

How I Learned to Stop Worrying and Love Uncertainty

10-24

If you did not already know

10-19

GovernmentCIO: How AI Can Stop Doctors Likely to Overprescribe Opioids — and Stem the Crisis

09-04

Logistic Regression: Concept & Application

09-03

Document worth reading: “A rational analysis of curiosity”

08-20

Distilled News

08-10

Parsimonious principle vs integration over all uncertainties

07-26

Divisibility in statistics: Where is it needed?

07-08

Performance metrics aren't everything

02-09

AutoML on AWS

12-04

House Price Prediction using a Random Forest Classifier

11-29

Is the Universe Random?

06-19

Applying Machine Learning To March Madness

03-12

Principle Component Analysis in Regression

03-08

Getting Rich using Bitcoin stockprices and Twitter!

02-22

Where Predictive Modeling Goes Astray

01-27

Random forest interpretation – conditional feature contributions

10-24

Random Forest Tutorial: Predicting Crime in San Francisco

08-25

Decision Trees Tutorial

07-27

Making Python on Apache Hadoop Easier with Anaconda and CDH

02-17

Predicting Fantasy Football Points

10-07

Denoising Dirty Documents: Part 7

09-23

Whats new on arXiv

01-13

Whats new on arXiv

01-12

Ensure consistency in data processing code between training and inference in Amazon SageMaker

01-11

Whats new on arXiv

01-10

Hackathon Winner Interview: Friendship University of Russia | Kaggle University Club

01-10

Automated and continuous deployment of Amazon SageMaker models with AWS Step Functions

01-10

If you did not already know

01-10

4 Myths of Big Data and 4 Ways to Improve with Deep Data

01-09

Whats new on arXiv

01-09

Distilled News

01-09

“The Book of Why” by Pearl and Mackenzie

01-08

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

01-08

Whats new on arXiv

01-08

AI Gotchas (& How to Avoid Them)

01-08

Dow Jones Stock Market Index (3/4): Log Returns GARCH Model

01-08

On deck for the first half of 2019

01-07

February 21st & 22nd: End-2-End from a Keras/TensorFlow model to production

01-07

Auto-Keras and AutoML: A Getting Started Guide

01-07

Distilled News

01-07

Scaling H2O analytics with AWS and p(f)urrr (Part 1)

01-06

Distilled News

01-05

If you did not already know

01-05

R Packages worth a look

01-04

Whats new on arXiv

01-04

Whats new on arXiv

01-03

Magister Dixit

01-03

If you did not already know

01-03

Whats new on arXiv

01-03

Whats new on arXiv

01-01

If you did not already know

01-01

Whats new on arXiv

12-31

Leaf Plant Classification: Statistical Learning Model – Part 2

12-31

Whats new on arXiv

12-30

Distilled News

12-29

Whats new on arXiv

12-29

Supervised Learning: Model Popularity from Past to Present

12-28

The business case for federated learning

12-28

Whats new on arXiv

12-28

If you did not already know

12-27

Statistical Assessments of AUC

12-26

BERT: State of the Art NLP Model, Explained

12-26

R Packages worth a look

12-26

Whats new on arXiv

12-25

Distilled News

12-25

Text classification with tidy data principles

12-24

Pivot Billions and Deep Learning enhanced trading models achieve 100% net profit

12-24

Whats new on arXiv

12-22

November 2018: “Top 40” New Packages

12-21

Top 10 Data Science Tools (other than SQL Python R)

12-21

Distilled News

12-20

Whats new on arXiv

12-20

Amazon SageMaker adds Scikit-Learn support

12-20

Whats new on arXiv

12-20

4 Strategies to Deal With Large Datasets Using Pandas

12-19

Whats new on arXiv

12-19

AI, Machine Learning and Data Science Roundup: December 2018

12-19

Whats new on arXiv

12-18

If you did not already know

12-18

Distilled News

12-18

If you did not already know

12-17

If you did not already know

12-17

Meta-Learning For Better Machine Learning

12-17

The 2019 PAW Business Agenda is Live – Super Early Bird expires this Friday

12-17

Whats new on arXiv

12-17

Distilled News

12-17

If you did not already know

12-16

Data Scientist’s Dilemma – The Cold Start Problem

12-15

If you did not already know

12-15

NLP Breakthrough Imagenet Moment has arrived

12-14

Implementing ResNet with MXNET Gluon and Comet.ml for Image Classification

12-14

Why You Shouldn’t be a Data Science Generalist

12-14

Solve any Image Classification Problem Quickly and Easily

12-13

R Packages worth a look

12-13

Whats new on arXiv

12-13

R Packages worth a look

12-13

Cummins: Reliability Analytics Leader [Columbus, IN]

12-13

Amazon SageMaker Automatic Model Tuning now supports early stopping of training jobs

12-13

WNS Hackathon Solutions by Top Finishers

12-13

Four Approaches to Explaining AI and Machine Learning

12-12

P&G: Data Scientist – Machine Learning/NLP [Cincinnati, OH]

12-11

If you did not already know

12-11

R Packages worth a look

12-11

Machine Learning (ML) Essentials

12-11

Distilled News

12-11

Whats new on arXiv

12-11

Whats new on arXiv

12-10

Distilled News

12-10

Failure Pressure Prediction Using Machine Learning

12-10

Whats new on arXiv

12-10

Keras – Save and Load Your Deep Learning Models

12-10

Whats new on arXiv

12-09

Distilled News

12-09

Whats new on arXiv

12-08

If you did not already know

12-08

R Packages worth a look

12-08

If you did not already know

12-07

Distilled News

12-07

The Machine Learning Project Checklist

12-07

If you did not already know

12-06

If you did not already know

12-06

Common mistakes when carrying out machine learning and data science

12-06

Distilled News

12-06

Explainable Artificial Intelligence (Part 2) – Model Interpretation Strategies

12-06

Trust in ML models. Slides from TWiML & AI EMEA Meetup + iX Articles

12-05

Building Gene Expression Atlases with Deep Generative Models for Single-cell Transcriptomics

12-05

Distilled News

12-05

Anomaly detection on Amazon DynamoDB Streams using the Amazon SageMaker Random Cut Forest algorithm

12-05

Data Science Projects Employers Want To See: How To Show A Business Impact

12-04

Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation

12-04

Deep Learning and Medical Image Analysis with Keras

12-03

Distilled News

12-03

Whats new on arXiv

12-03

Best Machine Learning languages, Data Visualization Tools, DL Frameworks, and Big Data Tools

12-03

If you did not already know

12-03

TSstudio 0.1.3

12-02

Distilled News

12-02

Interpretability is crucial for trusting AI and machine learning

12-01

NYC buses: C5.0 classification with R; more than 20 minute delay?

12-01

Whats new on arXiv

12-01

Distilled News

11-30

Amazon SageMaker now comes with new capabilities for accelerating machine learning experimentation

11-29

Whats new on arXiv

11-29

Whats new on arXiv

11-29

Whats new on arXiv

11-28

Whats new on arXiv

11-28

Multilevel models for multiple comparisons! Varying treatment effects!

11-28

R Packages worth a look

11-28

Filter Clickbait from News Content with our custom Natural Language Processing Model

11-28

Introducing Dynamic Training for deep learning with Amazon EC2

11-27

Bringing Machine Learning Research to Product Commercialization

11-27

How to Engineer Your Way Out of Slow Models

11-27

“Economic predictions with big data” using partial pooling

11-26

My secret sauce to be in top 2% of a Kaggle competition

11-26

Whats new on arXiv

11-26

If you did not already know

11-26

Data Pro Cyber Monday – Choose Your Savings

11-26

OneR – fascinating insights through simple rules

11-25

Whats new on arXiv

11-25

Whats new on arXiv

11-23

R Packages worth a look

11-21

Whats new on arXiv

11-20

Machine Learning in Action: Going Beyond Decision Support Data Science

11-20

Distilled News

11-20

Quantcast: Sr Applied Scientist, Audience Platform [Seattle, WA]

11-20

Detect suspicious IP addresses with the Amazon SageMaker IP Insights algorithm

11-19

Distilled News

11-19

Don’t Peek part 2: Predictions without Test Data

11-18

Graphs and tables, tables and graphs

11-18

Whats new on arXiv

11-17

If you did not already know

11-16

Whats new on arXiv

11-16

Distilled News

11-16

Using Uncertainty to Interpret your Model

11-16

The tidy caret interface in R

11-16

Whats new on arXiv

11-15

Mastering The New Generation of Gradient Boosting

11-15

Whats new on arXiv

11-15

Distilled News

11-14

Federated learning: distributed machine learning with data locality and privacy

11-14

Whats new on arXiv

11-14

Magister Dixit

11-13

TWIMLAI European Online Meetup about Trust in Predictions of ML Models

11-13

Whats new on arXiv

11-13

Preview my new book: Introduction to Reproducible Science in R

11-12

Whats new on arXiv

11-12

Deriving Expectation-Maximization

11-11

Multi-Class Text Classification with Doc2Vec & Logistic Regression

11-09

Why would I ever NEED Bayesian Statistics?

11-09

Exploring Models with lime

11-09

If you did not already know

11-08

Deep Learning Performance Cheat Sheet

11-08

Practical statistics books for software engineers

11-08

Egg-Not-Egg Deep Learning Model

11-08

Now easily perform incremental learning on Amazon SageMaker

11-07

Whats new on arXiv

11-07

EMNLP 2018 Highlights: Inductive bias, cross-lingual learning, and more

11-07

Whats new on arXiv

11-07

Whats new on arXiv

11-06

Whats new on arXiv

11-06

Can we predict the crawling of the Google-Bot?

11-06

Mastering the Learning Rate to Speed Up Deep Learning

11-06

Whats new on arXiv

11-05

Quantum Machine Learning: A look at myths, realities, and future projections

11-05

R Packages worth a look

11-05

Maps, models, and analytic problem framing

11-05

Distilled News

11-05

Don’t use AI when BI will suffice!

11-05

The 3Ds of Machine Learning Systems Design

11-05

R tip: Make Your Results Clear with sigr

11-04

R tip: Make Your Results Clear with sigr

11-04

Whats new on arXiv

11-04

Whats new on arXiv

11-04

Visualize the Business Value of your Predictive Models with modelplotr

11-03

If you did not already know

11-03

“Simulations are not scalable but theory is scalable”

11-02

Whats new on arXiv

11-02

My two talks in Austria next week, on two of your favorite topics!

11-02

R Packages worth a look

11-01

Distilled News

10-31

If you did not already know

10-31

Multilevel Modeling Solves the Multiple Comparison Problem: An Example with R

10-31

Labeling Unstructured Text for Meaning to Achieve Predictive Lift

10-31

Whats new on arXiv

10-30

Additional Strategies for Confronting the Partition Function

10-30

R Packages worth a look

10-29

Data Science With R Course Series – Week 7

10-29

Top Obstacles to Overcome when Implementing Predictive Maintenance

10-29

Learning to learn in a model-agnostic way

10-29

Distilled News

10-28

Whats new on arXiv

10-27

Can we do better than using averaged measurements?

10-26

Whats new on arXiv

10-26

Whats new on arXiv

10-26

Marketing Analytics and Data Science

10-26

Notes on Feature Preprocessing: The What, the Why, and the How

10-26

Whats new on arXiv

10-25

Distilled News

10-25

AI, Machine Learning and Data Science Roundup: October 2018

10-25

SiliconANGLE: Machine learning automation startup DataRobot lands $100M round

10-24

R Packages worth a look

10-24

Computer Vision for Model Assessment

10-23

Whats new on arXiv

10-23

Computer Vision for Model Assessment

10-23

Whats new on arXiv

10-23

Whats new on arXiv

10-22

Distilled News

10-22

MVP for Data Projects

10-22

Whats new on arXiv

10-22

Data Science With R Course Series – Week 6

10-22

Whats new on arXiv

10-21

Distilled News

10-21

A Thorough Introduction to Boltzmann Machines

10-20

Holy Grail of AI for Enterprise — Explainable AI

10-19

Will Models Rule the World? Data Science Salon Miami, Nov 6-7

10-19

Distilled News

10-18

Distilled News

10-18

Four machine learning strategies for solving real-world problems

10-17

Slides from my talk at the R-Ladies Meetup about Interpretable Deep Learning with R, Keras and LIME

10-17

RStudio 1.2 Preview: Stan

10-16

Whats new on arXiv

10-16

Applied Data Science: Solving a Predictive Maintenance Business Problem Part 3

10-16

If you did not already know

10-14

Whats new on arXiv

10-13

Limitations of “Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection”

10-12

Modeling Airbnb prices

10-12

Whats new on arXiv

10-11

Why are functional programming languages so popular in the programming languages community?

10-11

Evaluating the Business Value of Predictive Models in Python and R

10-11

Machine Reading Comprehension: Learning to Ask & Answer

10-11

Using Confusion Matrices to Quantify the Cost of Being Wrong

10-11

Whats new on arXiv

10-10

Whats new on arXiv

10-09

Whats new on arXiv

10-09

Whats new on arXiv

10-09

Don’t Peek: Deep Learning without looking … at test data

10-08

If you did not already know

10-07

Document worth reading: “Learning Tree Distributions by Hidden Markov Models”

10-07

Distilled News

10-06

Challenges & Solutions for Production Recommendation Systems

10-05

Whats new on arXiv

10-05

Whats new on arXiv

10-04

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 2

10-04

Whats new on arXiv

10-04

Document worth reading: “Bayesian model reduction”

10-03

Whats new on arXiv

10-03

5 Reasons Why You Should Use Cross-Validation in Your Data Science Projects

10-02

Whats new on arXiv

10-02

Modeling muti-category Outcomes With vtreat

10-01

A Review of the Neural History of Natural Language Processing

10-01

If you did not already know

09-30

Python Vs R : The Eternal Question for Data Scientists

09-29

Whats new on arXiv

09-28

Machine Learning and Deep Learning : Differences

09-28

Whats new on arXiv

09-28

Whats new on arXiv

09-27

Deploy your own TensorFlow object detection model to AWS DeepLens

09-27

Understanding Regression Error Metrics

09-26

Whats new on arXiv

09-25

Whats new on arXiv

09-25

Distilled News

09-24

R Packages worth a look

09-24

Python Vs R : The Eternal Question for Data Scientists

09-24

Whats new on arXiv

09-24

Whats new on arXiv

09-21

Whats new on arXiv

09-21

AI-Based Virtual Tutors – The Future of Education?

09-21

Whats new on arXiv

09-20

Three Mighty Good Reasons to Learn R for Data Science

09-19

Training models with unequal economic error costs using Amazon SageMaker

09-18

Whats new on arXiv

09-18

Distilled News

09-18

BRUNO: A Deep Recurrent Model for Exchangeable Data

09-17

Deep learning made easier with transfer learning

09-17

Whats new on arXiv

09-17

Distilled News

09-17

Monotonicity constraints in machine learning

09-16

Distilled News

09-15

If you did not already know

09-15

If you did not already know

09-15

Whats new on arXiv

09-14

Divergent and Convergent Phases of Data Analysis

09-14

Whats new on arXiv

09-13

R Packages worth a look

09-13

Dataiku 5.0: Enterprise AI Within Reach

09-12

Against Arianism 2: Arianism Grande

09-12

Distilled News

09-12

Whats new on arXiv

09-12

Data Science Glossary

09-12

Distilled News

09-11

Whats new on arXiv

09-11

If you did not already know

09-09

Distilled News

09-08

Whats new on arXiv

09-08

Whats new on arXiv

09-07

Whats new on arXiv

09-07

Whats new on arXiv

09-07

Whats new on arXiv

09-07

Bothered by non-monotonicity? Here’s ONE QUICK TRICK to make you happy.

09-07

If you did not already know

09-05

Distilled News

09-04

StanCon 2018 Helsinki tutorial videos online

09-04

Distilled News

09-04

Logistic Regression: Concept & Application

09-03

Whats new on arXiv

09-03

Whats new on arXiv

09-01

R Tip: How to Pass a formula to lm

09-01

Amazon SageMaker runtime now supports the CustomAttributes header

08-31

Whats new on arXiv

08-31

If you did not already know

08-30

Visual search on AWS—Part 1: Engine implementation with Amazon SageMaker

08-30

R Packages worth a look

08-30

Whats new on arXiv

08-29

Whats new on arXiv

08-28

Whats new on arXiv

08-28

Distilled News

08-28

Whats new on arXiv

08-28

How I got in the top 1 % on Kaggle.

08-28

“To get started, I suggest coming up with a simple but reasonable model for missingness, then simulate fake complete data followed by a fake missingness pattern, and check that you can recover your missing-data model and your complete data model in that fake-data situation. You can then proceed from there. But if you can’t even do it with fake data, you’re sunk.”

08-27

If you did not already know

08-27

Bayesian model comparison in ecology

08-26

Whats new on arXiv

08-24

Whats new on arXiv

08-24

Whats new on arXiv

08-22

Distilled News

08-21

Whats new on arXiv

08-21

Whats new on arXiv

08-21

Forecasting financial time series with dynamic deep learning on AWS

08-20

R Packages worth a look

08-19

If you did not already know

08-18

R Packages worth a look

08-18

Whats new on arXiv

08-17

Whats new on arXiv

08-16

Document worth reading: “Sequences, yet Functions: The Dual Nature of Data-Stream Processing”

08-16

Deploy a TensorFlow trained image classification model to AWS DeepLens

08-15

Distilled News

08-14

Distilled News

08-13

Document worth reading: “Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution”

08-11

Distilled News

08-11

If you did not already know

08-11

Distilled News

08-10

Whats new on arXiv

08-10

Distilled News

08-08

Whats new on arXiv

08-08

The Hidden Costs of Data Silos

08-07

Distilled News

08-07

Whats new on arXiv

08-07

When Recurrent Models Don't Need to be Recurrent

08-06

Whats new on arXiv

08-04

Bring your own pre-trained MXNet or TensorFlow models into Amazon SageMaker

08-03

Whats new on arXiv

08-03

When LOO and other cross-validation approaches are valid

08-03

Distilled News

08-02

Whats new on arXiv

08-02

Tips & Tricks for Starting Your First Data Project

08-01

Progress in machine learning interpretability

07-31

Whats new on arXiv

07-31

Quantum Computing: Cats, Crushes, and Chemistry

07-30

Transfer learning for custom labels using a TensorFlow container and “bring your own algorithm” in Amazon SageMaker

07-27

Parsimonious principle vs integration over all uncertainties

07-26

AWS Deep Learning AMIs now include ONNX, enabling model portability across deep learning frameworks

07-26

Classify your own images using Amazon SageMaker

07-20

Model Updates: Entity-level Sentiment Analysis and Brand New Entity Extraction Models Now Live in the Text Analysis API

07-17

Using Siamese Networks and Pre-Trained Convolutional Neural Networks (CNNs) for Fashion Similarity Matching

07-10

Joint inference or modular inference? Pierre Jacob, Lawrence Murray, Chris Holmes, Christian Robert discuss conditions on the strength and weaknesses of these choices

07-09

Design Patterns for Production NLP Systems

07-09

Divisibility in statistics: Where is it needed?

07-08

Data science books - theory and practice

06-29

My Thoughts on Synthetic Data

06-27

DIY AI for the Future

06-27

Building a Diabetic Retinopathy Prediction Application using Azure Machine Learning

06-25

Add Constrained Optimization To Your Toolbelt

06-21

How to Do Distributed Deep Learning for Object Detection Using Horovod on Azure

06-20

The Role of Resources in Data Analysis

06-18

The Dynamics of Philippine Senate Bills: Gensim, Topic Modeling and All That Good NLP Stuff

06-09

Deep Learning for Emojis with VS Code Tools for AI – Part 2

06-05

Forbes: DataRobot Puts the Power of Machine Learning in the Hands of Business Analysts

06-04

How to Use FPGAs for Deep Learning Inference to Perform Land Cover Mapping on Terabytes of Aerial Images

05-29

My steps into Data Science

05-21

Profiling Top Kagglers: Bestfitting, Currently

05-07

Gensim Survey 2018

04-30

Time Series for scikit-learn People (Part II): Autoregressive Forecasting Pipelines

03-22

When Men and Women talk to Siri

03-09

Text to Speech Deep Learning Architectures

02-20

Production Recommendation Systems with Cloudera

02-20

Neural Networks and the generalisation problem

01-28

57 Summaries of Machine Learning and NLP Research

01-17

Weekly Review: 12/23/2017

12-23

Why mere Machine Learning cannot predict Bitcoin price

12-18

Weekly Review: 12/10/2017

12-10

Using Artificial Intelligence to Augment Human Intelligence

12-04

AutoML on AWS

12-04

Java Handwritten Digit Recognition with Neural Networks

11-29

On Pyro - Deep Probabilistic Programming on PyTorch

11-03

Deep learning with Apache MXNet on Cloudera Data Science Workbench

10-19

How to use Tensorboard with PyTorch

10-16

Building a Visual Search Algorithm

10-13

How To Predict ICU Mortality with Digital Health Data, DL4J, Apache Spark and Cloudera

09-18

Deep Learning with Intel’s BigDL and Apache Spark

09-06

When (not) to use Deep Learning for NLP

09-04

The Advent of Analytics Engineering

09-01

How much compute do we need to train generative models?

08-31

Hierarchical Softmax

08-01

Logistic Regression

07-30

Prophecy Fulfilled: Keras and Cloudera Data Science Workbench

07-25

Introductory Machine Learning Terminology with Food

07-18

Kaggle’s Mercedes-Benz Greener Manufacturing

07-01

Deep learning on Apache Spark and Apache Hadoop with Deeplearning4j

06-27

Machine Learning Fraud Detection: A Simple Machine Learning Approach

06-15

Teaching Machines to Draw

05-19

Transfer Learning for Flight Delay Prediction via Variational Autoencoders

05-08

The Benefits of Migrating HPC Workloads To Apache Spark

05-04

Machine Learning in Science and Industry slides

04-20

Sentiment analysis on Twitter using word2vec and keras

04-20

Deriving the Softmax from First Principles

04-19

Sentiment Analysis model deployed!

04-17

Time Series Analysis with Generalized Additive Models

04-04

A Practical Guide to the Lomb-Scargle Periodogram

03-30

Model AUC depends on test set difficulty

03-19

Random-Walk Bayesian Deep Networks: Dealing with Non-Stationary Data

03-14

Cognitive Machine Learning (2): Uncertain Thoughts

03-12

Deep and Hierarchical Implicit Models

02-28

What is an Interaction Effect?

02-25

Cognitive Machine Learning (1): Learning to Explain

02-05

RescueTime Inference via the "Poor Man's Dirichlet"

02-03

Where Predictive Modeling Goes Astray

01-27

Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

01-24

Customer lifetime value and the proliferation of misinformation on the internet

01-08

Attending to characters in neural sequence labeling models

01-06

Recurrent Neural Network Tutorial for Artists

01-01

Post NIPS Reflections

12-13

On Model Mismatch and Bayesian Analysis

12-13

3D printing glass and bronze: Lost-PLA casting

12-11

Properties of Interpretability

12-06

Don't Panic: Deep Learning will be Mostly Harmless

11-29

Goals of Interpretability

11-17

Two papers released on arXiv, "Operator Variational Inference" and "Model Criticism for Bayesian Causal Inference"

10-30

Intro to Implicit Matrix Factorization: Classic ALS with Sketchfab Models

10-19

A Billion Words and The Limits of Language Modeling

09-23

Discussion of "Fast Approximate Inference for Arbitrarily Large Semiparametric Regression Models via Message Passing"

09-19

The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)

08-24

Preliminary Note on the Complexity of a Neural Network

08-16

Why I’m Not a Fan of R-Squared

07-24

Linear regression can be understood in many ways (optimization, probabilistic, bayesian)

07-20

Recurrent Neural Networks in Tensorflow I

07-11

Translating W2v Embedding From One Space To Another

06-06

Bayesian Deep Learning

06-01

The Frog of CIFAR 10

04-06

Diagnosing Heart Diseases with Deep Neural Networks

03-15

Online Representation Learning in Recurrent Neural Language Models

01-24

A New Library for Analyzing Time-Series Data with Apache Spark

12-14

A Challenge to Data Scientists

11-22

Denoising Dirty Documents: Part 11

11-08

Deep Learning for Visual Question Answering

11-02

Predicting Fantasy Football Points

10-07

Denoising Dirty Documents: Part 7

09-23

Denoising Dirty Documents: Part 9

10-15

Denoising Dirty Documents: Part 8

10-02

Denoising Dirty Documents: Part 7

09-23

Top 5 Data Science Courses in 2019

01-09

Learn Python for Data Science From Scratch

01-09

Top KDnuggets tweets, Dec 19 – Jan 1: Deep Learning Cheat Sheets

01-02

My book ‘Practical Machine Learning in R and Python: Third edition’ on Amazon

01-02

Nimble tweak to use specific python version or virtual environment in RStudio

01-01

R or Python? Why not both? Using Anaconda Python within R with {reticulate}

12-30

Distilled News

12-25

Top Python Libraries in 2018 in Data Science, Deep Learning, Machine Learning

12-19

Top Stories of 2018: 9 Must-have skills you need to become a Data Scientist, updated; Python eats away at R: Top Software for Analytics, Data Science, Machine Learning in 2018

12-14

My book ‘Deep Learning from first principles:Second Edition’ now on Amazon

12-14

Here are the most popular Python IDEs / Editors

12-07

Best Machine Learning languages, Data Visualization Tools, DL Frameworks, and Big Data Tools

12-03

Why R for data science – and not Python?

12-02

Free ebook: Exploring Data with python

11-29

KDnuggets™ News 18:n44, Nov 21: What is the Best Python IDE for Data Science?; Anticipating the next move in data science

11-21

Distilled News

11-20

Top 10 Python Data Science Libraries

11-16

Top KDnuggets tweets, Oct 31 – Nov 6: 10 More Free Must-Read Books for Machine Learning and Data Science

11-07

Python vs R: Head to Head Data Analysis

11-01

Data Science Interview Questions with Answers

10-28

Get a 2–6x Speed-up on Your Data Pre-processing with Python

10-23

Distilled News

10-22

Accelerating Your Algorithms in Production [Webinar Replay]

10-16

Hitchhiker's guide to Exploratory Data Analysis

10-12

The One reason you should learn Python

10-11

Top 8 Python Machine Learning Libraries

10-09

Python Vs R : The Eternal Question for Data Scientists

09-29

Python Vs R : The Eternal Question for Data Scientists

09-24

Distilled News

09-04

Distilled News

08-31

What is Data Science?

08-20

On the growth of our PyDataLondon community

08-16

Distilled News

08-10

Essential Tips and Tricks for Starting Machine Learning with Python

08-05

What makes the Python Cool.

07-31

Python数据分析之pandas

07-18

Using WSL Linux on Windows 10 for Deep Learning Development.

07-04

5 Tips To Learn Machine Learning

06-17

World Models Experiments

06-09

Python and Tidyverse

06-01

Introducing Python for data scientists - Pt2

03-23

Why you should start using .npy file more often…

03-20

Introducing Python for data scientists - Pt1

03-15

Python Tutorial: Learn Python in one Day

11-28

8 Important Python Interview Questions and Answers

11-17

PyConUK 2017, PyDataCardiff and “Machine Learning Libraries You’d Wish You’d Known About”

11-05

How to launch your data science career (with Python)

07-12

Use your favorite Python library on PySpark cluster with Cloudera Data Science Workbench

04-26

Deep Learning Frameworks on CDH and Cloudera Data Science Workbench

04-20

Getting Rich using Bitcoin stockprices and Twitter!

02-22

Simple Stock Ticker App

02-04

Django and Elastic Beanstalk, a perfect combination

11-28

Simple python to LaTeX parser

11-18

Quick reference to Python in a single script (and notebook)

10-13

Python 2.7 still reigns supreme in pip installs

09-03

Grokking Deep Learning

08-17

Announcing hs2client, A Fast New C++ / Python Thrift Client for Impala and Hive

06-16

Why pandas users should be excited about Apache Arrow

02-22

Theano Tutorial

01-25

Why is Keras Running So Slow?

12-05

The problem with the data science language wars

11-02

Yet Another PhD to Data Science Post (Part I)

09-23

Distilled News

01-13

Generating Synthetic Data Sets with ‘synthpop’ in R

01-13

Making sense of the METS and ALTO XML standards

01-13

Showing a difference in means between two groups

01-13

Practical Data Science with R, 2nd Edition discount!

01-12

Why Vegetarians Miss Fewer Flights – Five Bizarre Insights from Data

01-12

The year in AI/Machine Learning advances: Xavier Amatriain 2018 Roundup

01-11

If you did not already know

01-11

R Packages worth a look

01-11

epubr 0.6.0 CRAN release

01-11

Visualizing the Asian Cup with R!

01-11

vitae: Dynamic CVs with R Markdown

01-10

Explainable Artificial Intelligence

01-10

Tutorial: Time Series Analysis with Pandas

01-10

Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data

01-10

10 Companies to Work with After a Data Science Course

01-10

Core Principles of Sustainable Data Science, Machine Learning and AI Product Development: Research as a core driver

01-09

How Data Scientists Think - A Mini Case Study

01-09

On the Road to 0.8.0 — Some Additional New Features Coming in the sergeant Package

01-09

Top 5 Data Science Courses in 2019

01-09

Top December Stories: Why You Shouldn’t be a Data Science Generalist

01-09

Distilled News

01-09

Top KDnuggets tweets, Jan 02-08: 10 Free Must-Read Books for Machine Learning and Data Science

01-09

AI Gotchas (& How to Avoid Them)

01-08

If you did not already know

01-08

Don’t reinvent the wheel: making use of shiny extension packages. Join MünsteR for our next meetup!

01-08

Comparison of the Text Distance Metrics

01-07

7 Reasons for Policy Professionals to Get Pumped About R Programming in 2019

01-07

RTest: pretty testing of R packages

01-07

Role of Computer Science in Data Science World

01-07

Stock Price prediction using ML and DL

01-07

Dow Jones Stock Market Index (2/4): Trade Volume Exploratory Analysis

01-07

Rev Summit for Data Science Leaders featuring Daniel Kahneman

01-07

Distilled News

01-07

R Packages worth a look

01-06

Distilled News

01-06

R Packages worth a look

01-06

Magister Dixit

01-06

Distilled News

01-05

Distilled News

01-05

Here’s why 2019 is a great year to start with R: A story of 10 year old R code then and now

01-05

My Activities in 2018 with R and ShinyApp

01-04

In case you missed it: December 2018 roundup

01-04

Strata Data SF 2019 KDnuggets Offer

01-04

The cold start problem: how to build your machine learning portfolio

01-04

x-mas tRees with gganimate, ggplot, plotly and friends

01-03

gganimate has transitioned to a state of release

01-03

Music listener statistics: last.fm’s last.year as an R package

01-02

Why Learning Data Science Live is Better than Self-Paced Learning

01-02

Magister Dixit

01-02

What to do when you read a paper and it’s full of errors and the author won’t share the data or be open about the analysis?

01-02

Seeing the wood for the trees

01-01

Silent Duels and an Old Paper of Restrepo

12-31

Distilled News

12-31

New Year's Resolution: Help Data Scientists Help You

12-31

If you did not already know

12-30

If you did not already know

12-29

Part 5: Code corrections to optimism corrected bootstrapping series

12-29

Deep Learning for Media Content

12-28

Part 4: Why does bias occur in optimism corrected bootstrapping?

12-28

R Packages worth a look

12-28

The business case for federated learning

12-28

The Essence of Machine Learning

12-28

Some fun with {gganimate}

12-27

9 Reasons Excel Users Should Consider Learning Programming

12-27

How AI Will Change Brick-and-Mortar Retail in 2019

12-26

If you did not already know

12-26

Miami University: Assistant Provost for Institutional Research and Effectiveness [Oxford, OH]

12-26

Deep learning in Satellite imagery

12-26

Finally, You Can Plot H2O Decision Trees in R

12-26

R Packages worth a look

12-26

Data Science & ML : A Complete Interview Guide

12-26

Distilled News

12-25

4 Reasons Santa Needs Machine Learning & AI

12-24

Dreaming of a white Christmas – with ggmap in R

12-24

Twas the Night Before Analysis or A Visit from the Chief Data Scientist

12-24

Zak David expresses critical views of some published research in empirical quantitative finance

12-24

Pivot Billions and Deep Learning enhanced trading models achieve 100% net profit

12-24

The Semantic Web: Where is it now?

12-23

R Packages worth a look

12-23

R Packages worth a look

12-22

November 2018: “Top 40” New Packages

12-21

Feature engineering, Explained

12-21

10 More Must-See Free Courses for Machine Learning and Data Science

12-20

The Key to Getting a Data Science Job, According to Briana Brownell

12-20

Examining the Tweeting Patterns of Prominent Crossfit Gyms

12-20

Distilled News

12-20

The importance of Data Analytics skills in today’s MBA roles

12-19

Hackathon Winner Interview: Penn State | Kaggle University Club

12-19

UnitedHealth Group: Director, Data Science [Minnetonka, MN]

12-19

Distilled News

12-19

Distilled News

12-19

Data Science & ML : A Complete Interview Guide

12-19

The Netflix Data War

12-19

R Packages worth a look

12-19

Think Twice Before You Accept That Fancy Data Science Job

12-19

Top Python Libraries in 2018 in Data Science, Deep Learning, Machine Learning

12-19

Data, movies and ggplot2

12-19

Industry Predictions: AI, Machine Learning, Analytics & Data Science Main Developments in 2018 and Key Trends for 2019

12-18

vtreat Variable Importance

12-18

Statistics in Glaucoma: Part III

12-18

Exploring the Data Jungle Free eBook

12-18

vtreat Variable Importance

12-18

Magister Dixit

12-18

Distilled News

12-18

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-17

Vanguard: Senior AI Architect [Malvern, PA]

12-17

Top 10 Advantages of a Data Science Certification

12-17

Day 17 – little helper to_na

12-17

The 2019 PAW Business Agenda is Live – Super Early Bird expires this Friday

12-17

Introduction to Pandas, NumPy and RegEx in Python

12-17

Vanguard: Senior AI Engineer [Malvern, PA]

12-17

LoyaltyOne: Associate Director, Client Services [Westborough, MA]

12-17

LoyaltyOne: Associate Director, CPG [Westborough, MA]

12-17

Distilled News

12-17

Six Sigma DMAIC Series in R – Part4

12-15

Top Insights from 50 Chief Data Officers

12-14

Learning R: A gentle introduction to higher-order functions

12-14

CBH Group: Sr Data Engineer [Perth, Australia]

12-14

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-14

In case you missed it: November 2018 roundup

12-14

LoyaltyOne: Manager, CPG [Westborough, MA]

12-14

Top Stories of 2018: 9 Must-have skills you need to become a Data Scientist, updated; Python eats away at R: Top Software for Analytics, Data Science, Machine Learning in 2018

12-14

Spark + AI Summit: learn best practices in ML and DL, latest frameworks, and more – special KDnuggets offer

12-14

Distilled News

12-14

Cummins: Data Engineering Technical Specialist [Columbus, IN]

12-13

Cummins: Data Engineering Apps and Solutions Architect [Columbus, IN]

12-13

MINDBODY: Business Intelligence Analyst II [San Luis Obispo, CA]

12-13

What's the future of the pandas library?

12-12

Intuit: Staff Data Scientist [Mountain View, CA]

12-12

Intuit: Staff Experimentation Data Scientist [Mountain View, CA]

12-12

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

Intuit: Staff Data Scientist – Business Analytics [Mountain View, CA]

12-12

Code for case study – Customer Churn with Keras/TensorFlow and H2O

12-12

Using ggplot2 for functional time series

12-12

Distilled News

12-12

P&G: Data Scientist – Machine Learning/NLP [Cincinnati, OH]

12-11

CBH Group: Data Scientist [Perth, Australia]

12-11

Intuit: Staff Data Scientist [Woodland Hills, CA and Mountain View, CA]

12-11

“Do you have any recommendations for useful priors when datasets are small?”

12-11

CBH Group: Sr Data Scientist [Perth, Australia]

12-11

The Role of Theory in Data Analysis

12-11

A Machine Learning Deep Dive [Webinar, Dec 13]

12-11

Machine Learning (ML) Essentials

12-11

Document worth reading: “Taxonomy of Big Data: A Survey”

12-11

Top November Stories: The Most in Demand Skills for Data Scientists; What is the Best Python IDE for Data Science?

12-11

Intuit: Staff Data Scientist [Mountain View, CA]

12-11

Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

12-11

Learning Machine Learning vs Learning Data Science

12-11

Let Automation Carry You from BI to AI in 2019

12-11

Reflections on the 10th anniversary of the Revolutions blog

12-10

Should you become a data scientist?

12-10

5½ Reasons to Ditch Spreadsheets for Data Science: Code is Poetry

12-10

R Packages worth a look

12-10

Day 10 – little helper %nin%

12-10

Reflections on the 10th anniversary of the Revolutions blog

12-10

Canada Map

12-09

An 8-hour course on R and Data Mining

12-09

An 8-hour course on R and Data Mining

12-09

Distilled News

12-09

Interesting packages taken from R/Pharma

12-09

Timing Grouped Mean Calculation in R

12-08

Timing Grouped Mean Calculation in R

12-08

R Packages worth a look

12-08

Day 07 – little helper count_na

12-07

The Machine Learning Project Checklist

12-07

One-stop-learning-shop for data pros – get exclusive access for less than a cup of coffee

12-06

An Intro to Deep Learning in Python

12-06

Automated Dashboard visualizations with Deviation in R

12-06

If you did not already know

12-06

Distilled News

12-06

Top KDnuggets tweets, Nov 28 – Dec 4: Deep Learning Cheat Sheets; Amazon opens its internal

12-05

ggQC | ggplot Quality Control Charts – New Release

12-05

KDnuggets™ News 18:n46, Dec 5: AI, Data Science, Analytics 2018 Main Developments, 2019 Key Trends; Deep Learning Cheat Sheets

12-05

Automated Dashboard with various correlation visualizations in R

12-05

Kick Start Your Data Career! Tips From the Frontline

12-05

Document worth reading: “A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition”

12-05

Heatmaps of Mortality Rates

12-04

Deep learning in Satellite imagery

12-04

Day 04 – little helper evenstrings

12-04

Data Mining Book – Chapter Download

12-04

Why Primary Research?

12-04

Distilled News

12-03

AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-03

The State of Data in Astronomy

12-03

Graph-Powered Machine Learning

12-03

One Recipe Step to Rule Them All

12-03

Statistics in Glaucoma: Part I

12-03

Ronin: Sr Machine Learning and AI Data Scientist [San Mateo, CA]

12-03

Ronin: Data Engineer [San Mateo, CA]

12-03

R Packages worth a look

12-02

Day 02 – little helper na_omitlist

12-02

R plus Magento 2 REST API revisited: part 3 – more complex samples of use

12-02

Interpretability is crucial for trusting AI and machine learning

12-01

NYC buses: C5.0 classification with R; more than 20 minute delay?

12-01

Tribes.ai: Sr Data Scientist [Remote, India / Eastern Europe]

12-01

Whats new on arXiv

12-01

If you did not already know

12-01

WPI: Research Scientist [Worcester, MA]

11-30

Number of births in the twentieth century by @ellis2013nz

11-30

NYC buses: Cubist regression with more predictors

11-30

Distilled News

11-30

University of Tennessee Knoxville: Assistant or Associate Professor in Data Science [Knoxville, TN]

11-30

ML Methods for Prediction and Personalization

11-30

Distilled News

11-29

Document worth reading: “Big Data and Fog Computing”

11-29

October 2018: “Top 40” New Packages

11-29

NYC buses: simple Cubist regression

11-29

Linking Data Science Activities to Business Initiatives Using the Hypothesis Development Canvas

11-29

How to Find Mentors for Data Science?

11-29

Plotting Scottish census data with some tidyverse magic

11-28

Top KDnuggets tweets, Nov 21-27: Intro to

11-28

KDnuggets™ News 18:n45, Nov 28: Your Favorite Python IDE/editor? Intro to Data Science for Managers

11-28

Sales Forecasting Using Facebook’s Prophet

11-28

R Packages worth a look

11-28

Marginal Effects for (mixed effects) regression models

11-28

NYC buses: company level predictors with R

11-28

Lessons from posting a fake map about pies

11-28

Visualization of NYC bus delays with R

11-27

Magister Dixit

11-27

Humana: Principal Data Scientist/Informatics Principal [Chicago, IL, Dallas, TX and Louisville, KY]

11-27

Drexel University: 2 Teaching Faculty Positions in Data Science [Philadelphia, PA]

11-27

Distilled News

11-26

Distilled News

11-26

My secret sauce to be in top 2% of a Kaggle competition

11-26

Talking on “High Performance Python” at Linuxing In London last week

11-26

Global Legal Entity Identifier Foundation (GLEIF): Data Analyst [Frankfurt, Germany]

11-26

You Can’t Do AI Without Augmented Analytics and AutoML

11-26

If you did not already know

11-26

Data Pro Cyber Monday – Choose Your Savings

11-26

OneR – fascinating insights through simple rules

11-25

OneR – fascinating insights through simple rules

11-24

Distilled News

11-24

Distilled News

11-24

EARL conference recap: Seattle 2018

11-24

Document worth reading: “Learning From Positive and Unlabeled Data: A Survey”

11-23

Magister Dixit

11-23

Top 5 domains Big Data analytics helps to transform

11-23

If you did not already know

11-23

Beautiful Chaos: The Double Pendulum

11-22

Cartoon: Thanksgiving, Big Data, and Turkey Data Science.

11-22

If you did not already know

11-22

Dealing with failed projects

11-22

6 Goals Every Wannabe Data Scientist Should Make for 2019

11-22

Machine Learning. In conversation with Jelena Ilic, Senior Data Scientist at Mango Solutions

11-21

An Introduction to AI

11-21

The best way to visit Luxembourguish castles is doing data science + combinatorial optimization

11-21

Machine Learning in Action: Going Beyond Decision Support Data Science

11-20

Address Your Data Science Strategy at DSNY

11-20

Data Shows No Increase In NYC Plowing as Storm Picked Up

11-20

Quantcast: Sr Applied Scientist, Audience Platform [Seattle, WA]

11-20

Introducing Octoparse New Version 7.1 – web scraping for dummies is official

11-20

UnitedHealth Group: Sr Manager, Data Engineering [Minnetonka, MN]

11-19

Change over time is not “treatment response”

11-19

The Big Data Game Board™

11-19

The Distribution of Time Between Recessions: Revisited (with MCHT)

11-19

Insights on the role data can play in your organization

11-19

ML Methods for Prediction and Personalization

11-19

What I Learned About Machine Learning at ODSC West 2018

11-19

Distilled News

11-19

Predictive Analytics in 2018: Salaries & Industry Shifts

11-19

UnitedHealth Group: Sr Manager, Data Science [Telecommute, Central or Eastern Time Zones]

11-19

Cognitive Services in Containers

11-19

Cognitive Services in Containers

11-19

R Packages worth a look

11-18

Document worth reading: “Graphical Models for Processing Missing Data”

11-18

Graphs and tables, tables and graphs

11-18

Statistics Sunday: Reading and Creating a Data Frame with Multiple Text Files

11-18

A more systematic look at suppressed data by @ellis2013nz

11-17

UnitedHealth Group: Clinical Data Statistical Analyst – SQL SAS (Clinician Required) [Telecommute]

11-16

Top 10 Python Data Science Libraries

11-16

UnitedHealth Group: Data Analytics and Reporting Lead [Minnetonka, MN or Telecommute]

11-16

UnitedHealth Group: Senior Principal Data Scientist [Telecommute, Central or Eastern Time Zones]

11-16

Distilled News

11-16

The tidy caret interface in R

11-16

Report from the Enterprise Applications of the R Language conference

11-16

Report from the Enterprise Applications of the R Language conference

11-16

(Webinar) Farmers and Chubb on Humanizing Claims with AI

11-15

Introducing Drexel new online MS in Data Science

11-15

Strategy: Customer Analytics: Are you Profiting from your Data?

11-14

Bright Lights, Bright Future. TDWI Is Back in Vegas

11-14

Federated learning: distributed machine learning with data locality and privacy

11-14

KDnuggets™ News 18:n43, Nov 14: To get hired as a data scientist, don’t follow the herd; LinkedIn Top Voices in Data Science & Analytics

11-14

Distilled News

11-14

Windows Clipboard Access with R

11-14

LinkedIn Top Voices 2018: Data Science & Analytics

11-13

The ultimate guide to starting AI

11-13

The 5 Basic Statistics Concepts Data Scientists Need to Know

11-13

Distilled News

11-12

Preview my new book: Introduction to Reproducible Science in R

11-12

Data Science With R Course Series – Week 9

11-12

Distilled News

11-12

Angela Bassa discusses managing data science teams and much more.

11-12

Distilled News

11-11

If you did not already know

11-10

If you did not already know

11-10

Dr. Data Show Video: What the Hell Does “Data Science” Really Mean?

11-10

The Gamification Of Fitbit: How an API Provided the Next Level of tRaining

11-10

R Packages worth a look

11-10

Top October Stories: 9 Must-have skills you need to become a Data Scientist, updated; 10 Best Mobile Apps for Data Scientist / Data Analysts

11-09

Top 5 Trends in Data Science

11-09

Why would I ever NEED Bayesian Statistics?

11-09

Exploring Models with lime

11-09

Deep Learning Performance Cheat Sheet

11-08

Practical statistics books for software engineers

11-08

Distilled News

11-08

Hilary Mason and Gilad Lotan to Keynote at MADS 2019

11-08

Latest Trends in Computer Vision Technology and Applications

11-07

Working with US Census Data in R

11-07

Working with US Census Data in R

11-07

DePaul University: Professor of Practice position in Data Science [Chicago, IL]

11-07

Distilled News

11-07

If you did not already know

11-07

7 Best Practices for Machine Learning on a Data Lake

11-07

New: Maintained Datasets

11-06

R plus Magento 2 REST API revisited: part 1- authentication and universal search

11-06

Document worth reading: “Toward a System Building Agenda for Data Integration”

11-06

Data Feminism

11-06

Turn data into revenue. Wharton can show you how.

11-06

Data Science in 30 Minutes with Jake Porway of DataKind

11-06

EARL Houston: Interview with Hadley Wickham

11-05

Maps, models, and analytic problem framing

11-05

Distilled News

11-05

The 3Ds of Machine Learning Systems Design

11-05

Linear Regression in Real Life

11-05

Data Science With R Course Series – Week 8

11-05

Distilled News

11-04

New R Cheatsheet: Data Science Workflow with R

11-04

If you did not already know

11-03

Data Mining Book – Chapter Download

11-02

The Most in Demand Skills for Data Scientists

11-02

Learn how machine learning is transforming business

11-02

Learn how machine learning is transforming business, Nov 12 Webinar

11-02

Data Science “Paint by the Numbers” with the Hypothesis Development Canvas

11-02

Document worth reading: “Transfer Metric Learning: Algorithms, Applications and Outlooks”

11-02

The blocks and rows theory of data shaping

11-02

My two talks in Austria next week, on two of your favorite topics!

11-02

How Data Science Is Improving Higher Education

11-01

Multi-Class Text Classification Model Comparison and Selection

11-01

Multithreaded in the Wild

11-01

Communicating results with R Markdown

11-01

Facial feedback: “These findings suggest that minute differences in the experimental protocol might lead to theoretically meaningful changes in the outcomes.”

11-01

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: November and Beyond

11-01

Top KDnuggets tweets, Oct 24-30: Building a Question-Answering System from Scratch

10-31

If you did not already know

10-31

How to Start Learning R for Data Science

10-31

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Moody’s Analytics: Machine Learning / NLP – Research Scientist / Engineer [New York, NY]

10-30

Our Favorite Spooky AI & Data Articles

10-30

Data + Art STEAM Project: Final Results

10-30

R Packages worth a look

10-30

How to create useful features for Machine Learning

10-30

Are petrol prices in Australia fair?

10-30

Using deep learning on AWS to lower property damage losses from natural disasters

10-30

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Lehigh University: Tenure Track Positions in Foundations of Data Science [Bethlehem, PA]

10-30

Document worth reading: “Resource Management in Fog/Edge Computing: A Survey”

10-30

Federated Learning: Machine Learning with Privacy on the Edge

10-29

Bank of Canada: Data Scientist [Ottawa, Canada]

10-29

Key Takeaways from AI Conference SF, Day 1: Domain Specific Architectures, Emerging China, AI Risks

10-29

The Future of Management: Human Resource Analytics

10-29

Open Source Deep Dive with Olivier Grisel

10-29

Data Science With R Course Series – Week 7

10-29

Top Obstacles to Overcome when Implementing Predictive Maintenance

10-29

How to be an Artificial Intelligence (AI) Expert?

10-29

American Association of Colleges of Osteopathic Medicine: Data Analyst [Bethesda, Maryland]

10-29

Distilled News

10-28

Distilled News

10-28

Data Science Interview Questions with Answers

10-28

Can we do better than using averaged measurements?

10-26

RConsortium — Building an R Certification

10-26

The Final Data Science Roadshow is Just the Beginning

10-26

CRAN’s New Missing Data Task View

10-26

Spotlight on Julia Silge, Keynote Speaker EARL Seattle 7th November

10-26

Are you buying an apartment? How to hack competition in the real estate market

10-26

Implementing Automated Machine Learning Systems with Open Source Tools

10-25

How to be an Artificial Intelligence (AI) Expert?

10-25

Distilled News

10-25

11 Design Tips for Data Visualization

10-25

If you did not already know

10-25

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 1: Basics

10-25

U. of Zurich: Professorship in Big Data Science (Open Rank) [Zurich, Switzerland]

10-24

Join us at the EARL US Roadshow – a conference dedicated to the real-world usage of R

10-24

When the numbers don't tell the whole story

10-24

R Packages worth a look

10-24

Google, Microsoft & Fraunhofer at the First European Edition of Deep Learning World – 12 Nov, 2018

10-23

Introducing gratia

10-23

Whats new on arXiv

10-23

Cross-over study design with a major constraint

10-23

Amazon SageMaker Batch Transform now supports Amazon VPC and AWS KMS-based encryption

10-22

Don’t miss Big Data LDN 2018

10-22

Distilled News

10-22

University of Rhode Island: Assistant Professor of Data Science [Kingston, RI]

10-22

Data Science With R Course Series – Week 6

10-22

Residential Property Investment Visualization and Analysis Shiny App

10-22

Distilled News

10-21

Dr. Data Show Video: How Can You Trust AI?

10-20

A Thorough Introduction to Boltzmann Machines

10-20

McKinsey Datathon: The City Cup17 November, Amsterdam, Stockholm and Zurich. Apply Now

10-19

Loops and Pizzas

10-19

Will Models Rule the World? Data Science Salon Miami, Nov 6-7

10-19

R Packages worth a look

10-18

Graphs Are The Next Frontier In Data Science

10-18

BI to AI: Getting Intelligent Insights to Everyone

10-18

Blockchain applications in the Federal Government sector

10-17

Building a data warehouse

10-17

Citizen Data Scientists | Why Not DIY AI?

10-17

Use R with Excel: Importing and Exporting Data

10-17

If you did not already know

10-17

Why you need GPUs for your deep learning platform

10-16

Self-Service Analytics or Operationalization: Which Should I Implement?

10-16

Applied Data Science: Solving a Predictive Maintenance Business Problem Part 3

10-16

R Packages worth a look

10-15

Distilled News

10-15

Choose Your Own Adventure – Analytics On-boarding

10-15

Piping into ggplot2

10-13

Prophets of gloom: Using NLP to analyze Radiohead lyrics

10-13

Piping into ggplot2

10-13

Review: Excel TV’s Data Science with Power BI and R

10-12

Limitations of “Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection”

10-12

Distilled News

10-12

Stan on the web! (thanks to RStudio)

10-12

Top Blockchain Applications Making Waves in Commercial Real Estate

10-12

Machine learning — Is the emperor wearing clothes?

10-12

We Sized Washington’s Edible Marijuana Market Using AI

10-12

Business Analysis (BA) Career Path

10-11

The One reason you should learn Python

10-11

Distilled News

10-11

Top KDnuggets tweets, Oct 3–9: 5 Reasons Logistic Regression should be the first thing you learn when becoming a Data Scientist

10-10

TDWI In-Person and Virtual Data and Analytics Training

10-10

a4 Media: Manager, Machine Learning Data Engineer [Long Island City, NY]

10-10

Announcing Ursa Labs's partnership with NVIDIA

10-10

Life in Madrid seen through BiciMAD

10-10

10 Best Mobile Apps for Data Scientist / Data Analysts

10-10

Data Mining Book: Chapter Download.

10-10

Shopper Sentiment: Analyzing in-store customer experience

10-09

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount.

10-09

Distilled News

10-09

Leading the Charge 🔌 🚘: 10 Charts on Electric Vehicles in Plotly

10-09

Distilled News

10-08

Running the Same Task in Python and R

10-08

The economic consequences of MOOCs

10-08

BIG, small or Right Data: Which is the proper focus?

10-08

Job: Postdoctoral Researcher in Small Data Deep Learning and Explainable Machine Learning, Livermore, CA

10-08

Distilled News

10-06

Here is How You Can build a data science team from scratch in 2018 - The Definitive Guide

10-05

Online Master’s in Applied Data Science From Syracuse

10-05

Multithreaded in the Wild

10-05

Distilled News

10-05

Why do I Call Myself a Data Scientist?

10-05

Colorado State University: Assistant Professor in Industrial and Organizational (IO) Psychology [Fort Collins, CO]

10-05

“Ivy League Football Saw Large Reduction in Concussions After New Kickoff Rules”

10-05

Big Data Day Camp: Big Data Tools & Techniques (October 25-26)

10-04

Linear Regression in the Wild

10-03

Data Science at Northwestern

10-03

Sequence Modeling with Neural Networks – Part I

10-03

Cool postdoc position in Arizona on forestry forecasting using tree ring models!

10-02

The Enterprise AI Lab: Not Your Average AI Lab

10-02

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount

10-01

Modeling muti-category Outcomes With vtreat

10-01

Chromebook Data Science - a free online data science program for anyone with a web browser.

10-01

A Right to Reasonable Inferences

10-01

Up your open source game with Hacktoberfest at Locke Data!

10-01

Reinforcement Learning: Super Mario, AlphaGo and beyond

10-01

Dr. Data Show Video: Why Machine Learning Is the Coolest Science

10-01

Document worth reading: “Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances”

10-01

Robust Quality – Powerful Integration of Data Science and Process Engineering

10-01

Distilled News

09-30

R Packages worth a look

09-29

Functions and Packages

09-29

If you did not already know

09-27

(People are missing the point on Wansink, so) what’s the lesson we should be drawing from this story?

09-27

Your Guide to AI and Machine Learning at re:Invent 2018

09-27

Document worth reading: “Data Science vs. Statistics: Two Cultures”

09-27

Advantages of Online Data Science Courses

09-26

3-D shadow maps in R: the rayshader package

09-26

Distilled News

09-26

Morph, an open-source tool for data-driven art without code

09-26

If you did not already know

09-25

One Drink Per Day, Your Chances of Developing an Alcohol-Related Condition

09-25

Distilled News

09-25

Distilled News

09-24

R Packages worth a look

09-24

Dataquest helped me get my dream job at Noodle.ai

09-24

Distilled News

09-23

R Packages worth a look

09-22

Distilled News

09-22

A psychology researcher uses Stan, multiverse, and open data exploration to explore human memory

09-21

Using a Column as a Column Index

09-21

How Pol Brigneti got a Data Analyst job using Dataquest at Belgrave Valley

09-21

AI-Based Virtual Tutors – The Future of Education?

09-21

Distilled News

09-20

PyConUK 2018

09-19

Distilled News

09-19

Three Mighty Good Reasons to Learn R for Data Science

09-19

Variety is the Secret Sauce for Big Discoveries in Big Data

09-18

Cuisine Ingredients

09-18

Cuisine Ingredients

09-18

Deep learning made easier with transfer learning

09-17

Distilled News

09-17

Distilled News

09-14

R Packages worth a look

09-13

Distilled News

09-12

Data Science Glossary

09-12

Document worth reading: “Analytics for the Internet of Things: A Survey”

09-12

Distilled News

09-11

R Packages worth a look

09-10

Distilled News

09-08

R Tip: Give data.table a Try

09-08

Being at the Center

09-07

Multithreaded in the Wild

09-07

Document worth reading: “Putting Data Science In Production”

09-07

Magister Dixit

09-07

Who wrote that anonymous NYT op-ed? Text similarity analyses with R

09-07

How Foundations Student Russell Martin got into The Data Incubator’s Fellowship

09-06

R Packages worth a look

09-06

In case you missed it: August 2018 roundup

09-06

Distilled News

09-06

R Packages worth a look

09-05

Distilled News

09-04

Book review: SQL Server 2017 Machine Learning Services with R

09-04

“We continuously increased the number of animals until statistical significance was reached to support our conclusions” . . . I think this is not so bad, actually!

09-04

The Data Science Roadshow is ON!

09-03

R Packages worth a look

09-03

Magister Dixit

09-02

If you did not already know

09-01

R Packages worth a look

09-01

Distilled News

08-31

Distilled News

08-31

Document worth reading: “PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison”

08-31

R Packages worth a look

08-30

R Tip: Put Your Values in Columns

08-30

Document worth reading: “Idealised Bayesian Neural Networks Cannot Have Adversarial Examples: Theoretical and Empirical Study”

08-29

Document worth reading: “A Comparative Study on using Principle Component Analysis with Different Text Classifiers”

08-29

Distilled News

08-28

Videos from NYC R Conference

08-28

“To get started, I suggest coming up with a simple but reasonable model for missingness, then simulate fake complete data followed by a fake missingness pattern, and check that you can recover your missing-data model and your complete data model in that fake-data situation. You can then proceed from there. But if you can’t even do it with fake data, you’re sunk.”

08-27

World map shows aerosol billowing in the wind

08-24

Constructing a Data Analysis

08-24

If you did not already know

08-23

Video: Azure Machine Learning in plain English

08-23

Distilled News

08-23

Getting Started with Competitions - A Peer to Peer Guide

08-22

Distilled News

08-22

R Packages worth a look

08-22

Distilled News

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

Managing your expenses with Amazon Lex

08-21

Distilled News

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

What is Data Science?

08-20

If you did not already know

08-19

More Practical Data Science with R Book News

08-19

AI, Machine Learning and Data Science Roundup: August 2018

08-17

Distilled News

08-17

Distilled News

08-16

Document worth reading: “Sequences, yet Functions: The Dual Nature of Data-Stream Processing”

08-16

R Packages worth a look

08-16

The Law and Order of Data Science

08-15

Announcing Practical Data Science with R, 2nd Edition

08-15

If you did not already know

08-15

data.table is Really Good at Sorting

08-14

Distilled News

08-14

R Packages worth a look

08-12

Shared items

08-11

Distilled News

08-11

Distilled News

08-10

Magister Dixit

08-09

DIFFERENCE BETWEEN DATA SCIENCE, DATA ANALYTICS AND MACHINE LEARNING

08-09

The Trillion Dollar Question

08-09

Distilled News

08-08

If you did not already know

08-08

The Hidden Costs of Data Silos

08-07

Distilled News

08-07

R Packages worth a look

08-07

“The most important aspect of a statistical analysis is not what you do with the data, it’s what data you use” (survey adjustment edition)

08-07

Magister Dixit

08-04

Distributed Deep Learning on AZTK and HDInsight Spark Clusters

08-02

Distilled News

08-02

Download 3 million Russian troll tweets

08-02

If you did not already know

08-02

Magister Dixit

07-31

Facilitate Proactive Cybersecurity Operations with Big Data Analytics and Machine Intelligence

07-30

Four Ways to Harness Big Data in the Energy Sector

07-30

Machine Learning Making Big Moves in Marketing

07-30

Keynote at EuroPython 2018 on “Citizen Science”

07-27

How to think about an accelerating string of research successes?

07-26

Top 20 Python AI and Machine Learning Open Source Projects

07-23

Year 3 of Data, Beer, & Inspiration

07-23

Python数据分析之pandas

07-18

Scanning Office 365 documents

07-16

CRN: The 10 Coolest Machine-Learning And AI Startups Of 2018 (So Far)

07-16

Data Science in 30 Minutes: Using Data Science to Predict the Future with Kirk Borne

07-11

John Mount speaking on rquery and rqdatatable

07-11

Top-Down vs. Bottom-Up Approaches to Data Science

07-10

Joint inference or modular inference? Pierre Jacob, Lawrence Murray, Chris Holmes, Christian Robert discuss conditions on the strength and weaknesses of these choices

07-09

Divisibility in statistics: Where is it needed?

07-08

How to update your scikit-learn code for 2018

07-04

Basic Statistics in Python: Descriptive Statistics

07-03

Here’s How to Survive the Rise of A.I. – Become a Data Facilitator

07-03

seplyr 0.5.8 Now Available on CRAN

07-02

Announcement – The Data Incubator Partnership with MRI Network

06-28

Can Lessons from Data Science Help Journalism?

06-27

Supercharging Classification - The Value of Multi-task Learning

06-26

What Is Machine Learning and How Is It Making Our World a Better Place?

06-23

Using Clustering Algorithms to Analyze Golf Shots from the U.S. Open

06-19

U.S. Open Data — Gathering and Understanding the Data from 2018 Shinnecock

06-15

Data Science in 30 Minutes: Holden Karau – A Quick Introduction to PySpark

06-13

Engineering a New Career in Data Science: Alumni Spotlight on Abhishek Mishra

06-06

Bulk Loading Shapefiles Into Postgres/Postgis

06-01

Python and Tidyverse

06-01

Data Science in 30 Minutes: Deep Learning to Detect Fake News with Uber ATG Head of Data Science, Mike Tamir

05-30

“Creating correct and capable classifiers” at PyDataAmsterdam 2018

05-26

ML beyond Curve Fitting: An Intro to Causal Inference and do-Calculus

05-24

SQLite vs Pandas: Performance Benchmarks

05-23

Enterprise Deployment Tips for Azure Data Science Virtual Machine (DSVM)

05-21

Life-cycle of a Data Science Project

05-18

Using Linear Regression for Predictive Modeling in R

05-16

Learn D3.js in 5 minutes

05-16

Multithreaded in the Wild

05-07

Ffa1ea00fdab31b3b44b87839c503629

05-06

Cambridge Analytica, Facebook, and user data – Monthly Media Review with the AYLIEN News API, April

05-03

A particles-arly fun book draw

05-02

Technology and Information: Data Science and UX

05-01

How many CRAN package maintainers have been pwned?

04-18

Multithreaded in the Wild

04-09

Automated machine learning is coming... and it won't matter

04-04

Learn to R blog series - R and RStudio

03-29

Introducing Python for data scientists - Pt2

03-23

Crossing Your Data Science Chasm

03-22

Time Series for scikit-learn People (Part II): Autoregressive Forecasting Pipelines

03-22

Introducing Python for data scientists - Pt1

03-15

Using RSiteCatalyst With Microsoft PowerBI Desktop

03-13

Understanding rolling calculations in R

03-07

How Americans make a living based on their age

03-06

Multithreaded in the Wild

03-02

Production Recommendation Systems with Cloudera

02-20

Fast Company's 2018 World's Most Innovative Companies List

02-20

It’s okay to not be a data scientist

02-20

PyData Conference & AHL Hackathon

02-16

How to maraaverickfy a blog post without even reading it

02-12

Getting Started With MapD, Part 1: Docker Install and Loading Data

02-01

9 new pandas updates that will save you time

01-25

Machine Learning Trick of the Day (7): Density Ratio Trick

01-14

Linked Lists

12-28

Weekly Review: 12/23/2017

12-23

Data professional definitions: Data analyst vs data scientist vs data engineer

12-14

Weekly Review: 11/18/2017

11-18

Recommender System With Implicit Feedback

11-18

8 Important Python Interview Questions and Answers

11-17

PyDataBudapest and “Machine Learning Libraries You’d Wish You’d Known About”

11-15

Data Science in Healthcare

11-14

PyConUK 2017, PyDataCardiff and “Machine Learning Libraries You’d Wish You’d Known About”

11-05

New in Cloudera Data Science Workbench 1.2: Usage Monitoring for Administrators

10-31

Recommender System

10-30

JUnit,Integration,End to End Tests

10-22

Weekly Review: 10/21/2017

10-21

Data Science for Managers and Directors (DS4MAD)

10-10

Talk like a pirate day 2017

09-19

Customizing Docker Images in Cloudera Data Science Workbench

09-14

Joining ASAPP

09-09

Making Smart Phones Dumb Again

09-07

What Killed the Curse of Dimensionality?

09-06

Inferring data loss (and correcting for it) from fundamental relationships

09-01

A.I. 'Bias' Doesn't Mean What Journalists Say It Means

08-30

My 10-step path to becoming a remote data scientist with Automattic

07-29

Prophecy Fulfilled: Keras and Cloudera Data Science Workbench

07-25

Kaggle’s Mercedes-Benz Greener Manufacturing

07-01

Random Effects Neural Networks in Edward and Keras

06-15

Machine Learning Fraud Detection: A Simple Machine Learning Approach

06-15

JMP Publishes Exercises to Accompany Data Mining Techniques (3rd Edition)

05-31

Python Deep Learning tutorial: Elman RNN implementation in Tensorflow

05-17

Getting Started with Cloudera Data Science Workbench

05-08

The Benefits of Migrating HPC Workloads To Apache Spark

05-04

Announcement

04-27

Deep Learning Frameworks on CDH and Cloudera Data Science Workbench

04-20

Retrospective on leaving academia for industry data science

04-09

Random-Walk Bayesian Deep Networks: Dealing with Non-Stationary Data

03-14

Does the Muslim ban make us safer?

03-10

Principle Component Analysis in Regression

03-08

Self-Service Adobe Analytics Data Feeds!

03-03

Introduction to Random forest

02-28

Writing Effective Amazon Machine Learning

02-19

Accelerating Apache Spark MLlib with Intel® Math Kernel Library (Intel® MKL)

02-08

Analyzing US flight data on Amazon S3 with sparklyr and Apache Spark 2.0

02-06

Machine Learning Madden NFL: The best player position switches for Madden 17

01-20

Hello, world!

01-16

Data Readiness Levels: Turning Data from Palid to Vivid

01-12

Our R package roundup

12-31

Generating World Flags with Sparse Auto-Encoders

12-14

Post NIPS Reflections

12-13

Don't Panic: Deep Learning will be Mostly Harmless

11-29

Respecting Boundaries with Inhomogeneous Kernels

11-29

Docker and Kaggle with Ernie and Bert

11-22

Data-Informed vs Data-Driven

11-20

Becoming a Data Scientist Podcast Special Episode

11-14

Two papers released on arXiv, "Operator Variational Inference" and "Model Criticism for Bayesian Causal Inference"

10-30

How to Use t-SNE Effectively

10-13

Introducing sparklyr, an R Interface for Apache Spark

09-30

Unevenly Spaced Data

09-26

Smart Cities at the Nexus of Emerging Data Technologies and You

09-18

k-Nearest Neighbors & Anomaly Detection Tutorial

09-14

Outside a train rumbles by

09-09

Republican-leaning states tend to have more traffic deaths

09-04

Python 2.7 still reigns supreme in pip installs

09-03

“Becoming a Data Scientist” Survey Results 1: Jobs & Education

08-22

Evolution of active categorical image classification via saccadic eye movement

08-13

Podcast Episodes 0 to 3

08-13

How to score 0.8134 in Titanic Kaggle Challenge

08-10

Boosting (in Machine Learning) as a Metaphor for Diverse Teams

08-07

Is Data Scientist a useless job title?

08-04

Re-work Interview Questions

07-26

Learning in Brains and Machines (4): Episodic and Interactive Memory

07-24

Bulk Downloading Adobe Analytics Data

07-21

I'm all about ML, but let's talk about OR

07-20

Instagram’s Blind Spot

07-19

Becoming a Data Scientist Podcast Episode 13: Debbie Berebichez

07-15

12 Ways To Cultivate A Data-Savvy Workforce

07-15

Build your own offshore company

07-06

Bayesian Deep Learning Part II: Bridging PyMC3 and Lasagne to build a Hierarchical Neural Network

07-05

Data Science Challenges

07-01

Big Data Technology Trends in Banking

06-24

Becoming a Data Scientist Podcast Episode 12: Data Science Learning Club Members

06-15

Bayesian Deep Learning

06-01

Becoming a Data Scientist Podcast Episode 11: Stephanie Rivera

05-31

Data Trusts

05-29

German Temperature Data

05-12

Becoming a Data Scientist Podcast Episode 10: Trey Causey

05-01

Data Analysis, NHS and Industrial Partners

04-28

First 3rd party notebook for Databricks Community Edition

04-14

Becoming a Data Scientist Podcast Episode 09: Justin Kiggins

04-12

Step by step Kaggle competition tutorial

04-10

Genome Analysis Toolkit: Now Using Apache Spark for Data Processing

04-06

Sheffield University Life

04-05

Becoming a Data Scientist Podcast Episode 08: Sebastian Raschka

03-29

Top content from two years of Data School

03-24

Sense is now part of Cloudera!

03-22

Large Data with Scikit-learn - Boston Meetup

03-16

Quora Q&A Session Answers

03-09

Analyzing Customer Churn – Competing Risks

03-08

Second Annual Data Science Bowl – Part 2

03-07

Deep Learning, Pachinko, and James Watt: Efficiency is the Driver of Uncertainty

03-04

Sheffield Advertises Posts in Machine Learning

03-01

Discovering and understanding patterns in highly dimensional data

02-28

Data Science Learning Club Update

02-21

Becoming a Data Scientist Podcast Episode 05: Clare Corthell

02-15

Developing effective data scientists

02-11

Why Today’s Big Data is Not Yesterday’s Big Data — Exponential and Combinatorial Growth

01-26

Becoming a Data Scientist Podcast Episode 02: Safia Abdalla

01-04

Our R package roundup

12-30

Podcast Available on Stitcher

12-21

Becoming A Data Scientist Podcast Episode 0: Me!

12-14

OpenAI won't benefit humanity without data-sharing

12-14

Some Observations on Winsorization and Trimming

12-03

A Challenge to Data Scientists

11-22

It's not an Internet of Things, It's an Internet of People

11-17

The Information Barons Threaten our Autonomy and Our Privacy

11-16

Association rule analysis beyond transaction data

11-11

MCMC sampling for dummies

11-10

Books for Data Science Beginners, and Data Sources

10-26

Prototyping Long Term Time Series Storage with Kafka and Parquet

10-25

Data Science Tutorials Flipboard Magazine

10-21

Recurrent Neural Networks

10-20

Analyzing Pronto CycleShare Data with Python and Pandas

10-18

Denoising Dirty Documents: Part 9

10-15

Yet Another PhD to Data Science Post (Part III)

10-06

Rebuilding Map Example With Apply Functions

09-30

Yet Another PhD to Data Science Post (Part II)

09-29

Yet Another PhD to Data Science Post (Part I)

09-23

R Tip: Use seqi() For Indexes

01-11

R Tip: Use seqi() For Indexes

01-11

MS in Applied Data Science Online – which track is right for you?

01-10

Top 5 Data Science Courses in 2019

01-09

Do something for yourself in 2019

01-08

Miami University: Director of the Center for Analytics & Data Science (CADS) [Oxford, OH]

12-20

Kent State University: Assistant/Associate Professor – Business Analytics/Information Systems [Kent, OH]

12-19

Yeshiva University: Data Science Program Director [New York, NY]

11-30

8 Reasons to Take Data Analytics Certification Courses

11-28

3 Challenges for Companies Tackling Data Science

11-26

UnitedHealth Group: Senior Principal Data Scientist [Telecommute, Central or Eastern Time Zones]

11-16

Carlos: ‘Everything Dataquest showed me, I use in my new job’

11-08

EARL Houston: Interview with Hadley Wickham

11-05

The role of academia in data science education

11-01

Explainable ML versus Interpretable ML

10-30

Distilled News

10-22

The economic consequences of MOOCs

10-08

If you did not already know

09-22

The Best Programming Languages for Data Science and Machine Learning in 2018

09-20

If you did not already know

08-15

Highlights from the useR! 2018 conference in Brisbane

07-18

An Updated Review of The Data Incubator Data Science Bootcamp

05-29

What's New in Dataquest v1.85: Takeaways, Intermediate R, and More

05-25

Weekly Review: 12/10/2017

12-10

Python Tutorial: Learn Python in one Day

11-28

Review of The Data Incubator data science bootcamp

05-29

Kinesis Savant Elite 2 Foot pedals

06-14

Yet Another PhD to Data Science Post (Part I)

09-23

If you did not already know

12-14

Whats new on arXiv

12-10

ML Methods for Prediction and Personalization

11-30

ML Methods for Prediction and Personalization

11-19

An Overview of Recommendation Systems

05-23

A Gentle Introduction to Recommender Systems with Implicit Feedback

05-30

Yet Another PhD to Data Science Post (Part I)

09-23

Document worth reading: “The importance of being dissimilar in Recommendation”

12-30

If you did not already know

12-14

Whats new on arXiv

12-10

ML Methods for Prediction and Personalization

11-30

ML Methods for Prediction and Personalization

11-19

Challenges & Solutions for Production Recommendation Systems

10-05

If you did not already know

09-30

Weighing the risk of moderate alcohol consumption

08-24

Fake News and Filter Bubbles

08-21

An Overview of Recommendation Systems

05-23

Production Recommendation Systems with Cloudera

02-20

Python Data Science jobs list into 2018

12-31

A Gentle Introduction to Recommender Systems with Implicit Feedback

05-30

Yet Another PhD to Data Science Post (Part I)

09-23

Gravity Variations

09-27

Gravity Variations

09-27

Co-integration and Mean Reverting Portfolio

01-06

Structural Analisys of Bayesian VARs with an example using the Brazilian Development Bank

01-05

A Guide to Decision Trees for Machine Learning and Data Science

12-24

Six Steps to Master Machine Learning with Data Preparation

12-21

Feature engineering, Explained

12-21

Day 17 – little helper to_na

12-17

Reusable Pipelines in R

12-13

Reusable Pipelines in R

12-13

von Neumann Poker Analysis

12-13

Four Techniques for Outlier Detection

12-06

Marginal Effects for (mixed effects) regression models

11-28

Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

11-16

The 5 Basic Statistics Concepts Data Scientists Need to Know

11-13

More on sigr

11-06

Linear Regression in Real Life

11-05

Building a neighbour matrix with python

11-04

Simple Feed Ranking Algorithm

10-28

Data Science Interview Questions with Answers

10-28

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

10-27

Notes on Feature Preprocessing: The What, the Why, and the How

10-26

Beginner Data Visualization & Exploration Using Pandas

10-22

A Lazy Function

10-20

Implement Simple Convolution with Java

09-27

If you did not already know

09-21

What is P-value?

09-20

How I got in the top 1 % on Kaggle.

08-28

If you did not already know

08-27

What is a Box Plot?

08-24

Build a model to predict the impact of weather on urban air quality using Amazon SageMaker

08-16

Building a Linear Regression Model for Real World Problems, in R

08-14

What is a p-value

08-09

Basic Statistics in Python: Descriptive Statistics

07-03

seplyr 0.5.8 Now Available on CRAN

07-02

R Tip: Be Wary of “…”

06-15

R Tip: use isTRUE()

06-11

“If you deprive the robot of your intuition about cause and effect, you’re never going to communicate meaningfully.” – Pearl ’18

06-08

Image Compression using K-means Clustering.

05-28

How digital cameras work

05-25

Generating Climate Temperature Spirals in Python

05-21

Understanding rolling calculations in R

03-07

TSrepr - Time Series Representations in R

01-26

Java Handwritten Digit Recognition with Convolutional Neural Networks

12-13

Weekly Review: 12/03/2017

12-03

How to Build Your Own Blockchain Part 4.1 — Bitcoin Proof of Work Difficulty Explained

11-13

Recommender System

10-30

Using regression trees for forecasting double-seasonal time series with trend in R

08-22

Why Machine Learning Is A Metaphor For Life

08-16

Minimizing the Negative Log-Likelihood, in English

05-18

Why hierarchical models are awesome, tricky, and Bayesian

02-08

Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

01-24

Customer lifetime value and the proliferation of misinformation on the internet

01-08

Deep Learning Research Review Week 2: Reinforcement Learning

11-16

Enernoc smart meter data - forecast electricity consumption with similar day approach in R

11-12

Turning Distances into Distributions

09-19

Keras plays catch, a single file Reinforcement Learning example

03-17

Continuous Bayes’ Theorem

01-20

Explicit Matrix Factorization: ALS, SGD, and All That Jazz

01-09

Agnez, analytics for deep learning research

12-24

A New Library for Analyzing Time-Series Data with Apache Spark

12-14

Some Observations on Winsorization and Trimming

12-03

Q-learning with Neural Networks

10-30

Bayes Primer

10-17

Clustering debates from UK politicians

10-16

Generating Fibonacci Numbers

10-16

Gravity Variations

09-27

Gravity Variations

09-27

Multilevel models for multiple comparisons! Varying treatment effects!

11-28

R Packages worth a look

11-24

R Packages worth a look

10-23

Document worth reading: “The Risk of Machine Learning”

10-11

Don’t get fooled by observational correlations

09-16

Discussion of effects of growth mindset: Let’s not demand unrealistic effect sizes.

09-13

When anyone claims 80% power, I’m skeptical.

08-24

R Packages worth a look

08-24

Distilled News

08-23

No, I don’t think it’s the file drawer effect

08-16

No, I don’t think it’s the file drawer effect

08-16

Let’s be open about the evidence for the benefits of open science

08-06

R Packages worth a look

08-01

“The idea of replication is central not just to scientific practice but also to formal statistics . . . Frequentist statistics relies on the reference set of repeated experiments, and Bayesian statistics relies on the prior distribution which represents the population of effects.”

07-19

Gravity Variations

09-27

AzureR packages now on CRAN

01-08

running plot [and simulated annealing]

12-14

RcppGetconf 0.0.3

11-17

UI Update — Datazar

11-07

Speeding up TRPO through parallelization and parameter adaptation

12-09

A intuitive explanation of natural gradient descent

08-07

First Order Optimization Methods

07-02

Q-learning with Neural Networks

10-30

Hogwild Stochastic Gradient Descent

10-27

Beginner Tutorial: Neural Nets in Theano

09-29

Papers with Code: A Fantastic GitHub Resource for Machine Learning

12-31

RcppGetconf 0.0.3

11-17

UI Update — Datazar

11-07

Production Recommendation Systems with Cloudera

02-20

Useful External Resources

04-27

Beginner Tutorial: Neural Nets in Theano

09-29

Making sense of the METS and ALTO XML standards

01-13

So you want to play a pRank in R…?

12-18

Document worth reading: “Can Machines Design An Artificial General Intelligence Approach”

12-10

An Utility Function For Monotonic Binning

12-03

A tutorial on tidy cross-validation with R

11-25

Improving Binning by Bootstrap Bumping

11-25

R Packages worth a look

11-23

Creating List with Iterator

11-23

Growing List vs Growing Queue

11-18

Convert Data Frame to Dictionary List in R

11-17

4 ways to be more efficient using RStudio’s Code Snippets, with 11 ready to use examples

11-10

The Gamification Of Fitbit: How an API Provided the Next Level of tRaining

11-10

Introduction to Deep Learning with Keras

10-29

How to Define a Machine Learning Problem Like a Detective

10-22

Getting the data from the Luxembourguish elections out of Excel

10-21

On Tensor Networks and the Nature of Non-Linearity

06-20

Python Tutorial: Learn Python in one Day

11-28

Approximating Implicit Matrix Factorization with Shallow Neural Networks

04-07

Integrating D3.js into R Shiny

03-13

Second Annual Data Science Bowl – Part 1

03-06

Continuous Bayes’ Theorem

01-20

Beginner Tutorial: Neural Nets in Theano

09-29

Import AI 127: Why language AI advancements may make Google more competitive; COCO image captioning systems don’t live up to the hype, and Amazon sees 3X growth in voice shopping via Alexa

12-31

How to use Keras fit and fit_generator (a hands-on tutorial)

12-24

Amazon SageMaker adds Scikit-Learn support

12-20

Easily train models using datasets labeled by Amazon SageMaker Ground Truth

12-20

Keras – Save and Load Your Deep Learning Models

12-10

How Different are Conventional Programming and Machine Learning?

12-10

Deep Learning and Medical Image Analysis with Keras

12-03

Semantic Segmentation algorithm is now available in Amazon SageMaker

11-28

Filter Clickbait from News Content with our custom Natural Language Processing Model

11-28

Introduction to Amazon SageMaker Object2Vec

11-08

Now use Pipe mode with CSV datasets for faster training on Amazon SageMaker built-in algorithms

11-01

Naive Bayes from Scratch using Python only – No Fancy Frameworks

10-25

If you did not already know

10-14

Amazon SageMaker Neural Topic Model now supports auxiliary vocabulary channel, new topic evaluation metrics, and training subsampling

10-04

How to Create a Simple Neural Network in Python

10-02

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 1

09-26

Classifying high-resolution chest x-ray medical images with Amazon SageMaker

09-13

Distilled News

09-11

New speed record set for training deep learning models on AWS

08-22

Import AI: 108: Learning language with fake sentences, Chinese researchers use RL to train prototype warehouse robots; and what the implications are of scaled-up Neural Architecture Search

08-20

Distributed Deep Learning on AZTK and HDInsight Spark Clusters

08-02

Transfer learning for custom labels using a TensorFlow container and “bring your own algorithm” in Amazon SageMaker

07-27

AWS Deep Learning AMIs now with optimized TensorFlow 1.9 and Apache MXNet 1.2 with Keras 2 support to accelerate deep learning on Amazon EC2 instances

07-23

Scalable multi-node deep learning training using GPUs in the AWS Cloud

07-20

Classify your own images using Amazon SageMaker

07-20

Using Siamese Networks and Pre-Trained Convolutional Neural Networks (CNNs) for Fashion Similarity Matching

07-10

Deep Learning Vendor Update: Hyperparameter Tuning Systems

06-29

How to Do Distributed Deep Learning for Object Detection Using Horovod on Azure

06-20

World Models Experiments

06-09

Synthetic Gradients with Tensorflow

04-08

Semantic trees for training word embeddings with hierarchical softmax

09-07

Deep Learning with Intel’s BigDL and Apache Spark

09-06

How I Used Deep Learning To Train A Chatbot To Talk Like Me (Sorta)

07-25

Prophecy Fulfilled: Keras and Cloudera Data Science Workbench

07-25

Deep learning on Apache Spark and Apache Hadoop with Deeplearning4j

06-27

Sentiment Analysis model deployed!

04-17

A fastText-based hybrid recommender

09-27

The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)

08-24

Generative Adversarial Networks Explained

06-29

Translating W2v Embedding From One Space To Another

06-06

First Convergence Bias

04-11

Generating Large Images from Latent Vectors

04-01

Representational Power of Deeper Layers

03-30

Implementing Batch Normalization in Tensorflow

03-29

Deep Learning for Visual Question Answering

11-02

Beginner Tutorial: Neural Nets in Theano

09-29

Auto-Keras and AutoML: A Getting Started Guide

01-07

Import AI 127: Why language AI advancements may make Google more competitive; COCO image captioning systems don’t live up to the hype, and Amazon sees 3X growth in voice shopping via Alexa

12-31

Good Feature Building Techniques and Tricks for Kaggle

12-31

Document worth reading: “Learnable: Theory vs Applications”

12-28

BERT: State of the Art NLP Model, Explained

12-26

How to use Keras fit and fit_generator (a hands-on tutorial)

12-24

Distilled News

12-21

Amazon SageMaker adds Scikit-Learn support

12-20

Easily train models using datasets labeled by Amazon SageMaker Ground Truth

12-20

“My advisor and I disagree on how we should carry out repeated cross-validation. We would love to have a third expert opinion…”

12-15

Amazon SageMaker Automatic Model Tuning now supports early stopping of training jobs

12-13

Scaling Multi-Agent Reinforcement Learning

12-12

Keras – Save and Load Your Deep Learning Models

12-10

Deep Learning and Medical Image Analysis with Keras

12-03

Semantic Segmentation algorithm is now available in Amazon SageMaker

11-28

Filter Clickbait from News Content with our custom Natural Language Processing Model

11-28

Introducing Dynamic Training for deep learning with Amazon EC2

11-27

Import AI: 122: Google obtains new ImageNet state-of-the-art with GPipe; drone learns to land more effectively than PD controller policy; and Facebook releases its ‘CherryPi’ StarCraft bot

11-26

If you did not already know

11-20

Multi-Class Text Classification with Doc2Vec & Logistic Regression

11-09

Introduction to Amazon SageMaker Object2Vec

11-08

Now easily perform incremental learning on Amazon SageMaker

11-07

Document worth reading: “A User’s Guide to Support Vector Machines”

11-03

If you did not already know

11-01

Now use Pipe mode with CSV datasets for faster training on Amazon SageMaker built-in algorithms

11-01

Naive Bayes from Scratch using Python only – No Fancy Frameworks

10-25

Obtaining the number of components from cross validation of principal components regression

10-15

If you did not already know

10-14

If you did not already know

10-12

TDWI In-Person and Virtual Data and Analytics Training

10-10

Accelerate model training using faster Pipe mode on Amazon SageMaker

10-05

Amazon SageMaker Neural Topic Model now supports auxiliary vocabulary channel, new topic evaluation metrics, and training subsampling

10-04

Top 3 Trends in Deep Learning

10-03

PyTorch 1.0 preview now available in Amazon SageMaker and the AWS Deep Learning AMIs

10-03

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 1

09-26

Amazon SageMaker automatic model tuning produces better models, faster

09-25

Classifying high-resolution chest x-ray medical images with Amazon SageMaker

09-13

If you did not already know

09-11

Distilled News

09-11

Amazon SageMaker runtime now supports the CustomAttributes header

08-31

New speed record set for training deep learning models on AWS

08-22

Import AI: 108: Learning language with fake sentences, Chinese researchers use RL to train prototype warehouse robots; and what the implications are of scaled-up Neural Architecture Search

08-20

Document worth reading: “A Survey on Resilient Machine Learning”

08-19

Distributed Deep Learning on AZTK and HDInsight Spark Clusters

08-02

Transfer learning for custom labels using a TensorFlow container and “bring your own algorithm” in Amazon SageMaker

07-27

Classify your own images using Amazon SageMaker

07-20

How to Do Distributed Deep Learning for Object Detection Using Horovod on Azure

06-20

Weekly Review: 12/03/2017

12-03

mixup: Data-Dependent Data Augmentation

11-02

Understanding how Deep Learning learns to play SET®

10-12

Semantic trees for training word embeddings with hierarchical softmax

09-07

Deep Learning with Intel’s BigDL and Apache Spark

09-06

How much compute do we need to train generative models?

08-31

How I Used Deep Learning To Train A Chatbot To Talk Like Me (Sorta)

07-25

Prophecy Fulfilled: Keras and Cloudera Data Science Workbench

07-25

Deep learning on Apache Spark and Apache Hadoop with Deeplearning4j

06-27

Is BackPropagation Necessary?

08-23

Exploring convolutional neural networks with DL4J

04-14

First Convergence Bias

04-11

Generating Large Images from Latent Vectors

04-01

Representational Power of Deeper Layers

03-30

First Steps With Neural Nets in Keras

03-04

Implementing a CNN for Text Classification in TensorFlow

12-11

Deep Learning for Visual Question Answering

11-02

What a Deep Neural Network thinks about your

10-25

Beginner Tutorial: Neural Nets in Theano

09-29

R Packages worth a look

01-06

How to Write a Great Data Science Resume

01-03

Best Data Visualization Projects of 2018

12-27

At Year's End: 2018

12-25

Your AI journey… and Happy Holidays!

12-20

University of Rhode Island: Data Scientist, DataSpark (2 Positions) [Kingston, RI]

12-18

New public course on Successfully Delivering Data Science Projects for Feb 1st

12-18

R Packages worth a look

12-12

One-stop-learning-shop for data pros – get exclusive access for less than a cup of coffee

12-06

Community Call – Governance strategies for open source research software projects

12-05

8 Data Science Projects to Build your Portfolio

12-03

Request for Proposal: Topical Projects for January 2019

11-29

Students Combat MS with Data Science

11-29

Dealing with failed projects

11-22

What is the Best Python IDE for Data Science?

11-14

Don’t use AI when BI will suffice!

11-05

Peter Bull discusses the importance of human-centered design in data science.

11-05

From Project Manager to Data Champion — Conquer Your Data Projects

10-18

a4 Media: Manager, Machine Learning Data Engineer [Long Island City, NY]

10-10

Track the number of coffees consumed using AWS DeepLens

10-09

UnitedHealth Group: UHC Digital Director of Project Management [Minnetonka, MN]

10-04

UnitedHealth Group: UHC Digital Project Manager [Minnetonka, MN]

10-04

The Microsoft AI Idea Challenge – Breakthrough Ideas Wanted!

08-14

How to Build a Data Science Portfolio

08-13

2018 Data Sources for Cool Data Science Projects, provided by Thinknum

08-06

A Certification for R Package Quality

07-30

First Data Project? Go Tandem! (AVISIA at Play)

07-27

RAIN Project: evolution of the game development dream

07-13

Data Science Project Style Guide

07-09

Summer of Data Science 2018

05-28

Data Science for Managers and Directors (DS4MAD)

10-10

Guest Post – Learning R as an MBA Student

07-12

Writing Effective Amazon Machine Learning

02-19

How to make a good data-driven web app

05-25

Introducing Apache Arrow: A Fast, Interoperable In-Memory Columnar Data Structure Standard

02-18

Yet Another PhD to Data Science Post (Part II)

09-29

Some Thoughts on Meditation

10-22

Yet Another PhD to Data Science Post (Part II)

09-29

The brain as a neural network: this is why we can’t get along

12-19

Top 10 Advantages of a Data Science Certification

12-17

Should you become a data scientist?

12-10

Getting Started with Amazon Comprehend custom entities

11-17

5 Steps to Prepare for a Data Science Job

10-23

5 Steps to Prepare for a Data Science Job

10-22

University of San Francisco: Assistant Professor, Tenure Track, Mathematics and Statistics [San Francisco, CA]

10-17

An Updated Review of The Data Incubator Data Science Bootcamp

05-29

“Should I get a PhD to be a data scientist/analytics professional?”

11-19

Yet Another PhD to Data Science Post (Part II)

09-29

10 Companies to Work with After a Data Science Course

01-10

The cold start problem: how to build your machine learning portfolio

01-04

Magister Dixit

12-01

University of Tennessee Knoxville: Assistant or Associate Professor in Data Science [Knoxville, TN]

11-30

How to Build a Machine Learning Team When You Are Not Google or Facebook

11-28

Machine Learning. In conversation with Jelena Ilic, Senior Data Scientist at Mango Solutions

11-21

Telling Truth from Hype When Hunting for Data Science Work

11-05

Distilled News

10-01

How to Implement AI-First Business Models at Scale

09-21

Vulcan Post: This AI Startup Is Run By The World’s Top Data Scientists – Lets Anyone Build Predictive Models

09-04

“Seeding trials”: medical marketing disguised as science

08-01

AHL Python Data Hackathon

04-22

Fact over Fiction

04-22

Similarity in the Wild

02-19

Your First Job

11-15

Build your own offshore company

07-06

Google's NHS deal does not bode well for the future of data-sharing

05-05

Data Analysis, NHS and Industrial Partners

04-28

Yet Another PhD to Data Science Post (Part II)

09-29

Document worth reading: “Deep Neural Network Approximation Theory”

01-12

Long-awaited updates to htmlTable

01-07

If you did not already know

01-06

If you did not already know

01-06

Purr yourself into a math genius

01-03

Seeing the wood for the trees

01-01

Simulating Multi-state Models with R

01-01

R Packages worth a look

12-31

Will Julia Replace Python and R for Data Science?

12-26

The Need for Speed Part 2: C++ vs. Fortran vs. C

12-24

Objects types and some useful R functions for beginners

12-24

Custom JavaScript, CSS and HTML in Shiny

12-23

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

12-21

November 2018: “Top 40” New Packages

12-21

R Packages worth a look

12-19

Learning R: A gentle introduction to higher-order functions

12-14

Sharing Modeling Pipelines in R

12-11

Sharing Modeling Pipelines in R

12-11

The Need for Speed Part 1: Building an R Package with Fortran (or C)

12-10

Day 05 – little helper get_network

12-05

One Recipe Step to Rule Them All

12-03

How to get the homology of a antibody using R

12-02

Why R for data science – and not Python?

12-02

Document worth reading: “A Tutorial on Bayesian Optimization”

12-01

Math in Data Science

11-30

R Packages worth a look

11-28

Building Blocks of Decision Tree

11-26

Scrapping data about Australian politicians with RSelenium

11-21

epubr 0.5.0 CRAN release

11-18

Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

11-16

Rdew Valley: Optimizing Farming with R

11-14

R Packages worth a look

11-14

R Packages worth a look

11-13

Those “other” apply functions…

11-13

Detailed introduction of “myprettyreport” R package

11-10

Using httr to Detect HTTP(s) Redirects

11-06

Automated Email Reports with R

11-01

R Packages worth a look

10-30

RcppRedis 0.1.9

10-27

Naive Bayes from Scratch using Python only – No Fancy Frameworks

10-25

RApiDatetime 0.0.4: Updates and Extensions

10-21

A Lazy Function

10-20

R Packages worth a look

10-16

Voice Control your Shiny Apps

10-15

Running R scripts within in-database SQL Server Machine Learning

10-14

Piping into ggplot2

10-13

RcppNLoptExample 0.0.1: Use NLopt from C/C++

10-13

Piping into ggplot2

10-13

Writing Code to Read Quotes About Writing Code

10-12

Why are functional programming languages so popular in the programming languages community?

10-11

Processing complicated package outputs

10-09

R Packages worth a look

10-08

Tidyverse 'Starts_with' in M/Power Query

10-08

Functions and Packages

09-29

R Packages worth a look

09-24

R Packages worth a look

09-15

Announcing wrapr 1.6.2

09-13

Against Winner-Take-All Attribution

09-05

Distilled News

09-04

R Packages worth a look

09-01

Tips for analyzing Excel data in R

08-30

If you did not already know

08-20

If you did not already know

08-15

Document worth reading: “Are Efficient Deep Representations Learnable”

07-31

R Tip: Be Wary of “…”

06-15

From Gaussian Algebra to Gaussian Processes, Part 2

06-12

R Tip: use isTRUE()

06-11

How to use an R interface with Airtable API

05-23

From Gaussian Algebra to Gaussian Processes, Part 1

03-31

Natural and Artificial Intelligence

02-06

Gaussian Processes

11-25

Why Machine Learning Is A Metaphor For Life

08-16

Parallelizing Distance Calculations Using A GPU With CUDAnative.jl

08-14

What is Machine Learning?

07-17

Time Series Analysis with Generalized Additive Models

04-04

Ordered Categorical GLMs for Product Feedback Scores

03-17

Vestigial trigonometry functions

03-08

Data Cleaning, Categorization and Normalization

01-30

RNNs in Tensorflow, a Practical Guide and Undocumented Features

08-21

Learning in Brains and Machines (4): Episodic and Interactive Memory

07-24

A tour of Factor: 2

05-27

A tour of Factor: 1

05-23

Vanilla Neural Nets

05-16

RSiteCatalyst Version 1.4.8 Release Notes

04-04

Learning in Brains and Machines (1): Temporal Differences

02-21

How-to: Train Models in R and Python using Apache Spark MLlib and H2O

01-29

Theano Tutorial

01-25

The Mathematics Behind: Rejection Sampling

01-24

Explicit Matrix Factorization: ALS, SGD, and All That Jazz

01-09

Rebuilding Map Example With Apply Functions

09-30

R Packages worth a look

08-18

Rebuilding Map Example With Apply Functions

09-30

Rebuilding Map Example With Apply Functions

09-30

Creating List with Iterator

11-23

How to import a directory of csvs at once with base R and data.table. Can you guess which way is the fastest?

10-13

Rebuilding Map Example With Apply Functions

09-30

Adding Firebase Authentication to Shiny

01-03

Cartoon: Thanksgiving, Big Data, and Turkey Data Science.

11-22

AI, Machine Learning and Data Science Roundup: November 2018

11-21

Anticipating the next move in data science – my interview with Thomson Reuters

11-17

Best Deals in Deep Learning Cloud Providers: From CPU to GPU to TPU

11-15

What is Cloud Computing & Which is Better, AWS or GCP

11-15

Building statues of hope in augmented reality

10-22

Document worth reading: “Review of Deep Learning”

10-19

Sketchnotes from TWiML&AI: Evaluating Model Explainability Methods with Sara Hooker

10-12

R Packages worth a look

09-08

Bad headlines distract from real AI problems

08-20

Distilled News

08-16

Quick and Dirty Serverless Integer Programming

08-06

Distilled News

07-31

Using Entity-level Sentiment Analysis to understand News Content

07-30

DIY AI for the Future

06-27

Gsutil cheatsheet

11-02

Using Xcode with Github

05-25

AI and ML Futures 2: The Quiet Revolution

01-17

Conference on the Economics of Machine Intelligence-Dec 15

12-01

Long Short-Term Memory dramatically improves Google Voice etc – now available to a billion users

09-30

Whats new on arXiv

01-12

My presentations on ‘Elements of Neural Networks & Deep Learning’ -Part1,2,3

01-10

Whats new on arXiv

01-06

Distilled News

01-05

Whats new on arXiv

01-04

Whats new on arXiv

01-04

Whats new on arXiv

01-03

Whats new on arXiv

01-03

Distilled News

12-30

If you did not already know

12-26

Whats new on arXiv

12-25

If you did not already know

12-24

Whats new on arXiv

12-22

If you did not already know

12-21

Document worth reading: “Coupled Ensembles of Neural Networks”

12-16

Whats new on arXiv

12-14

Distilled News

12-14

If you did not already know

12-13

Distilled News

12-10

Whats new on arXiv

12-09

Whats new on arXiv

12-08

If you did not already know

12-07

Whats new on arXiv

12-01

Whats new on arXiv

11-30

If you did not already know

11-30

Whats new on arXiv

11-29

Whats new on arXiv

11-29

Whats new on arXiv

11-28

Whats new on arXiv

11-28

Whats new on arXiv

11-27

Whats new on arXiv

11-26

If you did not already know

11-25

Whats new on arXiv

11-25

Whats new on arXiv

11-23

If you did not already know

11-22

An Introduction to AI

11-21

If you did not already know

11-20

Document worth reading: “A Learning Approach to Secure Learning”

11-19

If you did not already know

11-17

Whats new on arXiv

11-16

Whats new on arXiv

11-14

If you did not already know

11-13

Whats new on arXiv

11-11

If you did not already know

11-10

If you did not already know

11-07

Whats new on arXiv

11-07

Whats new on arXiv

11-06

Whats new on arXiv

11-06

Building Surveillance System Using USB Camera and Wireless-Connected Raspberry Pi

11-06

Whats new on arXiv

11-04

Whats new on arXiv

11-04

Whats new on arXiv

11-03

“Simulations are not scalable but theory is scalable”

11-02

Whats new on arXiv

11-02

If you did not already know

11-01

Join AI experts from Google Brain, Open AI & Uber AI Labs in San Francisco

11-01

Whats new on arXiv

10-30

Whats new on arXiv

10-30

If you did not already know

10-29

Learning to learn in a model-agnostic way

10-29

Document worth reading: “Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks”

10-25

Google, Microsoft & Fraunhofer at the First European Edition of Deep Learning World – 12 Nov, 2018

10-23

Whats new on arXiv

10-23

Whats new on arXiv

10-23

Whats new on arXiv

10-22

Whats new on arXiv

10-22

Whats new on arXiv

10-22

If you did not already know

10-21

Whats new on arXiv

10-21

Whats new on arXiv

10-19

If you did not already know

10-19

Data Notes: The Secret of Academic Success

10-17

Distilled News

10-16

If you did not already know

10-11

Whats new on arXiv

10-09

If you did not already know

10-08

If you did not already know

10-07

Distilled News

10-06

Document worth reading: “Detecting Dead Weights and Units in Neural Networks”

10-05

Whats new on arXiv

10-05

Whats new on arXiv

10-04

Whats new on arXiv

10-04

Distilled News

10-04

If you did not already know

10-03

Sequence Modeling with Neural Networks – Part I

10-03

A Review of the Neural History of Natural Language Processing

10-01

If you did not already know

09-29

Whats new on arXiv

09-28

Whats new on arXiv

09-27

Whats new on arXiv

09-25

Whats new on arXiv

09-21

Whats new on arXiv

09-14

Java Home Made Face Recognition Application

09-12

If you did not already know

09-11

Distilled News

09-11

Whats new on arXiv

09-11

Whats new on arXiv

09-11

Document worth reading: “Quantizing deep convolutional networks for efficient inference: A whitepaper”

09-10

Whats new on arXiv

09-08

Whats new on arXiv

09-07

Whats new on arXiv

09-07

If you did not already know

09-05

Whats new on arXiv

08-28

Document worth reading: “A Tutorial on Network Embeddings”

08-26

Why you can't have privacy on the internet

08-22

Whats new on arXiv

08-21

If you did not already know

08-20

If you did not already know

08-18

Whats new on arXiv

08-17

Document worth reading: “How Important Is a Neuron”

08-15

Whats new on arXiv

08-15

Whats new on arXiv

08-13

R Packages worth a look

08-12

Whats new on arXiv

08-10

Whats new on arXiv

08-09

Whats new on arXiv

08-08

Whats new on arXiv

08-07

Whats new on arXiv

08-03

Whats new on arXiv

08-01

Whats new on arXiv

07-31

Facilitate Proactive Cybersecurity Operations with Big Data Analytics and Machine Intelligence

07-30

What Data Scientists should focus on in 2018?

06-27

Import AI:

06-25

AI Lab: Learn to Code with the Cutting-Edge Microsoft AI Platform

06-19

Transfer Your Font Style with GANs

03-13

Weekly Review: 12/16/2017

12-16

Weekly Review: 12/03/2017

12-03

Building a Visual Search Algorithm

10-13

XOR Revisited: Keras and TensorFlow

04-24

Deep Learning without Backpropagation

03-21

Tutorial: Deep Learning in PyTorch

01-15

Deep Learning Research Review Week 3: Natural Language Processing

01-10

Post NIPS Reflections

12-13

Artificial Neural Networks Introduction (Part II)

11-03

Recurrent Neural Network Gradients, and Lessons Learned Therein

10-18

The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)

08-24

Is BackPropagation Necessary?

08-23

A Beginner's Guide To Understanding Convolutional Neural Networks Part 2

07-29

Bayesian Deep Learning Part II: Bridging PyMC3 and Lasagne to build a Hierarchical Neural Network

07-05

Generating Large Images from Latent Vectors - Part Two

06-02

Neural Network Evolution Playground with Backprop NEAT

05-07

The structure of Mafia syndacates

05-04

Rolling and Unrolling RNNs

04-28

Interactive Abstract Pattern Generation Javascript Demo

04-24

Generating Large Images from Latent Vectors

04-01

Introduction to Semi-Supervised Learning with Ladder Networks

01-19

Attention and Memory in Deep Learning and NLP

01-03

Recurrent Neural Networks

10-20

Long Short-Term Memory dramatically improves Google Voice etc – now available to a billion users

09-30

Whats new on arXiv

01-13

Distilled News

01-13

Whats new on arXiv

01-12

Practical Apache Spark in 10 Minutes

01-11

If you did not already know

01-11

Whats new on arXiv

01-10

Whats new on arXiv

01-09

KDnuggets™ News 19:n02, Jan 9: The cold start problem: how to build your machine learning portfolio; 5 Best Data Visualization Libraries

01-09

How do Convolutional Neural Nets (CNNs) learn? + Keras example

01-09

Distilled News

01-09

Top KDnuggets tweets, Jan 02-08: 10 Free Must-Read Books for Machine Learning and Data Science

01-09

Industry leaders to speak at Mega-PAW, Las Vegas – June 16-20

01-09

Learn Python for Data Science From Scratch

01-09

Document worth reading: “Recent Advances in Deep Learning: An Overview”

01-08

Whats new on arXiv

01-08

AI Gotchas (& How to Avoid Them)

01-08

NLP Overview: Modern Deep Learning Techniques Applied to Natural Language Processing

01-08

5 things that happened in Data Science in 2018

01-08

The Data Science Event You Need in 2019

01-07

Whats new on arXiv

01-06

R Packages worth a look

01-06

Distilled News

01-05

Document worth reading: “Recent Research Advances on Interactive Machine Learning”

01-05

Whats new on arXiv

01-04

Whats new on arXiv

01-04

Improve your AI and Machine Learning skills at AI NEXTCon in Seattle, Jan 23-27

01-03

Whats new on arXiv

01-03

Notebooks from the Practical AI Workshop

01-03

KDnuggets™ News 19:n01, Jan 3: The Essence of Machine Learning; A Guide to Decision Trees for Machine Learning and Data Science

01-03

The Backpropagation Algorithm Demystified

01-02

Whats new on arXiv

01-01

Distilled News

12-31

Papers with Code: A Fantastic GitHub Resource for Machine Learning

12-31

Distilled News

12-30

If you did not already know

12-30

Whats new on arXiv

12-29

Deep Learning for Media Content

12-28

The business case for federated learning

12-28

Whats new on arXiv

12-28

The Essence of Machine Learning

12-28

How AI Will Change Brick-and-Mortar Retail in 2019

12-26

If you did not already know

12-26

Data Science & ML : A Complete Interview Guide

12-26

If you did not already know

12-25

Distilled News

12-25

Document worth reading: “Learning to Reason”

12-22

Does imputing model labels using the model predictions can improve it’s performance?

12-21

If you did not already know

12-21

Distilled News

12-20

Whats new on arXiv

12-20

Whats new on arXiv

12-20

If you did not already know

12-20

Hackathon Winner Interview: Penn State | Kaggle University Club

12-19

Top KDnuggets tweets, Dec 12-18: Deep Learning Cheat Sheets; The Nate Silver vs. Nassim Taleb Twitter War

12-19

Distilled News

12-19

Distilled News

12-19

Data Science & ML : A Complete Interview Guide

12-19

Whats new on arXiv

12-19

AI, Machine Learning and Data Science Roundup: December 2018

12-19

Highlights of 2018

12-18

Whats new on arXiv

12-18

Distilled News

12-18

eBook: An Introduction to Active Learning

12-17

Top 10 Advantages of a Data Science Certification

12-17

Distilled News

12-17

NLP Breakthrough Imagenet Moment has arrived

12-14

Top Stories of 2018: 9 Must-have skills you need to become a Data Scientist, updated; Python eats away at R: Top Software for Analytics, Data Science, Machine Learning in 2018

12-14

Spark + AI Summit: learn best practices in ML and DL, latest frameworks, and more – special KDnuggets offer

12-14

Whats new on arXiv

12-14

Four Real-Life Machine Learning Use Cases

12-13

State of Deep Learning and Major Advances: H2 2018 Review

12-13

Whats new on arXiv

12-13

Keras Hyperparameter Tuning in Google Colab Using Hyperas

12-12

10 Data Science Skills to Land your Dream Job in 2019

12-12

Distilled News

12-12

KDnuggets™ News 18:n47, Dec 12: Common mistakes when doing machine learning; Here are the most popular Python IDEs / Editors

12-12

P&G: Data Scientist – Machine Learning/NLP [Cincinnati, OH]

12-11

A Machine Learning Deep Dive [Webinar, Dec 13]

12-11

Top November Stories: The Most in Demand Skills for Data Scientists; What is the Best Python IDE for Data Science?

12-11

Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

12-11

Learning Machine Learning vs Learning Data Science

12-11

Whats new on arXiv

12-11

Whats new on arXiv

12-10

Whats new on arXiv

12-10

Whats new on arXiv

12-10

Whats new on arXiv

12-09

Distilled News

12-09

Whats new on arXiv

12-08

Whats new on arXiv

12-07

Must-Have Resources to Become a Data Scientist

12-06

Distilled News

12-05

KDnuggets™ News 18:n46, Dec 5: AI, Data Science, Analytics 2018 Main Developments, 2019 Key Trends; Deep Learning Cheat Sheets

12-05

If you did not already know

12-04

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: December and Beyond

12-04

Whats new on arXiv

12-04

Whats new on arXiv

12-03

Ronin: Sr Machine Learning and AI Data Scientist [San Mateo, CA]

12-03

If you did not already know

12-03

A Programmer’s Introduction to Mathematics

12-01

If you did not already know

12-01

Variational Autoencoders Explained in Detail

11-30

A Complete Guide to Choosing the Best Machine Learning Course

11-30

Deep Learning for the Masses (… and The Semantic Layer)

11-30

Math in Data Science

11-30

Visual Model-Based Reinforcement Learning as a Path towards Generalist Robots

11-30

Distilled News

11-30

ML Methods for Prediction and Personalization

11-30

Distilled News

11-29

Whats new on arXiv

11-29

Whats new on arXiv

11-29

Combating Customer Churn with AI

11-29

Amazon SageMaker notebooks now support Git integration for increased persistence, collaboration, and reproducibility

11-29

How to Find Mentors for Data Science?

11-29

How to Engineer Your Way Out of Slow Models

11-27

Amazon Launches Machine Learning University

11-27

Distilled News

11-26

3 Challenges for Companies Tackling Data Science

11-26

If you did not already know

11-26

Data Pro Cyber Monday – Choose Your Savings

11-26

Whats new on arXiv

11-26

Whats new on arXiv

11-25

Whats new on arXiv

11-25

Document worth reading: “Learning From Positive and Unlabeled Data: A Survey”

11-23

KNNs (K-Nearest-Neighbours) in Python

11-22

Join the World’s Biggest Deep Learning Summit – KDnuggets Early Cyber Monday

11-21

AI, Machine Learning and Data Science Roundup: November 2018

11-21

Whats new on arXiv

11-20

Machine Learning in Action: Going Beyond Decision Support Data Science

11-20

Word Morphing – an original idea

11-20

Distilled News

11-20

Import AI 121: Sony researchers make ultra-fast ImageNet training breakthrough; Berkeley researchers tackle StarCraft II with modular RL system; and Germany adds €3bn for AI research

11-19

Insights on the role data can play in your organization

11-19

ML Methods for Prediction and Personalization

11-19

If you did not already know

11-19

Whats new on arXiv

11-17

Anticipating the next move in data science – my interview with Thomson Reuters

11-17

Document worth reading: “Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches”

11-17

If you did not already know

11-17

Distilled News

11-16

Whats new on arXiv

11-16

Using Uncertainty to Interpret your Model

11-16

Whats new on arXiv

11-15

Free Reinforcement Learning Textbook

11-14

Paris ML E#2 S#6: Conscience, Code Analysis, Can a machine learn like a child?

11-14

Document worth reading: “Deep Reinforcement Learning: An Overview”

11-14

Distilled News

11-14

Federated learning: distributed machine learning with data locality and privacy

11-14

Whats new on arXiv

11-14

All About Scikit-Learn, with Olivier Grisel

11-13

If you did not already know

11-13

Whats new on arXiv

11-13

Distilled News

11-12

To get hired as a data scientist, don’t follow the herd

11-12

Whats new on arXiv

11-12

Distilled News

11-12

If you did not already know

11-12

Whats new on arXiv

11-11

Distilled News

11-11

Top October Stories: 9 Must-have skills you need to become a Data Scientist, updated; 10 Best Mobile Apps for Data Scientist / Data Analysts

11-09

Deep Learning Performance Cheat Sheet

11-08

Carlos: ‘Everything Dataquest showed me, I use in my new job’

11-08

Whats new on arXiv

11-07

KDnuggets™ News 18:n42, Nov 7: The Most in Demand Skills for Data Scientists; How Machines Understand Our Language: Intro to NLP

11-07

Whats new on arXiv

11-06

Distilled News

11-05

Machine Learning Classification: A Dataset-based Pictorial

11-05

Import AI 119: How to benefit AI research in Africa; German politician calls for billions in spending to prevent country being left behind; and using deep learning to spot thefts

11-05

Data Science With R Course Series – Week 8

11-05

Distilled News

11-04

Whats new on arXiv

11-04

Whats new on arXiv

11-04

If you did not already know

11-03

Whats new on arXiv

11-03

The Most in Demand Skills for Data Scientists

11-02

Top 13 Python Deep Learning Libraries

11-02

Distilled News

11-02

Distilled News

10-31

Top KDnuggets tweets, Oct 24-30: Building a Question-Answering System from Scratch

10-31

How to Start Learning R for Data Science

10-31

Moody’s Analytics: Machine Learning / NLP – Research Scientist / Engineer [New York, NY]

10-30

Whats new on arXiv

10-30

How to create useful features for Machine Learning

10-30

Machine Learning Basics – Random Forest

10-30

Federated Learning: Machine Learning with Privacy on the Edge

10-29

The quest continues: a look at a new initiative to explore human and machine intelligence

10-29

Please vote

10-29

Data Science With R Course Series – Week 7

10-29

Learning to learn in a model-agnostic way

10-29

How to be an Artificial Intelligence (AI) Expert?

10-29

Distilled News

10-28

Document worth reading: “Restricted Boltzmann Machines: Introduction and Review”

10-27

Distilled News

10-27

Whats new on arXiv

10-26

Whats new on arXiv

10-25

Implementing Automated Machine Learning Systems with Open Source Tools

10-25

How to be an Artificial Intelligence (AI) Expert?

10-25

Distilled News

10-25

AI, Machine Learning and Data Science Roundup: October 2018

10-25

U. of Zurich: Assistant Professorship in AI and Machine Learning (Non-tenure Track) [Zurich, Switzerland]

10-24

Google, Microsoft & Fraunhofer at the First European Edition of Deep Learning World – 12 Nov, 2018

10-23

Computer Vision for Model Assessment

10-23

5 Steps to Prepare for a Data Science Job

10-23

Computer Vision for Model Assessment

10-23

Introduction to Active Learning

10-23

Whats new on arXiv

10-22

Distilled News

10-22

Speak at Mega-PAW Vegas 2019 – on Machine Learning Deployment (Apply by Nov 15)

10-22

5 Steps to Prepare for a Data Science Job

10-22

If you did not already know

10-21

Whats new on arXiv

10-21

Distilled News

10-21

automl package: part 1/2 why and how

10-21

Start your journey into data science today

10-19

If you did not already know

10-19

If you did not already know

10-19

Distilled News

10-18

Distilled News

10-18

Building a data warehouse

10-17

Four machine learning strategies for solving real-world problems

10-17

Mindstrong Health: Sr Data Scientist / Machine Learning, Statistics, Coding [Palo Alto, CA]

10-17

Distilled News

10-16

Accelerating Your Algorithms in Production [Webinar Replay]

10-16

Whats new on arXiv

10-16

University of San Francisco: Postdoctoral Fellowship, Data Institute [San Francisco, CA]

10-16

Exploring college major and income: a live data analysis in R

10-16

Document worth reading: “A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress”

10-15

Distilled News

10-15

Whats new on arXiv

10-13

Sketchnotes from TWiML&AI: Evaluating Model Explainability Methods with Sara Hooker

10-12

Whats new on arXiv

10-11

Distilled News

10-11

Top KDnuggets tweets, Oct 3–9: 5 Reasons Logistic Regression should be the first thing you learn when becoming a Data Scientist

10-10

Top 10 Mistakes to Avoid to Master Data Science

10-10

If you did not already know

10-10

How to get a Data Science Job in 6 Months

10-10

Top September Stories: Essential Math for Data Science: Why and How; Machine Learning Cheat Sheets

10-10

10 Best Mobile Apps for Data Scientist / Data Analysts

10-10

If you did not already know

10-09

Distilled News

10-09

Whats new on arXiv

10-09

Top 8 Python Machine Learning Libraries

10-09

Distilled News

10-08

Job: Postdoctoral Researcher in Small Data Deep Learning and Explainable Machine Learning, Livermore, CA

10-08

Document worth reading: “Learning Tree Distributions by Hidden Markov Models”

10-07

Distilled News

10-06

A Concise Explanation of Learning Algorithms with the Mitchell Paradigm

10-05

Journey from Non-Technical background to an expert in Data Science

10-05

Why do I Call Myself a Data Scientist?

10-05

Semantic Segmentation: Wiki, Applications and Resources

10-04

Distilled News

10-04

Top 10 Mistakes to Avoid to Master Data Science

10-04

Big Data Day Camp: Big Data Tools & Techniques (October 25-26)

10-04

Top KDnuggets tweets, Sep 26 – Oct 2: Why building your own Deep Learning Computer is 10x cheaper than AWS; 6 Steps To Write Any Machine Learning Algorithm

10-03

KDnuggets™ News 18:n37, Oct 3: Mathematics of Machine Learning; Effective Transfer Learning for NLP; Path Analysis with R

10-03

PyTorch 1.0 preview now available in Amazon SageMaker and the AWS Deep Learning AMIs

10-03

Whats new on arXiv

10-03

Sequence Modeling with Neural Networks – Part I

10-03

Whats new on arXiv

10-02

Distilled News

10-01

Distilled News

09-30

Document worth reading: “Physically optimizing inference”

09-29

Python Vs R : The Eternal Question for Data Scientists

09-29

If you did not already know

09-29

Whats new on arXiv

09-28

Machine Learning and Deep Learning : Differences

09-28

Whats new on arXiv

09-28

Whats new on arXiv

09-27

Distilled News

09-26

Whats new on arXiv

09-25

Whats new on arXiv

09-25

Distilled News

09-25

If you did not already know

09-24

Python Vs R : The Eternal Question for Data Scientists

09-24

Dataquest helped me get my dream job at Noodle.ai

09-24

Distilled News

09-23

Distilled News

09-22

If you did not already know

09-21

How Pol Brigneti got a Data Analyst job using Dataquest at Belgrave Valley

09-21

Whats new on arXiv

09-20

AI, Machine Learning and Data Science Roundup: September 2018

09-20

PyConUK 2018

09-19

Whats new on arXiv

09-19

How to generalize (algorithmically)

09-18

Distilled News

09-18

If you did not already know

09-17

Distilled News

09-17

Distilled News

09-17

Whats new on arXiv

09-14

Distilled News

09-14

Distilled News

09-12

Whats new on arXiv

09-12

Data Center Scale Computing and Artificial Intelligence with Matei Zaharia, Inventor of Apache Spark

09-12

Distilled News

09-11

Distilled News

09-11

R Packages worth a look

09-11

Whats new on arXiv

09-11

Distilled News

09-07

Whats new on arXiv

09-07

Welcome to Dataiku University!

09-07

If you did not already know

09-06

Distilled News

09-06

Visual Reinforcement Learning with Imagined Goals

09-06

If you did not already know

09-05

Whats new on arXiv

09-03

Magister Dixit

09-02

Whats new on arXiv

09-01

If you did not already know

09-01

Use Kaggle to start (and guide) your ML/ Data Science journey — Why and How

08-29

Whats new on arXiv

08-28

If you did not already know

08-27

If you did not already know

08-25

Whats new on arXiv

08-24

Microsoft Weekly Data Science News for August 24, 2018

08-24

Whats new on arXiv

08-24

Whats new on arXiv

08-22

Whats new on arXiv

08-21

Bad headlines distract from real AI problems

08-20

Document worth reading: “A Survey on Resilient Machine Learning”

08-19

Whats new on arXiv

08-17

AI, Machine Learning and Data Science Roundup: August 2018

08-17

Whats new on arXiv

08-17

Distilled News

08-16

Whats new on arXiv

08-16

Whats new on arXiv

08-15

Whats new on arXiv

08-14

Distilled News

08-09

Whats new on arXiv

08-07

Essential Tips and Tricks for Starting Machine Learning with Python

08-05

Distilled News

08-04

Document worth reading: “Attend Before you Act: Leveraging human visual attention for continual learning”

08-03

Amazon Polly adds bilingual Indian English/Hindi language support

08-02

Three flavors of data scientist

08-02

Distilled News

08-02

Whats new on arXiv

08-01

Recent top-selling books in AI and Machine Learning

07-31

Whats new on arXiv

07-31

New Research on Multi-Task Learning

07-24

Top 20 Python AI and Machine Learning Open Source Projects

07-23

Import AI:

07-23

AI, Machine Learning and Data Science Roundup: July 2018

07-23

One-Shot Imitation from Watching Videos

06-28

Import AI:

06-25

Import AI

06-18

Overview and benchmark of traditional and deep learning models in text classification

06-12

Engineering a New Career in Data Science: Alumni Spotlight on Abhishek Mishra

06-06

My eRum 2018 biggest highlights

05-19

Mother's Day Interview: How Nicole Finnie Became a Competitive Kaggler on Maternity Leave

05-10

Gensim Survey 2018

04-30

TDM: From Model-Free to Model-Based Deep Reinforcement Learning

04-26

Can a Machine Be Racist or Sexist?

04-16

Goals and Principles of Representation Learning

04-12

Graph embeddings in Hyperbolic Space

04-10

Webcam based image processing in Jupyter notebooks

04-09

Automated machine learning is coming... and it won't matter

04-04

Introducing Python for data scientists - Pt1

03-15

It’s okay to not be a data scientist

02-20

Sutton’s Temporal-Difference Learning

02-19

Pervasive Simulator Misuse with Reinforcement Learning

02-14

A Practical Guide to the "Open-Source Machine Learning Masters"

02-03

Neural Networks and the generalisation problem

01-28

Lessons learned in my first year as a data scientist

01-25

New Year's Resolutions 2018

01-05

ML/NLP Publications in 2017

01-02

AI and Deep Learning in 2017 – A Year in Review

12-31

Weekly Review: 12/23/2017

12-23

Weekly Review: 12/16/2017

12-16

Weekly Review: 12/10/2017

12-10

The Last 5 Years In Deep Learning

12-04

Evolving Stable Strategies

11-12

Weekly Review: 11/11/2017

11-11

Weekly Review: 10/21/2017

10-21

Advice for aspiring data scientists and other FAQs

10-15

Deep Learning Dead-End?

09-17

Joining ASAPP

09-09

What Killed the Curse of Dimensionality?

09-06

Python Deep Learning tutorial: Create a GRU (RNN) in TensorFlow

08-27

My Qualifying Exam (Oral)

08-07

What is Machine Learning?

07-17

How to launch your data science career (with Python)

07-12

Machine Learning the Future Class

06-12

COLT 2017 accepted papers

06-03

Python Deep Learning tutorial: Elman RNN implementation in Tensorflow

05-17

Transfer Learning for Flight Delay Prediction via Variational Autoencoders

05-08

Announcement

04-27

How to make the transition from academia to data science

04-23

F beta score for Keras

04-23

Post NIPS Reflections

12-13

Deep Learning Research Review Week 2: Reinforcement Learning

11-16

AI ‘judge’ doesn’t explain why it reaches certain decisions

10-24

Gradient descent learns linear dynamical systems

10-13

Cognitive Machine Learning: Prologue

10-08

Learning Reinforcement Learning (with Code, Exercises and Solutions)

10-02

TensorFlow in a Nutshell — Part Two: Hybrid Learning

09-13

Grokking Deep Learning

08-17

How to score 0.8134 in Titanic Kaggle Challenge

08-10

Boosting (in Machine Learning) as a Metaphor for Diverse Teams

08-07

A intuitive explanation of natural gradient descent

08-07

My Open-Source Machine Learning Masters (in Casablanca, Morocco)

07-29

Talk: Building Machines that Imagine and Reason

07-28

Re-work Interview Questions

07-26

Learning in Brains and Machines (4): Episodic and Interactive Memory

07-24

Occam razor vs. machine learning

07-12

Learning in Brains and Machines (3): Synergistic and Modular Action

07-03

First Order Optimization Methods

07-02

Making use of the model

06-20

The Power of IPython Notebook + Pandas + and Scikit-learn

06-11

Model-Free Prediction and Control

06-07

Deep Learning Trends @ ICLR 2016

06-01

Deep Reinforcement Learning: Pong from Pixels

05-31

Becoming a Data Scientist Podcast Episode 08: Sebastian Raschka

03-29

Large Data with Scikit-learn - Boston Meetup

03-16

Deep Learning, Pachinko, and James Watt: Efficiency is the Driver of Uncertainty

03-04

Data Science Learning Club Update

02-21

Learning in Brains and Machines (1): Temporal Differences

02-21

The Best of Unpublished Machine Learning and Statistics Books

02-09

Introduction to Semi-Supervised Learning with Ladder Networks

01-19

AI and ML Futures 1: Background

01-17

The Deep Learning Gold Rush of 2015

11-07

Reinforcement Learning - Monte Carlo Methods

10-25

Theoretical Motivations for Deep Learning

10-18

7 tools in every data scientist’s toolbox

10-15

Deep Learning Startups, Applications and Acquisitions – A Summary

10-13

Long Short-Term Memory dramatically improves Google Voice etc – now available to a billion users

09-30

Long Short-Term Memory dramatically improves Google Voice etc – now available to a billion users

09-30

My presentations on ‘Elements of Neural Networks & Deep Learning’ -Part1,2,3

01-10

If you did not already know

01-07

Document worth reading: “Neural Style Transfer: A Review”

01-02

Fine-tuning for Natural Language Processing

12-28

If you did not already know

12-09

Deep Learning Cheat Sheets

11-28

Document worth reading: “A Learning Approach to Secure Learning”

11-19

Document worth reading: “Deep Reinforcement Learning: An Overview”

11-14

Slides from my talk at the R-Ladies Meetup about Interpretable Deep Learning with R, Keras and LIME

10-17

Open Workshop: Deep Learning in R and Keras, November 14th in Frankfurt

10-13

Top September Stories: Essential Math for Data Science: Why and How; Machine Learning Cheat Sheets

10-10

Document worth reading: “Detecting Dead Weights and Units in Neural Networks”

10-05

Distilled News

09-12

If you did not already know

08-27

If you did not already know

08-20

Neural reinterpretations of movie trailers

07-31

Understanding deep Convolutional Neural Networks with a practical use-case in Tensorflow and Keras

11-13

Deep Learning without Backpropagation

03-21

Learning in Brains and Machines (2): The Dogma of Sparsity

04-07

Beer reviews with Recurrent Neural Networks (RNN)

10-08

Long Short-Term Memory dramatically improves Google Voice etc – now available to a billion users

09-30

Cognitive Services in Containers

11-19

Cognitive Services in Containers

11-19

T-mobile uses R for Customer Service AI

11-09

Transfer learning for custom labels using a TensorFlow container and “bring your own algorithm” in Amazon SageMaker

07-27

Why Julia’s DataFrames are Still Slow

11-28

A Brief Guide to the Docker Ecosystem

10-01

ShinyProxy Christmas Release

12-23

R Packages worth a look

11-20

Distilled News

08-22

Getting Started With MapD, Part 1: Docker Install and Loading Data

02-01

Docker for AWS

06-27

Creating a PageRank Analytics Platform Using Spring Boot Microservices

01-03

A Brief Guide to the Docker Ecosystem

10-01

What is Cloud Computing & Which is Better, AWS or GCP

11-15

A Brief Guide to the Docker Ecosystem

10-01

How simpleshow uses Amazon Polly to voice stories in their explainer videos

01-11

ShinyProxy Christmas Release

12-23

AI, Machine Learning and Data Science Roundup: December 2018

12-19

AI, Machine Learning and Data Science Roundup: November 2018

11-21

Cognitive Services in Containers

11-19

Cognitive Services in Containers

11-19

What is Cloud Computing & Which is Better, AWS or GCP

11-15

AWS expands HIPAA eligible machine learning services for healthcare customers

11-08

The Definitive Guide to AI’s “Black Box” Problem

10-17

AI, Machine Learning and Data Science Announcements from Microsoft Ignite

10-02

Putting the Power of Kafka into the Hands of Data Scientists

09-05

Microsoft Weekly Data Science News for August 24, 2018

08-24

Why you can't have privacy on the internet

08-22

AI, Machine Learning and Data Science Roundup: August 2018

08-17

Distilled News

08-17

Make R speak

08-16

Aella Credit empowers underbanked individuals by using Amazon Rekognition for identity verification

08-15

Amazon Rekognition is now available in the Asia Pacific (Seoul) and Asia Pacific (Mumbai) Regions

08-09

Deep Learning for Emojis with VS Code Tools for AI – Part 2

06-05

Enterprise Deployment Tips for Azure Data Science Virtual Machine (DSVM)

05-21

Microsoft Weekly Data Science News for May 18, 2018

05-18

What Do Data Scientists Need to Know about Containerization? As Little as Possible.

02-22

JUnit,Integration,End to End Tests

10-22

Docker for AWS

06-27

Writing Effective Amazon Machine Learning

02-19

A Brief Guide to the Docker Ecosystem

10-01

A Brief Guide to the Docker Ecosystem

10-01

Here’s why 2019 is a great year to start with R: A story of 10 year old R code then and now

01-05

Custom JavaScript, CSS and HTML in Shiny

12-23

Search labels and IDs from IAB-QAG and IPTC Subject Codes taxonomies

11-01

Distilled News

09-28

Finding Similar Sounding Names – Some Basics

05-26

David MacKay Symposium

03-15

The Julia language for Scientific Computing

10-02

Will Julia Replace Python and R for Data Science?

12-26

Parallelizing Distance Calculations Using A GPU With CUDAnative.jl

08-14

The Julia language for Scientific Computing

10-02

The Five Best Data Visualization Libraries

01-07

Code for case study – Customer Churn with Keras/TensorFlow and H2O

12-12

KDnuggets™ News 18:n44, Nov 21: What is the Best Python IDE for Data Science?; Anticipating the next move in data science

11-21

Tis the Season to Check your SSL/TLS Cipher List Thrice (RCurl/curl/openssl)

11-17

Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

11-16

Top 13 Python Deep Learning Libraries

11-02

GitHub Python Data Science Spotlight: High Level Machine Learning & NLP, Ensembles, Command Line Viz & Docker Made Easy

10-16

How R gets built on Windows

10-11

How R gets built on Windows

10-11

Top 8 Python Machine Learning Libraries

10-09

Dataiku 5.0: Enterprise AI Within Reach

09-12

Guide to a high-performance, powerful R installation

08-31

Deep Learning Frameworks on CDH and Cloudera Data Science Workbench

04-20

In Praise Of Reinventing The Wheel

08-23

How to score 0.8134 in Titanic Kaggle Challenge

08-10

The Julia language for Scientific Computing

10-02

The Five Best Data Visualization Libraries

01-07

2018 Volatility Recap

01-06

My Activities in 2018 with R and ShinyApp

01-04

Certifiably Gone Phishing

12-23

Examining the Tweeting Patterns of Prominent Crossfit Gyms

12-20

Pdftools 2.0: powerful pdf text extraction tools

12-14

Code for case study – Customer Churn with Keras/TensorFlow and H2O

12-12

Automated Dashboard with various correlation visualizations in R

12-05

Create 3D County Maps Using Density as Z-Axis

11-29

Plotting wind highways using rWind

11-26

A tutorial on tidy cross-validation with R

11-25

Tis the Season to Check your SSL/TLS Cipher List Thrice (RCurl/curl/openssl)

11-17

Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

11-16

R Packages worth a look

11-10

coalesce with wrapr

11-03

coalesce with wrapr

11-03

Top 13 Python Deep Learning Libraries

11-02

How to Highlight 3D Brain Regions

10-31

Stop Installing Tensorflow Using pip for Performance Sake!

10-30

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Popular Halloween Candy on US State Grid Map

10-25

RcppNLoptExample 0.0.1: Use NLopt from C/C++

10-13

Prophets of gloom: Using NLP to analyze Radiohead lyrics

10-13

How R gets built on Windows

10-11

How R gets built on Windows

10-11

Top 8 Python Machine Learning Libraries

10-09

Speed Up With Microsoft

10-04

Better R Code with wrapr Dot Arrow

09-15

Dataiku 5.0: Enterprise AI Within Reach

09-12

Package Paths in R

03-31

Connect to Google Sheets in Power BI using R

03-06

In Praise Of Reinventing The Wheel

08-23

Interactive association rules exploration app

11-30

The Julia language for Scientific Computing

10-02

Import AI 127: Why language AI advancements may make Google more competitive; COCO image captioning systems don’t live up to the hype, and Amazon sees 3X growth in voice shopping via Alexa

12-31

Comparison of the Top Speech Processing APIs

12-28

Advent of Code: Most Popular Languages

12-15

NLP Breakthrough Imagenet Moment has arrived

12-14

Best Machine Learning languages, Data Visualization Tools, DL Frameworks, and Big Data Tools

12-03

6 Goals Every Wannabe Data Scientist Should Make for 2019

11-22

EMNLP 2018 Highlights: Inductive bias, cross-lingual learning, and more

11-07

Amazon Translate now offers 113 new language pairs

10-29

Why are functional programming languages so popular in the programming languages community?

10-11

Amazon Comprehend introduces new Region availability and language support for French, German, Italian, and Portuguese

10-10

A Review of the Neural History of Natural Language Processing

10-01

Meet Zhiyu—the first Mandarin Chinese voice for Amazon Polly

09-14

Document worth reading: “Putting Data Science In Production”

09-07

Redmonk Language Rankings, June 2018

08-10

Amazon Polly adds bilingual Indian English/Hindi language support

08-02

Learn to R blog series - R and RStudio

03-29

Introducing Python for data scientists - Pt2

03-23

The Two Tribes of Language Researchers

11-19

A Billion Words and The Limits of Language Modeling

09-23

Why Scala?

07-17

A tour of Factor: 1

05-23

Talking to Machines – The Rise of Conversational Interfaces and NLP

11-17

The Julia language for Scientific Computing

10-02

Timing Grouped Mean Calculation in R

12-08

Creating List with Iterator

11-23

How Data Science (+ Friends) Helped Me Learn French

11-01

How to scrape data from a website using Python

09-07

Timings of a Grouped Rank Filter Task

08-23

Denoising Dirty Documents: Part 8

10-02

Denoising Dirty Documents: Part 8

10-02

How do Convolutional Neural Nets (CNNs) learn? + Keras example

01-09

What to do when your training and testing data come from different distributions

01-04

Distilled News

12-31

Import AI 127: Why language AI advancements may make Google more competitive; COCO image captioning systems don’t live up to the hype, and Amazon sees 3X growth in voice shopping via Alexa

12-31

Deep learning in Satellite imagery

12-26

Bubble Packed Chart with R using packcircles package

12-22

Image Stitching with OpenCV and Python

12-17

Distilled News

12-05

Deep learning in Satellite imagery

12-04

If you did not already know

12-02

Variational Autoencoders Explained in Detail

11-30

Java Object Tracking for Cars

11-30

Semantic Segmentation algorithm is now available in Amazon SageMaker

11-28

Bringing Machine Learning Research to Product Commercialization

11-27

Physics-Based Learned Design: Teaching a Microscope How to Image

11-26

Stereograms

11-26

If you did not already know

11-25

Amazon Rekognition announces updates to its face detection, analysis, and recognition capabilities

11-22

Mask R-CNN with OpenCV

11-19

If you did not already know

11-17

The Long Tail of Medical Data

11-12

Image segmentation based on Superpixels and Clustering

11-09

Egg-Not-Egg Deep Learning Model

11-08

Tesseract 4 is here! State of the art OCR in R!

11-06

Creating GIFs with OpenCV

11-05

Why AI will not replace radiologists

11-01

How to Highlight 3D Brain Regions

10-31

Using deep learning on AWS to lower property damage losses from natural disasters

10-30

Distilled News

10-30

AI Masterpieces: But is it Art?

10-27

Join us at the EARL US Roadshow – a conference dedicated to the real-world usage of R

10-24

Drilling Down on Depth Sensing and Deep Learning

10-23

Distilled News

10-18

Document worth reading: “Deep Facial Expression Recognition: A Survey”

10-17

Slides from my talk at the R-Ladies Meetup about Interpretable Deep Learning with R, Keras and LIME

10-17

Will Compression Be Machine Learning’s Killer App?

10-16

Even more images as x-axis labels

10-16

Track the number of coffees consumed using AWS DeepLens

10-09

Basic Image Data Analysis Using Python – Part 4

10-05

Semantic Segmentation: Wiki, Applications and Resources

10-04

Deep Learning Without Labels

10-03

R Packages worth a look

10-03

“Snip Insights” – An Open Source Cross-Platform AI Tool for Intelligent Screen Capture

10-03

If you did not already know

09-30

Implement Simple Convolution with Java

09-27

R Packages worth a look

09-26

New Engen improves customer acquisition marketing campaigns using Amazon Rekognition

09-19

Save time and money by filtering faces during indexing with Amazon Rekognition

09-18

Not Hotdog: A Shiny app using the Custom Vision API

09-18

Classifying high-resolution chest x-ray medical images with Amazon SageMaker

09-13

Java Home Made Face Recognition Application

09-12

Visual Reinforcement Learning with Imagined Goals

09-06

Visual search on AWS—Part 1: Engine implementation with Amazon SageMaker

08-30

Document worth reading: “Idealised Bayesian Neural Networks Cannot Have Adversarial Examples: Theoretical and Empirical Study”

08-29

Whats new on arXiv

08-28

TINT uses Amazon Comprehend to find and aggregate the best social media content for customers

08-15

Amazon Rekognition is now available in the Asia Pacific (Seoul) and Asia Pacific (Mumbai) Regions

08-09

If you did not already know

08-05

Document worth reading: “Attend Before you Act: Leveraging human visual attention for continual learning”

08-03

Differentiable Image Parameterizations

07-25

Scalable multi-node deep learning training using GPUs in the AWS Cloud

07-20

Classify your own images using Amazon SageMaker

07-20

Using Siamese Networks and Pre-Trained Convolutional Neural Networks (CNNs) for Fashion Similarity Matching

07-10

Building a Diabetic Retinopathy Prediction Application using Azure Machine Learning

06-25

BDD100K Blog Update

06-18

Import AI:

05-29

3 Things We Can Do About Fake News

05-18

How analog TV worked

05-01

Webcam based image processing in Jupyter notebooks

04-09

Java Art Generation with Neural Style Transfer

02-24

Java Autonomous driving – Car detection

01-18

Java Image Cat&Dog Recognition with Deep Neural Networks

01-03

Java Handwritten Digit Recognition with Convolutional Neural Networks

12-13

Using Artificial Intelligence to Augment Human Intelligence

12-04

The Last 5 Years In Deep Learning

12-04

Understanding deep Convolutional Neural Networks with a practical use-case in Tensorflow and Keras

11-13

Building a Visual Search Algorithm

10-13

Run some cool GitHubs on Azure (Python)

09-26

Deep learning on Apache Spark and Apache Hadoop with Deeplearning4j

06-27

Rec-a-Sketch: a Flask App for Interactive Sketchfab Recommendations

02-05

Avoiding overfitting in object detection problem

12-19

Colorizing the DRAW Model

12-06

T-Shirt Design Contest!

11-19

Deep Learning Research Review Week 1: Generative Adversarial Nets

09-30

Analyzing The Papers Behind Facebook's Computer Vision Approach

09-01

The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)

08-24

A Beginner's Guide To Understanding Convolutional Neural Networks Part 2

07-29

Visualizing Features from a Convolutional Neural Network

06-15

Generating Large Images from Latent Vectors - Part Two

06-02

TensorFlow Implementation of "A Neural Algorithm of Artistic Style"

05-31

Piano Keyboards

05-22

Create a Chrome extension to modify a website’s HTML or CSS

04-14

The Frog of CIFAR 10

04-06

Generating Large Images from Latent Vectors

04-01

Generating Abstract Patterns with TensorFlow

03-25

Diagnosing Heart Diseases with Deep Neural Networks

03-15

Second Annual Data Science Bowl – Part 3 – Automatically Finding the Heart Location in an MRI Image

03-08

Second Annual Data Science Bowl – Part 1

03-06

Mini AI app using TensorFlow and Shiny

01-15

ICCV 2015, Day 3

12-16

ICCV 2015, Day 1

12-14

An Even Dozen – Denoising Dirty Documents: Part 12

11-15

What a Deep Neural Network thinks about your

10-25

Experiments with style transfer

10-02

Denoising Dirty Documents: Part 8

10-02

R Packages worth a look

01-06

Day 04 – little helper evenstrings

12-04

Shortest Crease Problem

01-14

Efficient Guttering

07-29

Denoising Dirty Documents: Part 8

10-02

Experiments with style transfer

10-02

Experiments with style transfer

10-02

Center for Ultrasound Research and Translation, Massachusetts General Hospital: Post-Doctoral Scholar / Research Scientist [Boston, MA]

12-31

How will automation tools change data science?

12-18

Distilled News

11-30

Humana: Principal Data Scientist/Informatics Principal [Chicago, IL, Dallas, TX and Louisville, KY]

11-27

Community Call – Working with images in R

10-24

Adversarial Examples, Explained

10-16

If you did not already know

10-05

If you did not already know

08-29

If you did not already know

08-10

Exercise and weight loss: long-term follow-up

07-10

Building a Data Science Portfolio: Storytelling with Data (Part 2: Data Exploration)

07-14

Experiments with style transfer

10-02

Experiments with style transfer

10-02

R Packages worth a look

12-21

R Packages worth a look

11-04

Similarity in the Wild

02-19

Using Keras' Pretrained Neural Networks for Visual Similarity Recommendations

12-05

Travel Recommendations with Jaccard Similarities

10-03

Travel Recommendations with Jaccard Similarities

10-03

Some fun with {gganimate}

12-27

Number of births in the twentieth century by @ellis2013nz

11-30

You’ve got data on 35 countries, but it’s really just N=3 groups.

09-25

Data concerns when interpreting comparisons of gender equality between countries

08-21

Data concerns when interpreting comparisons of gender equality between countries

08-21

Travel Recommendations with Jaccard Similarities

10-03

Some fun with {gganimate}

12-27

Manipulate dates easily with {lubridate}

12-15

Interactive panel EDA with 3 lines of code

12-09

Document worth reading: “Review of Deep Learning”

10-19

The Economist's Big Mac Index is calculated with R

10-12

The Economist's Big Mac Index is calculated with R

10-12

You’ve got data on 35 countries, but it’s really just N=3 groups.

09-25

Data concerns when interpreting comparisons of gender equality between countries

08-21

Data concerns when interpreting comparisons of gender equality between countries

08-21

Travel Recommendations with Jaccard Similarities

10-03

KDnuggets™ News 19:n01, Jan 3: The Essence of Machine Learning; A Guide to Decision Trees for Machine Learning and Data Science

01-03

KDnuggets™ News 18:n48, Dec 19: Why You Shouldn’t be a Data Science Generalist; Industry Data Science & Machine Learning 2019 Predictions

12-19

Top KDnuggets tweets, Dec 12-18: Deep Learning Cheat Sheets; The Nate Silver vs. Nassim Taleb Twitter War

12-19

Top KDnuggets tweets, Dec 5-11: How to build a data science project from scratch; NeurIPS 2018 video talk collection

12-13

Top KDnuggets tweets, Nov 28 – Dec 4: Deep Learning Cheat Sheets; Amazon opens its internal

12-05

Top KDnuggets tweets, Nov 21-27: Intro to

11-28

KDnuggets™ News 18:n44, Nov 21: What is the Best Python IDE for Data Science?; Anticipating the next move in data science

11-21

Top KDnuggets tweets, Oct 31 – Nov 6: 10 More Free Must-Read Books for Machine Learning and Data Science

11-07

KDnuggets™ News 18:n42, Nov 7: The Most in Demand Skills for Data Scientists; How Machines Understand Our Language: Intro to NLP

11-07

Top KDnuggets tweets, Oct 24-30: Building a Question-Answering System from Scratch

10-31

Top KDnuggets tweets, Oct 3–9: 5 Reasons Logistic Regression should be the first thing you learn when becoming a Data Scientist

10-10

Top KDnuggets tweets, Sep 26 – Oct 2: Why building your own Deep Learning Computer is 10x cheaper than AWS; 6 Steps To Write Any Machine Learning Algorithm

10-03

KDnuggets™ News 18:n37, Oct 3: Mathematics of Machine Learning; Effective Transfer Learning for NLP; Path Analysis with R

10-03

Scrape Tweets from Twitter using Python and Tweepy

02-24

Twitter, Social Bots, and the US Presidential Elections!

11-07

Emoticons decoder for social media sentiment analysis in R

10-16

Travel Recommendations with Jaccard Similarities

10-03

Craft Software

10-05

Craft Software

10-05

Craft Software

10-05

Craft Software

10-05

Craft Software

10-05

Lychrel Numbers

10-05

Lychrel Numbers

10-05

Six Steps to Master Machine Learning with Data Preparation

12-21

Scalable multi-node training with TensorFlow

12-17

The Machine Learning Project Checklist

12-07

Now easily perform incremental learning on Amazon SageMaker

11-07

Document worth reading: “Toward a System Building Agenda for Data Integration”

11-06

Multilevel Modeling Solves the Multiple Comparison Problem: An Example with R

10-31

Save time and money by filtering faces during indexing with Amazon Rekognition

09-18

Use Kaggle to start (and guide) your ML/ Data Science journey — Why and How

08-29

Build an automatic alert system to easily moderate content at scale with Amazon Rekognition Video

08-15

3 Steps to Build Your First Intelligent App – Conference Buddy

07-31

A particles-arly fun book draw

05-02

Introduction to XGBoost

02-17

Lychrel Numbers

10-05

Le Monde puzzle [#1075]

12-11

Variational Autoencoders Explained in Detail

11-30

National Pi Day

03-15

Weird Number Bases

12-16

Lychrel Numbers

10-05

Data Feminism

11-06

Lychrel Numbers

10-05

Automated and continuous deployment of Amazon SageMaker models with AWS Step Functions

01-10

Use AWS Machine Learning to Analyze Customer Calls from Contact Centers (Part 2): Automate, Deploy, and Visualize Analytics using Amazon Transcribe, Amazon Comprehend, AWS CloudFormation, and Amazon QuickSight

12-28

Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese

12-21

Build a serverless Twitter reader using AWS Fargate

12-06

Announcing the Winners of the 2018 AWS AI Hackathon

12-05

Amazon SageMaker notebooks now support Git integration for increased persistence, collaboration, and reproducibility

11-29

Amazon Launches Machine Learning University

11-27

Building a conversational business intelligence bot with Amazon Lex

11-21

New Features For Amazon SageMaker: Workflows, Algorithms, and Accreditation

11-21

Analyze live video at scale in real time using Amazon Kinesis Video Streams and Amazon SageMaker

11-19

What is Cloud Computing & Which is Better, AWS or GCP

11-15

AWS expands HIPAA eligible machine learning services for healthcare customers

11-08

Direct access to Amazon SageMaker notebooks from Amazon VPC by using an AWS PrivateLink endpoint

11-06

Amazon SageMaker Batch Transform now supports Amazon VPC and AWS KMS-based encryption

10-22

Amazon Comprehend introduces new Region availability and language support for French, German, Italian, and Portuguese

10-10

Track the number of coffees consumed using AWS DeepLens

10-09

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 2

10-04

How to use common workflows on Amazon SageMaker notebook instances

10-03

Discovering and indexing podcast episodes using Amazon Transcribe and Amazon Comprehend

09-20

Limit access to a Jupyter notebook instance by IP address

09-14

Get started with automated metadata extraction using the AWS Media Analysis Solution

09-07

Access Amazon S3 data managed by AWS Glue Data Catalog from Amazon SageMaker notebooks

08-29

R Packages worth a look

08-23

Managing your expenses with Amazon Lex

08-21

Build an automatic alert system to easily moderate content at scale with Amazon Rekognition Video

08-15

Announcing the Artificial Intelligence (AI) Hackathon: Build Intelligent Applications using machine learning APIs and serverless

08-15

Amazon Rekognition is now available in the Asia Pacific (Seoul) and Asia Pacific (Mumbai) Regions

08-09

Build a document search bot using Amazon Lex and Amazon Elasticsearch Service

08-01

AWS Machine Learning Big Data NYC

10-24

Writing Effective Amazon Machine Learning

02-19

The Unbundling of AWS

10-06

R Packages worth a look

12-28

The Unbundling of AWS

10-06

The Unbundling of AWS

10-06

The Unbundling of AWS

10-06

Distilled News

01-09

Whats new on arXiv

01-06

Distilled News

12-30

Leaf Plant Classification: An Exploratory Analysis – Part 1

12-29

Distilled News

12-21

Feature engineering, Explained

12-21

Solve any Image Classification Problem Quickly and Easily

12-13

Common mistakes when carrying out machine learning and data science

12-06

How to build a data science project from scratch

12-05

My secret sauce to be in top 2% of a Kaggle competition

11-26

If you did not already know

11-22

If you did not already know

11-19

How to create useful features for Machine Learning

10-30

Distilled News

10-28

If you did not already know

10-23

Applied Data Science: Solving a Predictive Maintenance Business Problem Part 3

10-16

Challenges & Solutions for Production Recommendation Systems

10-05

Document worth reading: “On the Learning Dynamics of Deep Neural Networks”

09-23

Distilled News

09-20

Distilled News

09-11

If you did not already know

09-05

Document worth reading: “Examining the Use of Neural Networks for Feature Extraction: A Comparative Analysis using Deep Learning, Support Vector Machines, and K-Nearest Neighbor Classifiers”

08-08

If you did not already know

08-08

Essential Tips and Tricks for Starting Machine Learning with Python

08-05

If you did not already know

08-01

Using Siamese Networks and Pre-Trained Convolutional Neural Networks (CNNs) for Fashion Similarity Matching

07-10

Feature-wise transformations

07-09

My Thoughts on Synthetic Data

06-27

From Gaussian Algebra to Gaussian Processes, Part 2

06-12

Kernel Feature Selection via Conditional Covariance Minimization

01-23

Weekly Review: 10/21/2017

10-21

Deep learning with Apache MXNet on Cloudera Data Science Workbench

10-19

Introductory Machine Learning Terminology with Food

07-18

Kaggle’s Mercedes-Benz Greener Manufacturing

07-01

Kaggle’s Quora Question Pairs Competition

06-07

Sentiment Analysis model deployed!

04-17

Deconstruction with Discrete Embeddings

02-15

Using Keras' Pretrained Neural Networks for Visual Similarity Recommendations

12-05

Random forest interpretation – conditional feature contributions

10-24

Visualizing Features from a Convolutional Neural Network

06-15

A Guide to Gradient Boosted Trees with XGBoost in Python

06-05

Step by step Kaggle competition tutorial

04-10

Guide to an in-depth understanding of logistic regression

02-22

A Torch autoencoder example

11-06

7 tools in every data scientist’s toolbox

10-15

The Unbundling of AWS

10-06

How to Learn Python in 30 days

01-12

How to Learn Python in 30 days

01-02

Why Learning Data Science Live is Better than Self-Paced Learning

01-02

Vanguard: Senior AI Architect [Malvern, PA]

12-17

8 Data Science Projects to Build your Portfolio

12-11

8 Data Science Projects to Build your Portfolio

12-03

Distilled News

11-27

What I Learned About Machine Learning at ODSC West 2018

11-19

Distilled News

11-05

Data Science Interview Questions with Answers

10-28

Advantages of Online Data Science Courses

09-26

Learning Statistics Online for Data Science

09-20

Distilled News

09-19

Forbes: 25 Machine Learning Startups to Watch in 2018

08-26

Yet Another PhD to Data Science Post (Part III)

10-06

Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data

01-10

New Year's Resolution: Help Data Scientists Help You

12-31

I Spy with my Graphing Eye 📊 👁️

12-12

Le Monde puzzle [#1078]

11-28

The “probability to win” is hard to estimate…

11-07

Talk: How Do We Support Under-represented Groups To Put Themselves Forward?

11-01

Building a Question-Answering System from Scratch

10-24

Machine Reading at Scale – Transfer Learning for Large Text Corpuses

10-17

Things you should know when traveling via the Big Data Engineering hype-train

10-08

Document worth reading: “A Survey on Expert Recommendation in Community Question Answering”

09-28

Constructing a Data Analysis

08-24

The Law and Order of Data Science

08-15

3 Steps to Build Your First Intelligent App – Conference Buddy

07-31

The Data Incubator Unofficial Frequently Asked Questions

05-30

Sock Puzzle Revisited

03-07

How To Write, Deploy, and Interact with Ethereum Smart Contracts on a Private Blockchain

12-15

8 Important Python Interview Questions and Answers

11-17

Two Recent Results in Transfer Learning for Music and Speech

11-01

Sales Automation Through a Deep Learning Platform

09-22

Is BackPropagation Necessary?

08-23

Decision Trees Tutorial

07-27

The Definitive Q&A Guide for Aspiring Data Scientists

01-25

“Becoming a Data Scientist” Learning Club?

11-09

Yet Another PhD to Data Science Post (Part III)

10-06

Yet Another PhD to Data Science Post (Part III)

10-06

The Right Kind of Internal Motivation Can Improve Your Studies

01-08

Forget Motivation and Double Your Chances of Learning Success

11-20

Does Sharing Goals Help or Hurt Your Chances of Success?

10-22

Why do I Call Myself a Data Scientist?

10-05

A potential big problem with placebo tests in econometrics: they’re subject to the “difference between significant and non-significant is not itself statistically significant” issue

09-26

How feminism has made me a better scientist

08-13

How feminism has made me a better scientist

08-13

Occam razor vs. machine learning

07-12

Yet Another PhD to Data Science Post (Part III)

10-06

Predicting Fantasy Football Points

10-07

Predicting Fantasy Football Points

10-07

In statistics, we talk about uncertainty without it being viewed as undesirable

08-25

Predicting Fantasy Football Points

10-07

Linguistic Signals of Album Quality: A Predictive Analysis of Pitchfork Review Scores Using Quanteda

01-10

All About Scikit-Learn, with Olivier Grisel

11-13

Document worth reading: “Resource Management in Fog/Edge Computing: A Survey”

10-30

Document worth reading: “Analytics for the Internet of Things: A Survey”

09-12

Mouse Among the Cats

09-11

ICML Board and Reviewer profiles

03-05

Beer reviews with Recurrent Neural Networks (RNN)

10-08

Stereograms

11-26

Implement Simple Convolution with Java

09-27

Using gganimate to illustrate the luminance illusion

08-22

Deepcolor: automatic coloring and shading of manga-style lineart

03-01

Colorizing the DRAW Model

12-06

Beer reviews with Recurrent Neural Networks (RNN)

10-08

Beer reviews with Recurrent Neural Networks (RNN)

10-08

Beer reviews with Recurrent Neural Networks (RNN)

10-08

On the consistency of ordinal regression methods

10-08

On the consistency of ordinal regression methods

10-08

On the consistency of ordinal regression methods

10-08

On the consistency of ordinal regression methods

10-08

On the consistency of ordinal regression methods

10-08

Safe Crime Detection

06-05

Random Forest Tutorial: Predicting Crime in San Francisco

08-25

Step by step Kaggle competition tutorial

04-10

How-to: Build a Machine-Learning App Using Sparkling Water and Apache Spark

10-08

Using Siamese Networks and Pre-Trained Convolutional Neural Networks (CNNs) for Fashion Similarity Matching

07-10

How-to: Build a Machine-Learning App Using Sparkling Water and Apache Spark

10-08

Discourse Network Analysis: Undertaking Literature Reviews in R

11-15

Build a Predictive Maintenance Engine with GIS Data

10-03

CES 2018

01-12

How-to: Build a Machine-Learning App Using Sparkling Water and Apache Spark

10-08

Cloudera Enterprise 5.12 is Now Available

07-13

Getting Started with Cloudera Data Science Workbench

05-08

Sense is now part of Cloudera!

03-22

How-to: Build a Machine-Learning App Using Sparkling Water and Apache Spark

10-08

How-to: Build a Machine-Learning App Using Sparkling Water and Apache Spark

10-08

LOCF and Linear Imputation with PostgreSQL

10-11

LOCF and Linear Imputation with PostgreSQL

10-11

R Packages worth a look

01-03

Interactive panel EDA with 3 lines of code

12-09

If you did not already know

11-03

Measuring Bernoulli Probabilities in the Presence of Delayed Reactions

08-11

LOCF and Linear Imputation with PostgreSQL

10-11

Considering sensitivity to unmeasured confounding: part 1

01-02

Matching (and discarding non-matches) to deal with lack of complete overlap, then regression to adjust for imbalance between treatment and control groups

11-10

If you did not already know

11-03

Measuring Bernoulli Probabilities in the Presence of Delayed Reactions

08-11

Discovering and understanding patterns in highly dimensional data

02-28

LOCF and Linear Imputation with PostgreSQL

10-11

Plotting Scottish census data with some tidyverse magic

11-28

How to Highlight 3D Brain Regions

10-31

LOCF and Linear Imputation with PostgreSQL

10-11

Whats new on arXiv

01-12

Whats new on arXiv

01-10

5 things that happened in Data Science in 2018

01-08

Whats new on arXiv

01-01

Whats new on arXiv

12-31

The Essence of Machine Learning

12-28

Data Science & ML : A Complete Interview Guide

12-26

Distilled News

12-21

Whats new on arXiv

12-19

2018 Year-in-Review: Machine Learning Open Source Projects & Frameworks

12-17

Whats new on arXiv

12-17

Soft Actor Critic—Deep Reinforcement Learning with Real-World Robots

12-14

Whats new on arXiv

12-13

Keras Hyperparameter Tuning in Google Colab Using Hyperas

12-12

8 Data Science Projects to Build your Portfolio

12-11

Whats new on arXiv

12-10

Whats new on arXiv

12-10

Whats new on arXiv

12-09

Whats new on arXiv

12-07

Distilled News

12-06

Whats new on arXiv

12-04

8 Data Science Projects to Build your Portfolio

12-03

Variational Autoencoders Explained in Detail

11-30

Distilled News

11-30

ML Methods for Prediction and Personalization

11-30

Whats new on arXiv

11-25

Whats new on arXiv

11-25

Distilled News

11-24

An Introduction to AI

11-21

Whats new on arXiv

11-20

ML Methods for Prediction and Personalization

11-19

Easily monitor and visualize metrics while training models on Amazon SageMaker

11-19

Whats new on arXiv

11-17

Whats new on arXiv

11-15

Distilled News

11-08

Distilled News

11-02

Distilled News

10-31

Learning to learn in a model-agnostic way

10-29

Whats new on arXiv

10-26

Introduction to Active Learning

10-23

If you did not already know

10-21

Distilled News

10-18

Four machine learning strategies for solving real-world problems

10-17

Distilled News

10-15

Whats new on arXiv

10-10

Learning Acrobatics by Watching YouTube

10-09

A Concise Explanation of Learning Algorithms with the Mitchell Paradigm

10-05

Whats new on arXiv

10-03

Reinforcement Learning: Super Mario, AlphaGo and beyond

10-01

Whats new on arXiv

09-28

Machine Learning and Deep Learning : Differences

09-28

How to Optimise Ad CTR with Reinforcement Learning

09-24

Whats new on arXiv

09-20

Whats new on arXiv

09-19

Whats new on arXiv

09-18

Deep learning made easier with transfer learning

09-17

How to Optimise Ad CTR with Reinforcement Learning

09-17

Distilled News

09-17

Whats new on arXiv

09-14

The Benefits of Active Learning for Data Science Skills

09-12

Data Science Glossary

09-12

Whats new on arXiv

09-03

Whats new on arXiv

08-24

Whats new on arXiv

08-17

Whats new on arXiv

08-16

DIFFERENCE BETWEEN DATA SCIENCE, DATA ANALYTICS AND MACHINE LEARNING

08-09

Whats new on arXiv

08-03

Whats new on arXiv

08-01

Whats new on arXiv

07-31

One-Shot Imitation from Watching Videos

06-28

TDM: From Model-Free to Model-Based Deep Reinforcement Learning

04-26

AlphaGo Zero Is Not A Sign of Imminent Human-Level AI

03-30

AI and Deep Learning in 2017 – A Year in Review

12-31

Deep reinforcement learning, battleship

10-15

Gradient descent learns linear dynamical systems

10-13

A fastText-based hybrid recommender

09-27

Learning in Brains and Machines (3): Synergistic and Modular Action

07-03

Making use of the model

06-20

Deep Learning Trends @ ICLR 2016

06-01

Deep Reinforcement Learning: Pong from Pixels

05-31

Keras plays catch, a single file Reinforcement Learning example

03-17

The Deep Learning Gold Rush of 2015

11-07

Theoretical Motivations for Deep Learning

10-18

Deep Learning Startups, Applications and Acquisitions – A Summary

10-13

Deep Learning Startups, Applications and Acquisitions – A Summary

10-13

Document worth reading: “Deep Neural Network Approximation Theory”

01-12

My presentations on ‘Elements of Neural Networks & Deep Learning’ -Part1,2,3

01-10

Biggest Deep Learning Summit – Special KDnuggets Offer

01-10

Document worth reading: “Recent Advances in Deep Learning: An Overview”

01-08

February 21st & 22nd: End-2-End from a Keras/TensorFlow model to production

01-07

Strata Data SF 2019 KDnuggets Offer

01-04

Top KDnuggets tweets, Dec 12-18: Deep Learning Cheat Sheets; The Nate Silver vs. Nassim Taleb Twitter War

12-19

Heavy Tailed Self Regularization in Deep Neural Nets: 1 year of research

12-18

Spark + AI Summit: learn best practices in ML and DL, latest frameworks, and more – special KDnuggets offer

12-14

My book ‘Deep Learning from first principles:Second Edition’ now on Amazon

12-14

If you did not already know

12-13

A Machine Learning Deep Dive [Webinar, Dec 13]

12-11

Top November Stories: The Most in Demand Skills for Data Scientists; What is the Best Python IDE for Data Science?

12-11

Einops — a new style of deep learning code

12-06

Top KDnuggets tweets, Nov 28 – Dec 4: Deep Learning Cheat Sheets; Amazon opens its internal

12-05

Deep Learning for the Masses (… and The Semantic Layer)

11-30

Document worth reading: “A Learning Approach to Secure Learning”

11-19

Document worth reading: “Deep Reinforcement Learning: An Overview”

11-14

Top 13 Python Deep Learning Libraries

11-02

R Packages worth a look

10-20

If you did not already know

10-17

If you did not already know

09-30

Machine Learning and Deep Learning : Differences

09-28

If you did not already know

09-06

If you did not already know

08-29

Videos from NYC R Conference

08-28

If you did not already know

08-25

Recent top-selling books in AI and Machine Learning

07-31

Deep Reinforcement Learning in Action (Announcement)

06-20

Data Science in 30 Minutes: Deep Learning to Detect Fake News with Uber ATG Head of Data Science, Mike Tamir

05-30

Michael B. Cohen

09-28

What Killed the Curse of Dimensionality?

09-06

Getting Started with Sonnet, Deep Mind’s Deep Learning Library

04-10

From Analytical to Numerical to Universal Solutions

03-20

Deep and Hierarchical Implicit Models

02-28

Grokking Deep Learning

08-17

Talk: Building Machines that Imagine and Reason

07-28

The Best of Unpublished Machine Learning and Statistics Books

02-09

ICCV 2015, Day 2

12-15

The Deep Learning Gold Rush of 2015

11-07

Deep Learning Startups, Applications and Acquisitions – A Summary

10-13

Deep Learning Startups, Applications and Acquisitions – A Summary

10-13

Denoising Dirty Documents: Part 9

10-15

Whats new on arXiv

01-08

Stock Price prediction using ML and DL

01-07

Whats new on arXiv

01-04

How to Learn Python in 30 days

01-02

BERT: State of the Art NLP Model, Explained

12-26

Top 10 Data Science Tools (other than SQL Python R)

12-21

Whats new on arXiv

12-20

Whats new on arXiv

12-20

Distilled News

12-17

NLP Breakthrough Imagenet Moment has arrived

12-14

Whats new on arXiv

12-11

Failure Pressure Prediction Using Machine Learning

12-10

Whats new on arXiv

12-08

Distilled News

12-07

Starspace for NLP

12-04

Whats new on arXiv

11-29

Whats new on arXiv

11-29

Whats new on arXiv

11-28

Whats new on arXiv

11-25

Whats new on arXiv

11-23

Whats new on arXiv

11-20

Distilled News

11-19

Whats new on arXiv

11-17

Whats new on arXiv

11-16

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

11-15

Whats new on arXiv

11-12

Whats new on arXiv

11-11

Why would I ever NEED Bayesian Statistics?

11-09

Whats new on arXiv

11-06

Can we predict the crawling of the Google-Bot?

11-06

The 3Ds of Machine Learning Systems Design

11-05

Whats new on arXiv

10-30

Whats new on arXiv

10-30

Whats new on arXiv

10-27

Whats new on arXiv

10-26

Distilled News

10-24

Whats new on arXiv

10-23

Whats new on arXiv

10-22

Whats new on arXiv

10-22

Applied Data Science: Solving a Predictive Maintenance Business Problem Part 3

10-16

Limitations of “Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection”

10-12

Modeling Airbnb prices

10-12

Whats new on arXiv

10-11

Whats new on arXiv

10-09

Whats new on arXiv

10-03

Whats new on arXiv

10-02

A Review of the Neural History of Natural Language Processing

10-01

Whats new on arXiv

09-27

Understanding Regression Error Metrics

09-26

Whats new on arXiv

09-25

Whats new on arXiv

09-24

Whats new on arXiv

09-21

Whats new on arXiv

09-18

Distilled News

09-17

Whats new on arXiv

09-14

Whats new on arXiv

09-13

Against Arianism 2: Arianism Grande

09-12

Whats new on arXiv

09-12

Whats new on arXiv

09-11

Whats new on arXiv

09-11

Whats new on arXiv

09-07

Whats new on arXiv

09-07

Whats new on arXiv

09-01

Whats new on arXiv

08-31

Whats new on arXiv

08-28

Whats new on arXiv

08-28

Bayesian model comparison in ecology

08-26

Whats new on arXiv

08-24

Whats new on arXiv

08-22

Whats new on arXiv

08-21

Whats new on arXiv

08-21

Whats new on arXiv

08-17

Build a model to predict the impact of weather on urban air quality using Amazon SageMaker

08-16

R Packages worth a look

08-16

Distilled News

08-14

Whats new on arXiv

08-14

Whats new on arXiv

08-13

Whats new on arXiv

08-09

When LOO and other cross-validation approaches are valid

08-03

Data science books - theory and practice

06-29

My Thoughts on Synthetic Data

06-27

57 Summaries of Machine Learning and NLP Research

01-17

Hierarchical Softmax

08-01

Cognitive Machine Learning (2): Uncertain Thoughts

03-12

Cognitive Machine Learning (1): Learning to Explain

02-05

Customer lifetime value and the proliferation of misinformation on the internet

01-08

Recurrent Neural Network Tutorial for Artists

01-01

On Model Mismatch and Bayesian Analysis

12-13

Denoising Dirty Documents: Part 9

10-15

R Packages worth a look

11-22

Building a neighbour matrix with python

11-04

R Packages worth a look

11-04

R Packages worth a look

10-17

Synesthesia: The Sound of Style

08-29

Two cool features of Python NumPy: Mutating by slicing and Broadcasting

03-17

Java Handwritten Digit Recognition with Convolutional Neural Networks

12-13

Square to Hex

03-11

Intro to Implicit Matrix Factorization: Classic ALS with Sketchfab Models

10-19

Collaborative Filtering using Alternating Least Squares

09-17

Creating a Search Engine

08-19

Factorization Machines A Theoretical Introduction

06-26

dotify: Recommending Spotify Music Through Country Arithmetic

04-15

Denoising Dirty Documents: Part 9

10-15

7 tools in every data scientist’s toolbox

10-15

RTest: pretty testing of R packages

01-07

Part 4: Why does bias occur in optimism corrected bootstrapping?

12-28

If you did not already know

12-27

Whats new on arXiv

12-18

Phillips-Ouliaris Test For Cointegration

12-17

covrpage, more information on unit testing

12-10

R Packages worth a look

12-03

The Distribution of Time Between Recessions: Revisited (with MCHT)

11-19

Distilled News

11-16

Gazing into the Abyss of P-Hacking: HARKing vs. Optional Stopping

11-14

Time Series and MCHT

11-12

RATest. A Randomization Tests package is available on CRAN

11-11

Machine Learning Classification: A Dataset-based Pictorial

11-05

Bootstrap Testing with MCHT

10-29

Packages for Testing your R Package

10-22

Maximized Monte Carlo Testing with MCHT

10-22

shinytest – Automated testing for Shiny apps

10-18

5 “Clean Code” Tips That Will Dramatically Improve Your Productivity

10-15

He had a sudden cardiac arrest. How does this change the probability that he has a particular genetic condition?

10-14

A/B Testing: The Definitive Guide to Improving Your Product

10-11

Quick Significance Calculations for A/B Tests in R

10-06

5 Reasons Why You Should Use Cross-Validation in Your Data Science Projects

10-02

What is P-value?

09-20

R Packages worth a look

08-30

R Packages worth a look

08-25

Two things about power

05-14

JUnit,Integration,End to End Tests

10-22

How I was screwing up testing my code

10-15

Facts and Fallacies of Software Engineering - Book Review

02-11

A tour of Factor: 3

06-20

Why Can't Gay Men Donate Blood? A Bayesian Analysis

06-16

A Guide to Gradient Boosted Trees with XGBoost in Python

06-05

Defective Circuit Board Puzzle

01-14

Most Winning A/B Test Results are Illusory

11-01

7 tools in every data scientist’s toolbox

10-15

Dow Jones Stock Market Index (3/4): Log Returns GARCH Model

01-08

RTest: pretty testing of R packages

01-07

Part 4: Why does bias occur in optimism corrected bootstrapping?

12-28

Optimism corrected bootstrapping: a problematic method

12-25

Whats new on arXiv

12-18

Phillips-Ouliaris Test For Cointegration

12-17

covrpage, more information on unit testing

12-10

R Packages worth a look

12-03

The Distribution of Time Between Recessions: Revisited (with MCHT)

11-19

Distilled News

11-16

Gazing into the Abyss of P-Hacking: HARKing vs. Optional Stopping

11-14

Time Series and MCHT

11-12

5 Critical Steps to Predictive Business Analytics

11-08

Building a Repository of Alpine-based Docker Images for R, Part I

11-04

Bootstrap Testing with MCHT

10-29

Packages for Testing your R Package

10-22

Maximized Monte Carlo Testing with MCHT

10-22

shinytest – Automated testing for Shiny apps

10-18

He had a sudden cardiac arrest. How does this change the probability that he has a particular genetic condition?

10-14

A/B Testing: The Definitive Guide to Improving Your Product

10-11

If you did not already know

10-10

R Packages worth a look

09-18

R Packages worth a look

08-30

R Packages worth a look

08-25

Testing code with random output

08-06

Two things about power

05-14

How I was screwing up testing my code

10-15

A tour of Factor: 3

06-20

Why Can't Gay Men Donate Blood? A Bayesian Analysis

06-16

Defective Circuit Board Puzzle

01-14

7 tools in every data scientist’s toolbox

10-15

Emoticons decoder for social media sentiment analysis in R

10-16

Emoticons decoder for social media sentiment analysis in R

10-16

Emoticons decoder for social media sentiment analysis in R

10-16

An even better rOpenSci website with Hugo

01-09

Icon making with ggplot2 and magick

01-03

x-mas tRees with gganimate, ggplot, plotly and friends

01-03

Nimble tweak to use specific python version or virtual environment in RStudio

01-01

My

12-28

R 3.5.2 now available

12-20

Examining the Tweeting Patterns of Prominent Crossfit Gyms

12-20

R community update: announcing sessions for useR Delhi December meetup

12-13

Le Monde puzzle [#1075]

12-11

It was twenty years ago …

12-08

R community update: announcing useR Delhi December meetup and CFP

12-07

Trust in ML models. Slides from TWiML & AI EMEA Meetup + iX Articles

12-05

Using R: the best thing I’ve changed about my code in years

12-01

Simulating dinosaur populations, with R

11-30

Le Monde puzzle [#1078]

11-28

More Robust Monotonic Binning Based on Isotonic Regression

11-24

Scrapping data about Australian politicians with RSelenium

11-21

Checklist Recipe – How we created a template to standardize species data

11-20

More Sandwiches, Anyone?

11-14

TWIMLAI European Online Meetup about Trust in Predictions of ML Models

11-13

anytime – dates in R

11-08

A quick look at GHCN version 4

11-03

Site Migration

10-30

Community Call – Working with images in R

10-24

Cannibus Curve with ggplot2

10-17

Writing Code to Read Quotes About Writing Code

10-12

Download 3 million Russian troll tweets

08-02

Ffa1ea00fdab31b3b44b87839c503629

05-06

Twitter bots for good, and information contagion!

09-27

Diffusion of ISIS propaganda on Twitter

07-28

6279e808ef0c35488ea3a81e9b6d302a

07-06

Millions of social bots invaded Twitter!

03-14

Emotional contagion in Twitter!

11-14

Emoticons decoder for social media sentiment analysis in R

10-16

Generating Fibonacci Numbers

10-16

R Packages worth a look

11-01

Generating Fibonacci Numbers

10-16

Generating Fibonacci Numbers

10-16

What does it mean to write “vectorized” code in R?

01-04

R Packages worth a look

09-06

What is a p-value

08-09

Generating Fibonacci Numbers

10-16

Clustering debates from UK politicians

10-16

Where does .Renviron live on Citrix?

01-08

Displaying our “R – Quality Control Individual Range Chart Made Nice” inside a Java web App using AJAX – How To.

01-04

Looking into 19th century ads from a Luxembourguish newspaper with R

01-04

Dataviz Course Packet Quickstart

01-02

AzureStor: an R package for working with Azure storage

12-18

So you want to play a pRank in R…?

12-18

AzureStor: an R package for working with Azure storage

12-18

RStudio Pandoc – HTML To Markdown

12-15

Teaching and Learning Materials for Data Visualization

12-12

Statistics Sunday: Introduction to Regular Expressions

11-25

Statistics Sunday: Reading and Creating a Data Frame with Multiple Text Files

11-18

Congress Over Time

11-17

UI Update — Datazar

11-07

If you did not already know

11-07

Source and List: Organizing R Shiny Apps

11-06

Beginner Data Visualization & Exploration Using Pandas

10-22

Running R scripts within in-database SQL Server Machine Learning

10-14

R Packages worth a look

09-18

R Packages worth a look

09-02

R Packages worth a look

09-01

Getting Started with Competitions - A Peer to Peer Guide

08-22

R Packages worth a look

08-10

Open Source Datasets with Kaggle

06-21

Top 12 Essential Command Line Tools for Data Scientists

06-20

Why you should start using .npy file more often…

03-20

Top gsutil command lines to get started on Google Cloud Storage

01-01

Gsutil cheatsheet

11-02

Feather format update: Whence and Whither?

10-16

Building a Tic-Tac-Toe web-app in this Webpack tutorial and Babel tutorial

04-07

Extreme IO performance with parallel Apache Parquet in Python

02-10

Simple Stock Ticker App

02-04

Optimizing Split Sizes for Hadoop’s CombineFileInputFormat

05-09

Adobe Analytics Clickstream Data Feed: Loading To Relational Database

03-18

Do average consumers still need Dropbox?

03-13

A Million Text Files And A Single Laptop

01-28

Clustering debates from UK politicians

10-16

Clustering debates from UK politicians

10-16

Scrapping data about Australian politicians with RSelenium

11-21

Getting the data from the Luxembourguish elections out of Excel

10-21

Clustering debates from UK politicians

10-16

If you did not already know

12-18

Morph, an open-source tool for data-driven art without code

09-26

Help! I can’t reproduce a machine learning project!

09-19

If you did not already know

09-09

If you did not already know

08-05

Overview and benchmark of traditional and deep learning models in text classification

06-12

SQLite vs Pandas: Performance Benchmarks

05-23

Kolmogorov and randomness

02-18

Flipping a Coin on a Crazy Plane

05-01

Simple reinforcement learning methods to learn CartPole

07-01

History of Monte Carlo Methods - Part 1

10-16

R Packages worth a look

11-03

Continuous tempering through path sampling

08-02

History of Monte Carlo Methods - Part 1

10-16

Document worth reading: “Recommendation System based on Semantic Scholar Mining and Topic modeling: A behavioral analysis of researchers from six conferences”

01-03

If you did not already know

12-17

“My advisor and I disagree on how we should carry out repeated cross-validation. We would love to have a third expert opinion…”

12-15

R Packages worth a look

12-07

Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation

12-04

Gazing into the Abyss of P-Hacking: HARKing vs. Optional Stopping

11-14

If you did not already know

11-01

Additional Strategies for Confronting the Partition Function

10-30

A Thorough Introduction to Boltzmann Machines

10-20

Statistics Sunday: Some Psychometric Tricks in R

10-14

R Packages worth a look

10-14

A/B Testing: The Definitive Guide to Improving Your Product

10-11

Distilled News

08-23

R Packages worth a look

08-15

Data Science Portfolio Project: Is Fandango Still Inflating Ratings?

08-15

Handling Imbalanced Classes in the Dataset

08-03

Gaussian Processes

11-25

From Instance Noise to Gradient Regularisation

06-01

Tic-Tac-AI: A Strong Tic-Tac-Toe AI Opponent using Forward Sampling

03-07

Blending independent estimates

05-25

The Mathematics Behind: Rejection Sampling

01-24

History of Monte Carlo Methods - Part 1

10-16

Top 5 Data Visualization Tools for 2019

01-03

Combining apparently contradictory evidence

12-30

Introduction to Statistics for Data Science

12-17

“Increase sample size until statistical significance is reached” is not a valid adaptive trial design; but it’s fixable.

12-07

How to Gather Your Own Data by Conducting a Great Survey

11-27

Gazing into the Abyss of P-Hacking: HARKing vs. Optional Stopping

11-14

If you did not already know

11-01

Additional Strategies for Confronting the Partition Function

10-30

If you did not already know

10-17

Statistics Sunday: Some Psychometric Tricks in R

10-14

A/B Testing: The Definitive Guide to Improving Your Product

10-11

If you did not already know

09-05

Data Science Portfolio Project: Is Fandango Still Inflating Ratings?

08-15

Handling Imbalanced Classes in the Dataset

08-03

Automated machine learning is coming... and it won't matter

04-04

Incremental means and variances

11-28

Gaussian Processes

11-25

What's new in PyMC3 3.1

07-05

From Instance Noise to Gradient Regularisation

06-01

Normal Distributions

05-14

Hail: Scalable Genomics Analysis with Apache Spark

05-02

Model AUC depends on test set difficulty

03-19

Tic-Tac-AI: A Strong Tic-Tac-Toe AI Opponent using Forward Sampling

03-07

Blending independent estimates

05-25

The Mathematics Behind: Rejection Sampling

01-24

History of Monte Carlo Methods - Part 1

10-16

Reflections on the 10th anniversary of the Revolutions blog

12-10

Reflections on the 10th anniversary of the Revolutions blog

12-10

Very Non-Standard Calling in R

12-03

Very Non-Standard Calling in R

12-03

Document worth reading: “Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches”

11-17

More Sandwiches, Anyone?

11-14

Arnaub Chatterjee discusses artificial intelligence (AI) and machine learning (ML) in healthcare.

10-29

Document worth reading: “Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service”

10-07

Document worth reading: “Foundations of Complex Event Processing”

08-04

Generative King of Kowloon

01-05

History of Monte Carlo Methods - Part 1

10-16

Bayes Primer

10-17

Bayes Primer

10-17

Bayes Primer

10-17

Bayes Primer

10-17

Analyzing Pronto CycleShare Data with Python and Pandas

10-18

Analyzing Pronto CycleShare Data with Python and Pandas

10-18

Analyzing Pronto CycleShare Data with Python and Pandas

10-18

Analyzing Pronto CycleShare Data with Python and Pandas

10-18

Whats new on arXiv

01-13

How to Learn Python in 30 days

01-12

Whats new on arXiv

01-10

Whats new on arXiv

01-09

How do Convolutional Neural Nets (CNNs) learn? + Keras example

01-09

Learn Python for Data Science From Scratch

01-09

Whats new on arXiv

01-08

5 things that happened in Data Science in 2018

01-08

Auto-Keras and AutoML: A Getting Started Guide

01-07

Whats new on arXiv

01-06

Whats new on arXiv

01-04

How to Learn Python in 30 days

01-02

Whats new on arXiv

12-31

Distilled News

12-29

Whats new on arXiv

12-29

Whats new on arXiv

12-28

Distilled News

12-25

Distilled News

12-20

Whats new on arXiv

12-20

Whats new on arXiv

12-20

Hackathon Winner Interview: Penn State | Kaggle University Club

12-19

Distilled News

12-19

Whats new on arXiv

12-19

vtreat Variable Importance

12-18

vtreat Variable Importance

12-18

Distilled News

12-18

Whats new on arXiv

12-17

Soft Actor Critic—Deep Reinforcement Learning with Real-World Robots

12-14

Whats new on arXiv

12-14

Solve any Image Classification Problem Quickly and Easily

12-13

Whats new on arXiv

12-13

Distilled News

12-12

Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

12-11

Whats new on arXiv

12-11

Whats new on arXiv

12-10

Whats new on arXiv

12-10

Whats new on arXiv

12-10

Whats new on arXiv

12-07

Must-Have Resources to Become a Data Scientist

12-06

If you did not already know

12-04

Whats new on arXiv

12-04

Distilled News

12-03

Whats new on arXiv

12-03

If you did not already know

12-03

Whats new on arXiv

12-01

If you did not already know

12-01

Variational Autoencoders Explained in Detail

11-30

Math in Data Science

11-30

Distilled News

11-30

Distilled News

11-30

Whats new on arXiv

11-29

Whats new on arXiv

11-27

How to Engineer Your Way Out of Slow Models

11-27

Distilled News

11-26

Whats new on arXiv

11-26

Whats new on arXiv

11-25

Distilled News

11-24

An Introduction to AI

11-21

Whats new on arXiv

11-20

Distilled News

11-20

Import AI 121: Sony researchers make ultra-fast ImageNet training breakthrough; Berkeley researchers tackle StarCraft II with modular RL system; and Germany adds €3bn for AI research

11-19

What I Learned About Machine Learning at ODSC West 2018

11-19

If you did not already know

11-19

Online Bayesian Deep Learning in Production at Tencent

11-15

Whats new on arXiv

11-15

Whats new on arXiv

11-14

Distilled News

11-14

Whats new on arXiv

11-13

Whats new on arXiv

11-11

Distilled News

11-11

If you did not already know

11-08

Carlos: ‘Everything Dataquest showed me, I use in my new job’

11-08

10 Free Must-See Courses for Machine Learning and Data Science

11-08

Distilled News

11-08

Whats new on arXiv

11-07

Whats new on arXiv

11-06

Whats new on arXiv

11-05

Machine Learning Classification: A Dataset-based Pictorial

11-05

Import AI 119: How to benefit AI research in Africa; German politician calls for billions in spending to prevent country being left behind; and using deep learning to spot thefts

11-05

Distilled News

11-05

Distilled News

11-04

Whats new on arXiv

11-04

Whats new on arXiv

11-04

Whats new on arXiv

11-03

The Most in Demand Skills for Data Scientists

11-02

Whats new on arXiv

10-30

Machine Learning Basics – Random Forest

10-30

How to be an Artificial Intelligence (AI) Expert?

10-29

Document worth reading: “Opening the black box of deep learning”

10-28

Distilled News

10-27

Whats new on arXiv

10-26

Whats new on arXiv

10-26

How to be an Artificial Intelligence (AI) Expert?

10-25

Distilled News

10-25

Distilled News

10-24

Whats new on arXiv

10-23

Dr. Data Show Video: How Can You Trust AI?

10-20

I’m an Analyst and the software engineers made fun of my code!

10-19

Building a data warehouse

10-17

Exploring college major and income: a live data analysis in R

10-16

Distilled News

10-15

Whats new on arXiv

10-13

Distilled News

10-12

Machine learning — Is the emperor wearing clothes?

10-12

Whats new on arXiv

10-11

If you did not already know

10-11

Distilled News

10-11

Whats new on arXiv

10-10

If you did not already know

10-10

How to get a Data Science Job in 6 Months

10-10

If you did not already know

10-09

How To Learn Data Science If You’re Broke

10-09

Learning Acrobatics by Watching YouTube

10-09

A Concise Explanation of Learning Algorithms with the Mitchell Paradigm

10-05

Why do I Call Myself a Data Scientist?

10-05

Whats new on arXiv

10-05

Whats new on arXiv

10-04

Distilled News

10-04

Reinforcement Learning: Super Mario, AlphaGo and beyond

10-01

Python Vs R : The Eternal Question for Data Scientists

09-29

Whats new on arXiv

09-25

If you did not already know

09-24

Python Vs R : The Eternal Question for Data Scientists

09-24

How to Optimise Ad CTR with Reinforcement Learning

09-24

Whats new on arXiv

09-24

Whats new on arXiv

09-20

PyConUK 2018

09-19

Whats new on arXiv

09-19

How to generalize (algorithmically)

09-18

Whats new on arXiv

09-18

Distilled News

09-18

If you did not already know

09-17

Deep learning made easier with transfer learning

09-17

How to Optimise Ad CTR with Reinforcement Learning

09-17

Whats new on arXiv

09-14

The Benefits of Active Learning for Data Science Skills

09-12

Whats new on arXiv

09-12

Distilled News

09-11

Whats new on arXiv

09-11

Why Would Prosthetic Arms Need to See or Connect to Cloud AI?

09-10

Whats new on arXiv

09-07

Whats new on arXiv

09-07

Visual Reinforcement Learning with Imagined Goals

09-06

Whats new on arXiv

09-01

Dexterous Manipulation with Reinforcement Learning: Efficient, General, and Low-Cost

08-31

If you did not already know

08-30

Whats new on arXiv

08-28

If you did not already know

08-27

Whats new on arXiv

08-24

If you did not already know

08-24

Import AI: 108: Learning language with fake sentences, Chinese researchers use RL to train prototype warehouse robots; and what the implications are of scaled-up Neural Architecture Search

08-20

Document worth reading: “A Survey on Resilient Machine Learning”

08-19

Whats new on arXiv

08-16

Whats new on arXiv

08-15

DIFFERENCE BETWEEN DATA SCIENCE, DATA ANALYTICS AND MACHINE LEARNING

08-09

Whats new on arXiv

08-07

Essential Tips and Tricks for Starting Machine Learning with Python

08-05

Whats new on arXiv

08-03

Tips & Tricks for Starting Your First Data Project

08-01

Whats new on arXiv

08-01

One-Shot Imitation from Watching Videos

06-28

Supercharging Classification - The Value of Multi-task Learning

06-26

Import AI

06-18

Engineering a New Career in Data Science: Alumni Spotlight on Abhishek Mishra

06-06

My eRum 2018 biggest highlights

05-19

Mother's Day Interview: How Nicole Finnie Became a Competitive Kaggler on Maternity Leave

05-10

Profiling Top Kagglers: Bestfitting, Currently

05-07

TDM: From Model-Free to Model-Based Deep Reinforcement Learning

04-26

It’s okay to not be a data scientist

02-20

A Practical Guide to the "Open-Source Machine Learning Masters"

02-03

Lessons learned in my first year as a data scientist

01-25

57 Summaries of Machine Learning and NLP Research

01-17

New Year's Resolutions 2018

01-05

AI and Deep Learning in 2017 – A Year in Review

12-31

Weekly Review: 12/23/2017

12-23

Everything is a Model

12-13

NIPS 2017 Summary

12-11

Weekly Review: 12/10/2017

12-10

The Last 5 Years In Deep Learning

12-04

Python Deep Learning tutorial: Create a GRU (RNN) in TensorFlow

08-27

How to launch your data science career (with Python)

07-12

Guest Post – Learning R as an MBA Student

07-12

How to make the transition from academia to data science

04-23

Deep Learning Research Review Week 2: Reinforcement Learning

11-16

Gradient descent learns linear dynamical systems

10-13

Approaching fairness in machine learning

09-06

Re-work Interview Questions

07-26

Learning in Brains and Machines (3): Synergistic and Modular Action

07-03

Making use of the model

06-20

The Power of IPython Notebook + Pandas + and Scikit-learn

06-11

Deep Reinforcement Learning: Pong from Pixels

05-31

Guide to an in-depth understanding of logistic regression

02-22

“Becoming a Data Scientist” Learning Club?

11-09

The Deep Learning Gold Rush of 2015

11-07

Theoretical Motivations for Deep Learning

10-18

Distilled News

11-29

4 ways to be more efficient using RStudio’s Code Snippets, with 11 ready to use examples

11-10

Text Preprocessing in Python: Steps, Tools, and Examples

11-06

Sharing the Recipe for rOpenSci’s Unconf Ice Breaker

11-01

Distilled News

10-28

R Packages worth a look

10-17

What Does it Take to Train Deep Learning Models On-Device?

10-04

Document worth reading: “Idealised Bayesian Neural Networks Cannot Have Adversarial Examples: Theoretical and Empirical Study”

08-29

Data Notes: Drought and the War in Syria

08-23

Learn to R blog series - Operators and Objects

07-19

R Tip: use isTRUE()

06-11

How To Predict ICU Mortality with Digital Health Data, DL4J, Apache Spark and Cloudera

09-18

Introduction to XGBoost

02-17

No juice for you, CSV format. It just makes you more awful.

09-23

Why I’m Not a Fan of R-Squared

07-24

Theoretical Motivations for Deep Learning

10-18

Pruning Neural Networks: Two Recent Papers

02-06

The Generalization Mystery: Sharp vs Flat Minima

01-18

Theoretical Motivations for Deep Learning

10-18

If you did not already know

12-20

Making a Profit with Henry Wan in Arkham Horror: The Card Game

12-03

Autonomy – Do we have the choice?

11-21

AdaSearch: A Successive Elimination Approach to Adaptive Search

11-14

Learning in Brains and Machines (3): Synergistic and Modular Action

07-03

Simple reinforcement learning methods to learn CartPole

07-01

Making use of the model

06-20

Model-Free Prediction and Control

06-07

Keras plays catch, a single file Reinforcement Learning example

03-17

Q-learning with Neural Networks

10-30

Reinforcement Learning - Part 1

10-19

If you did not already know

11-20

Document worth reading: “A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress”

10-15

Deep reinforcement learning, battleship

10-15

Simple reinforcement learning methods to learn CartPole

07-01

The Policy Gradient

06-16

Learning in Brains and Machines (1): Temporal Differences

02-21

Reinforcement Learning - Monte Carlo Methods

10-25

Reinforcement Learning - Part 1

10-19

Reinforcement Learning - Part 1

10-19

Music listener statistics: last.fm’s last.year as an R package

01-02

If you did not already know

11-12

Slot Machines

10-15

Reinforcement Learning - Part 1

10-19

R Packages worth a look

01-13

R Packages worth a look

12-07

Hitchhiker's guide to Exploratory Data Analysis

10-12

If you did not already know

09-10

Seasonalities: The Near-Term Future for the Market

04-14

The Bull Survived on Friday, but Barely

03-25

Top 10 oldest and youngest industries in the U.S.

03-05

Incremental means and variances

11-28

While We Were Busy with Prosperity

11-10

Reinforcement Learning - Part 1

10-19

Timing the Same Algorithm in R, Python, and C++

01-06

Timing the Same Algorithm in R, Python, and C++

01-06

What does it mean to write “vectorized” code in R?

01-04

Data Representation for Natural Language Processing Tasks

11-02

If you did not already know

10-29

If you did not already know

10-17

Preprocessing for Deep Learning: From covariance matrix to image whitening

10-10

R Packages worth a look

10-03

R Objects

08-24

Can You Read My Mind? Analyzing The Killers’ Discography with NLP

08-09

Crosslingual document comparison

08-31

Approximating Implicit Matrix Factorization with Shallow Neural Networks

04-07

Persistent Homology (Part 4)

02-23

LSTMs

06-04

Rolling and Unrolling RNNs

04-28

dotify: Recommending Spotify Music Through Country Arithmetic

04-15

Explicit Matrix Factorization: ALS, SGD, and All That Jazz

01-09

Recurrent Neural Networks

10-20

What is Neural Network?

09-06

Java Handwritten Digit Recognition with Neural Networks

11-29

Self-Organizing Maps Tutorial

11-02

Beyond Binary: Ternary and One-hot Neurons

02-08

Artificial Neural Networks Introduction (Part II)

11-03

Recurrent Neural Networks

10-20

Master R shiny: One trick to build maintainable and scalable event chains

11-02

R Packages worth a look

09-03

Document worth reading: “A Survey on Resilient Machine Learning”

08-19

Document worth reading: “How Important Is a Neuron”

08-15

Linear Feedback Shift Registers

11-19

Anyone Can Learn To Code an LSTM-RNN in Python (Part 1: RNN)

11-15

Recurrent Neural Networks

10-20

Dropout Ensembling In Neural Nets

10-21

If you did not already know

01-01

My book ‘Deep Learning from first principles:Second Edition’ now on Amazon

12-14

Graph-Powered Machine Learning

12-03

Document worth reading: “Internet of Things: An Overview”

11-25

If you did not already know

11-19

R Packages worth a look

11-16

If you did not already know

09-15

What is Neural Network?

09-06

Class visualization with bilateral filters

02-05

Dropout Ensembling In Neural Nets

10-21

Journal: PLXtrum - realtime machine learning for predicting note onset

01-28

Dropout Ensembling In Neural Nets

10-21

Dropout Ensembling In Neural Nets

10-21

Serial and Parallel bulb puzzle

10-18

Don’t calculate post-hoc power using observed estimate of effect size

09-24

Post-publication peer review: who’s qualified?

09-20

Dropout Ensembling In Neural Nets

10-21

Looking into 19th century ads from a Luxembourguish newspaper with R

01-04

Document worth reading: “Recommendation System based on Semantic Scholar Mining and Topic modeling: A behavioral analysis of researchers from six conferences”

01-03

Document worth reading: “A Review of Theoretical and Practical Challenges of Trusted Autonomy in Big Data”

12-04

Book Review – Sound Analysis and Synthesis with R

11-03

If you did not already know

10-10

Amazon SageMaker Neural Topic Model now supports auxiliary vocabulary channel, new topic evaluation metrics, and training subsampling

10-04

New Dynamics for Topic Models

07-31

The Dynamics of Philippine Senate Bills: Gensim, Topic Modeling and All That Good NLP Stuff

06-09

How to mine newsfeed data and extract interactive insights in Python

03-15

Topic Modeling for Keyword Extraction

02-05

The Real Story Behind Today's Referendum

06-23

Paris Meetup slides Topic Modeling of Twitter Followers

02-08

Data Science Tutorials Flipboard Magazine

10-21

The replication crisis and the political process

08-03

Data Science Tutorials Flipboard Magazine

10-21

Document worth reading: “A Comprehensive Study of Deep Learning for Image Captioning”

10-31

If you did not already know

08-21

Prototyping Long Term Time Series Storage with Kafka and Parquet

10-25

Easily monitor and visualize metrics while training models on Amazon SageMaker

11-19

Prototyping Long Term Time Series Storage with Kafka and Parquet

10-25

Advanced News API search: leveraging DBpedia entity types

12-11

Distilled News

08-13

Data types

05-08

Type Safety and Statistical Computing

12-12

Feather: it's about metadata

04-26

Be Like Water

01-19

Why Julia’s DataFrames are Still Slow

11-28

Prototyping Long Term Time Series Storage with Kafka and Parquet

10-25

Prototyping Long Term Time Series Storage with Kafka and Parquet

10-25

Making a Profit with Henry Wan in Arkham Horror: The Card Game

12-03

Cribbage Scores

02-25

Understanding how Deep Learning learns to play SET®

10-12

Poker Odds

09-22

Baseball Card Collecting

04-29

Reinforcement Learning - Monte Carlo Methods

10-25

Reinforcement Learning - Monte Carlo Methods

10-25

If you did not already know

01-07

Whats new on arXiv

01-06

Whats new on arXiv

01-03

Whats new on arXiv

12-28

Part 2: Optimism corrected bootstrapping is definitely bias, further evidence

12-26

Optimism corrected bootstrapping: a problematic method

12-25

Document worth reading: “Are screening methods useful in feature selection? An empirical study”

12-18

Distilled News

12-14

What's the future of the pandas library?

12-12

Whats new on arXiv

12-11

Whats new on arXiv

12-10

If you did not already know

12-07

Document worth reading: “A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition”

12-05

If you did not already know

11-29

Document worth reading: “An Introductory Survey on Attention Mechanisms in NLP Problems”

11-28

Sales Forecasting Using Facebook’s Prophet

11-28

Whats new on arXiv

11-26

If you did not already know

11-23

If you did not already know

11-19

A more systematic look at suppressed data by @ellis2013nz

11-17

Distilled News

11-16

Scikit-learn Tutorial: Machine Learning in Python

11-15

Discourse Network Analysis: Undertaking Literature Reviews in R

11-15

If you did not already know

11-07

R Packages worth a look

10-23

If you did not already know

10-23

Whats new on arXiv

10-22

Document worth reading: “Machine Learning for Spatiotemporal Sequence Forecasting: A Survey”

10-21

Whats new on arXiv

10-16

Document worth reading: “A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress”

10-15

Document worth reading: “Vector and Matrix Optimal Mass Transport: Theory, Algorithm, and Applications”

10-14

Whats new on arXiv

10-13

Document worth reading: “A Survey on Expert Recommendation in Community Question Answering”

09-28

Document worth reading: “Do Deep Learning Models Have Too Many Parameters An Information Theory Viewpoint”

09-22

Timing Column Indexing in R

09-21

Document worth reading: “Decision-Making with Belief Functions: a Review”

09-19

Document worth reading: “Accelerating CNN inference on FPGAs: A Survey”

09-08

Three Operator Splitting

09-04

Whats new on arXiv

08-22

Document worth reading: “Radial Basis Function Approximations: Comparison and Applications”

08-15

If you did not already know

08-15

“The most important aspect of a statistical analysis is not what you do with the data, it’s what data you use” (survey adjustment edition)

08-07

If you did not already know

08-07

Scale out your Pandas DataFrame operations using Dask

08-05

TSrepr use case - Clustering time series representations in R

03-13

TSrepr - Time Series Representations in R

01-26

Alchemy, Rigour and Engineering

12-07

RSiteCatalyst Version 1.4.10 Release Notes

12-13

Book Review: Computer Age Statistical Inference

11-23

First Order Optimization Methods

07-02

The Best of Unpublished Machine Learning and Statistics Books

02-09

Machine Learning is not BS in Monitoring

01-09

A Year of Approximate Inference: Review of the NIPS 2015 Workshop

12-18

Most Winning A/B Test Results are Illusory

11-01

Reinforcement Learning - Monte Carlo Methods

10-25

What a Deep Neural Network thinks about your

10-25

What a Deep Neural Network thinks about your

10-25

November 2018: “Top 40” New Packages

12-21

Data Mining Book – Chapter Download

12-04

Defining visualization literacy

11-30

Intro to Data Science for Managers

11-23

Data Mining Book – Chapter Download

11-02

SQL, Python, & R in One Platform

10-26

New Course: Interactive Data Visualization with rbokeh

10-19

The Chartmaker Directory: Data visualizations in every tool

08-24

3368a9b98a073e7ba296e1f5f41f6c4f

06-02

Introducing Python for data scientists - Pt2

03-23

The Building Blocks of Interpretability

03-06

Feature Visualization

11-07

Announcing Elemetric

06-23

Interactive association rules exploration app

11-30

What a Deep Neural Network thinks about your

10-25

MS in Applied Data Science Online – which track is right for you?

01-10

Top December Stories: Why You Shouldn’t be a Data Science Generalist

01-09

Learn Python for Data Science From Scratch

01-09

Rev Summit for Data Science Leaders featuring Daniel Kahneman

01-07

KDnuggets™ News 19:n01, Jan 3: The Essence of Machine Learning; A Guide to Decision Trees for Machine Learning and Data Science

01-03

KDnuggets™ News 18:n48, Dec 19: Why You Shouldn’t be a Data Science Generalist; Industry Data Science & Machine Learning 2019 Predictions

12-19

Industry Predictions: AI, Machine Learning, Analytics & Data Science Main Developments in 2018 and Key Trends for 2019

12-18

Surprise-hacking: “the narrative of blindness and illusion sells, and therefore continues to be the central thesis of popular books written by psychologists and cognitive scientists”

12-16

Document worth reading: “Taxonomy of Big Data: A Survey”

12-11

Should you become a data scientist?

12-10

Top KDnuggets tweets, Nov 28 – Dec 4: Deep Learning Cheat Sheets; Amazon opens its internal

12-05

AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-03

How to Find Mentors for Data Science?

11-29

Drexel University: 2 Teaching Faculty Positions in Data Science [Philadelphia, PA]

11-27

Magister Dixit

11-23

KDnuggets™ News 18:n44, Nov 21: What is the Best Python IDE for Data Science?; Anticipating the next move in data science

11-21

Address Your Data Science Strategy at DSNY

11-20

Distilled News

11-20

Hey, check this out: Columbia’s Data Science Institute is hiring research scientists and postdocs!

11-16

Report from the Enterprise Applications of the R Language conference

11-16

Report from the Enterprise Applications of the R Language conference

11-16

The State of the Art

11-15

In case you missed it: October 2018 roundup

11-15

Distilled News

11-14

Help us understand your Data Science goals!

11-13

To get hired as a data scientist, don’t follow the herd

11-12

“Law professor Alan Dershowitz’s new book claims that political differences have lately been criminalized in the United States. He has it wrong. Instead, the orderly enforcement of the law has, ludicrously, been framed as political.”

11-12

“Recapping the recent plagiarism scandal”

11-09

New R Cheatsheet: Data Science Workflow with R

11-04

Learn how machine learning is transforming business

11-02

Learn how machine learning is transforming business, Nov 12 Webinar

11-02

Data Science “Paint by the Numbers” with the Hypothesis Development Canvas

11-02

Raghuveer Parthasarathy’s big idea for fixing science

11-01

The Final Data Science Roadshow is Just the Beginning

10-26

Spotlight on Julia Silge, Keynote Speaker EARL Seattle 7th November

10-26

University of San Francisco: Assistant Professor, Tenure Track, Mathematics and Statistics [San Francisco, CA]

10-17

Music for Data Scientists? Music by Data Scientists? …What…?!

10-17

University of San Francisco: Postdoctoral Fellowship, Data Institute [San Francisco, CA]

10-16

How to get a Data Science Job in 6 Months

10-10

All About Open Source

10-09

Distilled News

10-06

Online Master’s in Applied Data Science From Syracuse

10-05

Why do I Call Myself a Data Scientist?

10-05

Chromebook Data Science

10-04

Understand Why ODSC is the Most Recommended Conference for Applied Data Science

10-04

In case you missed it: September 2018 roundup

10-03

Distilled News

09-07

How Foundations Student Russell Martin got into The Data Incubator’s Fellowship

09-06

The Data Science Roadshow is ON!

09-03

What is Data Science?

08-20

R Packages worth a look

08-19

Announcing Practical Data Science with R, 2nd Edition

08-15

Video: How to run R and Python in SQL Server from a Jupyter notebook

08-03

Revisiting “Is the scientific paper a fraud?”

07-29

Data Science in 30 Minutes: Using Data Science to Predict the Future with Kirk Borne

07-11

I Can’t Afford to Hire a Data Scientist. Now What?

07-11

Announcement – The Data Incubator Partnership with MRI Network

06-28

Engineering Data Science at Automattic

03-20

Fast Company's 2018 World's Most Innovative Companies List

02-20

PyData Conference & AHL Hackathon

02-16

Should I do a Data Science bootcamp?

01-03

AWS Machine Learning Big Data NYC

10-24

Advice for aspiring data scientists and other FAQs

10-15

The Advent of Analytics Engineering

09-01

How to launch your data science career (with Python)

07-12

T-Shirt Design Contest!

11-19

Ask Why! Finding motives, causes, and purpose in data science

09-19

Smart Cities at the Nexus of Emerging Data Technologies and You

09-18

Podcast Episodes 0 to 3

08-13

Becoming a Data Scientist Podcast Episode 13: Debbie Berebichez

07-15

Becoming a Data Scientist Podcast Episode 12: Data Science Learning Club Members

06-15

Becoming a Data Scientist Podcast Episode 11: Stephanie Rivera

05-31

Q & A with Meta Brown

05-18

It's not an Internet of Things, It's an Internet of People

11-17

Books for Data Science Beginners, and Data Sources

10-26

Long-awaited updates to htmlTable

01-07

Books for Data Science Beginners, and Data Sources

10-26

R Packages worth a look

12-12

New Features For Amazon SageMaker: Workflows, Algorithms, and Accreditation

11-21

An Overview of the Singapore Hiring Landscape

11-21

Announcing RStudio Package Manager

10-17

UnitedHealth Group: UHC Digital Director of Project Management [Minnetonka, MN]

10-04

Nextgov: DHS Funds Machine Learning Tool to Boost Other Countries’ Airport Security

08-20

Enterprise Deployment Tips for Azure Data Science Virtual Machine (DSVM)

05-21

If you don’t pay attention, data can drive you off a cliff

08-21

Go easy on Volkswagen

10-26

Go easy on Volkswagen

10-26

Go easy on Volkswagen

10-26

Data Science With R Course Series – Week 9

11-12

12 Ways To Cultivate A Data-Savvy Workforce

07-15

Go easy on Volkswagen

10-26

Comparison of the Text Distance Metrics

01-07

Comparison of the Top Speech Processing APIs

12-28

A couple of thoughts regarding the hot hand fallacy fallacy

12-14

Word Morphing – an original idea

11-20

The Long Tail of Medical Data

11-12

Multi-Class Text Classification with Doc2Vec & Logistic Regression

11-09

EMNLP 2018 Highlights: Inductive bias, cross-lingual learning, and more

11-07

Data Representation for Natural Language Processing Tasks

11-02

Data + Art STEAM Project: Final Results

10-30

If you did not already know

10-29

The Main Approaches to Natural Language Processing Tasks

10-17

If you did not already know

10-17

Document worth reading: “An Analysis of Hierarchical Text Classification Using Word Embeddings”

10-06

Sequence Modeling with Neural Networks – Part I

10-03

A Review of the Neural History of Natural Language Processing

10-01

Who wrote that anonymous NYT op-ed? Text similarity analyses with R

09-07

Unfolding Naïve Bayes From Scratch!

09-02

If you did not already know

08-30

Can You Read My Mind? Analyzing The Killers’ Discography with NLP

08-09

Lana Del Rey’s Discography through the Lens of Text Analytics

08-09

Sequence labeling with semi-supervised multi-task learning

06-29

A Study Of Reddit Politics

06-20

Sent2Vec: An unsupervised approach towards learning sentence embeddings

06-19

The Dynamics of Philippine Senate Bills: Gensim, Topic Modeling and All That Good NLP Stuff

06-09

Turning Water into Wine

03-13

57 Summaries of Machine Learning and NLP Research

01-17

Text Segmentation using Word Embeddings

10-16

Hierarchical Softmax

08-01

Emojis Analysis in R

03-24

Getting Rich using Bitcoin stockprices and Twitter!

02-22

Deep Learning Research Review Week 3: Natural Language Processing

01-10

A Billion Words and The Limits of Language Modeling

09-23

Creating a Search Engine

08-19

Recurrent Neural Networks for Beginners

08-13

A tour of Factor: 4

07-04

Document Similarity With Word Movers Distance

06-13

Translating W2v Embedding From One Space To Another

06-06

A tour of Factor: 2

05-27

A tour of Factor: 1

05-23

Hamming Codes

01-30

Understanding Convolutional Neural Networks for NLP

11-07

The Evolution of Pop Lyrics and a tale of two LDA’s

10-27

The Evolution of Pop Lyrics and a tale of two LDA’s

10-27

Can You Read My Mind? Analyzing The Killers’ Discography with NLP

08-09

Lana Del Rey’s Discography through the Lens of Text Analytics

08-09

The Evolution of Pop Lyrics and a tale of two LDA’s

10-27

The Evolution of Pop Lyrics and a tale of two LDA’s

10-27

Distilled News

01-13

Document worth reading: “Deep learning in agriculture: A survey”

01-12

AI in Healthcare (With a case study)

01-10

Top 10 Books on NLP and Text Analysis

01-09

Whats new on arXiv

12-31

Distilled News

12-29

If you did not already know

12-23

Feature engineering, Explained

12-21

10 Data Science Skills to Land your Dream Job in 2019

12-12

ggQC | ggplot Quality Control Charts – New Release

12-05

Ronin: Data Engineer [San Mateo, CA]

12-03

Intro to Data Science for Managers

11-23

Distilled News

11-12

Top 5 Trends in Data Science

11-09

Document worth reading: “Deep Learning for Image Denoising: A Survey”

11-04

AI Masterpieces: But is it Art?

10-27

Get a 2–6x Speed-up on Your Data Pre-processing with Python

10-23

Distilled News

10-05

Document worth reading: “Automatic Language Identification in Texts: A Survey”

09-20

Understanding Different Components & Roles in Data Science

09-18

If you did not already know

09-11

Big Data : Meaning, Components, Collection & Analysis

09-10

Understanding Different Components & Roles in Data Science

08-30

Document worth reading: “Foundations of Complex Event Processing”

08-04

If you did not already know

08-02

Revisiting “Is the scientific paper a fraud?”

07-29

What is Machine Learning?

07-17

Self-Service Adobe Analytics Data Feeds!

03-03

Data Cleaning, Categorization and Normalization

01-30

Learning Reinforcement Learning (with Code, Exercises and Solutions)

10-02

The Evolution of Pop Lyrics and a tale of two LDA’s

10-27

Hogwild Stochastic Gradient Descent

10-27

Akka Stream

03-25

Hogwild Stochastic Gradient Descent

10-27

Notes on the Frank-Wolfe algorithm, Part I

03-20

Hogwild Stochastic Gradient Descent

10-27

Notes on the Frank-Wolfe algorithm, Part I

03-20

Hogwild Stochastic Gradient Descent

10-27

Spying on instance methods with Python's mock module

10-29

Notes on the Frank-Wolfe Algorithm, Part II: A Primal-dual Analysis

11-14

Synthetic Gradients with Tensorflow

04-08

From Gaussian Algebra to Gaussian Processes, Part 1

03-31

Deterministic A/B tests via the hashing trick

03-20

How to Build Your Own Blockchain Part 4.2 — Ethereum Proof of Work Difficulty Explained

11-21

PokerBot: Create your poker AI bot in Python

11-01

How to Build Your Own Blockchain Part 2 — Syncing Chains From Different Nodes

10-27

Hard Examples Mining in Keras

10-22

How to Build Your Own Blockchain Part 1 — Creating, Storing, Syncing, Displaying, Mining, and Proving Work

10-17

Parallel computation with two lines of code

05-18

Beyond Binary: Ternary and One-hot Neurons

02-08

Avoiding overfitting in object detection problem

12-19

Recurrent Neural Networks in Tensorflow II

07-25

Project Euler using Scala: Problem

07-19

Bayesian Deep Learning Part II: Bridging PyMC3 and Lasagne to build a Hierarchical Neural Network

07-05

Concurrent bloom filters

05-30

Sequence prediction using recurrent neural networks(LSTM) with TensorFlow

05-14

Exploring convolutional neural networks with DL4J

04-14

Inverting a Neural Net

04-05

Second Annual Data Science Bowl – Part 3 – Automatically Finding the Heart Location in an MRI Image

03-08

Building a news search engine

01-21

So You Want to Implement a Custom Loss Function?

11-18

Denoising Dirty Documents – Part 10

11-01

Spying on instance methods with Python's mock module

10-29

Spying on instance methods with Python's mock module

10-29

Spying on instance methods with Python's mock module

10-29

Spying on instance methods with Python's mock module

10-29

Production Recommendation Systems with Cloudera

02-20

Speeding up TRPO through parallelization and parameter adaptation

12-09

A intuitive explanation of natural gradient descent

08-07

Q-learning with Neural Networks

10-30

Q-learning with Neural Networks

10-30

History of Monte Carlo Methods - Part 2

10-30

History of Monte Carlo Methods - Part 2

10-30

History of Monte Carlo Methods - Part 2

10-30

Approaches to Text Summarization: An Overview

01-03

R Packages worth a look

12-21

Starspace for NLP

12-04

Distilled News

11-26

Distilled News

11-26

Statistics Sunday: Introduction to Regular Expressions

11-25

Characterizing Online Public Discussions through Patterns of Participant Interactions

11-11

Multi-Class Text Classification with Doc2Vec & Logistic Regression

11-09

Spotlight on Julia Silge, Keynote Speaker EARL Seattle 7th November

10-26

Spam Detection with Natural Language Processing-Part 2

10-18

The Main Approaches to Natural Language Processing Tasks

10-17

Machine Reading at Scale – Transfer Learning for Large Text Corpuses

10-17

Writing Code to Read Quotes About Writing Code

10-12

Amazon Comprehend introduces new Region availability and language support for French, German, Italian, and Portuguese

10-10

Distilled News

09-06

R Packages worth a look

09-03

Document worth reading: “A Comparative Study on using Principle Component Analysis with Different Text Classifiers”

08-29

Distilled News

08-13

R Packages worth a look

08-10

New download API for pretrained NLP models and datasets in Gensim

11-27

Kaggle’s Quora Question Pairs Competition

06-07

Ordered Categorical GLMs for Product Feedback Scores

03-17

Intercausal Reasoning in Bayesian Networks

03-13

Persistent Homology (Part 3)

02-23

History of Monte Carlo Methods - Part 2

10-30

Thanksgiving Special 🦃: GANs are Being Fixed in More than One Way

11-23

History of Monte Carlo Methods - Part 2

10-30

KNNs (K-Nearest-Neighbours) in Python

11-22

Multi-Class Text Classification with Doc2Vec & Logistic Regression

11-09

K-means clustering with Amazon SageMaker

11-08

Introduction to Deep Learning with Keras

10-29

Named Entity Recognition and Classification with Scikit-Learn

10-25

Modeling Airbnb prices

10-12

Hierarchical Bayesian Neural Networks with Informative Priors

08-13

Scale out your Pandas DataFrame operations using Dask

08-05

On Using Hyperopt: Advanced Machine Learning

08-04

What makes the Python Cool.

07-31

How To Remotely Send R and Python Execution to SQL Server from Jupyter Notebooks

07-17

How to update your scikit-learn code for 2018

07-04

Opinion mining on Dutch news articles

06-20

Traveling salesman portrait in Python

04-12

Sleeping Giant Rural Postman Problem

12-01

Python Matplotlib (pyplot), a step-by-step Tutorial

11-15

PokerBot: Create your poker AI bot in Python

11-01

Hard Examples Mining in Keras

10-22

How to use Tensorboard with PyTorch

10-16

What's new in PyMC3 3.1

07-05

Machine Learning Fraud Detection: A Simple Machine Learning Approach

06-15

Make a Profitable Portfolio using Python

06-08

Parallel computation with two lines of code

05-18

F beta score for Keras

04-23

Building a Tic-Tac-Toe web-app in this Webpack tutorial and Babel tutorial

04-07

Random-Walk Bayesian Deep Networks: Dealing with Non-Stationary Data

03-14

Development update: High speed Apache Parquet in Python with Apache Arrow

01-25

Native Hadoop file system (HDFS) connectivity in Python

01-03

Avoiding overfitting in object detection problem

12-19

TensorFlow in a Nutshell — Part Three: All the Models

10-03

GPU-accelerated Theano & Keras with Windows 10

09-23

Bayesian Deep Learning Part II: Bridging PyMC3 and Lasagne to build a Hierarchical Neural Network

07-05

Animate NBA shot events with Paper.js

06-08

Bayesian Deep Learning

06-01

Inverting a Neural Net

04-05

Representational Power of Deeper Layers

03-30

Second Annual Data Science Bowl – Part 3 – Automatically Finding the Heart Location in an MRI Image

03-08

Calling RSiteCatalyst From Python

02-22

MCMC sampling for dummies

11-10

Denoising Dirty Documents – Part 10

11-01

Keras Conv2D and Convolutional Layers

12-31

Solve any Image Classification Problem Quickly and Easily

12-13

Introduction to Deep Learning with Keras

10-29

If you did not already know

10-09

What Does it Take to Train Deep Learning Models On-Device?

10-04

5 Reasons Why You Should Use Cross-Validation in Your Data Science Projects

10-02

Java Home Made Face Recognition Application

09-12

What is Neural Network?

09-06

My notes on (Liang et al., 2017): Generalization and the Fisher-Rao norm

01-25

Java Image Cat&Dog Recognition with Deep Neural Networks

01-03

Two Recent Results in Transfer Learning for Music and Speech

11-01

Deep Learning with Intel’s BigDL and Apache Spark

09-06

Random Dilation Networks for Action Recognition in Videos

07-29

Deep learning on Apache Spark and Apache Hadoop with Deeplearning4j

06-27

Machine learning applied to showers in the OPERA

06-24

Python Deep Learning tutorial: Elman RNN implementation in Tensorflow

05-17

XOR Revisited: Keras and TensorFlow

04-24

Deep and Hierarchical Implicit Models

02-28

Analyzing The Papers Behind Facebook's Computer Vision Approach

09-01

The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)

08-24

Playing with convolutions in TensorFlow

08-09

A Beginner's Guide To Understanding Convolutional Neural Networks Part 2

07-29

Visualizing Features from a Convolutional Neural Network

06-15

Interactive Abstract Pattern Generation Javascript Demo

04-24

First Convergence Bias

04-11

How to Code and Understand DeepMind's Neural Stack Machine

02-25

Introduction to Semi-Supervised Learning with Ladder Networks

01-19

Does AI stand for Alchemical Intelligence?

12-14

The TensorFlow perspective on neural networks

11-30

Anyone Can Learn To Code an LSTM-RNN in Python (Part 1: RNN)

11-15

Denoising Dirty Documents – Part 10

11-01

Showing a difference in means between two groups

01-13

Leaf Plant Classification: Statistical Learning Model – Part 2

12-31

If you did not already know

12-26

If you did not already know

12-06

How to get the homology of a antibody using R

12-02

“She also observed that results from smaller studies conducted by NGOs – often pilot studies – would often look promising. But when governments tried to implement scaled-up versions of those programs, their performance would drop considerably.”

11-22

Benford’s Law for Fraud Detection with an Application to all Brazilian Presidential Elections from 2002 to 2018

11-17

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Quick DB result caching in R

05-05

Implementing Poincaré Embeddings

12-09

Incremental means and variances

11-28

Parallelizing Distance Calculations Using A GPU With CUDAnative.jl

08-14

Retrospective review of my first deep learning competition

07-22

Asynchronous Scraping with Python

10-16

Simulating the Colombian Peace Vote: Did the "No" Really Win?

10-12

Gradient Boosting explained [demonstration]

06-24

Where will Artificial Intelligence come from?

04-20

Denoising Dirty Documents – Part 10

11-01

GPU-accelerated Theano & Keras with Windows 10

09-23

Why is Keras Running So Slow?

12-05

Denoising Dirty Documents – Part 10

11-01

Most Winning A/B Test Results are Illusory

11-01

Most Winning A/B Test Results are Illusory

11-01

Most Winning A/B Test Results are Illusory

11-01

Distilled News

10-11

Neurally Embedded Emojis

06-19

Deep Learning for Visual Question Answering

11-02

Le Monde puzzle [#1078]

11-28

How do I visualise the results of a Bayesian Model: Rugby models in Arviz

10-29

Machine Reading at Scale – Transfer Learning for Large Text Corpuses

10-17

Distilled News

10-11

Document worth reading: “A Survey on Expert Recommendation in Community Question Answering”

09-28

Lucy`s Secret Number puzzle

06-03

How To Write, Deploy, and Interact with Ethereum Smart Contracts on a Private Blockchain

12-15

Three Bag Logic Puzzle

03-23

Baseball Card Collecting

04-29

So You Want to Implement a Custom Loss Function?

11-18

Deep Learning for Visual Question Answering

11-02

Grazing and Calculus Revisited

07-26

Intro to Recommender Systems: Collaborative Filtering

11-02

No, I don’t think it’s the file drawer effect

08-16

No, I don’t think it’s the file drawer effect

08-16

I think they use witchcraft

07-08

Intro to Recommender Systems: Collaborative Filtering

11-02

If you did not already know

01-06

What to do when your training and testing data come from different distributions

01-04

Data Notes: Malaria Detection with FastAI

01-03

Synthetic Data Generation: A must-have skill for new data scientists

12-27

Document worth reading: “The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers”

12-27

If you did not already know

12-25

Document worth reading: “Are screening methods useful in feature selection? An empirical study”

12-18

If you did not already know

12-16

Why You Shouldn’t be a Data Science Generalist

12-14

Keras – Save and Load Your Deep Learning Models

12-10

Shinyfit: Advanced regression modelling in a shiny app

12-07

Designing a Self-Learning Tic-Tac-Toe Player

11-29

Data Notes: Impact of Game of Thrones on US Baby Names

11-15

Metadata Enrichment is Essential to Realize the Value of Open Datasets

11-14

Data Notes: Chinese Tourism's Impact on Taiwan

11-01

Now use Pipe mode with CSV datasets for faster training on Amazon SageMaker built-in algorithms

11-01

If you did not already know

10-31

Distilled News

10-30

Building a Question-Answering System from Scratch

10-24

Import AI: 117: Surveillance search engines; harvesting real-world road data with hovering drones; and improving language with unsupervised pre-training

10-22

If you did not already know

10-17

Data Notes: The Secret of Academic Success

10-17

Distilled News

10-14

If you did not already know

10-14

Data Notes: Are Those Honey Bees Healthy?

10-04

R Packages worth a look

09-24

Data Notes: How Do Autoencoders Work?

09-20

Google Dataset Search : Google’s New Data Search Engine

09-10

If you did not already know

09-06

Data Notes: The Secret to Getting to a Second Date

09-06

Google Dataset Search now in public beta

09-06

Use Kaggle to start (and guide) your ML/ Data Science journey — Why and How

08-29

Access Amazon S3 data managed by AWS Glue Data Catalog from Amazon SageMaker notebooks

08-29

Data Notes: Drought and the War in Syria

08-23

Document worth reading: “Radial Basis Function Approximations: Comparison and Applications”

08-15

Data Notes: From Hate Speech to Russian Troll Tweets

08-09

Import AI:

07-23

Data Notes: How to Forecast the S&P 500 with Prophet

07-12

Data Notes: Your smartphone knows *what*?

06-28

My Thoughts on Synthetic Data

06-27

Open Source Datasets with Kaggle

06-21

BDD100K Blog Update

06-18

Data Notes: Predict the World Cup 2018 Winner

06-14

Import AI:

05-29

Apache Arrow and the "10 Things I Hate About pandas"

09-21

Streaming Columnar Data with Apache Arrow

01-27

Collaborative Filtering using Alternating Least Squares

09-17

Making Python on Apache Hadoop Easier with Anaconda and CDH

02-17

Data Mining with Python on Medical Datasets for Data Mining

01-25

Intro to Recommender Systems: Collaborative Filtering

11-02

Installing Python Packages from a Jupyter Notebook

12-05

Intro to Recommender Systems: Collaborative Filtering

11-02

Intro to Recommender Systems: Collaborative Filtering

11-02

Slides from my talks about Demystifying Big Data and Deep Learning (and how to get started)

11-20

McKinsey Datathon: The City Cup17 November, Amsterdam, Stockholm and Zurich. Apply Now

10-19

Will Models Rule the World? Data Science Salon Miami, Nov 6-7

10-19

How to Solve the ModelOps Challenge

10-18

Citizen Data Scientists | Why Not DIY AI?

10-17

Open Workshop: Deep Learning in R and Keras, November 14th in Frankfurt

10-13

How many college football teams can you watch in-person in one football season?

03-21

Thanksgiving Special 🦃: GANs are Being Fixed in More than One Way

11-23

Data Science in Healthcare

11-14

Analyzing Interactive Brokers XML Flex Statements with pandas

11-02

Analyzing Interactive Brokers XML Flex Statements with pandas

11-02

Hello, world!

01-16

Analyzing Interactive Brokers XML Flex Statements with pandas

11-02

4 Strategies to Deal With Large Datasets Using Pandas

12-19

Introduction to Pandas, NumPy and RegEx in Python

12-17

What's the future of the pandas library?

12-12

Timings of a Grouped Rank Filter Task

08-23

SQLite vs Pandas: Performance Benchmarks

05-23

Time Series for scikit-learn People (Part I): Where's the X Matrix?

01-28

Apache Arrow and the "10 Things I Hate About pandas"

09-21

Streaming Columnar Data with Apache Arrow

01-27

2017 Outlook: pandas, Arrow, Feather, Parquet, Spark, Ibis

12-27

The Power of IPython Notebook + Pandas + and Scikit-learn

06-11

On Software Demos and Potemkin Villages

04-06

Why pandas users should be excited about Apache Arrow

02-22

Calling RSiteCatalyst From Python

02-22

Analyzing Interactive Brokers XML Flex Statements with pandas

11-02

The problem with the data science language wars

11-02

How to scrape data from a website using Python

09-07

Top 12 Essential Command Line Tools for Data Scientists

06-20

No juice for you, CSV format. It just makes you more awful.

09-23

The problem with the data science language wars

11-02

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

Creating Tables Using R and Pure HTML

12-05

Distilled News

11-19

Creating GIFs with OpenCV

11-05

Automated Email Reports with R

11-01

cransays - Follow your R Package Journey to CRANterbury with our Dashboard!

10-11

Deploy your own TensorFlow object detection model to AWS DeepLens

09-27

Help improve lives through Machine Learning by joining the AWS DeepLens Challenge!

09-25

Deep Learning for Emojis with VS Code Tools for AI – Part 2

06-05

How-to: Automate Your sparklyr Environment with Cloudera Director

12-15

TensorFlow in a Nutshell — Part One: Basics

08-22

The problem with the data science language wars

11-02

R Packages worth a look

01-04

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

Creating Tables Using R and Pure HTML

12-05

Creating GIFs with OpenCV

11-05

Short Article Reveals the Undeniable Facts About College Essay Writing Service and How It Can Affect You

10-04

Introducing a tensorflow library for deep learning and reinforcement learning

07-17

How-to: Automate Your sparklyr Environment with Cloudera Director

12-15

The problem with the data science language wars

11-02

Document worth reading: “A rational analysis of curiosity”

08-20

Announcing Practical Data Science with R, 2nd Edition

08-15

Do Bayesians Overfit?

07-11

Movie Genre Ratings - Addendum

02-24

Adobe Analytics Clickstream Data Feed: Calculations and Outlier Analysis

05-24

A Torch autoencoder example

11-06

A Torch autoencoder example

11-06

Document worth reading: “Computing the Unique Information”

12-12

Deep Learning Cheat Sheets

11-28

Document worth reading: “Do Deep Learning Models Have Too Many Parameters An Information Theory Viewpoint”

09-22

House Price Prediction using a Random Forest Classifier

11-29

NPR Sunday Puzzle Solving, And Other Baby Name Questions

10-02

How-to: Do Scalable Graph Analytics with Apache Spark

10-03

A Torch autoencoder example

11-06

Introduction to Semi-Supervised Learning with Ladder Networks

01-19

A Torch autoencoder example

11-06

The Deep Learning Gold Rush of 2015

11-07

If you did not already know

01-04

If you did not already know

09-20

If you did not already know

08-27

If you did not already know

08-11

Convolve all the things

05-31

Visualizing Features from a Convolutional Neural Network

06-15

Understanding Convolutional Neural Networks for NLP

11-07

If you did not already know

11-08

Implement Simple Convolution with Java

09-27

Save time and money by filtering faces during indexing with Amazon Rekognition

09-18

Java Handwritten Digit Recognition with Convolutional Neural Networks

12-13

Basic Math on How Bloom Filter Works

08-27

A Beginner's Guide To Understanding Convolutional Neural Networks Part 2

07-29

Understanding Convolutional Neural Networks for NLP

11-07

Day 03 – little helper multiplot

12-03

NYC buses: Cubist regression with more predictors

11-30

Understanding object detection in deep learning

11-19

Avoid unsigned integers in C++ if you can

03-17

Understanding Convolutional Neural Networks for NLP

11-07

Understanding Convolutional Neural Networks for NLP

11-07

vitae: Dynamic CVs with R Markdown

01-10

Don’t reinvent the wheel: making use of shiny extension packages. Join MünsteR for our next meetup!

01-08

BH 1.69.0-1 on CRAN

01-07

Long-awaited updates to htmlTable

01-07

Your and my 2019 R goals

01-01

9 Reasons Excel Users Should Consider Learning Programming

12-27

Finally, You Can Plot H2O Decision Trees in R

12-26

Re-creating a Voronoi-Style Map with R

12-22

BH 1.69.0-0 pre-releases and three required changes

12-20

In case you missed it: November 2018 roundup

12-14

Pdftools 2.0: powerful pdf text extraction tools

12-14

Using ggplot2 for functional time series

12-12

covrpage, more information on unit testing

12-10

The Need for Speed Part 1: Building an R Package with Fortran (or C)

12-10

Interesting packages taken from R/Pharma

12-09

Distilled News

12-05

Detecting spatiotemporal groups in relocation data with spatsoc

12-04

A tutorial on tidy cross-validation with R

11-25

Hacking Bioconductor

11-19

Rcpp now used by 1500 CRAN packages

11-15

Use GitHub Vulnerability Alerts to Keep Users of Your R Packages Safe

11-14

AzureR: R packages to control Azure services

11-08

Rcpp 1.0.0: The Tenth Birthday Release

11-08

AzureR: R packages to control Azure services

11-08

Quick overview on the new Bioconductor 3.8 release

11-02

Python vs R: Head to Head Data Analysis

11-01

Azure ML Studio now supports R 3.4

11-01

What R version do you really need for a package?

11-01

How to Start Learning R for Data Science

10-31

CRAN’s New Missing Data Task View

10-26

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 1: Basics

10-25

Distilled News

10-24

Packages for Testing your R Package

10-22

Summer Intern Projects

10-22

Examining Inter-Rater Reliability in a Reality Baking Show

10-18

Announcing RStudio Package Manager

10-17

A small logical change with big impact

10-16

A small logical change with big impact

10-16

A small logical change with big impact

10-16

Package support offer

10-15

How we use emojis

10-15

I fell out with tapply and in love with dplyr

10-15

RcppNLoptExample 0.0.1: Use NLopt from C/C++

10-13

Speed Up With Microsoft

10-04

Functions and Packages

09-29

Distilled News

09-28

Three Mighty Good Reasons to Learn R for Data Science

09-19

R Packages worth a look

08-25

Distilled News

08-16

Microsoft R Open 3.5.1 now available

08-14

A Certification for R Package Quality

07-30

rqdatatable: rquery Powered by data.table

06-03

My eRum 2018 biggest highlights

05-19

How many CRAN package maintainers have been pwned?

04-18

Package Paths in R

03-31

Learn to R blog series - R and RStudio

03-29

Introducing Python for data scientists - Pt1

03-15

Installing Python Packages from a Jupyter Notebook

12-05

implyr: R Interface for Apache Impala

07-19

Use your favorite Python library on PySpark cluster with Cloudera Data Science Workbench

04-26

Our R package roundup

12-31

Python 2.7 still reigns supreme in pip installs

09-03

conda-forge and PyData's CentOS moment

04-20

Our R package roundup

12-30

Denoising Dirty Documents: Part 11

11-08

Denoising Dirty Documents: Part 11

11-08

RNNs in Tensorflow, a Practical Guide and Undocumented Features

08-21

Denoising Dirty Documents: Part 11

11-08

Recurrent Neural Networks in Tensorflow III - Variable Length Sequences

11-15

Denoising Dirty Documents: Part 11

11-08

Announcing the ultimate seminar speaker contest: 2019 edition!

01-06

Back by popular demand . . . The Greatest Seminar Speaker contest!

01-04

An Overview of the Singapore Hiring Landscape

11-21

LinkedIn Top Voices 2018: Data Science & Analytics

11-13

Most liked R-bloggers’ posts from last week (2018-10-07 till 2018-10-13 – based on twitter)

10-15

Hi

09-27

Reddit science discussions as a dataset

06-22

Announcement

04-27

Approaching fairness in machine learning

09-06

Neural networks, linear transformations and word embeddings

11-09

“Becoming a Data Scientist” Learning Club?

11-09

A short proof for Nesterov’s momentum

11-21

Cognitive Services in Containers

11-19

Cognitive Services in Containers

11-19

Using Stacking to Average Bayesian Predictive Distributions (with Discussion)

09-26

Visual search on AWS—Part 2: Deployment with AWS DeepLens

09-05

Recurrent Neural Networks for Churn Prediction

02-22

“Becoming a Data Scientist” Learning Club?

11-09

“Becoming a Data Scientist” Learning Club?

11-09

Artificial Stupidity and the Mechanistic Fallacy

11-09

Artificial Stupidity and the Mechanistic Fallacy

11-09

Certifiably Gone Phishing

12-23

R Packages worth a look

10-28

A Right to Reasonable Inferences

10-01

Making Smart Phones Dumb Again

09-07

Artificial Stupidity and the Mechanistic Fallacy

11-09

Artificial Stupidity and the Mechanistic Fallacy

11-09

If you did not already know

01-01

R Packages worth a look

12-20

R Packages worth a look

10-18

How to Optimise Ad CTR with Reinforcement Learning

09-24

How to Optimise Ad CTR with Reinforcement Learning

09-17

Unfolding Naïve Bayes From Scratch!

09-02

Further Exploring Common Probabilistic Models

06-06

An introduction to Bayesian Belief Networks

03-10

Turning Distances into Distributions

09-19

Probability Calibration And Isotonic Regression

09-18

The Probability Monad and Why it's Important for Data Science

09-05

Continuous Bayes’ Theorem

01-20

Estimating known unknowns

12-11

Common Probability Distributions: The Data Scientist’s Crib Sheet

12-03

History of Monte Carlo Methods - Part 3

11-13

Artificial Stupidity and the Mechanistic Fallacy

11-09

Neural networks, linear transformations and word embeddings

11-09

Neural networks, linear transformations and word embeddings

11-09

Enter the

12-10

Neural networks, linear transformations and word embeddings

11-09

Neural networks, linear transformations and word embeddings

11-09

MCMC sampling for dummies

11-10

MCMC sampling for dummies

11-10

MCMC sampling for dummies

11-10

Golf Balls

11-11

Golf Balls

11-11

Golf Balls

11-11

2018.

12-31

The Role of Theory in Data Analysis

12-11

Heatmaps of Mortality Rates

12-04

Model evaluation, model selection, and algorithm selection in machine learning

11-10

Debate about genetics and school performance

10-27

If you did not already know

10-24

What to think about this new study which says that you should limit your alcohol to 5 drinks a week?

10-23

Ask the Question, Visualize the Answer

10-17

Help! I can’t reproduce a machine learning project!

09-19

A couple more papers on genetic diversity as an explanation for why Africa and remote Andean countries are so poor while Europe and North America are so wealthy

09-19

Distilled News

09-17

Distilled News

08-31

What is a Box Plot?

08-24

How feminism has made me a better scientist

08-13

How feminism has made me a better scientist

08-13

Of statistics class and judo class: Beyond the paradigm of sequential education

07-22

Talking about clinical significance

06-01

Sutton’s Temporal-Difference Learning

02-19

Natural and Artificial Intelligence

02-06

Deep Learning Research Review Week 3: Natural Language Processing

01-10

How to Use t-SNE Effectively

10-13

Making Bayesian A/B testing more accessible

06-19

Golf Balls

11-11

Document worth reading: “Advice from the Oracle: Really Intelligent Information Retrieval”

11-10

k-server, part 3: entropy regularization for weighted k-paging

01-29

Basic Math on How Bloom Filter Works

08-27

Golf Balls

11-11

Interactive panel EDA with 3 lines of code

12-09

Association rule analysis beyond transaction data

11-11

Association rule analysis beyond transaction data

11-11

Weekly Review: 10/21/2017

10-21

Building Event-driven Microservices Using CQRS and Serverless

02-01

Building Spring Cloud Microservices That Strangle Legacy Systems

08-30

Association rule analysis beyond transaction data

11-11

Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

10-29

Business Analysis (BA) Career Path

10-11

Association rule analysis beyond transaction data

11-11

If you did not already know

01-06

If you did not already know

01-01

R Packages worth a look

12-20

“My advisor and I disagree on how we should carry out repeated cross-validation. We would love to have a third expert opinion…”

12-15

The 5 Basic Statistics Concepts Data Scientists Need to Know

11-13

Document worth reading: “Restricted Boltzmann Machines: Introduction and Review”

10-27

R Packages worth a look

10-18

Unfolding Naïve Bayes From Scratch!

09-02

Basic Statistics in Python: Probability

07-18

Kung Fury Review (2015) : Don’t Hassle the Hoff

05-05

Further Exploring Common Probabilistic Models

06-06

An introduction to Bayesian Belief Networks

03-10

Turning Distances into Distributions

09-19

Probability Calibration And Isotonic Regression

09-18

Continuous Bayes’ Theorem

01-20

Estimating known unknowns

12-11

Hamiltonian Monte Carlo

12-10

History of Monte Carlo Methods - Part 3

11-13

History of Monte Carlo Methods - Part 3

11-13

German Temperature Data

05-12

History of Monte Carlo Methods - Part 3

11-13

History of Monte Carlo Methods - Part 3

11-13

Short Story on AI: A Cognitive Discontinuity.

11-14

If you did not already know

01-07

Scaling Multi-Agent Reinforcement Learning

12-12

Import AI: 123: Facebook sees demands for deep learning services in its data centers grow by 3.5X; why advanced AI might require a global policeforce; and diagnosing natural disasters with deep learning

12-03

If you did not already know

11-12

Document worth reading: “Neural Approaches to Conversational AI”

10-29

Society of Machines: The Complex Interaction of Agents

10-04

If you did not already know

09-21

Introduction to Learning to Trade with Reinforcement Learning

02-11

Evolving Stable Strategies

11-12

Cognitive Machine Learning: Prologue

10-08

Short Story on AI: A Cognitive Discontinuity.

11-14

Short Story on AI: A Cognitive Discontinuity.

11-14

Short Story on AI: A Cognitive Discontinuity.

11-14

Short Story on AI: A Cognitive Discontinuity.

11-14

James Bond movies

11-14

James Bond movies

11-14

If you did not already know

12-01

Document worth reading: “Attribute-aware Collaborative Filtering: Survey and Classification”

10-23

R Packages worth a look

09-03

Machine Learning Madden NFL: How Madden player ratings are actually calculated

01-10

Interacting with ML Models

10-26

DynamoDB Learnings

10-23

Wesley Crushes Ratings

04-19

James Bond movies

11-14

Machine Learning Madden NFL: How Madden player ratings are actually calculated

01-10

Interacting with ML Models

10-26

DynamoDB Learnings

10-23

James Bond movies

11-14

Automated Dashboard Visualizations with Ranking in R

12-07

James Bond movies

11-14

Document worth reading: “Psychological State in Text: A Limitation of Sentiment Analysis”

09-03

Emotional contagion in Twitter!

11-14

Document worth reading: “A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior”

08-07

Emotional contagion in Twitter!

11-14

Emotional contagion in Twitter!

11-14

Classifying yin and yang using MRI

12-18

If you did not already know

12-15

R Packages worth a look

12-04

“She also observed that results from smaller studies conducted by NGOs – often pilot studies – would often look promising. But when governments tried to implement scaled-up versions of those programs, their performance would drop considerably.”

11-22

Forget Motivation and Double Your Chances of Learning Success

11-20

Facial feedback: “These findings suggest that minute differences in the experimental protocol might lead to theoretically meaningful changes in the outcomes.”

11-01

What to think about this new study which says that you should limit your alcohol to 5 drinks a week?

10-23

Does Sharing Goals Help or Hurt Your Chances of Success?

10-22

How many deaths were caused by the hurricane in Puerto Rico?

09-14

What if a big study is done and nobody reports it?

09-10

Some clues that this study has big big problems

08-29

No, I don’t think it’s the file drawer effect

08-16

No, I don’t think it’s the file drawer effect

08-16

Let’s be open about the evidence for the benefits of open science

08-06

Amelia, it was just a false alarm

07-31

Assorted links

05-30

Emotional contagion in Twitter!

11-14

How to use Keras fit and fit_generator (a hands-on tutorial)

12-24

Carol Nickerson explains what those mysterious diagrams were saying

12-22

Image Stitching with OpenCV and Python

12-17

Instance segmentation with OpenCV

11-26

Project planning with plotly

11-26

Mask R-CNN with OpenCV

11-19

Creating GIFs with OpenCV

11-05

Multi-object tracking with dlib

10-29

About a Curious Feature and Interpretation of Linear Regressions

10-29

Object tracking with dlib

10-22

Ask the Question, Visualize the Answer

10-17

How to scrape data from a website using Python

09-07

Top 12 Essential Command Line Tools for Data Scientists

06-20

How analog TV worked

05-01

Exploring Line Lengths in Python Packages

11-09

Crosslingual document comparison

08-31

Voronoi Diagrams

05-12

Anyone Can Learn To Code an LSTM-RNN in Python (Part 1: RNN)

11-15

Deep Learning without Backpropagation

03-21

Building Safe A.I.

03-17

Anyone Can Learn To Code an LSTM-RNN in Python (Part 1: RNN)

11-15

Anyone Can Learn To Code an LSTM-RNN in Python (Part 1: RNN)

11-15

An Even Dozen – Denoising Dirty Documents: Part 12

11-15

An Even Dozen – Denoising Dirty Documents: Part 12

11-15

An Even Dozen – Denoising Dirty Documents: Part 12

11-15

Stock Price prediction using ML and DL

01-07

If you did not already know

12-29

Best Deals in Deep Learning Cloud Providers: From CPU to GPU to TPU

11-15

Improving model interpretability with LIME

10-31

An Even Dozen – Denoising Dirty Documents: Part 12

11-15

Weird Number Bases

12-16

The Information Barons Threaten our Autonomy and Our Privacy

11-16

The Information Barons Threaten our Autonomy and Our Privacy

11-16

Detect suspicious IP addresses with the Amazon SageMaker IP Insights algorithm

11-19

Millions of social bots invaded Twitter!

03-14

The Information Barons Threaten our Autonomy and Our Privacy

11-16

Download 3 million Russian troll tweets

08-02

Data trusts could allay our privacy fears

06-03

The Information Barons Threaten our Autonomy and Our Privacy

11-16

If you did not already know

11-10

RcppNLoptExample 0.0.1: Use NLopt from C/C++

10-13

A quick tour of AI services in Azure

07-24

Talking to Machines – The Rise of Conversational Interfaces and NLP

11-17

$ vs. votes

11-27

Poker odds with wild cards

09-27

Talking to Machines – The Rise of Conversational Interfaces and NLP

11-17

R Packages worth a look

12-07

10 Best Mobile Apps for Data Scientist / Data Analysts

10-10

No code chatbots: TIBCO uses Amazon Lex to put chat interfaces into the hands of business users

09-05

Habits and Tools, Old and New

01-26

Talking to Machines – The Rise of Conversational Interfaces and NLP

11-17

9 new pandas updates that will save you time

01-25

Talking to Machines – The Rise of Conversational Interfaces and NLP

11-17

Neural Ordinary Differential Equations

12-15

Temple University: Faculty Positions (Assistant/Associate/Full Professor) [Philadelphia, PA]

10-12

It's not an Internet of Things, It's an Internet of People

11-17

Looking back on 2018, looking to 2019

01-07

Here’s why 2019 is a great year to start with R: A story of 10 year old R code then and now

01-05

LightOn: Forward We Go !

12-20

Gender Diversity in the R and Python Communities

12-05

Gender Diversity in the R and Python Communities

12-05

In case you missed it: October 2018 roundup

11-15

Why R? 2018 Conference – After Movie and Summary

11-07

Graphs Are The Next Frontier In Data Science

10-18

Visualising Networks in ASOIAF – Part II

10-14

Paris Machine Learning

10-10

R Consortium grant applications due October 31

10-09

R Consortium grant applications due October 31

10-09

A few upcoming R conferences

10-05

A few upcoming R conferences

10-05

Year 3 of Data, Beer, & Inspiration

07-23

RSiteCatalyst Version 1.4.14 Release Notes

02-16

Smart Cities at the Nexus of Emerging Data Technologies and You

09-18

Data Analysis, NHS and Industrial Partners

04-28

It's not an Internet of Things, It's an Internet of People

11-17

Don’t Peek: Deep Learning without looking … at test data

10-08

Fact over Fiction

04-22

It's not an Internet of Things, It's an Internet of People

11-17

So You Want to Implement a Custom Loss Function?

11-18

Good Feature Building Techniques and Tricks for Kaggle

12-31

Don’t Peek part 2: Predictions without Test Data

11-18

Basic Image Data Analysis Using Python – Part 4

10-05

From Gaussian Algebra to Gaussian Processes, Part 1

03-31

Avoiding overfitting in object detection problem

12-19

László Babai's New Proof

12-16

So You Want to Implement a Custom Loss Function?

11-18

If you did not already know

01-01

My book ‘Deep Learning from first principles:Second Edition’ now on Amazon

12-14

Document worth reading: “The Dynamics of Learning: A Random Matrix Approach”

12-04

If you did not already know

11-17

Gradient optimisation on the Poincaré disc

04-10

Matrix Factorization in PyTorch

06-20

What is the natural gradient, and how does it work?

12-30

Hyperparameter optimization with approximate gradient

05-24

So You Want to Implement a Custom Loss Function?

11-18

Visualizing the 2015 NL Cy Young Race

11-19

Building a neighbour matrix with python

11-04

Modeling Airbnb prices

10-12

Data Science Portfolio Project: Is Fandango Still Inflating Ratings?

08-15

Synthetic Gradients with Tensorflow

04-08

A Neural Network for predicting Restaurant Reservations

11-30

House Price Prediction using a Random Forest Classifier

11-29

Make a Profitable Portfolio using Python

06-08

Hail: Scalable Genomics Analysis with Apache Spark

05-02

Audio Signals in Python

04-17

First Steps With Neural Nets in Keras

03-04

Visualizing the 2015 NL Cy Young Race

11-19

GARCH and a rudimentary application to Vol Trading

12-03

Visualizing the 2015 NL Cy Young Race

11-19

Day 05 – little helper get_network

12-05

Voronoi diagram with ggvoronoi package with Train Station data

11-10

Visualizing the 2015 NL Cy Young Race

11-19

Visualizing the 2015 NL Cy Young Race

11-19

Datascope Promotes Brian Lange to Partner

11-19

Datascope Promotes Bo Peng to Partner

11-19

Datascope Promotes Brian Lange to Partner

11-19

Datascope Promotes Bo Peng to Partner

11-19

GitHub Streak: Round Five

10-13

Lending Club Data Analysis Revisited with Python

11-22

Lending Club Data Analysis Revisited with Python

11-22

Lending Club Data Analysis Revisited with Python

11-22

Lending Club Data Analysis Revisited with Python

11-22

A couple of thoughts regarding the hot hand fallacy fallacy

12-14

Document worth reading: “Causal inference and the data-fusion problem”

10-26

Why AI Isn’t A Black Box (And Its Business Value)

07-17

Rules to Learn By

05-31

A.I. 'Bias' Doesn't Mean What Journalists Say It Means

08-30

Goals of Interpretability

11-17

A Challenge to Data Scientists

11-22

How to Remove Unfair Bias From Your AI

01-11

A Case For Explainable AI & Machine Learning

12-27

A.I. 'Bias' Doesn't Mean What Journalists Say It Means

08-30

Goals of Interpretability

11-17

A Challenge to Data Scientists

11-22

Whats new on arXiv

01-13

Whats new on arXiv

01-12

Linguistic Signals of Album Quality: A Predictive Analysis of Pitchfork Review Scores Using Quanteda

01-10

Distilled News

01-09

“The Book of Why” by Pearl and Mackenzie

01-08

Whats new on arXiv

01-08

If you did not already know

01-08

Dow Jones Stock Market Index (3/4): Log Returns GARCH Model

01-08

On deck for the first half of 2019

01-07

Whats new on arXiv

01-04

Whats new on arXiv

01-04

Whats new on arXiv

01-03

Whats new on arXiv

01-03

Whats new on arXiv

01-01

If you did not already know

01-01

Simulating Multi-state Models with R

01-01

Whats new on arXiv

12-31

Leaf Plant Classification: Statistical Learning Model – Part 2

12-31

Whats new on arXiv

12-30

Distilled News

12-29

Whats new on arXiv

12-29

A Case For Explainable AI & Machine Learning

12-27

Finally, You Can Plot H2O Decision Trees in R

12-26

Whats new on arXiv

12-25

Text classification with tidy data principles

12-24

Top 10 Data Science Tools (other than SQL Python R)

12-21

Distilled News

12-20

Whats new on arXiv

12-20

Distilled News

12-19

Whats new on arXiv

12-19

Distilled News

12-18

If you did not already know

12-17

Meta-Learning For Better Machine Learning

12-17

Whats new on arXiv

12-17

Data Scientist’s Dilemma – The Cold Start Problem

12-15

NLP Breakthrough Imagenet Moment has arrived

12-14

Distilled News

12-14

R Packages worth a look

12-13

R Packages worth a look

12-13

If you did not already know

12-12

R Packages worth a look

12-11

Whats new on arXiv

12-10

Whats new on arXiv

12-10

Distilled News

12-10

Whats new on arXiv

12-10

Whats new on arXiv

12-08

If you did not already know

12-08

Whats new on arXiv

12-07

If you did not already know

12-07

Distilled News

12-07

The Machine Learning Project Checklist

12-07

Bayesian Nonparametric Models in NIMBLE, Part 2: Nonparametric Random Effects

12-07

Distilled News

12-06

Building Gene Expression Atlases with Deep Generative Models for Single-cell Transcriptomics

12-05

Data Science Projects Employers Want To See: How To Show A Business Impact

12-04

Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation

12-04

Whats new on arXiv

12-03

Best Machine Learning languages, Data Visualization Tools, DL Frameworks, and Big Data Tools

12-03

R Packages worth a look

12-03

If you did not already know

12-03

Interpretability is crucial for trusting AI and machine learning

12-01

NYC buses: C5.0 classification with R; more than 20 minute delay?

12-01

Whats new on arXiv

12-01

Deep Learning for the Masses (… and The Semantic Layer)

11-30

Whats new on arXiv

11-29

Whats new on arXiv

11-29

Whats new on arXiv

11-28

Multilevel models for multiple comparisons! Varying treatment effects!

11-28

Whats new on arXiv

11-27

“Economic predictions with big data” using partial pooling

11-26

Whats new on arXiv

11-26

Data Pro Cyber Monday – Choose Your Savings

11-26

Whats new on arXiv

11-26

Whats new on arXiv

11-23

R Packages worth a look

11-21

Machine Learning in Action: Going Beyond Decision Support Data Science

11-20

Quantcast: Sr Applied Scientist, Audience Platform [Seattle, WA]

11-20

Build Your Own Natural Language Models on AWS (no ML experience required)

11-19

Detect suspicious IP addresses with the Amazon SageMaker IP Insights algorithm

11-19

Distilled News

11-19

If you did not already know

11-19

Whats new on arXiv

11-17

Whats new on arXiv

11-16

Using Uncertainty to Interpret your Model

11-16

Whats new on arXiv

11-15

Whats new on arXiv

11-15

Distilled News

11-14

Whats new on arXiv

11-14

Whats new on arXiv

11-13

Preview my new book: Introduction to Reproducible Science in R

11-12

Whats new on arXiv

11-12

Whats new on arXiv

11-11

Practical statistics books for software engineers

11-08

Now easily perform incremental learning on Amazon SageMaker

11-07

EMNLP 2018 Highlights: Inductive bias, cross-lingual learning, and more

11-07

Whats new on arXiv

11-07

Whats new on arXiv

11-06

Turn data into revenue. Wharton can show you how.

11-06

Whats new on arXiv

11-05

Maps, models, and analytic problem framing

11-05

Visualize the Business Value of your Predictive Models with modelplotr

11-03

Whats new on arXiv

11-03

If you did not already know

10-31

Multilevel Modeling Solves the Multiple Comparison Problem: An Example with R

10-31

Whats new on arXiv

10-30

Whats new on arXiv

10-27

Whats new on arXiv

10-26

Marketing Analytics and Data Science

10-26

Notes on Feature Preprocessing: The What, the Why, and the How

10-26

Whats new on arXiv

10-25

AI, Machine Learning and Data Science Roundup: October 2018

10-25

Distilled News

10-24

Whats new on arXiv

10-23

Whats new on arXiv

10-23

Whats new on arXiv

10-22

Whats new on arXiv

10-22

Data Science With R Course Series – Week 6

10-22

Whats new on arXiv

10-21

Distilled News

10-21

Multilevel models with group-level predictors

10-21

Whats new on arXiv

10-19

Distilled News

10-18

Four machine learning strategies for solving real-world problems

10-17

RStudio 1.2 Preview: Stan

10-16

Whats new on arXiv

10-16

Whats new on arXiv

10-11

Evaluating the Business Value of Predictive Models in Python and R

10-11

Whats new on arXiv

10-10

Whats new on arXiv

10-09

Whats new on arXiv

10-09

Whats new on arXiv

10-09

Whats new on arXiv

10-05

Amazon SageMaker Neural Topic Model now supports auxiliary vocabulary channel, new topic evaluation metrics, and training subsampling

10-04

Whats new on arXiv

10-04

Whats new on arXiv

10-03

Whats new on arXiv

10-02

Modeling muti-category Outcomes With vtreat

10-01

Whats new on arXiv

09-28

Whats new on arXiv

09-27

Distilled News

09-26

Whats new on arXiv

09-25

Whats new on arXiv

09-25

Whats new on arXiv

09-24

Distilled News

09-22

Whats new on arXiv

09-21

Whats new on arXiv

09-21

Whats new on arXiv

09-20

Whats new on arXiv

09-19

Whats new on arXiv

09-18

Whats new on arXiv

09-17

Distilled News

09-15

If you did not already know

09-15

If you did not already know

09-15

Whats new on arXiv

09-14

Whats new on arXiv

09-14

Divergent and Convergent Phases of Data Analysis

09-14

Whats new on arXiv

09-13

Whats new on arXiv

09-12

Whats new on arXiv

09-11

Distilled News

09-08

Whats new on arXiv

09-08

Whats new on arXiv

09-07

Whats new on arXiv

09-07

Bothered by non-monotonicity? Here’s ONE QUICK TRICK to make you happy.

09-07

R Packages worth a look

09-05

Distilled News

09-04

Whats new on arXiv

09-03

If you did not already know

09-02

Whats new on arXiv

09-01

R Tip: How to Pass a formula to lm

09-01

Whats new on arXiv

08-31

Whats new on arXiv

08-29

Whats new on arXiv

08-28

Whats new on arXiv

08-28

Bayesian model comparison in ecology

08-26

R Packages worth a look

08-25

Whats new on arXiv

08-24

Whats new on arXiv

08-24

Whats new on arXiv

08-22

Distilled News

08-21

Whats new on arXiv

08-21

Whats new on arXiv

08-21

Whats new on arXiv

08-21

Whats new on arXiv

08-17

Whats new on arXiv

08-17

Whats new on arXiv

08-16

Document worth reading: “Sequences, yet Functions: The Dual Nature of Data-Stream Processing”

08-16

Whats new on arXiv

08-15

Distilled News

08-14

Whats new on arXiv

08-14

Hierarchical Bayesian Neural Networks with Informative Priors

08-13

Whats new on arXiv

08-13

Document worth reading: “Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution”

08-11

Distilled News

08-11

Whats new on arXiv

08-09

Distilled News

08-08

Whats new on arXiv

08-08

When Recurrent Models Don't Need to be Recurrent

08-06

Distilled News

08-04

Whats new on arXiv

08-03

When LOO and other cross-validation approaches are valid

08-03

Whats new on arXiv

08-02

Whats new on arXiv

08-01

New Dynamics for Topic Models

07-31

Whats new on arXiv

07-31

Quantum Computing: Cats, Crushes, and Chemistry

07-30

He wants to model a proportion given some predictors that sum to 1

07-10

Design Patterns for Production NLP Systems

07-09

Add Constrained Optimization To Your Toolbelt

06-21

The Role of Resources in Data Analysis

06-18

The Dynamics of Philippine Senate Bills: Gensim, Topic Modeling and All That Good NLP Stuff

06-09

Forbes: DataRobot Puts the Power of Machine Learning in the Hands of Business Analysts

06-04

Using Linear Regression for Predictive Modeling in R

05-16

Multithreaded in the Wild

04-09

Time Series for scikit-learn People (Part II): Autoregressive Forecasting Pipelines

03-22

When Men and Women talk to Siri

03-09

Fast Company's 2018 World's Most Innovative Companies List

02-20

57 Summaries of Machine Learning and NLP Research

01-17

AutoML on AWS

12-04

Sequence Modeling with CTC

11-27

NIPS 2017 Workshop on Approximate Inference

09-25

How To Predict ICU Mortality with Digital Health Data, DL4J, Apache Spark and Cloudera

09-18

When (not) to use Deep Learning for NLP

09-04

The Advent of Analytics Engineering

09-01

How much compute do we need to train generative models?

08-31

Prophecy Fulfilled: Keras and Cloudera Data Science Workbench

07-25

Random Effects Neural Networks in Edward and Keras

06-15

Transfer Learning for Flight Delay Prediction via Variational Autoencoders

05-08

The Benefits of Migrating HPC Workloads To Apache Spark

05-04

Sentiment analysis on Twitter using word2vec and keras

04-20

Sentiment Analysis model deployed!

04-17

Time Series Analysis with Generalized Additive Models

04-04

Ordered Categorical GLMs for Product Feedback Scores

03-17

Random-Walk Bayesian Deep Networks: Dealing with Non-Stationary Data

03-14

Intercausal Reasoning in Bayesian Networks

03-13

Deep and Hierarchical Implicit Models

02-28

Why hierarchical models are awesome, tricky, and Bayesian

02-08

RescueTime Inference via the "Poor Man's Dirichlet"

02-03

Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

01-24

Attending to characters in neural sequence labeling models

01-06

On Model Mismatch and Bayesian Analysis

12-13

3D printing glass and bronze: Lost-PLA casting

12-11

Two papers released on arXiv, "Operator Variational Inference" and "Model Criticism for Bayesian Causal Inference"

10-30

A Billion Words and The Limits of Language Modeling

09-23

Discussion of "Fast Approximate Inference for Arbitrarily Large Semiparametric Regression Models via Message Passing"

09-19

Why I’m Not a Fan of R-Squared

07-24

Linear regression can be understood in many ways (optimization, probabilistic, bayesian)

07-20

Translating W2v Embedding From One Space To Another

06-06

Bayesian Deep Learning

06-01

How-to: Train Models in R and Python using Apache Spark MLlib and H2O

01-29

A Challenge to Data Scientists

11-22

Easy CI/CD of GPU applications on Google Cloud including bare-metal using Gitlab and Kubernetes

12-14

Document worth reading: “Deep Learning for Image Denoising: A Survey”

11-04

If you did not already know

08-23

Thanks, NVIDIA

08-01

Parallelizing Distance Calculations Using A GPU With CUDAnative.jl

08-14

Building a Data Science Workstation (2017)

01-18

Build your own Deep Learning Box

05-19

How to Setup Theano to Run on GPU on Ubuntu 14.04 with Nvidia Geforce GTX 780

11-24

How to Setup Theano to Run on GPU on Ubuntu 14.04 with Nvidia Geforce GTX 780

11-24

Where does .Renviron live on Citrix?

01-08

OpenCPU 2.1 Release: Scalable R Services

11-22

Hacking Bioconductor

11-19

Building a Repository of Alpine-based Docker Images for R, Part II

11-14

Tesseract 4 is here! State of the art OCR in R!

11-06

Stop Installing Tensorflow Using pip for Performance Sake!

10-30

New package in CRAN: PkgsFromFiles

10-13

Creating a MapD ODBC Connection in RStudio Server

08-21

How To Remotely Send R and Python Execution to SQL Server from Jupyter Notebooks

07-17

Using WSL Linux on Windows 10 for Deep Learning Development.

07-04

How to use an R interface with Airtable API

05-23

Static Blog: Jekyll, Hyde and GitHub Pages

02-01

Create conda recipe to use C extended Python library on PySpark cluster with Cloudera Data Science Workbench

05-15

How-to: Automate Your sparklyr Environment with Cloudera Director

12-15

How to Setup Theano to Run on GPU on Ubuntu 14.04 with Nvidia Geforce GTX 780

11-24

Where does .Renviron live on Citrix?

01-08

Hacking Bioconductor

11-19

Building a Repository of Alpine-based Docker Images for R, Part II

11-14

Stop Installing Tensorflow Using pip for Performance Sake!

10-30

How R gets built on Windows

10-11

How R gets built on Windows

10-11

Functions and Packages

09-29

Guide to a high-performance, powerful R installation

08-31

Creating a MapD ODBC Connection in RStudio Server

08-21

Static Blog: Jekyll, Hyde and GitHub Pages

02-01

Create conda recipe to use C extended Python library on PySpark cluster with Cloudera Data Science Workbench

05-15

Getting Started with Sonnet, Deep Mind’s Deep Learning Library

04-10

Tutorial: Deep Learning in PyTorch

01-15

How-to: Automate Your sparklyr Environment with Cloudera Director

12-15

Build your own Deep Learning Box

05-19

How to Setup Theano to Run on GPU on Ubuntu 14.04 with Nvidia Geforce GTX 780

11-24

Hacking Bioconductor

11-19

Using OSX? Compiling an R package from source? Issues with ‘-fopenmp’? Try this.

11-18

Building a Repository of Alpine-based Docker Images for R, Part II

11-14

Stop Installing Tensorflow Using pip for Performance Sake!

10-30

New package in CRAN: PkgsFromFiles

10-13

Creating a MapD ODBC Connection in RStudio Server

08-21

How To Remotely Send R and Python Execution to SQL Server from Jupyter Notebooks

07-17

Using WSL Linux on Windows 10 for Deep Learning Development.

07-04

Setting Up Selenium on RaspberryPi 2/3

12-22

Getting Started with Sonnet, Deep Mind’s Deep Learning Library

04-10

Tutorial: Deep Learning in PyTorch

01-15

How to Setup Theano to Run on GPU on Ubuntu 14.04 with Nvidia Geforce GTX 780

11-24

Scale out your Pandas DataFrame operations using Dask

08-05

Python Pandas Tutorial: The Basics

11-23

Streaming Columnar Data with Apache Arrow

01-27

Clustering Zeppelin on Zeppelin

10-23

Easier data analysis in Python with pandas (video series)

05-10

Compiling DataFrame code is harder than it looks

03-16

Why Julia’s DataFrames are Still Slow

11-28

Data types

05-08

Why Julia’s DataFrames are Still Slow

11-28

R Tip: Use seqi() For Indexes

01-11

R Tip: Use seqi() For Indexes

01-11

Role of Computer Science in Data Science World

01-07

Check Machin-like formulae with arbitrary-precision arithmetic

01-03

Purr yourself into a math genius

01-03

3 More Google Colab Environment Management Tips

01-02

My book ‘Practical Machine Learning in R and Python: Third edition’ on Amazon

01-02

Papers with Code: A Fantastic GitHub Resource for Machine Learning

12-31

“Check yourself before you wreck yourself: Assessing discrete choice models through predictive simulations”

12-29

Zak David expresses critical views of some published research in empirical quantitative finance

12-24

Day 15 – little helper sci_palette

12-15

5½ Reasons to Ditch Spreadsheets for Data Science: Code is Poetry

12-10

The Need for Speed Part 1: Building an R Package with Fortran (or C)

12-10

Distilled News

12-06

Creating Tables Using R and Pure HTML

12-05

Graph-Powered Machine Learning

12-03

Tribes.ai: Sr Data Scientist [Remote, India / Eastern Europe]

12-01

Using R: the best thing I’ve changed about my code in years

12-01

Community Call Summary – Code Review in the Lab

11-29

styler 1.1.0

11-27

New version of pqR, with major speed improvements

11-25

RcppMsgPack 0.2.3

11-18

Paris ML E#2 S#6: Conscience, Code Analysis, Can a machine learn like a child?

11-14

4 ways to be more efficient using RStudio’s Code Snippets, with 11 ready to use examples

11-10

Best Practices for Using Notebooks for Data Science

11-08

xts 0.11-2 on CRAN

11-06

India vs US – Kaggle Users & Data Scientists

11-05

Distilled News

10-30

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

Visualizing The Catholic Lectionary – Part 1

10-27

RApiDatetime 0.0.4: Updates and Extensions

10-21

5 “Clean Code” Tips That Will Dramatically Improve Your Productivity

10-15

Distilled News

10-14

RcppNLoptExample 0.0.1: Use NLopt from C/C++

10-13

Join us for DataTech19, Scotland’s first technical data science conference as part of DataFest

10-12

Distilled News

10-01

Timing Column Indexing in R

09-21

If not Notebooks, then what? Look to Literate Programming

09-12

Distilled News

09-04

Turn Whiteboard UX Sketches into Working HTML in Seconds – Introducing Sketch2Code

08-30

R Packages worth a look

08-27

R Objects

08-24

Azure Functions for Data Science

08-06

What makes the Python Cool.

07-31

Why I Indent My Code 8 Spaces

07-27

Data Science Project Style Guide

07-09

Programming Best Practices For Data Science

06-08

Some web API package development lessons from HIBPwned

04-19

Why you should start using .npy file more often…

03-20

RSiteCatalyst Version 1.4.14 Release Notes

02-16

Optimization of Scientific Code with Cython: Ising Model

12-11

Python Tutorial: Learn Python in one Day

11-28

Python List Comprehension + Set + Dict Comprehension

11-16

Engineering is the bottleneck in (Deep Learning) Research

01-17

Likes Out! Guerilla Dataset!

10-09

Project Euler using Scala: Problem

07-19

Top 8 resources for learning data analysis with pandas

05-16

Feather: it's about metadata

04-26

Calling RSiteCatalyst From Python

02-22

Why Blog?

02-18

Give me five

12-04

Why Julia’s DataFrames are Still Slow

11-28

Tom Wolfe

11-19

Magister Dixit

11-07

The hot hand—in darts!

09-18

Jeremy Freese was ahead of the curve

08-10

No juice for you, CSV format. It just makes you more awful.

09-23

Ten Tips for Writing CS Papers, Part 2

12-10

Ten Tips for Writing CS Papers, Part 1

11-29

Ten Tips for Writing CS Papers, Part 1

11-29

Class visualization with bilateral filters

02-05

Ten Tips for Writing CS Papers, Part 1

11-29

Ten Tips for Writing CS Papers, Part 1

11-29

Ten Tips for Writing CS Papers, Part 1

11-29

Mazes

11-29

Comparison of the Text Distance Metrics

01-07

My book ‘Practical Machine Learning in R and Python: Third edition’ on Amazon

01-02

Sudoku Solver

12-30

Whats new on arXiv

12-22

10 More Must-See Free Courses for Machine Learning and Data Science

12-20

R Packages worth a look

12-14

If you did not already know

12-14

Machine Learning (ML) Essentials

12-11

Learning Machine Learning vs Learning Data Science

12-11

Document worth reading: “A Short Introduction to Local Graph Clustering Methods and Software”

12-10

Multithreaded in the Wild

12-03

If you did not already know

12-01

Math in Data Science

11-30

Distilled News

11-30

Whats new on arXiv

11-29

Distilled News

11-26

You Can’t Do AI Without Augmented Analytics and AutoML

11-26

Cathy O’Neil discusses the current lack of fairness in artificial intelligence and much more.

11-26

Building Blocks of Decision Tree

11-26

Improving Binning by Bootstrap Bumping

11-25

Distilled News

11-24

Magister Dixit

11-23

If you did not already know

11-23

Word Morphing – an original idea

11-20

Understanding object detection in deep learning

11-19

Easily monitor and visualize metrics while training models on Amazon SageMaker

11-19

Mastering The New Generation of Gradient Boosting

11-15

R Packages worth a look

11-09

Deep Learning Performance Cheat Sheet

11-08

Distilled News

11-07

If you did not already know

11-07

Quantum Machine Learning: A look at myths, realities, and future projections

11-05

If you did not already know

11-04

If you did not already know

11-04

Distilled News

10-30

Whats new on arXiv

10-22

Distilled News

10-18

Machine learning — Is the emperor wearing clothes?

10-12

Multithreaded in the Wild

10-05

Big Data Day Camp: Big Data Tools & Techniques (October 25-26)

10-04

Machine Learning and Deep Learning : Differences

09-28

Document worth reading: “Closing the AI Knowledge Gap”

09-14

Distilled News

09-08

Unfolding Naïve Bayes From Scratch!

09-02

If you did not already know

09-01

If you did not already know

09-01

R Packages worth a look

08-27

If you did not already know

08-24

DIFFERENCE BETWEEN DATA SCIENCE, DATA ANALYTICS AND MACHINE LEARNING

08-09

On Using Hyperopt: Advanced Machine Learning

08-04

Distilled News

08-02

R Packages worth a look

07-31

Where do I learn about log_sum_exp, log1p, lccdf, and other numerical analysis tricks?

07-12

A Real World Reinforcement Learning Research Program

07-06

Using Clustering Algorithms to Analyze Golf Shots from the U.S. Open

06-19

Differentiable Dynamic Programs and SparseMAP Inference

05-15

Multithreaded in the Wild

05-07

Multithreaded in the Wild

04-09

Notes on the Frank-Wolfe algorithm, Part I

03-20

Turning Water into Wine

03-13

Multithreaded in the Wild

03-02

Fast Company's 2018 World's Most Innovative Companies List

02-20

Weekly Review: 12/10/2017

12-10

A Visual Guide to Evolution Strategies

10-29

Data-Informed vs Data-Driven

11-20

Discussion of "Fast Approximate Inference for Arbitrarily Large Semiparametric Regression Models via Message Passing"

09-19

László Babai's New Proof

12-16

Adaptive data analysis

12-14

Conference on the Economics of Machine Intelligence-Dec 15

12-01

Mazes

11-29

Training models with unequal economic error costs using Amazon SageMaker

09-18

Can Lessons from Data Science Help Journalism?

06-27

Deep Learning Research Review Week 3: Natural Language Processing

01-10

Mazes

11-29

A tour of Factor: 2

05-27

A tour of Factor: 1

05-23

Mazes

11-29

Statistics in Glaucoma: Part I

12-03

Russian Roulette

02-06

Mazes

11-29

February 21st & 22nd: End-2-End from a Keras/TensorFlow model to production

01-07

2018 Year-in-Review: Machine Learning Open Source Projects & Frameworks

12-17

Keras vs. TensorFlow – Which one is better and which one should I learn?

10-08

Deploy a TensorFlow trained image classification model to AWS DeepLens

08-15

Getting Started with Sonnet, Deep Mind’s Deep Learning Library

04-10

Recurrent Neural Networks in Tensorflow I

07-11

Implementing a CNN for Text Classification in TensorFlow

12-11

The TensorFlow perspective on neural networks

11-30

Matrix Factorization in PyTorch

06-20

Quick reference to Python in a single script (and notebook)

10-13

The TensorFlow perspective on neural networks

11-30

The TensorFlow perspective on neural networks

11-30

The TensorFlow perspective on neural networks

11-30

Interactive association rules exploration app

11-30

2018.

12-31

November 2018: “Top 40” New Packages

12-21

Historic Wildfire Data: Exploratory Visualization in R

12-11

When the numbers don’t tell the whole story

10-24

✚ Chart Components and Working On Your Graphics Piece-wise

09-20

Top 8 Viz features in Excel 2016 !

01-02

Interactive association rules exploration app

11-30

What to do when your training and testing data come from different distributions

01-04

Icon making with ggplot2 and magick

01-03

R Packages worth a look

12-13

Introducing the New Zealand Trade Intelligence Dashboard

10-14

Interactive association rules exploration app

11-30

Maps of the issues mentioned most in election advertising

11-05

Conference on the Economics of Machine Intelligence-Dec 15

12-01

Scaling H2O analytics with AWS and p(f)urrr (Part 1)

01-06

Distilled News

01-05

10 More Must-See Free Courses for Machine Learning and Data Science

12-20

A parable regarding changing standards on the presentation of statistical evidence

12-06

Magister Dixit

12-05

10 Free Must-See Courses for Machine Learning and Data Science

11-08

Distilled News

11-07

KDnuggets™ News 18:n42, Nov 7: The Most in Demand Skills for Data Scientists; How Machines Understand Our Language: Intro to NLP

11-07

Coding Gradient boosted machines in 100 lines of code

11-05

Document worth reading: “A User’s Guide to Support Vector Machines”

11-03

Distilled News

11-02

Document worth reading: “Restricted Boltzmann Machines: Introduction and Review”

10-27

Distilled News

10-27

Machine Reading at Scale – Transfer Learning for Large Text Corpuses

10-17

Distilled News

10-16

GitHub Python Data Science Spotlight: High Level Machine Learning & NLP, Ensembles, Command Line Viz & Docker Made Easy

10-16

Distilled News

10-04

Distilled News

10-01

Dr. Data Show Video: Why Machine Learning Is the Coolest Science

10-01

Distilled News

08-25

AI, Machine Learning and Data Science Roundup: August 2018

08-17

Distilled News

08-14

Document worth reading: “Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution”

08-11

Document worth reading: “Examining the Use of Neural Networks for Feature Extraction: A Comparative Analysis using Deep Learning, Support Vector Machines, and K-Nearest Neighbor Classifiers”

08-08

Essential Tips and Tricks for Starting Machine Learning with Python

08-05

Recent top-selling books in AI and Machine Learning

07-31

What Is Machine Learning and How Is It Making Our World a Better Place?

06-23

Multi Armed Bandit

10-26

Future Debates: This House Believes An Artificial Intelligence will Benefit Society

02-29

Conference on the Economics of Machine Intelligence-Dec 15

12-01

Forbes: 25 Machine Learning Startups to Watch in 2018

08-26

Conference on the Economics of Machine Intelligence-Dec 15

12-01

Whats new on arXiv

12-14

Bootstrap Testing with MCHT

10-29

Maximized Monte Carlo Testing with MCHT

10-22

If you did not already know

09-12

If you did not already know

08-26

Distilled News

08-04

Exploring Line Lengths in Python Packages

11-09

Minimizing the Negative Log-Likelihood, in English

05-18

Normal Distributions

05-14

Hamiltonian Monte Carlo explained

12-19

conda-forge and PyData's CentOS moment

04-20

Histogram intersection for change detection

02-28

Common Probability Distributions: The Data Scientist’s Crib Sheet

12-03

Histogram intersection for change detection

02-28

Common Probability Distributions: The Data Scientist’s Crib Sheet

12-03

Common Probability Distributions: The Data Scientist’s Crib Sheet

12-03

Delayed Impact of Fair Machine Learning

05-17

Common Probability Distributions: The Data Scientist’s Crib Sheet

12-03

Four Techniques for Outlier Detection

12-06

Some Observations on Winsorization and Trimming

12-03

If you did not already know

10-23

Some Observations on Winsorization and Trimming

12-03

Using WSL Linux on Windows 10 for Deep Learning Development.

07-04

Using RSiteCatalyst With Microsoft PowerBI Desktop

03-13

Understanding rolling calculations in R

03-07

How to score 0.8134 in Titanic Kaggle Challenge

08-10

Set up Sublime Text for light-weight all-in-one data science IDE

12-23

Some Observations on Winsorization and Trimming

12-03

Timing the Same Algorithm in R, Python, and C++

01-06

Timing the Same Algorithm in R, Python, and C++

01-06

3 More Google Colab Environment Management Tips

01-02

9 Reasons Excel Users Should Consider Learning Programming

12-27

The Need for Speed Part 1: Building an R Package with Fortran (or C)

12-10

Shinyfit: Advanced regression modelling in a shiny app

12-07

India vs US – Kaggle Users & Data Scientists

11-05

5 “Clean Code” Tips That Will Dramatically Improve Your Productivity

10-15

Distilled News

09-04

R Objects

08-24

Azure Functions for Data Science

08-06

Why I Indent My Code 8 Spaces

07-27

Data Science Project Style Guide

07-09

Python Tutorial: Learn Python in one Day

11-28

Why Blog?

02-18

Give me five

12-04

Give me five

12-04

Give me five

12-04

gganimation for the nation

01-06

Introducing the First AI / Machine Learning Course With a Job Guarantee

11-30

The Decentralized Web

10-29

How to import a directory of csvs at once with base R and data.table. Can you guess which way is the fastest?

10-13

Up your open source game with Hacktoberfest at Locke Data!

10-01

✚ Visualization Away from the Computer, Developing Ideas, Bring in the Constraints

08-16

R Generation: 25 Years of R

08-01

PyDataLondon 2018 and “Creating Correct and Capable Classifiers”

04-30

Give me five

12-04

AlphaGo Zero Is Not A Sign of Imminent Human-Level AI

03-30

Decision Making and Diversity

11-15

Future of AI 6. Discussion of 'Superintelligence: Paths, Dangers, Strategies'

05-09

System Zero: What Kind of AI have we Created?

12-04

Roll Your Own Federal Government Shutdown-caused SSL Certificate Expiration Monitor in R

01-10

AI in Healthcare (With a case study)

01-10

Document worth reading: “A Review of Theoretical and Practical Challenges of Trusted Autonomy in Big Data”

12-04

Import AI: 123: Facebook sees demands for deep learning services in its data centers grow by 3.5X; why advanced AI might require a global policeforce; and diagnosing natural disasters with deep learning

12-03

Document worth reading: “Legible Normativity for AI Alignment: The Value of Silly Rules”

11-28

Anticipating the next move in data science – my interview with Thomson Reuters

11-17

Document worth reading: “Saliency Prediction in the Deep Learning Era: An Empirical Investigation”

11-16

The ultimate guide to starting AI

11-13

Melanie Miller says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

Melanie Mitchell says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

Why AI will not replace radiologists

11-01

How I Learned to Stop Worrying and Love Uncertainty

10-24

How AI Can Help Cope with Data Scientists’ Boredom

10-24

Distilled News

10-22

Distilled News

10-21

Ethical AI for Data Scientists

08-15

Artificial Intelligence in the Workplace

08-03

Document worth reading: “Attend Before you Act: Leveraging human visual attention for continual learning”

08-03

AI Lab: Learn to Code with the Cutting-Edge Microsoft AI Platform

06-19

Highlights of NAACL-HLT 2018: Generalization, Test-of-time, and Dialogue Systems

06-12

Shared Autonomy via Deep Reinforcement Learning

04-18

AlphaGo Zero Is Not A Sign of Imminent Human-Level AI

03-30

Natural and Artificial Intelligence

02-06

Using Artificial Intelligence to Augment Human Intelligence

12-04

Decision Making and Diversity

11-15

Future of AI 6. Discussion of 'Superintelligence: Paths, Dangers, Strategies'

05-09

Machine Learning is not BS in Monitoring

01-09

System Zero: What Kind of AI have we Created?

12-04

Distilled News

09-28

In statistics, we talk about uncertainty without it being viewed as undesirable

08-25

Reply-all loop

07-03

Summer of Data Science 2018

05-28

System Zero: What Kind of AI have we Created?

12-04

CES 2019

01-12

7 Reasons for Policy Professionals to Get Pumped About R Programming in 2019

01-07

Exploring the Gender Pay Gap with Publicly Available Data

12-12

The ultimate guide to starting AI

11-13

Divergent and Convergent Phases of Data Analysis

09-14

Old school

08-28

The Trillion Dollar Question

08-09

Understanding Latent Style

06-28

Recommender System With Implicit Feedback

11-18

Recommender System

10-30

How I was screwing up testing my code

10-15

The Two Tribes of Language Researchers

11-19

System Zero: What Kind of AI have we Created?

12-04

R Packages worth a look

12-08

If you did not already know

11-27

BI to AI: Getting Intelligent Insights to Everyone

10-18

Society of Machines: The Complex Interaction of Agents

10-04

Document worth reading: “Model-free, Model-based, and General Intelligence”

08-10

A quick tour of AI services in Azure

07-24

Free E-Book: A Developer’s Guide to Building AI Applications

06-04

Future of AI 5: The Singularians

05-09

Similar pages for Wikipedia

05-03

Quora Q&A Session Answers

03-09

Future Debates: This House Believes An Artificial Intelligence will Benefit Society

02-29

A Seasonal Test of AI

12-21

Does AI stand for Alchemical Intelligence?

12-14

OpenAI won't benefit humanity without data-sharing

12-14

System Zero: What Kind of AI have we Created?

12-04

Why is Keras Running So Slow?

12-05

How things float

06-20

Why is Keras Running So Slow?

12-05

Auto-Keras and AutoML: A Getting Started Guide

01-07

Keras Conv2D and Convolutional Layers

12-31

How to use Keras fit and fit_generator (a hands-on tutorial)

12-24

Keras Hyperparameter Tuning in Google Colab Using Hyperas

12-12

Deep Learning and Medical Image Analysis with Keras

12-03

Keras vs PyTorch:谁是「第一」深度学习框架?

06-30

XOR Revisited: Keras and TensorFlow

04-24

Recurrent Neural Networks for Churn Prediction

02-22

GPU-accelerated Theano & Keras with Windows 10

09-23

Why is Keras Running So Slow?

12-05

Alibaba acquires Data Artisans?

01-10

Top KDnuggets tweets, Jan 02-08: 10 Free Must-Read Books for Machine Learning and Data Science

01-09

Top Stories, Dec 24 – Jan 6: The Essence of Machine Learning; Papers with Code: A Fantastic GitHub Resource for Machine Learning

01-08

The Data Science Event You Need in 2019

01-07

Top KDnuggets tweets, Dec 19 – Jan 1: Deep Learning Cheat Sheets

01-02

World’s Biggest Deep Learning Summit 3 weeks away

12-27

Top Stories, Dec 17-23: Why You Shouldn’t be a Data Science Generalist; 10 More Must-See Free Courses for Machine Learning and Data Science

12-24

Custom JavaScript, CSS and HTML in Shiny

12-23

Blogdown – shortcode for radix-like Bibtex

12-21

How to Scrape Data from a JavaScript Website with R

12-20

Top Stories, Dec 10-16: Why You Shouldn’t be a Data Science Generalist; Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

12-17

2018-13 Rendering HTML Content in R Graphics

12-16

Are you ready to tackle the data-driven revolution?

12-13

Top KDnuggets tweets, Dec 5-11: How to build a data science project from scratch; NeurIPS 2018 video talk collection

12-13

My introductory course on Bayesian statistics

12-12

Top Stories, Dec 3-9: Common mistakes when carrying out machine learning and data science; AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-10

It was twenty years ago …

12-08

confint3: 2-Sided Confidence Interval (Extended Moodle Version)

12-08

XGBoost on GPUs: Unlocking Machine Learning Performance and Productivity

12-07

Take a Look at Looker, Demo/Webinar Dec 13

12-07

Top Stories, Nov 26 – Dec 2: Deep Learning Cheat Sheets; A Complete Guide to Choosing the Best Machine Learning Course

12-03

Serve yourself. The Next-Generation of Data Analytics. Dec 6 Webinar

11-29

What Python editors or IDEs you used the most in 2018?

11-27

Top Stories, Nov 19-25: What is the Best Python IDE for Data Science?; Intro to Data Science for Managers

11-26

Top KDnuggets tweets, Nov 14-20: 10 Free Must-See Courses for Machine Learning and Data Science; Great list of

11-21

How Important is that Machine Learning Model be Understandable? We analyze poll results

11-19

Top Stories, Nov 12-18: What is the Best Python IDE for Data Science?; To get hired as a data scientist, don’t follow the herd

11-19

Top KDnuggets tweets, Nov 07-13: 10 Free Must-See Courses for Machine Learning and Data Science

11-14

Windows Clipboard Access with R

11-14

Top Stories, Nov 5-11: The Most in Demand Skills for Data Scientists; 10 Free Must-See Courses for Machine Learning and Data Science

11-13

7 Best Practices for Machine Learning on a Data Lake

11-07

Top Stories, Oct 29 – Nov 4: The Most in Demand Skills for Data Scientists; How Machines Understand Our Language

11-05

KDnuggets™ News 18:n41, Oct 31: Introduction to Deep Learning with Keras; Easy Named Entity Recognition with Scikit-Learn

10-31

New Poll: How Important is Understanding Machine Learning Models?

10-30

Top Stories, Oct 22-28: 9 Must-have skills you need to become a Data Scientist, updated; Named Entity Recognition and Classification with Scikit-Learn

10-29

Learn how to create data-driven marketing team

10-25

KDnuggets™ News 18:n40, Oct 24: Graphs Are The Next Frontier In Data Science; Apache Spark Intro for Beginners

10-24

Top KDnuggets tweets, Oct 17-23: Machine Learning Cheat Sheets

10-24

Top Stories, Oct 15-21: Graphs Are The Next Frontier In Data Science; The Main Approaches to Natural Language Processing Tasks

10-22

Start your journey into data science today

10-19

How to Solve the ModelOps Challenge

10-18

Top KDnuggets tweets, Oct 10-16: 7 Books to Grasp Mathematical Foundations of Data Science and Machine Learning; 6 Books Every Data Scientist Should Keep Nearby

10-17

KDnuggets™ News 18:n39, Oct 17: 10 Best Mobile Apps for Data Scientist; Vote in new poll: Largest dataset you analyzed?

10-17

New Poll: What was the largest dataset you analyzed / data mined?

10-12

KDnuggets™ News 18:n38, Oct 10: Concise Explanation of Learning Algorithms; Why I Call Myself a Data Scientist; Linear Regression in the Wild

10-10

Understand Why ODSC is the Most Recommended Conference for Applied Data Science

10-04

Top KDnuggets tweets, Sep 26 – Oct 2: Why building your own Deep Learning Computer is 10x cheaper than AWS; 6 Steps To Write Any Machine Learning Algorithm

10-03

Unleash a Faster Python on Your Data

10-02

Turn Whiteboard UX Sketches into Working HTML in Seconds – Introducing Sketch2Code

08-30

CRN: The 10 Coolest Machine-Learning And AI Startups Of 2018 (So Far)

07-16

Markdown Language Reference

11-24

Web scraping the President's lies in 16 lines of Python

07-27

Looking for exceptional postdoc candidates in Computational Social Sciences

09-09

Making Deep Networks Probabilistic via Test-time Dropout

06-17

Building a news search engine

01-21

Agnez, analytics for deep learning research

12-24

ICCV 2015: Twenty one hottest research papers

12-09

Making Deep Networks Probabilistic via Test-time Dropout

06-17

ICCV 2015: Twenty one hottest research papers

12-09

Top KDnuggets tweets, Jan 02-08: 10 Free Must-Read Books for Machine Learning and Data Science

01-09

Top Stories, Dec 24 – Jan 6: The Essence of Machine Learning; Papers with Code: A Fantastic GitHub Resource for Machine Learning

01-08

The Data Science Event You Need in 2019

01-07

Top KDnuggets tweets, Dec 19 – Jan 1: Deep Learning Cheat Sheets

01-02

World’s Biggest Deep Learning Summit 3 weeks away

12-27

Top Stories, Dec 17-23: Why You Shouldn’t be a Data Science Generalist; 10 More Must-See Free Courses for Machine Learning and Data Science

12-24

Top Stories, Dec 10-16: Why You Shouldn’t be a Data Science Generalist; Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

12-17

Are you ready to tackle the data-driven revolution?

12-13

Top KDnuggets tweets, Dec 5-11: How to build a data science project from scratch; NeurIPS 2018 video talk collection

12-13

Top Stories, Dec 3-9: Common mistakes when carrying out machine learning and data science; AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-10

Take a Look at Looker, Demo/Webinar Dec 13

12-07

Top Stories, Nov 26 – Dec 2: Deep Learning Cheat Sheets; A Complete Guide to Choosing the Best Machine Learning Course

12-03

Serve yourself. The Next-Generation of Data Analytics. Dec 6 Webinar

11-29

What Python editors or IDEs you used the most in 2018?

11-27

Top Stories, Nov 19-25: What is the Best Python IDE for Data Science?; Intro to Data Science for Managers

11-26

Top KDnuggets tweets, Nov 14-20: 10 Free Must-See Courses for Machine Learning and Data Science; Great list of

11-21

How Important is that Machine Learning Model be Understandable? We analyze poll results

11-19

Top Stories, Nov 12-18: What is the Best Python IDE for Data Science?; To get hired as a data scientist, don’t follow the herd

11-19

Top KDnuggets tweets, Nov 07-13: 10 Free Must-See Courses for Machine Learning and Data Science

11-14

Top Stories, Nov 5-11: The Most in Demand Skills for Data Scientists; 10 Free Must-See Courses for Machine Learning and Data Science

11-13

7 Best Practices for Machine Learning on a Data Lake

11-07

Top Stories, Oct 29 – Nov 4: The Most in Demand Skills for Data Scientists; How Machines Understand Our Language

11-05

New Poll: How Important is Understanding Machine Learning Models?

10-30

Top Stories, Oct 22-28: 9 Must-have skills you need to become a Data Scientist, updated; Named Entity Recognition and Classification with Scikit-Learn

10-29

Learn how to create data-driven marketing team

10-25

Top KDnuggets tweets, Oct 17-23: Machine Learning Cheat Sheets

10-24

Top Stories, Oct 15-21: Graphs Are The Next Frontier In Data Science; The Main Approaches to Natural Language Processing Tasks

10-22

Start your journey into data science today

10-19

How to Solve the ModelOps Challenge

10-18

Top KDnuggets tweets, Oct 10-16: 7 Books to Grasp Mathematical Foundations of Data Science and Machine Learning; 6 Books Every Data Scientist Should Keep Nearby

10-17

Understand Why ODSC is the Most Recommended Conference for Applied Data Science

10-04

Top KDnuggets tweets, Sep 26 – Oct 2: Why building your own Deep Learning Computer is 10x cheaper than AWS; 6 Steps To Write Any Machine Learning Algorithm

10-03

Unleash a Faster Python on Your Data

10-02

CRN: The 10 Coolest Machine-Learning And AI Startups Of 2018 (So Far)

07-16

Looking for exceptional postdoc candidates in Computational Social Sciences

09-09

Making Deep Networks Probabilistic via Test-time Dropout

06-17

ICCV 2015: Twenty one hottest research papers

12-09

ICCV 2015: Twenty one hottest research papers

12-09

StanCon 2018 Helsinki talk slides, notebooks and code online

12-03

ICML 2019: Some Changes and Call for Papers

11-28

Why R? 2018 Conference – After Movie and Summary

11-07

Google, Microsoft & Fraunhofer at the First European Edition of Deep Learning World – 12 Nov, 2018

10-23

Graphs Are The Next Frontier In Data Science

10-18

Highlights from the useR! 2018 conference in Brisbane

07-18

Video: R for AI, and the Not Hotdog workshop

07-17

“Creating correct and capable classifiers” at PyDataAmsterdam 2018

05-26

Best practices with pandas (video series)

05-23

Evolution of active categorical image classification via saccadic eye movement

08-13

10 ways you might be able to tell when an area of research is undergoing rapid expansion and society's expectations may be somewhat unrealistic ...

12-13

ICCV 2015: Twenty one hottest research papers

12-09

Here are the most popular Python IDEs / Editors

12-07

Superbowl Helmet Puzzle

02-04

Voronoi Diagrams

05-12

Hamiltonian Monte Carlo

12-10

Hamiltonian Monte Carlo

12-10

What to do when your training and testing data come from different distributions

01-04

Introduction to Statistics for Data Science

12-17

Prior distributions for covariance matrices

12-10

R Packages worth a look

12-10

October 2018: “Top 40” New Packages

11-29

Distilled News

11-12

Bootstrap Testing with MCHT

10-29

Maximized Monte Carlo Testing with MCHT

10-22

R Packages worth a look

10-10

If you did not already know

08-26

R Packages worth a look

08-05

My Thoughts on Synthetic Data

06-27

ML beyond Curve Fitting: An Intro to Causal Inference and do-Calculus

05-24

An intuitive, visual guide to copulas

05-03

Minimizing the Negative Log-Likelihood, in English

05-18

Normal Distributions

05-14

What is the natural gradient, and how does it work?

12-30

Hamiltonian Monte Carlo explained

12-19

Respecting Boundaries with Inhomogeneous Kernels

11-29

Turning Distances into Distributions

09-19

Linear regression can be understood in many ways (optimization, probabilistic, bayesian)

07-20

Learning in Brains and Machines (2): The Dogma of Sparsity

04-07

Hamiltonian Monte Carlo

12-10

World map shows aerosol billowing in the wind

08-24

Machine learning applied to showers in the OPERA

06-24

Hamiltonian Monte Carlo

12-10

Ten Tips for Writing CS Papers, Part 2

12-10

Best Practices for Using Notebooks for Data Science

11-08

Ten Tips for Writing CS Papers, Part 2

12-10

The year in AI/Machine Learning advances: Xavier Amatriain 2018 Roundup

01-11

ML and NLP Publications in 2018

01-09

Supervised Learning: Model Popularity from Past to Present

12-28

Zak David expresses critical views of some published research in empirical quantitative finance

12-24

State of Deep Learning and Major Advances: H2 2018 Review

12-13

EMNLP 2018 Highlights: Inductive bias, cross-lingual learning, and more

11-07

Generative Adversarial Networks – Paper Reading Road Map

10-24

Short Article Reveals the Undeniable Facts About College Essay Writing Service and How It Can Affect You

10-04

Echo Chamber Incites Online Mob to Attack Math Profs

09-14

Mouse Among the Cats

09-11

Software as an academic publication

05-03

ICML Board and Reviewer profiles

03-05

AI and Deep Learning in 2017 – A Year in Review

12-31

Everything is a Model

12-13

The Last 5 Years In Deep Learning

12-04

At NIPS 2017

12-04

Two Recent Results in Transfer Learning for Music and Speech

11-01

COLT 2018 call for papers

10-24

My Qualifying Exam (Oral)

08-07

A Research to Engineering Workflow

06-03

Engineering is the bottleneck in (Deep Learning) Research

01-17

NLP and ML Publications – Looking Back at 2016

01-04

Properties of Interpretability

12-06

A Survival Guide to a PhD

09-07

A Beginner's Guide To Understanding Convolutional Neural Networks Part 2

07-29

Deep Learning Trends @ ICLR 2016

06-01

ICCV 2015, Day 2

12-15

ICCV 2015, Day 1

12-14

Ten Tips for Writing CS Papers, Part 2

12-10

New Year's Resolutions 2019

01-01

“We are reluctant to engage in post hoc speculation about this unexpected result, but it does not clearly support our hypothesis”

11-03

No juice for you, CSV format. It just makes you more awful.

09-23

Ten Tips for Writing CS Papers, Part 2

12-10

How Different are Conventional Programming and Machine Learning?

12-10

Synthetic Gradients with Tensorflow

04-08

Python Deep Learning tutorial: Create a GRU (RNN) in TensorFlow

08-27

Random Effects Neural Networks in Edward and Keras

06-15

Beyond Binary: Ternary and One-hot Neurons

02-08

Non-Zero Initial States for Recurrent Neural Networks

11-20

Recurrent Neural Networks in Tensorflow III - Variable Length Sequences

11-15

Deep reinforcement learning, battleship

10-15

TensorFlow in a Nutshell — Part Three: All the Models

10-03

Binary Stochastic Neurons in Tensorflow

09-24

TensorFlow in a Nutshell — Part Two: Hybrid Learning

09-13

TensorFlow in a Nutshell — Part One: Basics

08-22

RNNs in Tensorflow, a Practical Guide and Undocumented Features

08-21

Playing with convolutions in TensorFlow

08-09

Recurrent Neural Networks in Tensorflow II

07-25

Styles of Truncated Backpropagation

07-19

Recurrent Neural Networks in Tensorflow I

07-11

Deep Learning for Chatbots, Part 2 – Implementing a Retrieval-Based Model in Tensorflow

07-04

TensorFlow Implementation of "A Neural Algorithm of Artistic Style"

05-31

First Convergence Bias

04-11

Inverting a Neural Net

04-05

Representational Power of Deeper Layers

03-30

Implementing Batch Normalization in Tensorflow

03-29

Implementing a CNN for Text Classification in TensorFlow

12-11

“Thus, a loss aversion principle is rendered superfluous to an account of the phenomena it was introduced to explain.”

12-25

Handling Imbalanced Datasets in Deep Learning

12-04

If you did not already know

10-17

Online Hard Example Mining on PyTorch

10-22

Implementing a CNN for Text Classification in TensorFlow

12-11

Auto-Keras and AutoML: A Getting Started Guide

01-07

If you did not already know

12-02

OneR – fascinating insights through simple rules

11-25

OneR – fascinating insights through simple rules

11-24

Using Confusion Matrices to Quantify the Cost of Being Wrong

10-11

AI-Based Virtual Tutors – The Future of Education?

09-21

Document worth reading: “Quantizing deep convolutional networks for efficient inference: A whitepaper”

09-10

Sentiment Analysis model deployed!

04-17

Implementing a CNN for Text Classification in TensorFlow

12-11

What to do when your training and testing data come from different distributions

01-04

The Backpropagation Algorithm Demystified

01-02

Request for comments on planned features for futile.logger 1.5

12-15

Sales Forecasting Using Facebook’s Prophet

11-28

Watch out for naively (because implicitly based on flat-prior) Bayesian statements based on classical confidence intervals! (Comptroller of the Currency edition)

11-08

Master R shiny: One trick to build maintainable and scalable event chains

11-02

Understanding Regression Error Metrics

09-26

“Tweeking”: The big problem is not where you think it is.

09-23

R Packages worth a look

09-21

“We continuously increased the number of animals until statistical significance was reached to support our conclusions” . . . I think this is not so bad, actually!

09-04

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

Some web API package development lessons from HIBPwned

04-19

Guest Post – Learning R as an MBA Student

07-12

Facts and Fallacies of Software Engineering - Book Review

02-11

Tutorial: Deep Learning in PyTorch

01-15

RSiteCatalyst Version 1.4.7 (and 1.4.6.) Release Notes

02-01

Hamming Codes

01-30

Understanding the Pseudo-Truth as an Optimal Approximation

01-11

Estimating known unknowns

12-11

Estimating known unknowns

12-11

New Year's Resolutions 2019

01-01

Data Science Project Style Guide

07-09

Profiling Top Kagglers: Bestfitting, Currently

05-07

iPhone addiction? Get a grip!

02-06

New Year's Resolutions 2018

01-05

DynamoDB Learnings

10-23

The Real Story Behind Today's Referendum

06-23

Meet the Authors: “Data Analytics with Hadoop” from O’Reilly Media

03-01

Estimating known unknowns

12-11

10 ways you might be able to tell when an area of research is undergoing rapid expansion and society's expectations may be somewhat unrealistic ...

12-13

How to give money to the R project

12-11

Reduced privacy risk in exchange for accuracy in the Census count

12-06

10 ways you might be able to tell when an area of research is undergoing rapid expansion and society's expectations may be somewhat unrealistic ...

12-13

10 ways you might be able to tell when an area of research is undergoing rapid expansion and society's expectations may be somewhat unrealistic ...

12-13

Niall Ferguson and the perils of playing to your audience

12-05

KDnuggets™ News 18:n43, Nov 14: To get hired as a data scientist, don’t follow the herd; LinkedIn Top Voices in Data Science & Analytics

11-14

Distilled News

09-23

Last academic results

06-23

Reasons I left academia

02-12

10 ways you might be able to tell when an area of research is undergoing rapid expansion and society's expectations may be somewhat unrealistic ...

12-13

Adaptive data analysis

12-14

Adaptive data analysis

12-14

Adaptive data analysis

12-14

Whats new on arXiv

01-01

Whats new on arXiv

12-22

10 More Must-See Free Courses for Machine Learning and Data Science

12-20

Document worth reading: “Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences”

12-16

If you did not already know

12-14

Learning Machine Learning vs Learning Data Science

12-11

Document worth reading: “A Short Introduction to Local Graph Clustering Methods and Software”

12-10

Distilled News

11-26

Cathy O’Neil discusses the current lack of fairness in artificial intelligence and much more.

11-26

Distilled News

11-24

Mastering The New Generation of Gradient Boosting

11-15

Distilled News

11-05

Document worth reading: “Closing the AI Knowledge Gap”

09-14

Unfolding Naïve Bayes From Scratch!

09-02

Goals of Interpretability

11-17

Stability as a foundation of machine learning

03-14

Adaptive data analysis

12-14

Data Links

06-03

OpenAI: A new non-profit AI company

12-20

OpenAI won't benefit humanity without data-sharing

12-14

Distilled News

01-07

Document worth reading: “Computational Power and the Social Impact of Artificial Intelligence”

12-27

Day 22 – little helper get_files

12-22

Day 20 – little helper char_replace

12-20

Whats new on arXiv

12-18

Day 17 – little helper to_na

12-17

Day 15 – little helper sci_palette

12-15

Day 14 – little helper print_fs

12-14

Document worth reading: “Small Sample Learning in Big Data Era”

12-14

Day 12 – little helper dive

12-12

Day 10 – little helper %nin%

12-10

Day 08 – little helper intersect2

12-08

Day 07 – little helper count_na

12-07

Day 06 – little helper statusbar

12-06

Day 02 – little helper na_omitlist

12-02

If you did not already know

11-27

Open Workshop: Data Visualization in R and ggplot2, January 25th in Munich

11-26

Document worth reading: “An Introduction to Probabilistic Programming”

11-12

How Can Autonomous Drones Help the Energy and Utilities Industry?

10-23

Document worth reading: “Review of Deep Learning”

10-19

BI to AI: Getting Intelligent Insights to Everyone

10-18

Society of Machines: The Complex Interaction of Agents

10-04

If you did not already know

09-11

Document worth reading: “Cogniculture: Towards a Better Human-Machine Co-evolution”

08-18

Announcing the Artificial Intelligence (AI) Hackathon: Build Intelligent Applications using machine learning APIs and serverless

08-15

Document worth reading: “Model-free, Model-based, and General Intelligence”

08-10

Recent top-selling books in AI and Machine Learning

07-31

3 Steps to Build Your First Intelligent App – Conference Buddy

07-31

Weekly Review: 12/03/2017

12-03

Deep Learning Dead-End?

09-17

Where will Artificial Intelligence come from?

04-20

A Seasonal Test of AI

12-21

Does AI stand for Alchemical Intelligence?

12-14

OpenAI won't benefit humanity without data-sharing

12-14

OpenAI won't benefit humanity without data-sharing

12-14

A New Library for Analyzing Time-Series Data with Apache Spark

12-14

A New Library for Analyzing Time-Series Data with Apache Spark

12-14

A New Library for Analyzing Time-Series Data with Apache Spark

12-14

Data, movies and ggplot2

12-19

Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

12-03

Stereograms

11-26

Zero Counts in dplyr

11-19

11 Design Tips for Data Visualization

10-25

R Packages worth a look

10-03

Implement Simple Convolution with Java

09-27

What to Consider When Choosing Colors for Data Visualization

08-22

Generating Climate Temperature Spirals in Python

05-21

A particles-arly fun book draw

05-02

How analog TV worked

05-01

Deepcolor: automatic coloring and shading of manga-style lineart

03-01

Colorizing the DRAW Model

12-06

ICCV 2015, Day 1

12-14

ICCV 2015, Day 1

12-14

ICCV 2015, Day 1

12-14

AI in Healthcare (With a case study)

01-10

AI Gotchas (& How to Avoid Them)

01-08

Distilled News

01-05

Improve your AI and Machine Learning skills at AI NEXTCon in Seattle, Jan 23-27

01-03

Notebooks from the Practical AI Workshop

01-03

Your AI journey… and Happy Holidays!

12-20

AI, Machine Learning and Data Science Roundup: December 2018

12-19

Industry Predictions: AI, Machine Learning, Analytics & Data Science Main Developments in 2018 and Key Trends for 2019

12-18

Vanguard: Senior AI Architect [Malvern, PA]

12-17

In case you missed it: November 2018 roundup

12-14

Document worth reading: “AI Reasoning Systems: PAC and Applied Methods”

12-13

Distilled News

12-11

Let Automation Carry You from BI to AI in 2019

12-11

Join the World’s Biggest Deep Learning Summit – KDnuggets Early Cyber Monday

11-21

AI, Machine Learning and Data Science Roundup: November 2018

11-21

LinkedIn Top Voices 2018: Data Science & Analytics

11-13

AI for Good: slides and notebooks from the ODSC workshop

11-13

Melanie Miller says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

Melanie Mitchell says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

Document worth reading: “Artificial Intelligence for Long-Term Robot Autonomy: A Survey”

11-04

Distilled News

11-02

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: November and Beyond

11-01

Labeling Unstructured Text for Meaning to Achieve Predictive Lift

10-31

Fringe FM conversation on AI Ethics

10-30

AI Masterpieces: But is it Art?

10-27

The Final Data Science Roadshow is Just the Beginning

10-26

ITWire: VIDEO Interview with a DataRobot: Greg Michaelson talks AI, banking, machine learning and more

10-24

How Can Autonomous Drones Help the Energy and Utilities Industry?

10-23

Distilled News

10-16

Self-Service Analytics or Operationalization: Which Should I Implement?

10-16

How AI Will Change Healthcare

10-15

Learn the top things to look for in an AI Vendor

10-12

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount.

10-09

Understand Why ODSC is the Most Recommended Conference for Applied Data Science

10-04

“Snip Insights” – An Open Source Cross-Platform AI Tool for Intelligent Screen Capture

10-03

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: October and Beyond

10-03

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount

10-01

Distilled News

09-25

Dataquest helped me get my dream job at Noodle.ai

09-24

This New [AI] Software Constantly Improves – and that Makes all the Difference

09-21

How to Implement AI-First Business Models at Scale

09-21

Distilled News

09-19

Document worth reading: “Closing the AI Knowledge Gap”

09-14

Why Would Prosthetic Arms Need to See or Connect to Cloud AI?

09-10

Connected Arms – Can AI Revolutionize Prosthetic Devices & Make them More Affordable?

09-07

Distilled News

08-31

Ethical AI for Data Scientists

08-15

The Microsoft AI Idea Challenge – Breakthrough Ideas Wanted!

08-14

AI, Machine Learning and Data Science Roundup: July 2018

07-23

Video: R for AI, and the Not Hotdog workshop

07-17

Free E-Book: A Developer’s Guide to Building AI Applications

06-04

Microsoft Weekly Data Science News for May 18, 2018

05-18

AlphaGo Zero Is Not A Sign of Imminent Human-Level AI

03-30

Moving On, Looking Back

07-28

White House launches workshops to prepare for Artificial Intelligence

05-04

Does AI stand for Alchemical Intelligence?

12-14

Understanding the maths of Computed Tomography (CT) scans

01-09

Does AI stand for Alchemical Intelligence?

12-14

Object tracking with dlib

10-22

Learning Acrobatics by Watching YouTube

10-09

In case you missed it: August 2018 roundup

09-06

Build an automatic alert system to easily moderate content at scale with Amazon Rekognition Video

08-15

Import AI

06-05

9 new pandas updates that will save you time

01-25

How to launch your data science career (with Python)

07-12

Top content from two years of Data School

03-24

Becoming a Data Scientist Podcast Episode 03: Shlomo Argamon

01-18

Becoming A Data Scientist Podcast Episode 01: Will Kurt

12-21

Becoming A Data Scientist Podcast Episode 0: Me!

12-14

Fringe FM conversation on AI Ethics

10-30

What data scientists really do

08-21

Podcast Listens Analysis

10-02

Becoming a Data Scientist Podcast Episode 16: Randy Olson

03-22

Becoming a Data Scientist Podcast Episode 15: David Meza

01-30

Becoming a Data Scientist Podcast Episode 14: Jasmine Dumas

01-11

Becoming a Data Scientist Podcast Special Episode

11-14

Becoming a Data Scientist Podcast Episode 13: Debbie Berebichez

07-15

Becoming a Data Scientist Podcast Episode 12: Data Science Learning Club Members

06-15

Becoming a Data Scientist Podcast Episode 11: Stephanie Rivera

05-31

Becoming a Data Scientist Podcast Episode 10: Trey Causey

05-01

Becoming a Data Scientist Podcast Episode 09: Justin Kiggins

04-12

Becoming a Data Scientist Podcast Episode 08: Sebastian Raschka

03-29

Becoming a Data Scientist Podcast Episode 05: Clare Corthell

02-15

Becoming a Data Scientist Podcast Episode 03: Shlomo Argamon

01-18

Becoming a Data Scientist Podcast Episode 02: Safia Abdalla

01-04

Podcast Available on Stitcher

12-21

Becoming A Data Scientist Podcast Episode 01: Will Kurt

12-21

Becoming A Data Scientist Podcast Episode 0: Me!

12-14

Rotary

12-19

Becoming a Data Scientist Podcast Episode 16: Randy Olson

03-22

Becoming a Data Scientist Podcast Episode 15: David Meza

01-30

Becoming a Data Scientist Podcast Episode 14: Jasmine Dumas

01-11

Becoming a Data Scientist Podcast Episode 13: Debbie Berebichez

07-15

Becoming a Data Scientist Podcast Episode 10: Trey Causey

05-01

Becoming a Data Scientist Podcast Episode 09: Justin Kiggins

04-12

Becoming a Data Scientist Podcast Episode 08: Sebastian Raschka

03-29

Becoming a Data Scientist Podcast Episode 05: Clare Corthell

02-15

Becoming A Data Scientist Podcast Episode 0: Me!

12-14

Becoming a Data Scientist Podcast Episode 16: Randy Olson

03-22

Becoming a Data Scientist Podcast Episode 15: David Meza

01-30

Becoming a Data Scientist Podcast Episode 14: Jasmine Dumas

01-11

Becoming a Data Scientist Podcast Special Episode

11-14

Becoming A Data Scientist Podcast Episode 0: Me!

12-14

If you did not already know

12-01

TDM: From Model-Free to Model-Based Deep Reinforcement Learning

04-26

How to Solve a Problem In 3 Steps -- Define It, Redefine It, Repeat

08-29

Data Science Learning Club

12-14

Data Science Learning Club

12-14

Data Science Learning Club

12-14

A hex sticker wall, created with R

07-20

Data Science Learning Club

12-14

ICCV 2015, Day 2

12-15

ICCV 2015, Day 2

12-15

On receiving the Community Leadership Award at the NumFOCUS Summit 2018

11-11

Music for Data Scientists? Music by Data Scientists? …What…?!

10-17

ICCV 2015, Day 2

12-15

ICCV 2015, Day 4

12-16

ICCV 2015, Day 4

12-16

ML and NLP Publications in 2018

01-09

Improve your AI and Machine Learning skills at AI NEXTCon in Seattle, Jan 23-27

01-03

EARL conference recap: Seattle 2018

11-24

Join us at the EARL US Roadshow – a conference dedicated to the real-world usage of R

10-24

Speak at Mega-PAW Vegas 2019 – on Machine Learning Deployment (Apply by Nov 15)

10-22

A few upcoming R conferences

10-05

A few upcoming R conferences

10-05

PyImageConf 2018 Recap

10-01

3 Steps to Build Your First Intelligent App – Conference Buddy

07-31

ICCV 2015, Day 4

12-16

ICCV 2015, Day 4

12-16

The year in AI/Machine Learning advances: Xavier Amatriain 2018 Roundup

01-11

Autonomy – Do we have the choice?

11-21

What is DRAW (Deep Recurrent Attentive Writer)?

10-02

ICCV 2015, Day 4

12-16

Whats new on arXiv

12-10

Distilled News

11-08

Whats new on arXiv

11-07

Whats new on arXiv

09-28

If you did not already know

09-10

Whats new on arXiv

09-07

Whats new on arXiv

09-07

Whats new on arXiv

08-29

Whats new on arXiv

08-08

Whats new on arXiv

08-02

Weird Number Bases

12-16

Weird Number Bases

12-16

Weird Number Bases

12-16

ICCV 2015, Day 3

12-16

ICCV 2015, Day 3

12-16

ICCV 2015, Day 3

12-16

ICCV 2015, Day 3

12-16

László Babai's New Proof

12-16

Discussion of effects of growth mindset: Let’s not demand unrealistic effect sizes.

09-13

László Babai's New Proof

12-16

László Babai's New Proof

12-16

The most practical causal inference book I’ve read (is still a draft)

12-24

Hey! Here’s what to do when you have two or more surveys on the same population!

11-11

A Right to Reasonable Inferences

10-01

Joint inference or modular inference? Pierre Jacob, Lawrence Murray, Chris Holmes, Christian Robert discuss conditions on the strength and weaknesses of these choices

07-09

Discussion of "Fast Approximate Inference for Arbitrarily Large Semiparametric Regression Models via Message Passing"

09-19

A Year of Approximate Inference: Review of the NIPS 2015 Workshop

12-18

If you did not already know

01-03

If you did not already know

12-17

If you did not already know

09-20

A Year of Approximate Inference: Review of the NIPS 2015 Workshop

12-18

A Year of Approximate Inference: Review of the NIPS 2015 Workshop

12-18

R Packages worth a look

10-13

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

NIPS 2017 Workshop on Approximate Inference

09-25

Eiffel Tower

04-12

Understanding the Pseudo-Truth as an Optimal Approximation

01-11

A Year of Approximate Inference: Review of the NIPS 2015 Workshop

12-18

OpenAI: A new non-profit AI company

12-20

In case you missed it: December 2018 roundup

01-04

Day 04 – little helper evenstrings

12-04

How to work with strings in base R – An overview of 20+ methods for daily use.

11-24

WoRkshop in ToRonto

10-16

OpenAI: A new non-profit AI company

12-20

Artificial Intelligence to replace staff at O2

02-28

OpenAI: A new non-profit AI company

12-20

Demystifying Data Science

11-10

OpenAI: A new non-profit AI company

12-20

Becoming A Data Scientist Podcast Episode 01: Will Kurt

12-21

Becoming A Data Scientist Podcast Episode 01: Will Kurt

12-21

Becoming A Data Scientist Podcast Episode 01: Will Kurt

12-21

Join the World’s Biggest Deep Learning Summit – KDnuggets Early Cyber Monday

11-21

Blockchain applications in the Federal Government sector

10-17

Video: Azure Machine Learning in plain English

08-23

Podcast Available on Stitcher

12-21

Running an R script on heroku

12-06

AI for Good: slides and notebooks from the ODSC workshop

11-13

Cool tennis-tracking app

08-15

Cool tennis-tracking app

08-15

Google Calendar should prevent spam by default

02-22

Agnez, analytics for deep learning research

12-24

Podcast Available on Stitcher

12-21

A Seasonal Test of AI

12-21

A Seasonal Test of AI

12-21

5½ Reasons to Ditch Spreadsheets for Data Science: Code is Poetry

12-10

Autonomy – Do we have the choice?

11-21

If you did not already know

10-29

Document worth reading: “Weighted Abstract Dialectical Frameworks: Extended and Revised Report”

08-13

Document worth reading: “Foundations of Complex Event Processing”

08-04

Preliminary Note on the Complexity of a Neural Network

08-16

Becoming a Data Scientist Podcast Episode 10: Trey Causey

05-01

A Seasonal Test of AI

12-21

Who are the best MMA fighters of all time. A Bayesian study

12-22

Who are the best MMA fighters of all time. A Bayesian study

12-22

Who are the best MMA fighters of all time. A Bayesian study

12-22

Hackers beware: Bootstrap sampling may be harmful

01-07

Statistical Assessments of AUC

12-26

These 3 problems destroy many clinical trials (in context of some papers on problems with non-inferiority trials, or problems with clinical trials in general)

11-25

5 Critical Steps to Predictive Business Analytics

11-08

Watch out for naively (because implicitly based on flat-prior) Bayesian statements based on classical confidence intervals! (Comptroller of the Currency edition)

11-08

Is the answer to everything Gaussian?

10-29

Distilled News

09-28

Don’t calculate post-hoc power using observed estimate of effect size

09-24

“We continuously increased the number of animals until statistical significance was reached to support our conclusions” . . . I think this is not so bad, actually!

09-04

China air pollution regression discontinuity update

08-02

Type Safety and Statistical Computing

12-12

Who are the best MMA fighters of all time. A Bayesian study

12-22

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J, PART 2

12-05

When Traditional Programming Meets Machine Learning

11-05

How Data Science Fueled the Largest Outreach Effort in the History of New York City

01-08

Who are the best MMA fighters of all time. A Bayesian study

12-22

Set up Sublime Text for light-weight all-in-one data science IDE

12-23

Using R: the best thing I’ve changed about my code in years

12-01

What is the natural gradient, and how does it work?

12-30

Oil Changes, Gas Mileage, and my Unreliable Gut

02-24

Set up Sublime Text for light-weight all-in-one data science IDE

12-23

Set up Sublime Text for light-weight all-in-one data science IDE

12-23

Django and Elastic Beanstalk, a perfect combination

11-28

Set up Sublime Text for light-weight all-in-one data science IDE

12-23

An R Shiny app to recognize flower species

12-17

Extract data from a PNG/TIFF

12-05

R Packages worth a look

12-02

Monitoring your cluster in just a few minutes using ISA

07-18

March journal club

12-24

Using httr to Detect HTTP(s) Redirects

11-06

March journal club

12-24

Import AI:

06-25

March journal club

12-24

Agnez, analytics for deep learning research

12-24

Introduction to Pandas, NumPy and RegEx in Python

12-17

R Packages worth a look

11-22

Kolmogorov and randomness

02-18

From Python Hero to Java Rockstar

06-30

Data Cleaning, Categorization and Normalization

01-30

A tour of Factor: 4

07-04

Agnez, analytics for deep learning research

12-24

The Best Programming Languages for Data Science and Machine Learning in 2018

09-20

Why Scala?

07-17

Be Like Water

01-19

Managing managed libraries with Scala and Eclipse

12-29

Managing managed libraries with Scala and Eclipse

12-29

How to use common workflows on Amazon SageMaker notebook instances

10-03

“Snip Insights” – An Open Source Cross-Platform AI Tool for Intelligent Screen Capture

10-03

Managing managed libraries with Scala and Eclipse

12-29

Automated and continuous deployment of Amazon SageMaker models with AWS Step Functions

01-10

The Role of the Data Engineer is Changing

01-10

4 Myths of Big Data and 4 Ways to Improve with Deep Data

01-09

Tribes.ai: Sr Data Scientist [Remote, India / Eastern Europe]

12-01

RQuantLib 0.4.6: Updated upstream, and calls for help

11-25

Building a Repository of Alpine-based Docker Images for R, Part I

11-04

Document worth reading: “The Three Pillars of Machine-Based Programming”

09-18

Not Hotdog: A Shiny app using the Custom Vision API

09-18

My steps into Data Science

05-21

Joining ASAPP

09-09

Is Data Scientist a useless job title?

08-04

Managing managed libraries with Scala and Eclipse

12-29

Managing managed libraries with Scala and Eclipse

12-29

Teaching and Learning Materials for Data Visualization

12-12

Use GitHub Vulnerability Alerts to Keep Users of Your R Packages Safe

11-14

crfsuite for natural language processing

10-29

rqdatatable: rquery Powered by data.table

06-03

Our R package roundup

12-31

Our R package roundup

12-30

I Spy with my Graphing Eye 📊 👁️

12-12

Create 3D County Maps Using Density as Z-Axis

11-29

The Bull Survived on Friday, but Barely

03-25

Our R package roundup

12-30

How to combine Multiple ggplot Plots to make Publication-ready Plots

01-12

Tutorial: Time Series Analysis with Pandas

01-10

Marketing analytics with greybox

01-07

Dow Jones Stock Market Index (2/4): Trade Volume Exploratory Analysis

01-07

I Spy with my Graphing Eye 📊 👁️

12-12

Shinyfit: Advanced regression modelling in a shiny app

12-07

Day 05 – little helper get_network

12-05

Automated Dashboard with various correlation visualizations in R

12-05

Creating and saving multiple plots to Powerpoint

11-30

Top 10 Python Data Science Libraries

11-16

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 3: Layouts

10-25

New Course: Visualization Best Practices in R

10-19

R Packages worth a look

10-09

Linear Regression in the Wild

10-03

How to graph a function of 4 variables using a grid

09-20

Distilled News

08-30

What is a Box Plot?

08-24

Python and Tidyverse

06-01

Generating Climate Temperature Spirals in Python

05-21

Python Matplotlib (pyplot), a step-by-step Tutorial

11-15

Cinderella science

08-05

WordPress to Jekyll: A 30x Speedup

10-10

Our R package roundup

12-30

These 3 problems destroy many clinical trials (in context of some papers on problems with non-inferiority trials, or problems with clinical trials in general)

11-25

Using OSX? Compiling an R package from source? Issues with ‘-fopenmp’? Try this.

11-18

R tip: Make Your Results Clear with sigr

11-04

R tip: Make Your Results Clear with sigr

11-04

R Packages worth a look

10-30

Watch Tiny Neural Nets Learn

03-04

First Steps With Neural Nets in Keras

03-04

The Mathematics Behind: Polynomial Curve Fitting (MATLAB)

01-20

21st Century C: Error 64 on OSX When Using Make

12-31

Sudoku Solver

12-30

Alternative approaches to scaling Shiny with RStudio Shiny Server, ShinyProxy or custom architecture.

12-18

LoyaltyOne: Associate Director, CPG [Westborough, MA]

12-17

LoyaltyOne: Manager, CPG [Westborough, MA]

12-14

Cummins: Data Engineering Apps and Solutions Architect [Columbus, IN]

12-13

WNS Hackathon Solutions by Top Finishers

12-13

Cummins: Advanced Analytics Platform Principle Engineer [Columbus, IN]

12-12

Self Avoiding Walks

12-08

My talk tomorrow (Tues) noon at the Princeton University Psychology Department

12-03

Rdew Valley: Optimizing Farming with R

11-14

Metadata Enrichment is Essential to Realize the Value of Open Datasets

11-14

Top Data Science Hacks

11-05

Top Data Science Hacks

11-05

If you did not already know

10-31

Document worth reading: “Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation”

10-13

Top 10 Mistakes to Avoid to Master Data Science

10-10

Top 10 Mistakes to Avoid to Master Data Science

10-04

Timing Column Indexing in R

09-21

Get started with automated metadata extraction using the AWS Media Analysis Solution

09-07

Pixm takes on phishing attacks with deep learning using Apache MXNet on AWS

08-28

Aella Credit empowers underbanked individuals by using Amazon Rekognition for identity verification

08-15

Thorn partners with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

rqdatatable: rquery Powered by data.table

06-03

A Visual Guide to Evolution Strategies

10-29

Square to Hex

03-11

Moscow Math Olympiad Puzzle

08-07

Linear regression can be understood in many ways (optimization, probabilistic, bayesian)

07-20

21st Century C: Error 64 on OSX When Using Make

12-31

Microsoft Weekly Data Science News for August 24, 2018

08-24

21st Century C: Error 64 on OSX When Using Make

12-31

Advent of Code: Most Popular Languages

12-15

R > Python: a Concrete Example

11-21

Exploring Models with lime

11-09

Sharing the Recipe for rOpenSci’s Unconf Ice Breaker

11-01

“Demystifying Data Science” remote notes

10-24

21st Century C: Error 64 on OSX When Using Make

12-31

21st Century C: Error 64 on OSX When Using Make

12-31

XmR Chart | Step-by-Step Guide by Hand and with R

01-13

The Five Best Data Visualization Libraries

01-07

‘data:’ Scraping & Chart Reproduction : Arrows of Environmental Destruction

01-03

Learn to do Data Viz in R

12-05

Lessons from posting a fake map about pies

11-28

Ask the Question, Visualize the Answer

10-17

Estimating Control Chart Constants with R

10-17

✚ This is Misleading, This is Not Really Misleading

10-04

The Chartmaker Directory: Data visualizations in every tool

08-24

Top 8 Viz features in Excel 2016 !

01-02

The Five Best Data Visualization Libraries

01-07

Data Notes: Malaria Detection with FastAI

01-03

My R take on Advent of Code – Day 1

12-17

Smartly select and mutate data frame columns, using dict

12-09

Must-Have Resources to Become a Data Scientist

12-06

Community Call – Governance strategies for open source research software projects

12-05

Distilled News

12-03

Designing a Self-Learning Tic-Tac-Toe Player

11-29

Using OSX? Compiling an R package from source? Issues with ‘-fopenmp’? Try this.

11-18

Data Notes: Impact of Game of Thrones on US Baby Names

11-15

RATest. A Randomization Tests package is available on CRAN

11-11

India vs US – Kaggle Users & Data Scientists

11-05

Data Notes: Chinese Tourism's Impact on Taiwan

11-01

How to be an Artificial Intelligence (AI) Expert?

10-29

RConsortium — Building an R Certification

10-26

automl package: part 2/2 first steps how to

10-24

How we use emojis

10-15

Data Notes: Are Those Honey Bees Healthy?

10-04

Data Notes: How Do Autoencoders Work?

09-20

Data Notes: The Secret to Getting to a Second Date

09-06

Data Notes: Drought and the War in Syria

08-23

Data Notes: From Hate Speech to Russian Troll Tweets

08-09

Data Notes: Winning Solutions of Kaggle Competitions

07-26

Data Notes: How to Forecast the S&P 500 with Prophet

07-12

What I’ve learned from competing in machine learning contests on Kaggle

07-06

Data Notes: Your smartphone knows *what*?

06-28

Why mere Machine Learning cannot predict Bitcoin price

12-18

Michael B. Cohen

09-28

Neurally Embedded Emojis

06-19

Top 8 Viz features in Excel 2016 !

01-02

Dreaming of a white Christmas – with ggmap in R

12-24

Re-creating a Voronoi-Style Map with R

12-22

Canada Map

12-09

If you did not already know

11-07

7 Awesome Things You Can Do in Dataiku Without Coding

11-02

Distilled News

10-27

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 3: Layouts

10-25

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 1: Basics

10-25

Getting started Stamen maps with ggmap

10-25

3-D shadow maps in R: the rayshader package

09-26

R Packages worth a look

09-12

Document worth reading: “Attend Before you Act: Leveraging human visual attention for continual learning”

08-03

Data Notes: Your smartphone knows *what*?

06-28

The Probability Monad and Why it's Important for Data Science

09-05

Top 8 Viz features in Excel 2016 !

01-02

Top 8 Viz features in Excel 2016 !

01-02

Faster garbage collection in pqR

11-30

Announcing Ursa Labs's partnership with NVIDIA

10-10

If you did not already know

08-23

Linked Lists

12-28

Deep Learning Research Review Week 3: Natural Language Processing

01-10

From Arrow to pandas at 10 Gigabytes Per Second

12-27

Attention and Augmented Recurrent Neural Networks

09-08

Recurrent Neural Networks for Beginners

08-13

LSTMs

06-04

Feather and Apache Arrow: Grokking file formats vs. in-memory representations

04-21

Large Data with Scikit-learn - Boston Meetup

03-16

Attention and Memory in Deep Learning and NLP

01-03

Colorizing the DRAW Model

12-06

Attention and Memory in Deep Learning and NLP

01-03

R Packages worth a look

09-20

Amazon Translate now available in the Memsource translation management system

08-14

If you did not already know

08-12

Attention and Memory in Deep Learning and NLP

01-03

Power your website with on-demand translated reviews using Amazon Translate

12-20

Introducing Amazon Translate Custom Terminology

11-28

Amazon Translate now offers 113 new language pairs

10-29

Amazon Translate now available in the Memsource translation management system

08-14

The Real Problems with Neural Machine Translation

07-21

Attention and Memory in Deep Learning and NLP

01-03

Document worth reading: “Learning to Succeed while Teaching to Fail: Privacy in Closed Machine Learning Systems”

08-10

Creating a PageRank Analytics Platform Using Spring Boot Microservices

01-03

Creating a PageRank Analytics Platform Using Spring Boot Microservices

01-03

gganimate has transitioned to a state of release

01-03

Comparison of the Top Speech Processing APIs

12-28

How to build a data science project from scratch

12-05

Lifecycle configuration update for Amazon SageMaker notebook instances

11-06

Model Server for Apache MXNet v1.0 released

10-31

R Packages worth a look

08-24

Make R speak

08-16

Distilled News

08-13

Some web API package development lessons from HIBPwned

04-19

Time Series for Spark: 0.2.0 Released

01-22

Creating a PageRank Analytics Platform Using Spring Boot Microservices

01-03

pinp 0.0.7: More small YAML options

01-11

Characterizing Online Public Discussions through Patterns of Participant Interactions

11-11

The “Carl Sagan effect”

07-16

The Ponzi threshold and the Armstrong principle

07-02

Google's NHS deal does not bode well for the future of data-sharing

05-05

Creating a PageRank Analytics Platform Using Spring Boot Microservices

01-03

Becoming a Data Scientist Podcast Episode 02: Safia Abdalla

01-04

Becoming a Data Scientist Podcast Episode 02: Safia Abdalla

01-04

Advanced Jupyter Notebooks: A Tutorial

01-02

R Packages worth a look

12-13

Here are the most popular Python IDEs / Editors

12-07

Technoslavia 2.5: Open Source Topography

11-07

The Economist's Big Mac Index is calculated with R

10-12

The Economist's Big Mac Index is calculated with R

10-12

Distributed Deep Learning with Polyaxon

03-18

Jupyter notebooks and tensorboard on Polyaxon

03-04

Setting up Jupyter for Deep Learning on EC2

02-15

Top 8 resources for learning data analysis with pandas

05-16

Becoming a Data Scientist Podcast Episode 02: Safia Abdalla

01-04

Generative King of Kowloon

01-05

Generative King of Kowloon

01-05

R Packages worth a look

01-11

R Packages worth a look

11-23

More on Bias Corrected Standard Deviation Estimates

11-14

Document worth reading: “A Tutorial on Modeling and Inference in Undirected Graphical Models for Hyperspectral Image Analysis”

11-09

R Packages worth a look

10-06

“If you deprive the robot of your intuition about cause and effect, you’re never going to communicate meaningfully.” – Pearl ’18

06-08

Martingales

10-20

The Policy Gradient

06-16

Generative King of Kowloon

01-05

Hexagon Geometry Puzzle

06-27

Bisecting a triangular cake

05-09

Koch Snowflake

01-05

Attractive Mathematical Properties Of The Roc Curve

09-27

Koch Snowflake

01-05

Koch Snowflake

01-05

Koch Snowflake

01-05

Statistics in Glaucoma: Part I

12-03

R Packages worth a look

10-06

Build this media monitoring Slack bot in 20 minutes without writing code

07-04

Last academic results

06-23

GANs are Broken in More than One Way: The Numerics of GANs

10-05

Grazing and Calculus

02-29

How Data Science Fueled the Largest Outreach Effort in the History of New York City

01-08

R Packages worth a look

12-20

Matching (and discarding non-matches) to deal with lack of complete overlap, then regression to adjust for imbalance between treatment and control groups

11-10

When Traditional Programming Meets Machine Learning

11-05

How Data Science Fueled the Largest Outreach Effort in the History of New York City

01-08

How Data Science Fueled the Largest Outreach Effort in the History of New York City

01-08

How Data Science Fueled the Largest Outreach Effort in the History of New York City

01-08

R Packages worth a look

01-11

Dataviz Course Packet Quickstart

01-02

“Principles of posterior visualization”

01-01

2018.

12-31

Best Data Visualization Projects of 2018

12-27

Teaching kids data visualization

11-29

R Packages worth a look

11-26

✚ Chart Components and Working On Your Graphics Piece-wise

09-20

✚ Visualization Away from the Computer, Developing Ideas, Bring in the Constraints

08-16

Life-cycle of a Data Science Project

05-18

Ffa1ea00fdab31b3b44b87839c503629

05-06

WordPress to Jekyll: A 30x Speedup

10-10

IMDB Data Visualizations with D3 + Dimple.js

08-10

Who's at the Center of the Star Trek Universe?

07-22

Machine Learning is not BS in Monitoring

01-09

Machine Learning is not BS in Monitoring

01-09

Document worth reading: “Machine Learning in Official Statistics”

01-11

Alibaba acquires Data Artisans?

01-10

Document worth reading: “Recent Research Advances on Interactive Machine Learning”

01-05

Magister Dixit

01-03

My book ‘Practical Machine Learning in R and Python: Third edition’ on Amazon

01-02

The Essence of Machine Learning

12-28

Does imputing model labels using the model predictions can improve it’s performance?

12-21

Industry Predictions: AI, Machine Learning, Analytics & Data Science Main Developments in 2018 and Key Trends for 2019

12-18

Day 12 – little helper dive

12-12

P&G: Data Scientist – Machine Learning/NLP [Cincinnati, OH]

12-11

Learning Machine Learning vs Learning Data Science

12-11

Day 10 – little helper %nin%

12-10

Day 07 – little helper count_na

12-07

Day 02 – little helper na_omitlist

12-02

The Future of AI is the Enterprise

11-30

A Complete Guide to Choosing the Best Machine Learning Course

11-30

Document worth reading: “An Introductory Survey on Attention Mechanisms in NLP Problems”

11-28

R now supported in Azure SQL Database

11-28

Amazon Launches Machine Learning University

11-27

Distilled News

11-26

Distilled News

11-24

Document worth reading: “Learning From Positive and Unlabeled Data: A Survey”

11-23

Magister Dixit

11-23

Distilled News

11-14

T-mobile uses R for Customer Service AI

11-09

T-mobile uses R for Customer Service AI

11-09

“Statistical and Machine Learning forecasting methods: Concerns and ways forward”

11-06

Distilled News

11-05

R Packages worth a look

10-29

Please vote

10-29

Top KDnuggets tweets, Oct 17-23: Machine Learning Cheat Sheets

10-24

Distilled News

10-15

Job: Postdoctoral Researcher in Small Data Deep Learning and Explainable Machine Learning, Livermore, CA

10-08

In case you missed it: September 2018 roundup

10-03

AI, Machine Learning and Data Science Roundup: September 2018

09-20

Magister Dixit

08-31

If you did not already know

08-02

Facilitate Proactive Cybersecurity Operations with Big Data Analytics and Machine Intelligence

07-30

My steps into Data Science

05-21

Can a Machine Be Racist or Sexist?

04-16

Occam razor vs. machine learning

07-12

Machine Learning is not BS in Monitoring

01-09

Notes on the Frank-Wolfe Algorithm, Part II: A Primal-dual Analysis

11-14

Intro to Implicit Matrix Factorization: Classic ALS with Sketchfab Models

10-19

Explicit Matrix Factorization: ALS, SGD, and All That Jazz

01-09

What is the natural gradient, and how does it work?

12-30

Explicit Matrix Factorization: ALS, SGD, and All That Jazz

01-09

A Guide to Decision Trees for Machine Learning and Data Science

12-24

Building Blocks of Decision Tree

11-26

“Using numbers to replace judgment”

11-17

Turn data into revenue. Wharton can show you how.

11-06

Explainable ML versus Interpretable ML

10-30

Machine Learning Basics – Random Forest

10-30

Cassie Kozyrkov discusses decision making and decision intelligence!

10-22

Distilled News

10-14

Response to Rafa: Why I don’t think ROC [receiver operating characteristic] works as a model for science

08-05

Defining data science in 2018

07-22

Why AI Isn’t A Black Box (And Its Business Value)

07-17

Can Lessons from Data Science Help Journalism?

06-27

3 Things We Can Do About Fake News

05-18

Hacking A Hackaton

04-30

Cognitive Machine Learning (2): Uncertain Thoughts

03-12

The Fair Price to Pay a Spy: An Introduction to the Value of Information

01-09

Document worth reading: “A second-quantised Shannon theory”

12-20

The Role of Theory in Data Analysis

12-11

Melanie Miller says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

Melanie Mitchell says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

Document worth reading: “Fractal AI: A fragile theory of intelligence”

10-22

A psychology researcher uses Stan, multiverse, and open data exploration to explore human memory

09-21

If you did not already know

09-18

If you did not already know

08-18

The fallacy of the excluded middle — statistical philosophy edition

08-18

The fallacy of the excluded middle — statistical philosophy edition

08-18

Document worth reading: “A Reliability Theory of Truth”

08-02

The Fair Price to Pay a Spy: An Introduction to the Value of Information

01-09

The Role of Theory in Data Analysis

12-11

If you did not already know

09-18

The fallacy of the excluded middle — statistical philosophy edition

08-18

The fallacy of the excluded middle — statistical philosophy edition

08-18

Document worth reading: “A Reliability Theory of Truth”

08-02

The Fair Price to Pay a Spy: An Introduction to the Value of Information

01-09

Document worth reading: “Big Data and Fog Computing”

11-29

If you did not already know

11-27

Open Source Deep Dive with Olivier Grisel

10-29

Why you need GPUs for your deep learning platform

10-16

Data Center Scale Computing and Artificial Intelligence with Matei Zaharia, Inventor of Apache Spark

09-12

New speed record set for training deep learning models on AWS

08-22

The Two Tribes of Language Researchers

11-19

The Fair Price to Pay a Spy: An Introduction to the Value of Information

01-09

Magister Dixit

01-09

How Different are Conventional Programming and Machine Learning?

12-10

Document worth reading: “Big Data and Fog Computing”

11-29

Document worth reading: “Resource Management in Fog/Edge Computing: A Survey”

10-30

Open Source Deep Dive with Olivier Grisel

10-29

Data Center Scale Computing and Artificial Intelligence with Matei Zaharia, Inventor of Apache Spark

09-12

R Packages worth a look

08-30

If you did not already know

08-25

Document worth reading: “Fog Computing: Survey of Trends, Architectures, Requirements, and Research Directions”

08-22

The Trillion Dollar Question

08-09

Quantum Computing: Cats, Crushes, and Chemistry

07-30

5 Tips To Learn Machine Learning

06-17

Moravec's Paradox

01-31

Weekly Review: 11/11/2017

11-11

The Fair Price to Pay a Spy: An Introduction to the Value of Information

01-09

Life in Madrid seen through BiciMAD

10-10

CES 2016

01-11

iPhone addiction? Get a grip!

02-06

Making Smart Phones Dumb Again

09-07

CES 2016

01-11

CES 2019

01-12

RTutor: Driving Electric or Gasoline Cars? Comparing the Pollution Damages

11-21

Hitchhiker’s guide to Used Car Prices Estimation

12-04

CES 2016

01-11

Document worth reading: “Saliency Prediction in the Deep Learning Era: An Empirical Investigation”

11-16

If you did not already know

10-21

Announcing the Artificial Intelligence (AI) Hackathon: Build Intelligent Applications using machine learning APIs and serverless

08-15

CES 2016

01-11

R Packages worth a look

10-13

Document worth reading: “Radial Basis Function Approximations: Comparison and Applications”

08-15

Eiffel Tower

04-12

Understanding the Pseudo-Truth as an Optimal Approximation

01-11

Understanding the Pseudo-Truth as an Optimal Approximation

01-11

If you did not already know

12-30

R Packages worth a look

12-27

Part 3: Two more implementations of optimism corrected bootstrapping show shocking bias

12-27

Four Real-Life Machine Learning Use Cases

12-13

Document worth reading: “A Theory of Diagnostic Interpretation in Supervised Classification”

12-08

Shinyfit: Advanced regression modelling in a shiny app

12-07

Linear Regression in Real Life

11-05

How to create useful features for Machine Learning

10-30

Sketchnotes from TWiML&AI: Evaluating Model Explainability Methods with Sara Hooker

10-12

R Packages worth a look

10-12

If you did not already know

09-21

R Packages worth a look

09-19

Welcome to Dataiku University!

09-07

“We continuously increased the number of animals until statistical significance was reached to support our conclusions” . . . I think this is not so bad, actually!

09-04

R Tip: Put Your Values in Columns

08-30

World map shows aerosol billowing in the wind

08-24

If you did not already know

08-23

Multiple Linear Regression & Assumptions of Linear Regression: A-Z

08-17

Should the points in this scatterplot be binned?

07-11

Deep Learning Vendor Update: Hyperparameter Tuning Systems

06-29

PyDataLondon 2018 and “Creating Correct and Capable Classifiers”

04-30

Performance metrics aren't everything

02-09

PyDataBudapest and “Machine Learning Libraries You’d Wish You’d Known About”

11-15

Hard Examples Mining in Keras

10-22

XOR Revisited: Keras and TensorFlow

04-24

A fastText-based hybrid recommender

09-27

GPU-accelerated Theano & Keras with Windows 10

09-23

Sales Automation Through a Deep Learning Platform

09-22

TensorFlow Implementation of "A Neural Algorithm of Artistic Style"

05-31

Watch Tiny Neural Nets Learn

03-04

Understanding the Pseudo-Truth as an Optimal Approximation

01-11

RTest: pretty testing of R packages

01-07

Part 2: Optimism corrected bootstrapping is definitely bias, further evidence

12-26

Optimism corrected bootstrapping: a problematic method

12-25

covrpage, more information on unit testing

12-10

Bringing Machine Learning Research to Product Commercialization

11-27

Chocolate milk! Another stunning discovery from an experiment on 24 people!

11-13

RATest. A Randomization Tests package is available on CRAN

11-11

Model evaluation, model selection, and algorithm selection in machine learning

11-10

5 Critical Steps to Predictive Business Analytics

11-08

Building a Repository of Alpine-based Docker Images for R, Part I

11-04

shinytest – Automated testing for Shiny apps

10-18

5 “Clean Code” Tips That Will Dramatically Improve Your Productivity

10-15

5 Reasons Why You Should Use Cross-Validation in Your Data Science Projects

10-02

Testing code with random output

08-06

Two things about power

05-14

JUnit,Integration,End to End Tests

10-22

Facts and Fallacies of Software Engineering - Book Review

02-11

Ten Ways Your Data Project is Going to Fail

11-01

Defective Circuit Board Puzzle

01-14

Defective Circuit Board Puzzle

01-14

Defective Circuit Board Puzzle

01-14

Mini AI app using TensorFlow and Shiny

01-15

Statistical Assessments of AUC

12-26

Optimism corrected bootstrapping: a problematic method

12-25

What's the future of the pandas library?

12-12

Automated Dashboard Visualizations with Ranking in R

12-07

Plotting Scottish census data with some tidyverse magic

11-28

Project planning with plotly

11-26

Beautiful Chaos: The Double Pendulum

11-22

Zero Counts in dplyr

11-19

K-means clustering with Amazon SageMaker

11-08

Named Entity Recognition and Classification with Scikit-Learn

10-25

Beginner Data Visualization & Exploration Using Pandas

10-22

Analyzing English Team of the Year Data Since 1973

10-18

Examining Inter-Rater Reliability in a Reality Baking Show

10-18

I fell out with tapply and in love with dplyr

10-15

Monotonic Binning with Equal-Sized Bads for Scorecard Development

10-14

How to import a directory of csvs at once with base R and data.table. Can you guess which way is the fastest?

10-13

Using a Column as a Column Index

09-21

Scale out your Pandas DataFrame operations using Dask

08-05

A glass shattering book draw with gganimate

08-01

The Bull Survived on Friday, but Barely

03-25

WordPress to Jekyll: A 30x Speedup

10-10

Feather: A Fast On-Disk Format for Data Frames for R and Python, powered by Apache Arrow

03-29

Mini AI app using TensorFlow and Shiny

01-15

Mini AI app using TensorFlow and Shiny

01-15

My Top 10% Solution for Kaggle Rossman Store Sales Forecasting Competition

01-16

R plus Magento 2 REST API revisited: part 3 – more complex samples of use

12-02

Understanding Regression Error Metrics

09-26

My Top 10% Solution for Kaggle Rossman Store Sales Forecasting Competition

01-16

How AI Will Change Brick-and-Mortar Retail in 2019

12-26

My Top 10% Solution for Kaggle Rossman Store Sales Forecasting Competition

01-16

Dr. Data Show Video: Five Reasons Computers Predict When You’ll Die

12-08

Data Science Projects Employers Want To See: How To Show A Business Impact

12-04

Top 5 domains Big Data analytics helps to transform

11-23

How I Learned to Stop Worrying and Love Uncertainty

10-24

Logistic Regression: Concept & Application

09-03

Performance metrics aren't everything

02-09

AutoML on AWS

12-04

House Price Prediction using a Random Forest Classifier

11-29

Where Predictive Modeling Goes Astray

01-27

Artificial Neural Networks Introduction (Part II)

11-03

Random forest interpretation – conditional feature contributions

10-24

k-Nearest Neighbors & Anomaly Detection Tutorial

09-14

Random Forest Tutorial: Predicting Crime in San Francisco

08-25

My Top 10% Solution for Kaggle Rossman Store Sales Forecasting Competition

01-16

AI and ML Futures 3: The Trojan Wars of Machine Learning

01-17

AI and ML Futures 3: The Trojan Wars of Machine Learning

01-17

AI and ML Futures 3: The Trojan Wars of Machine Learning

01-17

R Packages worth a look

10-10

AI and ML Futures 3: The Trojan Wars of Machine Learning

01-17

AI and ML Futures 3: The Trojan Wars of Machine Learning

01-17

AI and ML Futures 2: The Quiet Revolution

01-17

AI and ML Futures 2: The Quiet Revolution

01-17

“discover feature relationships” – new EDA tool

01-10

Top December Stories: Why You Shouldn’t be a Data Science Generalist

01-09

If you did not already know

01-08

Distilled News

01-07

Scaling H2O analytics with AWS and p(f)urrr (Part 1)

01-06

Supervised Learning: Model Popularity from Past to Present

12-28

Document worth reading: “Computational Power and the Social Impact of Artificial Intelligence”

12-27

10 More Must-See Free Courses for Machine Learning and Data Science

12-20

The importance of Data Analytics skills in today’s MBA roles

12-19

2018 Year-in-Review: Machine Learning Open Source Projects & Frameworks

12-17

Why You Shouldn’t be a Data Science Generalist

12-14

Distilled News

12-14

Distilled News

12-11

Magister Dixit

12-05

Distilled News

12-02

How to Build a Machine Learning Team When You Are Not Google or Facebook

11-28

Whats new on arXiv

11-27

Machine Learning. In conversation with Jelena Ilic, Senior Data Scientist at Mango Solutions

11-21

Distilled News

11-16

Distilled News

11-08

Distilled News

11-07

Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake, Nov 8

11-01

Tomorrow, Nov 8 Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake

11-01

Document worth reading: “Machine Learning for Wireless Networks with Artificial Intelligence: A Tutorial on Neural Networks”

10-28

How to Define a Machine Learning Problem Like a Detective

10-22

Dr. Data Show Video: How Can You Trust AI?

10-20

GitHub Python Data Science Spotlight: High Level Machine Learning & NLP, Ensembles, Command Line Viz & Docker Made Easy

10-16

Reinforcement Learning: Super Mario, AlphaGo and beyond

10-01

Document worth reading: “Importance of the Mathematical Foundations of Machine Learning Methods for Scientific and Engineering Applications”

09-30

Your Guide to AI and Machine Learning at re:Invent 2018

09-27

Distilled News

09-19

In case you missed it: August 2018 roundup

09-06

Document worth reading: “PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison”

08-31

Document worth reading: “Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution”

08-11

Distilled News

08-11

DIFFERENCE BETWEEN DATA SCIENCE, DATA ANALYTICS AND MACHINE LEARNING

08-09

AI Meets Mail Processing (Automation for Admin Tasks)

08-09

Why AI Isn’t A Black Box (And Its Business Value)

07-17

5 Tips To Learn Machine Learning

06-17

Data professional definitions: Data analyst vs data scientist vs data engineer

12-14

PyConUK 2017, PyDataCardiff and “Machine Learning Libraries You’d Wish You’d Known About”

11-05

Don't Panic: Deep Learning will be Mostly Harmless

11-29

Becoming a Data Scientist Podcast Episode 11: Stephanie Rivera

05-31

Sheffield University Life

04-05

Quora Q&A Session Answers

03-09

Guide to an in-depth understanding of logistic regression

02-22

AI and ML Futures 2: The Quiet Revolution

01-17

AI and ML Futures 1: Background

01-17

AI and ML Futures 1: Background

01-17

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Podcast Special Episode 2 – The Future of AI with Dr. Ed Felten

12-29

White House launches workshops to prepare for Artificial Intelligence

05-04

AI and ML Futures 1: Background

01-17

“Do you have any recommendations for useful priors when datasets are small?”

12-11

Federated Learning: Machine Learning with Privacy on the Edge

10-29

Outside a train rumbles by

09-09

AI and ML Futures 1: Background

01-17

Becoming a Data Scientist Podcast Episode 03: Shlomo Argamon

01-18

Becoming a Data Scientist Podcast Episode 03: Shlomo Argamon

01-18

Top 10 Books on NLP and Text Analysis

01-09

R 3.5.2 now available

12-20

Power your website with on-demand translated reviews using Amazon Translate

12-20

If you did not already know

11-17

Report from the Enterprise Applications of the R Language conference

11-16

Report from the Enterprise Applications of the R Language conference

11-16

T-mobile uses R for Customer Service AI

11-09

T-mobile uses R for Customer Service AI

11-09

Melanie Miller says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

Melanie Mitchell says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

How Data Science (+ Friends) Helped Me Learn French

11-01

If you did not already know

10-12

Document worth reading: “Automatic Language Identification in Texts: A Survey”

09-20

In case you missed it: August 2018 roundup

09-06

If you did not already know

08-17

Recently in the sister blog

07-24

Markdown Language Reference

11-24

My Video Game Playlists in Japanese for Immersion

06-07

Becoming a Data Scientist Podcast Episode 03: Shlomo Argamon

01-18

Introduction to Semi-Supervised Learning with Ladder Networks

01-19

Be Like Water

01-19

Be Like Water

01-19

Are you buying an apartment? How to hack competition in the real estate market

10-26

Be Like Water

01-19

Comparison of the Text Distance Metrics

01-07

Comparison of the Top Speech Processing APIs

12-28

Clustering the Bible

12-27

Text classification with tidy data principles

12-24

If you did not already know

12-03

Data Representation for Natural Language Processing Tasks

11-02

Multi-Class Text Classification Model Comparison and Selection

11-01

AI ‘judge’ doesn’t explain why it reaches certain decisions

10-24

Solving Real-Life Mysteries with Big Data and Apache Spark

09-13

Learn How To Implement a Simple E-mail Spam Detector in Python

01-20

Learn How To Implement a Simple E-mail Spam Detector in Python

01-20

Safe Crime Detection

06-05

Learn How To Implement a Simple E-mail Spam Detector in Python

01-20

Deep learning, hydroponics, and medical marijuana

10-15

Keras vs. TensorFlow – Which one is better and which one should I learn?

10-08

Linear Regression in the Wild

10-03

The Power of IPython Notebook + Pandas + and Scikit-learn

06-11

Learn How To Implement a Simple E-mail Spam Detector in Python

01-20

Learn How To Implement a Simple E-mail Spam Detector in Python

01-20

Continuous Bayes’ Theorem

01-20

linl 0.0.3: Micro release

12-15

Proof that 1/7 is a repeated decimal

10-05

Tic-Tac-AI: A Strong Tic-Tac-Toe AI Opponent using Forward Sampling

03-07

Our R package roundup

12-31

Summing the Fibonacci Sequence

07-24

Project Euler using Scala: Problem

07-19

The Mathematics Behind: Rejection Sampling

01-24

The Mathematics Behind: Polynomial Curve Fitting (MATLAB)

01-20

Why Momentum Really Works

04-04

The Mathematics Behind: Polynomial Curve Fitting (MATLAB)

01-20

The Mathematics Behind: Polynomial Curve Fitting (MATLAB)

01-20

Document worth reading: “Are screening methods useful in feature selection? An empirical study”

12-18

R Packages worth a look

12-13

The Mathematics Behind: Polynomial Curve Fitting (MATLAB)

01-20

Building a news search engine

01-21

Introducing Octoparse New Version 7.1 – web scraping for dummies is official

11-20

Building a news search engine

01-21

High-performance mathematical paradigms in Python

11-22

Sequence prediction using recurrent neural networks(LSTM) with TensorFlow

05-14

Building a news search engine

01-21

Otoro Blog Migration

01-22

“Increase sample size until statistical significance is reached” is not a valid adaptive trial design; but it’s fixable.

12-07

Day 03 – little helper multiplot

12-03

R Packages worth a look

12-01

More Sandwiches, Anyone?

11-14

Remembering Michael

10-08

Speed Up With Microsoft

10-04

Introducing Python for data scientists - Pt2

03-23

Otoro Blog Migration

01-22

gganimation for the nation

01-06

Creating and saving multiple plots to Powerpoint

11-30

Blog has migrated from Ghost to Jekyll

08-11

Otoro Blog Migration

01-22

R community update: announcing sessions for useR Delhi December meetup

12-13

A comprehensive list of Machine Learning Resources: Open Courses, Textbooks, Tutorials, Cheat Sheets and more

12-07

R now supported in Azure SQL Database

11-28

T-mobile uses R for Customer Service AI

11-09

RcppTOML 0.1.5: Small extensions

11-01

A small logical change with big impact

10-16

A small logical change with big impact

10-16

A small logical change with big impact

10-16

Up your open source game with Hacktoberfest at Locke Data!

10-01

Connected Arms – Can AI Revolutionize Prosthetic Devices & Make them More Affordable?

09-07

Data Makes Possible Many Things: Insights Discovery, Innovation, and Better Decisions

08-01

Exercise and weight loss: long-term follow-up

07-10

The Data Incubator Unofficial Frequently Asked Questions

05-30

Variational Autoencoders Explained

08-06

Why Blog?

02-18

Otoro Blog Migration

01-22

Visual search on AWS—Part 2: Deployment with AWS DeepLens

09-05

Because it's Friday: The Curiosity Show

08-31

Forecasting financial time series with dynamic deep learning on AWS

08-20

R Packages worth a look

08-19

Data Notes: Winning Solutions of Kaggle Competitions

07-26

Python数据分析之pandas

07-18

BD reviews

07-11

Time Series for Spark: 0.2.0 Released

01-22

From Python Hero to Java Rockstar

06-30

Why Scala?

07-17

Time Series for Spark: 0.2.0 Released

01-22

What is the Best Python IDE for Data Science?

11-14

The One reason you should learn Python

10-11

Cloudera Enterprise 5.12 is Now Available

07-13

Time Series for Spark: 0.2.0 Released

01-22

Objects types and some useful R functions for beginners

12-24

Interesting packages taken from R/Pharma

12-09

Handling Imbalanced Datasets in Deep Learning

12-04

Deep Learning Performance Cheat Sheet

11-08

If you did not already know

09-25

Naive Bayes Classifier: A Geometric Analysis of the Naivete. Part 1

09-07

Handling Imbalanced Classes in the Dataset

08-03

What makes Robin Pemantle’s bag of tricks for teaching math so great?

07-27

Machine Learning Fraud Detection: A Simple Machine Learning Approach

06-15

Time Series for Spark: 0.2.0 Released

01-22

Simulating Persian Monarchs gameplay by @ellis2013nz

12-22

R Packages worth a look

12-13

Your Client Engagement Program Isn't Doing What You Think It Is.

11-08

R Packages worth a look

08-24

Multiple Raffle Strategy

10-09

Skill vs Strategy

01-23

Simulating Persian Monarchs gameplay by @ellis2013nz

12-22

Designing Turbofan Tycoon

12-06

Data Science Strategy Safari: Aligning Data Science Strategy to Org Strategy

11-26

Your Client Engagement Program Isn't Doing What You Think It Is.

11-08

R Packages worth a look

10-20

R Packages worth a look

08-24

Don’t call it a bandit

08-04

Multiple Raffle Strategy

10-09

d20 stopping puzzle

02-11

Skill vs Strategy

01-23

Recurrent Neural Network Gradients, and Lessons Learned Therein

10-18

Skill vs Strategy

01-23

Styles of Truncated Backpropagation

07-19

Skill vs Strategy

01-23

If you did not already know

11-26

Document worth reading: “The Risk of Machine Learning”

10-11

Skill vs Strategy

01-23

Approaches to Text Summarization: An Overview

01-03

If you did not already know

11-30

Building a Question-Answering System from Scratch

10-24

Spam Detection with Natural Language Processing (NLP) – Part 1

10-15

Simulating Twitch chat with a Recurrent Neural Network

07-21

Implement spelling correction using Language Models

02-08

Online Representation Learning in Recurrent Neural Language Models

01-24

Online Representation Learning in Recurrent Neural Language Models

01-24

If you did not already know

12-18

Online Representation Learning in Recurrent Neural Language Models

01-24

Neural Ordinary Differential Equations

12-15

If you did not already know

12-03

Machine Learning Classification: A Dataset-based Pictorial

11-05

Document worth reading: “Vector and Matrix Optimal Mass Transport: Theory, Algorithm, and Applications”

10-14

Overview and benchmark of traditional and deep learning models in text classification

06-12

Gradient optimisation on the Poincaré disc

04-10

Online Representation Learning in Recurrent Neural Language Models

01-24

Data Science Portfolio Project: Is Fandango Still Inflating Ratings?

08-15

Hierarchical Bayesian Neural Networks with Informative Priors

08-13

From Gaussian Algebra to Gaussian Processes, Part 1

03-31

Gaussian Processes

11-25

Model AUC depends on test set difficulty

03-19

The Mathematics Behind: Rejection Sampling

01-24

Generating Synthetic Data Sets with ‘synthpop’ in R

01-13

3368a9b98a073e7ba296e1f5f41f6c4f

06-02

Transfer Learning for Flight Delay Prediction via Variational Autoencoders

05-08

The Definitive Q&A Guide for Aspiring Data Scientists

01-25

The cold start problem: how to build your machine learning portfolio

01-04

The gaps between 1, 2, and 3 are just too large.

09-06

The Definitive Q&A Guide for Aspiring Data Scientists

01-25

The Definitive Q&A Guide for Aspiring Data Scientists

01-25

RcppArmadillo 0.9.200.5.0

11-28

RcppTOML 0.1.4: Now with TOML v0.5.0

10-23

R Packages worth a look

09-16

Why Indian companies should take on different projects than competing Valley companies - an application of Cobb-Douglas

11-07

The Definitive Q&A Guide for Aspiring Data Scientists

01-25

Theano Tutorial

01-25

Building a neighbour matrix with python

11-04

Why you should start using .npy file more often…

03-20

Two cool features of Python NumPy: Mutating by slicing and Broadcasting

03-17

From Arrow to pandas at 10 Gigabytes Per Second

12-27

Feather and Apache Arrow: Grokking file formats vs. in-memory representations

04-21

Theano Tutorial

01-25

Updated Review: jamovi User Interface to R

01-09

Marketing analytics with greybox

01-07

Structural Analisys of Bayesian VARs with an example using the Brazilian Development Bank

01-05

Carol Nickerson explains what those mysterious diagrams were saying

12-22

Machine Learning (ML) Essentials

12-11

Data Mining Book – Chapter Download

12-04

One Recipe Step to Rule Them All

12-03

NYC buses: simple Cubist regression

11-29

R Packages worth a look

11-14

Exploring Models with lime

11-09

Best Practices for Using Notebooks for Data Science

11-08

Data Mining Book – Chapter Download

11-02

If you did not already know

10-27

Marketing Analytics and Data Science

10-26

Applied Data Science: Solving a Predictive Maintenance Business Problem Part 3

10-16

Machine Learning Trick of the Day (8): Instrumental Thinking

10-15

Data Mining Book: Chapter Download.

10-10

Data Science Glossary

09-12

R Tip: How to Pass a formula to lm

09-01

Tips for analyzing Excel data in R

08-30

Distilled News

08-25

Multiple Linear Regression & Assumptions of Linear Regression: A-Z

08-17

Building a Linear Regression Model for Real World Problems, in R

08-14

If you did not already know

08-10

R Packages worth a look

07-31

Using Linear Regression for Predictive Modeling in R

05-16

Hitchhiker’s guide to Used Car Prices Estimation

12-04

What is an Interaction Effect?

02-25

Engineering is the bottleneck in (Deep Learning) Research

01-17

Forecast double seasonal time series with multiple linear regression in R

12-03

TensorFlow in a Nutshell — Part One: Basics

08-22

Principal Component Analysis Tutorial

06-14

A wild dataset has appeared! Now what?

05-02

Discovering and understanding patterns in highly dimensional data

02-28

Theano Tutorial

01-25

If you did not already know

11-26

If you did not already know

08-18

Where Will Your Country Stand in World War III?

04-12

Data Mining with Python on Medical Datasets for Data Mining

01-25

Data Mining with Python on Medical Datasets for Data Mining

01-25

Data Mining with Python on Medical Datasets for Data Mining

01-25

Data Mining with Python on Medical Datasets for Data Mining

01-25

Why Today’s Big Data is Not Yesterday’s Big Data — Exponential and Combinatorial Growth

01-26

Why Today’s Big Data is Not Yesterday’s Big Data — Exponential and Combinatorial Growth

01-26

Why Today’s Big Data is Not Yesterday’s Big Data — Exponential and Combinatorial Growth

01-26

Murmuration: Data Scientist [New York, NY]

01-10

10 Companies to Work with After a Data Science Course

01-10

KDnuggets™ News 19:n02, Jan 9: The cold start problem: how to build your machine learning portfolio; 5 Best Data Visualization Libraries

01-09

Industry leaders to speak at Mega-PAW, Las Vegas – June 16-20

01-09

Who is a Data Scientist?

12-27

Gold-Mining Week 16 (2018)

12-21

UnitedHealth Group: Director, Data Science [Minnetonka, MN]

12-19

The 2019 PAW Business Agenda is Live – Super Early Bird expires this Friday

12-17

Gold-Mining Week 15 (2018)

12-13

Cummins: Reliability Analytics Leader [Columbus, IN]

12-13

Intuit: Staff Data Scientist – Business Analytics [Mountain View, CA]

12-12

CBH Group: Data Scientist [Perth, Australia]

12-11

AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-03

University of Tennessee Knoxville: Assistant or Associate Professor in Data Science [Knoxville, TN]

11-30

Gold-Mining Week 13 (2018)

11-29

3 Challenges for Companies Tackling Data Science

11-26

Gold-Mining Week 12 (2018)

11-22

Mega-PAW Las Vegas Registration is Live & Super Early Bird Pricing is Now Available!

11-20

Address Your Data Science Strategy at DSNY

11-20

Insights on the role data can play in your organization

11-19

Predictive Analytics in 2018: Salaries & Industry Shifts

11-19

UnitedHealth Group: Sr Manager, Data Science [Telecommute, Central or Eastern Time Zones]

11-19

UnitedHealth Group: Sr Director, Decision Analytics [Minnetonka, MN]

11-19

UnitedHealth Group: Director, Omni-Channel Analytics [Minnetonka, MN]

11-19

Gold-Mining Week 11 (2018)

11-15

URI: Director, Data Analytics/DataSpark [Kingston, RI]

11-15

Strategy: Customer Analytics: Are you Profiting from your Data?

11-14

Bright Lights, Bright Future. TDWI Is Back in Vegas

11-14

Gold-Mining Week 10 (2018)

11-10

Dr. Data Show Video: What the Hell Does “Data Science” Really Mean?

11-10

Gold-Mining Week 9 (2018)

10-31

Bank of Canada: Data Scientist [Ottawa, Canada]

10-29

Gold-Mining Week 8 (2018)

10-26

Don’t miss Big Data LDN 2018

10-22

Speak at Mega-PAW Vegas 2019 – on Machine Learning Deployment (Apply by Nov 15)

10-22

McKinsey Datathon: The City Cup17 November, Amsterdam, Stockholm and Zurich. Apply Now

10-19

Gold-Mining Week 7 (2018)

10-19

Choose Your Own Adventure – Analytics On-boarding

10-15

Gold-Mining Week 6 (2018)

10-12

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount.

10-09

3 Stages of Creating Smart

10-04

Magister Dixit

10-02

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount

10-01

Robust Quality – Powerful Integration of Data Science and Process Engineering

10-01

Distilled News

09-24

Understanding Different Components & Roles in Data Science

09-18

Document worth reading: “Analytics for the Internet of Things: A Survey”

09-12

Understanding Different Components & Roles in Data Science

08-30

Machine Learning Making Big Moves in Marketing

07-30

Data Science in 30 Minutes: Using Data Science to Predict the Future with Kirk Borne

07-11

I Can’t Afford to Hire a Data Scientist. Now What?

07-11

Preparing for the Data Science Job Hunt

07-11

Crossing Your Data Science Chasm

03-22

It’s okay to not be a data scientist

02-20

Self-Service Adobe Analytics Data Feeds!

03-03

Q & A with Meta Brown

05-18

RSiteCatalyst Version 1.4.8 Release Notes

04-04

Why Today’s Big Data is Not Yesterday’s Big Data — Exponential and Combinatorial Growth

01-26

Bitcoin and Taxes: What You May Not Know

12-06

Top Blockchain Applications Making Waves in Commercial Real Estate

10-12

Cryptocurrency: Your Current Options

08-10

Top-Down vs. Bottom-Up Approaches to Data Science

07-10

How I built a receipt chatbot over a weekend

06-23

A Million Text Files And A Single Laptop

01-28

The Need for Speed Part 2: C++ vs. Fortran vs. C

12-24

Speed up your R Work

07-08

Extreme IO performance with parallel Apache Parquet in Python

02-10

A Million Text Files And A Single Laptop

01-28

Parallelize a For-Loop by Rewriting it as an Lapply Call

01-11

The Need for Speed Part 2: C++ vs. Fortran vs. C

12-24

Speed up your R Work

07-08

A Million Text Files And A Single Laptop

01-28

Parallelize a For-Loop by Rewriting it as an Lapply Call

01-11

The Need for Speed Part 2: C++ vs. Fortran vs. C

12-24

Reading List Faster With parallel, doParallel, and pbapply

12-12

Speed up your R Work

07-08

A Million Text Files And A Single Laptop

01-28

Document worth reading: “Machine Learning in Official Statistics”

01-11

“The Book of Why” by Pearl and Mackenzie

01-08

Document worth reading: “I can see clearly now: reinterpreting statistical significance”

01-08

On deck for the first half of 2019

01-07

If you did not already know

01-04

Office for Students report on “grade inflation”

01-02

Data Science & ML : A Complete Interview Guide

12-19

Document worth reading: “A Short Introduction to Local Graph Clustering Methods and Software”

12-10

A comprehensive list of Machine Learning Resources: Open Courses, Textbooks, Tutorials, Cheat Sheets and more

12-07

R Packages worth a look

12-05

My talk tomorrow (Tues) noon at the Princeton University Psychology Department

12-03

Defining visualization literacy

11-30

October 2018: “Top 40” New Packages

11-29

Polished statistical analysis chapters in evidence-based software engineering

11-24

Graphs and tables, tables and graphs

11-18

“Using numbers to replace judgment”

11-17

Robustness checks are a joke

11-14

Top KDnuggets tweets, Nov 07-13: 10 Free Must-See Courses for Machine Learning and Data Science

11-14

The 5 Basic Statistics Concepts Data Scientists Need to Know

11-13

If you did not already know

11-03

Document worth reading: “Declarative Statistics”

10-22

Ethics in statistical practice and communication: Five recommendations.

10-18

Document worth reading: “An Introduction to Inductive Statistical Inference — from Parameter Estimation to Decision-Making”

10-09

(People are missing the point on Wansink, so) what’s the lesson we should be drawing from this story?

09-27

What is P-value?

09-20

Learning Statistics Online for Data Science

09-20

Three Mighty Good Reasons to Learn R for Data Science

09-19

Don’t get fooled by observational correlations

09-16

R Packages worth a look

09-02

If you did not already know

09-01

John Hattie’s “Visible Learning”: How much should we trust this influential review of education research?

09-01

Distilled News

08-22

The fallacy of the excluded middle — statistical philosophy edition

08-18

The fallacy of the excluded middle — statistical philosophy edition

08-18

Distilled News

08-07

Top 20 Python AI and Machine Learning Open Source Projects

07-23

Basic Statistics in Python: Probability

07-18

Joint inference or modular inference? Pierre Jacob, Lawrence Murray, Chris Holmes, Christian Robert discuss conditions on the strength and weaknesses of these choices

07-09

Data Readiness Levels: Turning Data from Palid to Vivid

01-12

Book Review: Computer Age Statistical Inference

11-23

The Shuttle Challenger Disaster: Reflections and Connections to Data Science

01-28

How to Learn Python in 30 days

01-12

Top Skills Needed to Work as Data Scientist in iGaming

01-10

How to Write a Great Data Science Resume

01-03

How to Learn Python in 30 days

01-02

How to Land a Job As a Data Scientist in 2019

12-24

How Miguel Got 3 Data Science Job Offers Fast With Dataquest

12-24

Alternative approaches to scaling Shiny with RStudio Shiny Server, ShinyProxy or custom architecture.

12-18

How will automation tools change data science?

12-18

Why You Shouldn’t be a Data Science Generalist

12-14

Distilled News

12-10

6 Step Plan to Starting Your Data Science Career

12-05

Learn to do Data Viz in R

12-05

Why R for data science – and not Python?

12-02

Deep Learning for the Masses (… and The Semantic Layer)

11-30

Data Science Strategy Safari: Aligning Data Science Strategy to Org Strategy

11-26

Intro to Data Science for Managers

11-23

Anticipating the next move in data science – my interview with Thomson Reuters

11-17

Technoslavia 2.5: Open Source Topography

11-07

Top Data Science Hacks

11-05

Top Data Science Hacks

11-05

Azure ML Studio now supports R 3.4

11-01

The role of academia in data science education

11-01

Marketing Analytics and Data Science

10-26

5 Steps to Prepare for a Data Science Job

10-23

Cassie Kozyrkov discusses decision making and decision intelligence!

10-22

5 Steps to Prepare for a Data Science Job

10-22

Magister Dixit

10-05

The Best Programming Languages for Data Science and Machine Learning in 2018

09-20

Learning Statistics Online for Data Science

09-20

Understanding Different Components & Roles in Data Science

09-18

Divergent and Convergent Phases of Data Analysis

09-14

Google Dataset Search : Google’s New Data Search Engine

09-10

Welcome to Dataiku University!

09-07

Data Science Portfolio Project: Where to Advertise an E-learning Product

09-05

Understanding Different Components & Roles in Data Science

08-30

Microsoft Weekly Data Science News for August 24, 2018

08-24

How to Build a Data Science Portfolio

08-13

TechTarget: Data science in healthcare demands dual focus, expert says

08-01

Preparing for the Data Science Job Hunt

07-11

What I’ve learned from competing in machine learning contests on Kaggle

07-06

Data Science at Scale: Six Major Trends

07-05

What Data Scientists should focus on in 2018?

06-27

Programming Best Practices For Data Science

06-08

The Data Incubator Unofficial Frequently Asked Questions

05-30

Announcing Ursa Labs: an innovation lab for open source data science

04-19

Hiring Data Scientists

02-04

Lessons learned in my first year as a data scientist

01-25

QuantConnect – the only Game in Town

09-10

T-Shirt Contest Finalists

01-17

Demystifying Data Science

11-10

Ten Ways Your Data Project is Going to Fail

11-01

How To Become A Machine Learning Expert In One Simple Step

03-20

The Shuttle Challenger Disaster: Reflections and Connections to Data Science

01-28

The Shuttle Challenger Disaster: Reflections and Connections to Data Science

01-28

How to Learn Python in 30 days

01-12

Top 5 Data Science Courses in 2019

01-09

From a Night of Insomnia to Competition Winner | An Interview with Martin Barron

01-08

Role of Computer Science in Data Science World

01-07

Why Learning Data Science Live is Better than Self-Paced Learning

01-02

Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

12-11

Yeshiva University: Data Science Program Director [New York, NY]

11-30

Monash University: Lecturer/Sr Lecturer – Digital Health [Melbourne, Australia]

11-22

6 Goals Every Wannabe Data Scientist Should Make for 2019

11-22

NYU Stern: 2019-20 Asst. Professor of Information, Operations & Management Sciences – Information Systems, tenure-track [New York City, NY]

11-14

Angela Bassa discusses managing data science teams and much more.

11-12

Dr. Data Show Video: What the Hell Does “Data Science” Really Mean?

11-10

DePaul University: Professor of Practice position in Data Science [Chicago, IL]

11-07

DePaul University: Two tenure-track/tenured positions in Data Science/Computer Science [Chicago, IL]

11-07

The role of academia in data science education

11-01

Monash University: Academic Opportunities in Dialogue Research [Melbourne, Australia]

10-25

Temple University: Faculty Positions (Assistant/Associate/Full Professor) [Philadelphia, PA]

10-12

University of Nebraska at Omaha: Faculty Position in Computer Science [Omaha, NE]

10-05

“Six Signs of Scientism”: where I disagree with Haack

10-04

Response to Rafa: Why I don’t think ROC [receiver operating characteristic] works as a model for science

08-05

Data Science in 30 Minutes: Deep Learning to Detect Fake News with Uber ATG Head of Data Science, Mike Tamir

05-30

The Shuttle Challenger Disaster: Reflections and Connections to Data Science

01-28

The Shuttle Challenger Disaster: Reflections and Connections to Data Science

01-28

Simulating Multi-state Models with R

01-01

Will Julia Replace Python and R for Data Science?

12-26

How to use Keras fit and fit_generator (a hands-on tutorial)

12-24

Statistics in Glaucoma: Part III

12-18

TSstudio 0.1.3

12-02

Time Series and MCHT

11-12

R Packages worth a look

10-30

Introducing gratia

10-23

Functions and Packages

09-29

Against Winner-Take-All Attribution

09-05

R Packages worth a look

08-29

Parallelizing Distance Calculations Using A GPU With CUDAnative.jl

08-14

RNNs in Tensorflow, a Practical Guide and Undocumented Features

08-21

RSiteCatalyst Version 1.4.8 Release Notes

04-04

How-to: Train Models in R and Python using Apache Spark MLlib and H2O

01-29

How-to: Train Models in R and Python using Apache Spark MLlib and H2O

01-29

Thinking is not something that goes on entirely, or even mostly, inside people’s heads. Little...

01-30

Kent State University: Assistant/Associate Professor – Business Analytics/Information Systems [Kent, OH]

12-19

CBH Group: Sr Data Engineer [Perth, Australia]

12-14

Cummins: Advanced Analytics Systems Architect Principle [Columbus, IN]

12-12

If you did not already know

12-01

Document worth reading: “Legible Normativity for AI Alignment: The Value of Silly Rules”

11-28

Document worth reading: “Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches”

11-17

Quantum Machine Learning: A look at myths, realities, and future projections

11-05

The 3Ds of Machine Learning Systems Design

11-05

Why AI will not replace radiologists

11-01

If you did not already know

10-12

Distilled News

10-09

Document worth reading: “Closing the AI Knowledge Gap”

09-14

Big Data : Meaning, Components, Collection & Analysis

09-10

If you did not already know

08-23

Document worth reading: “A Survey on Visual Query Systems in the Web Era (extended version)”

08-20

The Real Problems with Neural Machine Translation

07-21

Design Patterns for Production NLP Systems

07-09

The Rise of Social Bots!

06-28

Thinking is not something that goes on entirely, or even mostly, inside people’s heads. Little...

01-30

Thinking is not something that goes on entirely, or even mostly, inside people’s heads. Little...

01-30

Thinking is not something that goes on entirely, or even mostly, inside people’s heads. Little...

01-30

Document worth reading: “A Survey on Trust Modeling from a Bayesian Perspective”

11-22

A study fails to replicate, but it continues to get referenced as if it had no problems. Communication channels are blocked.

10-24

A Neural Architecture for Bayesian CompressiveSensing over the Simplex via Laplace Techniques

10-08

Document worth reading: “The Three Pillars of Machine-Based Programming”

09-18

“Dynamically Rescaled Hamiltonian Monte Carlo for Bayesian Hierarchical Models”

09-06

Bayesian model comparison in ecology

08-26

Data concerns when interpreting comparisons of gender equality between countries

08-21

Data concerns when interpreting comparisons of gender equality between countries

08-21

It should be ok to just publish the data.

08-15

It should be ok to just publish the data.

08-15

China air pollution regression discontinuity update

08-02

AI ‘judge’ doesn’t explain why it reaches certain decisions

10-24

Thinking is not something that goes on entirely, or even mostly, inside people’s heads. Little...

01-30

Faster garbage collection in pqR

11-30

Kolmogorov and randomness

02-18

Linear Feedback Shift Registers

11-19

Basic Math on How Bloom Filter Works

08-27

Hamming Codes

01-30

Hamming Codes

01-30

Hamming Codes

01-30

RcppMsgPack 0.2.3

11-18

Maps of the issues mentioned most in election advertising

11-05

Document worth reading: “Deep Facial Expression Recognition: A Survey”

10-17

David Weakliem points out that both economic and cultural issues can be more or less “moralized.”

10-03

Hey—take this psychological science replication quiz!

09-02

Harmonizing and emojifying our GitHub issue trackers

07-12

GitHub's one-dimensional view of open source contributions

11-07

RSiteCatalyst Version 1.4.7 (and 1.4.6.) Release Notes

02-01

Bulk Downloading Adobe Analytics Data

07-21

Adobe: Give Credit. You DID NOT Write RSiteCatalyst.

05-09

RSiteCatalyst Version 1.4.7 (and 1.4.6.) Release Notes

02-01

collateral

11-02

Programming Best Practices For Data Science

06-08

RSiteCatalyst Version 1.4.7 (and 1.4.6.) Release Notes

02-01

Analysis of South African Funds

01-08

Deterministic A/B tests via the hashing trick

03-20

RSiteCatalyst Version 1.4.7 (and 1.4.6.) Release Notes

02-01

Six Roll Dice Game

02-04

Six Roll Dice Game

02-04

Six Roll Dice Game

02-04

Top KDnuggets tweets, Oct 31 – Nov 6: 10 More Free Must-Read Books for Machine Learning and Data Science

11-07

Six Roll Dice Game

02-04

Six Roll Dice Game

02-04

June is applied regression exam month!

12-24

Those “other” apply functions…

11-13

Guest Post – Learning R as an MBA Student

07-12

Class visualization with bilateral filters

02-05

Biggest Deep Learning Summit – Special KDnuggets Offer

01-10

Class visualization with bilateral filters

02-05

Class visualization with bilateral filters

02-05

Distilled News

12-19

Distilled News

11-12

If you did not already know

10-11

Distilled News

09-20

Run SQL queries from your SageMaker notebooks using Amazon Athena

09-12

Document worth reading: “A Survey on Visual Query Systems in the Web Era (extended version)”

08-20

Quick DB result caching in R

05-05

Getting Started With MapD, Part 1: Docker Install and Loading Data

02-01

Google F1 Server Reading Summary

11-26

“Redshift View Materializer” Now on Github

04-14

Amazon Redshift Performance – Bigger Clusters, or Bigger Nodes?

02-05

Distilled News

12-19

If you did not already know

12-09

If you did not already know

12-06

Run SQL queries from your SageMaker notebooks using Amazon Athena

09-12

Document worth reading: “A Survey on Visual Query Systems in the Web Era (extended version)”

08-20

Document worth reading: “Sequences, yet Functions: The Dual Nature of Data-Stream Processing”

08-16

Quick DB result caching in R

05-05

Getting Started With MapD, Part 1: Docker Install and Loading Data

02-01

Google F1 Server Reading Summary

11-26

Amazon Redshift Performance – Bigger Clusters, or Bigger Nodes?

02-05

R Packages worth a look

01-09

If you did not already know

01-05

If you did not already know

10-23

If you did not already know

09-25

Sleeping Giant Rural Postman Problem

12-01

50 states Rural Postman Problem

11-19

How to Build Your Own Blockchain Part 3 — Writing Nodes that Mine and Talk

11-02

Docker for AWS

06-27

An introduction to Bayesian Belief Networks

03-10

Monitoring your cluster in just a few minutes using ISA

07-18

Amazon Redshift Performance – Bigger Clusters, or Bigger Nodes?

02-05

Introducing RcppDynProg

12-31

Introducing RcppDynProg

12-31

Training models with unequal economic error costs using Amazon SageMaker

09-18

Streamlining Production with Predictive Maintenance and Essilor

09-04

Machine learning mega-benchmark: GPU providers (part 2)

02-08

Why Indian companies should take on different projects than competing Valley companies - an application of Cobb-Douglas

11-07

Why I'm bullish on Uber - the customer acquisition trough

04-20

Amazon Redshift Performance – Bigger Clusters, or Bigger Nodes?

02-05

Amazon Redshift Performance – Bigger Clusters, or Bigger Nodes?

02-05

Russian Roulette

02-06

Russian Roulette

02-06

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-17

LoyaltyOne: Associate Director, CPG [Westborough, MA]

12-17

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-14

Russian Roulette

02-06

Russian Roulette

02-06

Add a static pdf vignette to an R package

01-11

Robin Pemantle’s updated bag of tricks for math teaching!

01-04

Starspace for NLP

12-04

Communicating results with R Markdown

11-01

namer, Automatic Labelling of R Markdown Chunks

10-31

Spam Detection with Natural Language Processing-Part 2

10-18

Build a document search bot using Amazon Lex and Amazon Elasticsearch Service

08-01

Pivoted document length normalisation

06-19

Docstrings in open source Python

06-18

Crosslingual document comparison

08-31

A fastText-based hybrid recommender

09-27

Creating a Search Engine

08-19

Document Similarity With Word Movers Distance

06-13

How to make a good data-driven web app

05-25

Do average consumers still need Dropbox?

03-13

Paris Meetup slides Topic Modeling of Twitter Followers

02-08

Paris Meetup slides Topic Modeling of Twitter Followers

02-08

Journal: PLXtrum - realtime machine learning for predicting note onset

01-28

Paris Meetup slides Topic Modeling of Twitter Followers

02-08

Paris Meetup slides Topic Modeling of Twitter Followers

02-08

Looking into 19th century ads from a Luxembourguish newspaper with R

01-04

Implement spelling correction using Language Models

02-08

Implement spelling correction using Language Models

02-08

Implement spelling correction using Language Models

02-08

Implement spelling correction using Language Models

02-08

Free Reinforcement Learning Textbook

11-14

The Best of Unpublished Machine Learning and Statistics Books

02-09

A comprehensive list of Machine Learning Resources: Open Courses, Textbooks, Tutorials, Cheat Sheets and more

12-07

The Best of Unpublished Machine Learning and Statistics Books

02-09

Simulating simple dice games by @ellis2013nz

10-26

Chuck-a-Luck

12-26

d20 stopping puzzle

02-11

Simulating simple dice games by @ellis2013nz

10-26

Chuck-a-Luck

12-26

d20 stopping puzzle

02-11

d20 stopping puzzle

02-11

If you did not already know

01-06

R Packages worth a look

12-18

More on Bias Corrected Standard Deviation Estimates

11-14

Bob Erikson on the 2018 Midterms

10-01

Don’t call it a bandit

08-04

A Gentle Introduction to Bloom Filter

06-05

d20 stopping puzzle

02-11

Developing effective data scientists

02-11

Developing effective data scientists

02-11

Developing effective data scientists

02-11

Becoming a Data Scientist Podcast Episode 05: Clare Corthell

02-15

Linear Regression in the Wild

10-03

How to Build a Data Science Portfolio

08-13

The 2018 Best Picture Nominees Ranked, Reviewed, and Reflected Upon

03-03

Hiring Data Scientists

02-04

Becoming a Data Scientist Podcast Episode 05: Clare Corthell

02-15

Confluent Platform

02-15

Confluent Platform

02-15

Big Data : Meaning, Components, Collection & Analysis

09-10

Putting the Power of Kafka into the Hands of Data Scientists

09-05

Confluent Platform

02-15

Confluent Platform

02-15

Confluent Platform

02-15

Making Python on Apache Hadoop Easier with Anaconda and CDH

02-17

Making Python on Apache Hadoop Easier with Anaconda and CDH

02-17

Why Blog?

02-18

Why Blog?

02-18

Announcing Ursa Labs's partnership with NVIDIA

10-10

Some comments to Daniel Abadi's blog about Apache Arrow

11-01

Feather format update: Whence and Whither?

10-16

Apache Arrow and the "10 Things I Hate About pandas"

09-21

Feather: A Fast On-Disk Format for Data Frames for R and Python, powered by Apache Arrow

03-29

Introducing Apache Arrow: A Fast, Interoperable In-Memory Columnar Data Structure Standard

02-18

Introducing Apache Arrow: A Fast, Interoperable In-Memory Columnar Data Structure Standard

02-18

Data Notes: Predict the World Cup 2018 Winner

06-14

Introducing Apache Arrow: A Fast, Interoperable In-Memory Columnar Data Structure Standard

02-18

Two nugget problem

02-21

If you did not already know

12-01

R tip: Use Radix Sort

08-21

If you did not already know

08-15

data.table is Really Good at Sorting

08-14

Two nugget problem

02-21

R Packages worth a look

10-06

Two nugget problem

02-21

If you did not already know

12-01

Two nugget problem

02-21

Two nugget problem

02-21

Learning in Brains and Machines (1): Temporal Differences

02-21

Learning in Brains and Machines (1): Temporal Differences

02-21

Document worth reading: “Recent Research Advances on Interactive Machine Learning”

01-05

R Packages worth a look

11-26

Because it's Friday: Parable of the Polygons

10-26

New Course: Interactive Data Visualization with rbokeh

10-19

R Packages worth a look

10-18

R Packages worth a look

09-19

Gradient Boosting Interactive Playground

07-05

Data Science Learning Club Update

02-21

Redmonk Language Rankings, June 2018

08-10

Becoming a Data Scientist Podcast Episode 12: Data Science Learning Club Members

06-15

Data Science Learning Club Update

02-21

What makes Robin Pemantle’s bag of tricks for teaching math so great?

07-27

Becoming a Data Scientist Podcast Episode 12: Data Science Learning Club Members

06-15

Data Science Learning Club Update

02-21

SAGA algorithm in the lightning library

02-21

State of Deep Learning and Major Advances: H2 2018 Review

12-13

New version of pqR, with major speed improvements

11-25

Learn the top things to look for in an AI Vendor

10-12

Feather format update: Whence and Whither?

10-16

SAGA algorithm in the lightning library

02-21

New version of pqR, with major speed improvements

11-25

R Packages worth a look

11-20

R Packages worth a look

09-30

Distilled News

09-07

Feather format update: Whence and Whither?

10-16

SAGA algorithm in the lightning library

02-21

SAGA algorithm in the lightning library

02-21

SAGA algorithm in the lightning library

02-21

Adam: “It would have been much harder without Dataquest”

10-18

Calling RSiteCatalyst From Python

02-22

Calling RSiteCatalyst From Python

02-22

Plotting wind highways using rWind

11-26

Why pandas users should be excited about Apache Arrow

02-22

implyr: R Interface for Apache Impala

07-19

How-to: Use Impala and Kudu Together for Analytic Workloads

04-20

Why pandas users should be excited about Apache Arrow

02-22

Guide to an in-depth understanding of logistic regression

02-22

R Packages worth a look

01-13

A deep dive into glmnet: offset

01-09

June is applied regression exam month!

12-24

An 8-hour course on R and Data Mining

12-09

An 8-hour course on R and Data Mining

12-09

Shinyfit: Advanced regression modelling in a shiny app

12-07

If you did not already know

12-04

R Packages worth a look

11-30

R Packages worth a look

11-05

If you did not already know

10-24

R Packages worth a look

09-27

R Packages worth a look

09-16

R Packages worth a look

09-05

R Packages worth a look

08-12

Intercausal Reasoning in Bayesian Networks

03-13

Probability Calibration And Isotonic Regression

09-18

MLHEP 2016 lectures slides

07-12

Guide to an in-depth understanding of logistic regression

02-22

Science Week Talk 2016

02-24

Science Week Talk 2016

02-24

Science Week Talk 2016

02-24

Hierarchical Bayesian Neural Networks with Informative Priors

08-13

What's new in PyMC3 3.1

07-05

Why hierarchical models are awesome, tricky, and Bayesian

02-08

Bayesian Deep Learning

06-01

Science Week Talk 2016

02-24

Science Week Talk 2016

02-24

Oil Changes, Gas Mileage, and my Unreliable Gut

02-24

Oil Changes, Gas Mileage, and my Unreliable Gut

02-24

Nemirovski’s acceleration

01-09

Whats new on arXiv

12-04

The Intuitions Behind Bayesian Optimization with Gaussian Processes

10-19

Document worth reading: “Vector and Matrix Optimal Mass Transport: Theory, Algorithm, and Applications”

10-14

Whats new on arXiv

10-09

R Packages worth a look

10-06

Document worth reading: “Physically optimizing inference”

09-29

Whats new on arXiv

08-10

Do Bayesians Overfit?

07-11

Bayesian Inference via Simulated Annealing

02-07

Learning Reinforcement Learning (with Code, Exercises and Solutions)

10-02

Hyperparameter optimization with approximate gradient

05-24

Watch Tiny Neural Nets Learn

03-04

Oil Changes, Gas Mileage, and my Unreliable Gut

02-24

JMP Publishes Exercises to Accompany Data Mining Techniques (3rd Edition)

05-31

Oil Changes, Gas Mileage, and my Unreliable Gut

02-24

How to Code and Understand DeepMind's Neural Stack Machine

02-25

R Tip: Use seqi() For Indexes

01-11

R Tip: Use seqi() For Indexes

01-11

restez: Query GenBank locally

12-03

BRUNO: A Deep Recurrent Model for Exchangeable Data

09-17

When Recurrent Models Don't Need to be Recurrent

08-06

Convolve all the things

05-31

Linus Sequence

02-06

Genome Analysis Toolkit: Now Using Apache Spark for Data Processing

04-06

How to Code and Understand DeepMind's Neural Stack Machine

02-25

How to Code and Understand DeepMind's Neural Stack Machine

02-25

How to Code and Understand DeepMind's Neural Stack Machine

02-25

A Variant on “Statistically Controlling for Confounding Constructs is Harder than you Think”

02-25

R Packages worth a look

01-13

If you did not already know

12-15

Magister Dixit

10-05

If you did not already know

08-12

Two papers released on arXiv, "Operator Variational Inference" and "Model Criticism for Bayesian Causal Inference"

10-30

A Variant on “Statistically Controlling for Confounding Constructs is Harder than you Think”

02-25

A Variant on “Statistically Controlling for Confounding Constructs is Harder than you Think”

02-25

A Variant on “Statistically Controlling for Confounding Constructs is Harder than you Think”

02-25

A Variant on “Statistically Controlling for Confounding Constructs is Harder than you Think”

02-25

Improving Binning by Bootstrap Bumping

11-25

More Robust Monotonic Binning Based on Isotonic Regression

11-24

Source and List: Organizing R Shiny Apps

11-06

Monotonic Binning with Equal-Sized Bads for Scorecard Development

10-14

Integration method to map model scores to conversion rates from example data

03-04

Histogram intersection for change detection

02-28

Histogram intersection for change detection

02-28

Histogram intersection for change detection

02-28

Interspeech 2018: Highlights for Data Scientists

12-24

Quoting in R

11-15

Quoting in R

11-15

Import AI 114: Synthetic images take a big leap forward with BigGANs; US lawmakers call for national AI strategy; researchers probe language reasoning via HotspotQA

10-01

Awesome postdoc opportunities in computational genomics at JHU

05-17

Webcam based image processing in Jupyter notebooks

04-09

Discovering and understanding patterns in highly dimensional data

02-28

“Dissolving the Fermi Paradox”

01-05

Discovering and understanding patterns in highly dimensional data

02-28

Similar pages for Wikipedia

05-03

Future Debates: This House Believes An Artificial Intelligence will Benefit Society

02-29

Future Debates: This House Believes An Artificial Intelligence will Benefit Society

02-29

4 Myths of Big Data and 4 Ways to Improve with Deep Data

01-09

Document worth reading: “Computational Power and the Social Impact of Artificial Intelligence”

12-27

If you did not already know

12-06

If you did not already know

11-23

Introduction to PyTorch for Deep Learning

11-07

If you did not already know

11-01

Open Source Deep Dive with Olivier Grisel

10-29

Document worth reading: “A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior”

08-07

Quarterly product update: Create your data science projects on Kaggle

04-04

Moravec's Paradox

01-31

The Two Tribes of Language Researchers

11-19

Cognitive Machine Learning: Prologue

10-08

Build your own Deep Learning Box

05-19

Future Debates: This House Believes An Artificial Intelligence will Benefit Society

02-29

Grazing and Calculus

02-29

Grazing and Calculus

02-29

Grazing and Calculus

02-29

Grazing and Calculus

02-29

University of Virginia: Faculty, Open Rank Model and Simulation at the Human-Technology Frontier [Charlottesville, VA]

12-24

The Semantic Web: Where is it now?

12-23

Ronin: Data Engineer [San Mateo, CA]

12-03

Document worth reading: “An Overview of Blockchain Integration with Robotics and Artificial Intelligence”

11-08

MVP for Data Projects

10-22

Inside Higher Ed: Pushing the Boundaries of Learning With AI

09-26

Document worth reading: “Artificial Intelligence and Robotics”

09-04

Distilled News

08-21

Legal Tech: How Can Lawyers Benefit?

08-13

CES 2017

01-09

Data Science Challenges

07-01

Sheffield Advertises Posts in Machine Learning

03-01

The Semantic Web: Where is it now?

12-23

Ronin: Data Engineer [San Mateo, CA]

12-03

The Big Data Game Board™

11-19

New Jobs Sure to Emerge Alongside Artificial Intelligence

10-18

The Definitive Guide to AI’s “Black Box” Problem

10-17

Ethical AI for Data Scientists

08-15

Legal Tech: How Can Lawyers Benefit?

08-13

Moravec's Paradox

01-31

Questions on Artificial Intelligence

01-16

Data Science Challenges

07-01

AI and ML Futures 4: The Future of AI Meeting

04-22

Sheffield Advertises Posts in Machine Learning

03-01

Sheffield Advertises Posts in Machine Learning

03-01

Designing a Deep Learning Project

08-23

Sheffield Advertises Posts in Machine Learning

03-01

Deep Learning for Media Content

12-28

Meet the Authors: “Data Analytics with Hadoop” from O’Reilly Media

03-01

Meet the Authors: “Data Analytics with Hadoop” from O’Reilly Media

03-01

Meet the Authors: “Data Analytics with Hadoop” from O’Reilly Media

03-01

Meet the Authors: “Data Analytics with Hadoop” from O’Reilly Media

03-01

Wind Turbine Efficiency

06-19

Deep Learning, Pachinko, and James Watt: Efficiency is the Driver of Uncertainty

03-04

Top Python Libraries in 2018 in Data Science, Deep Learning, Machine Learning

12-19

If you did not already know

11-10

Distilled News

09-17

Wind Turbine Efficiency

06-19

Deep Learning, Pachinko, and James Watt: Efficiency is the Driver of Uncertainty

03-04

First Steps With Neural Nets in Keras

03-04

First Steps With Neural Nets in Keras

03-04

Watch Tiny Neural Nets Learn

03-04

Watch Tiny Neural Nets Learn

03-04

AI Gotchas (& How to Avoid Them)

01-08

Global Legal Entity Identifier Foundation (GLEIF): Data Analyst [Frankfurt, Germany]

11-26

Will Compression Be Machine Learning’s Killer App?

10-16

If you did not already know

10-02

Robust Quality – Powerful Integration of Data Science and Process Engineering

10-01

If you did not already know

09-11

scikit-learn-contrib, an umbrella for scikit-learn related projects.

03-05

scikit-learn-contrib, an umbrella for scikit-learn related projects.

03-05

scikit-learn-contrib, an umbrella for scikit-learn related projects.

03-05

R Packages worth a look

08-11

“Bayesian Meta-Analysis with Weakly Informative Prior Distributions”

07-13

Q & A with Meta Brown

05-18

scikit-learn-contrib, an umbrella for scikit-learn related projects.

03-05

scikit-learn-contrib, an umbrella for scikit-learn related projects.

03-05

Second Annual Data Science Bowl – Part 1

03-06

Second Annual Data Science Bowl – Part 1

03-06

Avoiding overfitting in object detection problem

12-19

Playing with convolutions in TensorFlow

08-09

Second Annual Data Science Bowl – Part 1

03-06

Second Annual Data Science Bowl – Part 2

03-07

Carol Nickerson explains what those mysterious diagrams were saying

12-22

Second Annual Data Science Bowl – Part 3 – Automatically Finding the Heart Location in an MRI Image

03-08

Second Annual Data Science Bowl – Part 2

03-07

Second Annual Data Science Bowl – Part 2

03-07

Second Annual Data Science Bowl – Part 2

03-07

Second Annual Data Science Bowl – Part 3 – Automatically Finding the Heart Location in an MRI Image

03-08

Analyzing Customer Churn – Competing Risks

03-08

Analyzing Customer Churn – Competing Risks

03-08

A Case For Explainable AI & Machine Learning

12-27

Intuit: Staff Data Scientist [Woodland Hills, CA and Mountain View, CA]

12-11

A Right to Reasonable Inferences

10-01

If you did not already know

09-01

“Should I get a PhD to be a data scientist/analytics professional?”

11-19

Analyzing Customer Churn – Competing Risks

03-08

Understanding the maths of Computed Tomography (CT) scans

01-09

Introduction to Pandas, NumPy and RegEx in Python

12-17

In case you missed it: October 2018 roundup

11-15

A Deep (But Jargon and Math Free) Dive Into Deep Learning

08-31

Weekly Review: 10/28/2017

10-28

Analyzing Customer Churn – Competing Risks

03-08

How Data Scientists Think - A Mini Case Study

01-09

Feature engineering, Explained

12-21

Here are the most popular Python IDEs / Editors

12-07

Angela Bassa discusses managing data science teams and much more.

11-12

EARL Houston: Interview with Hadley Wickham

11-05

How to be an Artificial Intelligence (AI) Expert?

10-29

How to be an Artificial Intelligence (AI) Expert?

10-25

He’s a history teacher and he has a statistics question

10-20

Overlapping Disks

09-30

Learning Statistics Online for Data Science

09-20

“The most important aspect of a statistical analysis is not what you do with the data, it’s what data you use” (survey adjustment edition)

08-07

Decision Making and Diversity

11-15

Quora Q&A Session Answers

03-09

Don’t reinvent the wheel: making use of shiny extension packages. Join MünsteR for our next meetup!

01-08

Adding Firebase Authentication to Shiny

01-03

x-mas tRees with gganimate, ggplot, plotly and friends

01-03

Alternative approaches to scaling Shiny with RStudio Shiny Server, ShinyProxy or custom architecture.

12-18

R Packages worth a look

12-01

R Packages worth a look

11-28

Interactive Graphics with R Shiny

11-23

Building a Repository of Alpine-based Docker Images for R, Part II

11-14

Communicating results with R Markdown

11-01

Summer Intern Projects

10-22

Voice Control your Shiny Apps

10-15

R Packages worth a look

09-21

R Packages worth a look

09-20

Distilled News

09-06

Integrating D3.js into R Shiny

03-13

R Packages worth a look

10-22

Document worth reading: “Applications of Artificial Intelligence to Network Security”

08-22

Quick reference to Python in a single script (and notebook)

10-13

Integrating D3.js into R Shiny

03-13

Integrating D3.js into R Shiny

03-13

Towards optimal personalization: synthesisizing machine learning and operations research

08-30

Simulating Twitch chat with a Recurrent Neural Network

07-21

Integrating D3.js into R Shiny

03-13

LoyaltyOne: Associate Director, Client Services [Westborough, MA]

12-17

Announcing RStudio Package Manager

10-17

Distilled News

09-26

Nextgov: DHS Funds Machine Learning Tool to Boost Other Countries’ Airport Security

08-20

Distilled News

08-16

Announcing the Amazon SageMaker MXNet 1.2 container

08-06

Data Science for Managers and Directors (DS4MAD)

10-10

Do average consumers still need Dropbox?

03-13

Customize your notebook volume size, up to 16 TB, with Amazon SageMaker

11-06

Build a document search bot using Amazon Lex and Amazon Elasticsearch Service

08-01

Docstrings in open source Python

06-18

How to make a good data-driven web app

05-25

Do average consumers still need Dropbox?

03-13

BIG, small or Right Data: Which is the proper focus?

10-08

Scanning Office 365 documents

07-16

How I was screwing up testing my code

10-15

Applying Machine Learning To March Madness

03-12

Do average consumers still need Dropbox?

03-13

Stability as a foundation of machine learning

03-14

Stability as a foundation of machine learning

03-14

Stability as a foundation of machine learning

03-14

How things float

06-20

Stability as a foundation of machine learning

03-14

Analyzing Golden State Warriors' passing network using GraphFrames in Spark

03-15

French Baccalaureate Results

01-08

Analyzing Golden State Warriors' passing network using GraphFrames in Spark

03-15

Analyzing Golden State Warriors' passing network using GraphFrames in Spark

03-15

How-to: Do Scalable Graph Analytics with Apache Spark

10-03

Analyzing Golden State Warriors' passing network using GraphFrames in Spark

03-15

AzureRMR: an R interface to Azure Resource Manager

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

Analyzing Golden State Warriors' passing network using GraphFrames in Spark

03-15

Announcing the ultimate seminar speaker contest: 2019 edition!

01-06

EARL conference recap: Seattle 2018

11-24

Neural networks to generate music

11-19

Report from the Enterprise Applications of the R Language conference

11-16

Report from the Enterprise Applications of the R Language conference

11-16

High school statistics class builds election prediction model

10-23

David MacKay Symposium

03-15

ICML 2017 Workshop on Implicit Models

06-02

David MacKay Symposium

03-15

David MacKay Symposium

03-15

Le Monde puzzle [#1076]

12-26

Diagnosing Heart Diseases with Deep Neural Networks

03-15

Tutorial: An app in R shiny visualizing biopsy data —  in a pharmaceutical company

01-07

Simulating Multi-state Models with R

01-01

Ronin: Sr Machine Learning and AI Data Scientist [San Mateo, CA]

12-03

AWS expands HIPAA eligible machine learning services for healthcare customers

11-08

Young Investigator Special Competition for Time-Sharing Experiment for the Social Sciences

10-19

How AI Will Change Healthcare

10-15

“Seeding trials”: medical marketing disguised as science

08-01

TechTarget: Data science in healthcare demands dual focus, expert says

08-01

Large-Scale Health Data Analytics with OHDSI

12-21

Data Trusts

05-29

Diagnosing Heart Diseases with Deep Neural Networks

03-15

MLHEP 2016 lectures slides

07-12

Diagnosing Heart Diseases with Deep Neural Networks

03-15

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

12-21

Is the answer to everything Gaussian?

10-29

Estimating Pi

10-16

Building an Image Classifier Running on Raspberry Pi

10-09

National Pi Day

03-15

National Pi Day

03-15

If you did not already know

11-10

Visual search on AWS—Part 1: Engine implementation with Amazon SageMaker

08-30

Distilled News

08-21

National Pi Day

03-15

Advent of Code: Most Popular Languages

12-15

Amazon SageMaker now comes with new capabilities for accelerating machine learning experimentation

11-29

If you did not already know

09-12

Google Dataset Search : Google’s New Data Search Engine

09-10

Distilled News

08-21

Amazon Rekognition is now available in the Asia Pacific (Seoul) and Asia Pacific (Mumbai) Regions

08-09

Topic Modeling for Keyword Extraction

02-05

National Pi Day

03-15

Compiling DataFrame code is harder than it looks

03-16

Compiling DataFrame code is harder than it looks

03-16

Compiling DataFrame code is harder than it looks

03-16

MINDBODY: Business Intelligence Analyst II [San Luis Obispo, CA]

12-13

R now supported in Azure SQL Database

11-28

Book review: SQL Server 2017 Machine Learning Services with R

09-04

Clustering Zeppelin on Zeppelin

10-23

Compiling DataFrame code is harder than it looks

03-16

Large Data with Scikit-learn - Boston Meetup

03-16

Amazon Transcribe now supports real-time transcriptions

11-20

Top 5 Trends in Data Science

11-09

Apache Spark Introduction for Beginners

10-18

Use AWS DeepLens to give Amazon Alexa the power to detect objects via Alexa skills

10-17

Distilled News

08-21

Streaming Columnar Data with Apache Arrow

01-27

Akka Stream

05-06

Akka Stream

03-25

Large Data with Scikit-learn - Boston Meetup

03-16

Stereograms

11-26

How digital cameras work

05-25

Keras plays catch, a single file Reinforcement Learning example

03-17

How to Optimise Ad CTR with Reinforcement Learning

09-24

How to Optimise Ad CTR with Reinforcement Learning

09-17

Keras plays catch, a single file Reinforcement Learning example

03-17

Avoid unsigned integers in C++ if you can

03-17

Avoid unsigned integers in C++ if you can

03-17

Recently in the sister blog

07-24

Avoid unsigned integers in C++ if you can

03-17

Avoid unsigned integers in C++ if you can

03-17

Exploring 2018 R-bloggers & R Weekly Posts with Feedly & the ‘seymour’ package

12-31

AI for Good: slides and notebooks from the ODSC workshop

11-13

Document worth reading: “A Comprehensive Study of Deep Learning for Image Captioning”

10-31

Trapped in the spam folder? Here’s what to do.

08-07

F beta score for Keras

04-23

Two Bingo Ball Puzzle

03-18

Two Bingo Ball Puzzle

03-18

Two Bingo Ball Puzzle

03-18

Two Bingo Ball Puzzle

03-18

Two Bingo Ball Puzzle

03-18

Document worth reading: “Universality of Deep Convolutional Neural Networks”

01-10

Document worth reading: “I can see clearly now: reinterpreting statistical significance”

01-08

Document worth reading: “Recent Research Advances on Interactive Machine Learning”

01-05

Document worth reading: “The importance of being dissimilar in Recommendation”

12-30

R Packages worth a look

12-29

R Packages worth a look

12-27

Magister Dixit

12-24

Document worth reading: “A second-quantised Shannon theory”

12-20

Rotary

12-19

Distilled News

12-15

R Packages worth a look

11-28

Magister Dixit

11-27

R Packages worth a look

11-26

Magister Dixit

11-23

Document worth reading: “To Cluster, or Not to Cluster: An Analysis of Clusterability Methods”

11-23

Document worth reading: “The Algorithm Selection Competition Series 2015-17”

11-21

Free Reinforcement Learning Textbook

11-14

R Packages worth a look

11-13

R Packages worth a look

11-10

R Packages worth a look

11-05

Document worth reading: “Artificial Intelligence for Long-Term Robot Autonomy: A Survey”

11-04

Document worth reading: “Resource Management in Fog/Edge Computing: A Survey”

10-30

Document worth reading: “Neural Approaches to Conversational AI”

10-29

Document worth reading: “Machine Learning for Wireless Networks with Artificial Intelligence: A Tutorial on Neural Networks”

10-28

R Packages worth a look

10-24

R Packages worth a look

10-23

R Packages worth a look

10-20

R Packages worth a look

10-08

R Packages worth a look

10-06

Document worth reading: “Detecting Dead Weights and Units in Neural Networks”

10-05

Magister Dixit

10-02

R Packages worth a look

09-26

A Better Example of the Confused By The Environment Issue

09-25

Document worth reading: “Graph-based Ontology Summarization: A Survey”

09-23

A Quick Appreciation of the R transform Function

09-10

R Tip: Give data.table a Try

09-08

R Packages worth a look

09-08

R Packages worth a look

08-19

Document worth reading: “How Important Is a Neuron”

08-15

R Packages worth a look

08-01

Getting Started With MapD, Part 1: Docker Install and Loading Data

02-01

Adobe Analytics Clickstream Data Feed: Loading To Relational Database

03-18

Adobe Analytics Clickstream Data Feed: Loading To Relational Database

03-18

Adobe Analytics Clickstream Data Feed: Loading To Relational Database

03-18

Adobe Analytics Clickstream Data Feed: Loading To Relational Database

03-18

Announcing Kaggle integration with Google Data Studio

12-05

New: Maintained Datasets

11-06

Data Notes: Are Those Honey Bees Healthy?

10-04

Use Kaggle to start (and guide) your ML/ Data Science journey — Why and How

08-29

Data Notes: How to Forecast the S&P 500 with Prophet

07-12

What I’ve learned from competing in machine learning contests on Kaggle

07-06

Mother's Day Interview: How Nicole Finnie Became a Competitive Kaggler on Maternity Leave

05-10

Profiling Top Kagglers: Bestfitting, Currently

05-07

Kaggle’s Quora Question Pairs Competition

06-07

Docker and Kaggle with Ernie and Bert

11-22

How To Become A Machine Learning Expert In One Simple Step

03-20

Slot Machines

10-15

Probability and Tennis

08-13

Exploiting Daily Fantasy Football for Fun and Profit

09-22

Tic-Tac-AI: A Strong Tic-Tac-Toe AI Opponent using Forward Sampling

03-07

NSA Easter Egg Puzzle

03-05

Chuck-a-Luck

12-26

How To Become A Machine Learning Expert In One Simple Step

03-20

How To Become A Machine Learning Expert In One Simple Step

03-20

NYU Stern Fubon Center for Technology, Business and Innovation: Fubon Center Faculty Fellow [New York, NY]

01-08

Southern Illinois University Edwardsville: Director of the Center for Predictive Analytics/(Associate) Professor of Mathematics and Statistics [Edwardsville, IL]

01-04

Distilled News

12-31

University of Rhode Island: Data Scientist, DataSpark (2 Positions) [Kingston, RI]

12-18

DePaul University: Professor of Practice position in Data Science [Chicago, IL]

11-07

DePaul University: Two tenure-track/tenured positions in Data Science/Computer Science [Chicago, IL]

11-07

Vanderbilt University: Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University’s Peabody College: Lecturer in Data and Analytics [Online Teaching]

11-05

How to Mitigate Open Source License Risks

10-30

The quest continues: a look at a new initiative to explore human and machine intelligence

10-29

Distilled News

10-28

Monash University: Academic Opportunities in Dialogue Research [Melbourne, Australia]

10-25

University of Rhode Island: Assistant Professor of Data Science [Kingston, RI]

10-22

SQL, Python, & R: All in One Platform

10-11

10 Best Mobile Apps for Data Scientist / Data Analysts

10-10

Distilled News

10-06

University of Nebraska at Omaha: Faculty Position in Computer Science [Omaha, NE]

10-05

Document worth reading: “Fog Computing: Survey of Trends, Architectures, Requirements, and Research Directions”

08-22

Distilled News

08-16

Free E-Book: A Developer’s Guide to Building AI Applications

06-04

Algorithms, Machine Learning, and Optimization: we are hiring!

11-12

NIPS 2017 Workshop on Approximate Inference

09-25

Outside a train rumbles by

09-09

How To Become A Machine Learning Expert In One Simple Step

03-20

Dealing with Corrupt Files in Hadoop

03-21

Dealing with Corrupt Files in Hadoop

03-21

NLP for Log Analysis – Tokenization

11-13

Rec-a-Sketch: a Flask App for Interactive Sketchfab Recommendations

02-05

Dealing with Corrupt Files in Hadoop

03-21

Dealing with Corrupt Files in Hadoop

03-21

Dealing with Corrupt Files in Hadoop

03-21

R 101

12-24

How tall is that tree?

03-22

Optimal Picture Viewing Distance

12-19

R Packages worth a look

12-18

Reflections on remote data science work

11-03

If you did not already know

10-24

k-server, part 2: continuous time mirror descent

12-20

Document Similarity With Word Movers Distance

06-13

How tall is that tree?

03-22

How Important is that Machine Learning Model be Understandable? We analyze poll results

11-19

Multilevel models with group-level predictors

10-21

Mapping opportunity for children, based on where they grew up

10-03

If you did not already know

08-26

It’s okay to not be a data scientist

02-20

How tall is that tree?

03-22

Optimal Picture Viewing Distance

12-19

Freudenstein’s Equation

12-07

Ackerman Steering

12-03

Efficient Guttering

07-29

How things float

06-20

How tall is that tree?

03-22

Stock Price prediction using ML and DL

01-07

x-mas tRees with gganimate, ggplot, plotly and friends

01-03

A Guide to Decision Trees for Machine Learning and Data Science

12-24

R Packages worth a look

12-08

R Packages worth a look

12-05

Building Blocks of Decision Tree

11-26

R Packages worth a look

11-14

R Packages worth a look

09-27

R Packages worth a look

09-21

R Packages worth a look

09-05

Using Linear Regression for Predictive Modeling in R

05-16

Semantic trees for training word embeddings with hierarchical softmax

09-07

Using regression trees for forecasting double-seasonal time series with trend in R

08-22

Hierarchical Softmax

08-01

Boosting (in Machine Learning) as a Metaphor for Diverse Teams

08-07

Decision Trees Tutorial

07-27

Gradient Boosting explained [demonstration]

06-24

How tall is that tree?

03-22

If you did not already know

10-10

Light FM Recommendation System Explained

05-24

Sense is now part of Cloudera!

03-22

How to build a data science project from scratch

12-05

Document worth reading: “Examining the Use of Neural Networks for Feature Extraction: A Comparative Analysis using Deep Learning, Support Vector Machines, and K-Nearest Neighbor Classifiers”

08-08

Sense is now part of Cloudera!

03-22

Sense is now part of Cloudera!

03-22

Top 5 Data Science Courses in 2019

01-09

Learn Python for Data Science From Scratch

01-09

How to Meet Your New Years Resolutions in 2019 (Udemy Coupons $9.99)

01-01

New public course on Successfully Delivering Data Science Projects for Feb 1st

12-18

Learn to do Data Viz in R

12-05

GARCH and a rudimentary application to Vol Trading

12-03

A Complete Guide to Choosing the Best Machine Learning Course

11-30

8 Reasons to Take Data Analytics Certification Courses

11-28

Data Pro Cyber Monday – Choose Your Savings

11-26

KDnuggets™ News 18:n43, Nov 14: To get hired as a data scientist, don’t follow the herd; LinkedIn Top Voices in Data Science & Analytics

11-14

How DataCamp Handles Course Quality

10-25

Review: Excel TV’s Data Science with Power BI and R

10-12

DataCamp: Part-time Contract Instructors [Remote]

10-11

KDnuggets™ News 18:n37, Oct 3: Mathematics of Machine Learning; Effective Transfer Learning for NLP; Path Analysis with R

10-03

Chromebook Data Science - a free online data science program for anyone with a web browser.

10-01

Advantages of Online Data Science Courses

09-26

Learning Statistics Online for Data Science

09-20

How Foundations Student Russell Martin got into The Data Incubator’s Fellowship

09-06

Of statistics class and judo class: Beyond the paradigm of sequential education

07-22

Top content from two years of Data School

03-24

MRP (or RPP) with non-census variables

10-28

High school statistics class builds election prediction model

10-23

What data scientists really do

08-21

Top content from two years of Data School

03-24

Drexel University: 2 Teaching Faculty Positions in Data Science [Philadelphia, PA]

11-27

Vanderbilt University: Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University: Sr Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University’s Peabody College: Lecturer in Data and Analytics [Online Teaching]

11-05

Vanderbilt University’s Peabody College: Sr. Lecturer in Data and Analytics [Nashville, TN]

11-05

University of Rhode Island: Assistant Professor of Data Science [Kingston, RI]

10-22

Colorado State University: Assistant Professor in Industrial and Organizational (IO) Psychology [Fort Collins, CO]

10-05

Top content from two years of Data School

03-24

Request for Proposal: Topical Projects for January 2019

11-29

Examining Your Presence on Twitter with Python

03-24

A visual analysis of jean pockets and their lack of practicality

08-16

Examining Your Presence on Twitter with Python

03-24

Examining Your Presence on Twitter with Python

03-24

Examining Your Presence on Twitter with Python

03-24

Examining Your Presence on Twitter with Python

03-24

If you did not already know

11-20

If you did not already know

08-14

Deconstruction with Discrete Embeddings

02-15

Generating Large Images from Latent Vectors - Part Two

06-02

Interactive Abstract Pattern Generation Javascript Demo

04-24

Generating Abstract Patterns with TensorFlow

03-25

Satellite imagery generation with Generative Adversarial Networks (GANs)

01-11

Generating data to explore the myriad causal effects that can be estimated in observational data analysis

11-20

Generative Adversarial Networks – Paper Reading Road Map

10-24

If you did not already know

09-26

Python Generators Tutorial

06-13

Transfer Your Font Style with GANs

03-13

Teaching Machines to Draw

05-19

Deconstruction with Discrete Embeddings

02-15

Hyper Networks

09-29

Generating Large Images from Latent Vectors - Part Two

06-02

Generating Abstract Patterns with TensorFlow

03-25

Satellite imagery generation with Generative Adversarial Networks (GANs)

01-11

Synthetic Data Generation: A must-have skill for new data scientists

12-27

If you did not already know

11-20

Document worth reading: “Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances”

10-01

Morph, an open-source tool for data-driven art without code

09-26

If you did not already know

08-14

Transfer Your Font Style with GANs

03-13

Superresolution with semantic guide

08-09

Hyper Networks

09-29

The Frog of CIFAR 10

04-06

Generating Abstract Patterns with TensorFlow

03-25

Satellite imagery generation with Generative Adversarial Networks (GANs)

01-11

Synthetic Data Generation: A must-have skill for new data scientists

12-27

Generative Adversarial Networks – Paper Reading Road Map

10-24

If you did not already know

10-17

Document worth reading: “Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances”

10-01

If you did not already know

09-26

Morph, an open-source tool for data-driven art without code

09-26

Distilled News

09-15

Document worth reading: “An Information-Theoretic Analysis of Deep Latent-Variable Models”

08-23

If you did not already know

08-14

Superresolution with semantic guide

08-09

Work in progress: Portraits of Imaginary People

06-06

Deep and Hierarchical Implicit Models

02-28

Deep Learning Research Review Week 1: Generative Adversarial Nets

09-30

Generating Large Images from Latent Vectors - Part Two

06-02

Deep Learning for Chatbots, Part 1 – Introduction

04-07

Generating Abstract Patterns with TensorFlow

03-25

Obtaining the number of components from cross validation of principal components regression

10-15

The Probability Monad and Why it's Important for Data Science

09-05

Akka Stream

03-25

Amazon Transcribe now supports real-time transcriptions

11-20

Analyze live video at scale in real time using Amazon Kinesis Video Streams and Amazon SageMaker

11-19

Top 5 Trends in Data Science

11-09

Use AWS DeepLens to give Amazon Alexa the power to detect objects via Alexa skills

10-17

Gensim Survey 2018

04-30

Akka Stream

05-06

Akka Stream

03-25

Whats new on arXiv

12-25

Statistics in Glaucoma: Part III

12-18

How we use emojis

10-15

Timings of a Grouped Rank Filter Task

08-23

Azure Functions for Data Science

08-06

Akka Stream

03-25

Crowdsourcing Fantasy Baseball Leagues

03-25

Crowdsourcing Fantasy Baseball Leagues

03-25

Do AIs dream of pwning FF leagues?

12-10

How easy is it to moneyball a fantasy football league draft?

10-28

Crowdsourcing Fantasy Baseball Leagues

03-25

Do AIs dream of pwning FF leagues?

12-10

Crowdsourcing Fantasy Baseball Leagues

03-25

Do AIs dream of pwning FF leagues?

12-10

How easy is it to moneyball a fantasy football league draft?

10-28

Crowdsourcing Fantasy Baseball Leagues

03-25

Feather: it's about metadata

04-26

Feather: A Fast On-Disk Format for Data Frames for R and Python, powered by Apache Arrow

03-29

Objects types and some useful R functions for beginners

12-24

Feather and Apache Arrow: Grokking file formats vs. in-memory representations

04-21

Feather: A Fast On-Disk Format for Data Frames for R and Python, powered by Apache Arrow

03-29

Feather: A Fast On-Disk Format for Data Frames for R and Python, powered by Apache Arrow

03-29

Becoming a Data Scientist Podcast Episode 08: Sebastian Raschka

03-29

Implementing Batch Normalization in Tensorflow

03-29

Implementing Batch Normalization in Tensorflow

03-29

If you did not already know

11-01

Preprocessing for Deep Learning: From covariance matrix to image whitening

10-10

Pivoted document length normalisation

06-19

Normal Distributions

05-14

Deep Learning without Backpropagation

03-21

Implementing Batch Normalization in Tensorflow

03-29

Representational Power of Deeper Layers

03-30

Generating Large Images from Latent Vectors

04-01

Overlapping Disks

09-30

Solar Eclipses

04-03

Solar Eclipses

04-03

Lagrange Points

08-21

Solar Eclipses

04-03

Solar Eclipses

04-03

Google Calendar should prevent spam by default

02-22

Solar Eclipses

04-03

Because it's Friday: Street Orientation

07-27

Lumpers and Splitters: Tensions in Taxonomies

04-05

Text Segmentation using Word Embeddings

10-16

RSiteCatalyst Version 1.4.8 Release Notes

04-04

Roll Your Own Federal Government Shutdown-caused SSL Certificate Expiration Monitor in R

01-10

So you want to play a pRank in R…?

12-18

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

11-14

7 Awesome Things You Can Do in Dataiku Without Coding

11-02

Data types

05-08

Sakura blossoms in Japan

04-11

RSiteCatalyst Version 1.4.8 Release Notes

04-04

Sheffield University Life

04-05

Multilevel models for multiple comparisons! Varying treatment effects!

11-28

Sheffield University Life

04-05

Sheffield University Life

04-05

Cross-over study design with a major constraint

10-23

Naive Bayes Classifier: A Geometric Analysis of the Naivete. Part 1

09-07

Sequence Modeling with CTC

11-27

Python Matplotlib (pyplot), a step-by-step Tutorial

11-15

TensorFlow in a Nutshell — Part Three: All the Models

10-03

First Convergence Bias

04-11

Inverting a Neural Net

04-05

ML beyond Curve Fitting: An Intro to Causal Inference and do-Calculus

05-24

mixup: Data-Dependent Data Augmentation

11-02

Deriving the Softmax from First Principles

04-19

Bayesian Inference via Simulated Annealing

02-07

Inverting a Neural Net

04-05

Travis CI: "You Have Too Many Tests LOLZ!"

04-05

Automated and continuous deployment of Amazon SageMaker models with AWS Step Functions

01-10

The Role of the Data Engineer is Changing

01-10

RQuantLib 0.4.6: Updated upstream, and calls for help

11-25

RcppRedis 0.1.9

10-27

How R gets built on Windows

10-11

How R gets built on Windows

10-11

Joining ASAPP

09-09

Is Data Scientist a useless job title?

08-04

conda-forge and PyData's CentOS moment

04-20

Travis CI: "You Have Too Many Tests LOLZ!"

04-05

Build a serverless Twitter reader using AWS Fargate

12-06

Introducing cricpy:A python package to analyze performances of cricketers

10-28

shinytest – Automated testing for Shiny apps

10-18

Access Amazon S3 data managed by AWS Glue Data Catalog from Amazon SageMaker notebooks

08-29

Getting Started with Cloudera Data Science Workbench

05-08

Einstein's Spacetime

10-11

Travis CI: "You Have Too Many Tests LOLZ!"

04-05

Scalable multi-node training with TensorFlow

12-17

Build a serverless Twitter reader using AWS Fargate

12-06

shinytest – Automated testing for Shiny apps

10-18

What Does it Take to Train Deep Learning Models On-Device?

10-04

R Packages worth a look

09-03

World Models Experiments

06-09

Reduce GPU costs with startup scripts on the Google Cloud Engine

02-21

Setting up Jupyter for Deep Learning on EC2

02-15

Static Blog: Jekyll, Hyde and GitHub Pages

02-01

TensorFlow in a Nutshell — Part One: Basics

08-22

Travis CI: "You Have Too Many Tests LOLZ!"

04-05

Travis CI: "You Have Too Many Tests LOLZ!"

04-05

Zak David expresses critical views of some published research in empirical quantitative finance

12-24

Ronin: Data Engineer [San Mateo, CA]

12-03

The Future of AI is the Enterprise

11-30

Document worth reading: “A Tutorial on Modeling and Inference in Undirected Graphical Models for Hyperspectral Image Analysis”

11-09

The 3Ds of Machine Learning Systems Design

11-05

I’m an Analyst and the software engineers made fun of my code!

10-19

This New [AI] Software Constantly Improves – and that Makes all the Difference

09-21

Bad headlines distract from real AI problems

08-20

Distilled News

07-31

Software as an academic publication

05-03

JMP Publishes Exercises to Accompany Data Mining Techniques (3rd Edition)

05-31

Facts and Fallacies of Software Engineering - Book Review

02-11

GitHub's one-dimensional view of open source contributions

11-07

Adobe: Give Credit. You DID NOT Write RSiteCatalyst.

05-09

On Software Demos and Potemkin Villages

04-06

On Software Demos and Potemkin Villages

04-06

On Software Demos and Potemkin Villages

04-06

On Software Demos and Potemkin Villages

04-06

Practical Apache Spark in 10 Minutes

01-11

Top KDnuggets tweets, Oct 17-23: Machine Learning Cheat Sheets

10-24

Apache Spark Introduction for Beginners

10-18

Big Data Day Camp: Big Data Tools & Techniques (October 25-26)

10-04

Data Center Scale Computing and Artificial Intelligence with Matei Zaharia, Inventor of Apache Spark

09-12

Data Science in 30 Minutes: Holden Karau – A Quick Introduction to PySpark

06-13

How-to: Do Scalable Graph Analytics with Apache Spark

10-03

Solving Real-Life Mysteries with Big Data and Apache Spark

09-13

Genome Analysis Toolkit: Now Using Apache Spark for Data Processing

04-06

Genome Analysis Toolkit: Now Using Apache Spark for Data Processing

04-06

Do something for yourself in 2019

01-08

Top 5 Data Visualization Tools for 2019

01-03

Intuit: Staff Experimentation Data Scientist [Mountain View, CA]

12-12

Machine Learning. In conversation with Jelena Ilic, Senior Data Scientist at Mango Solutions

11-21

Technoslavia 2.5: Open Source Topography

11-07

EARL Houston: Interview with Hadley Wickham

11-05

Communicating results with R Markdown

11-01

R Packages worth a look

10-22

Limitations of “Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection”

10-12

R Packages worth a look

10-10

Things you should know when traveling via the Big Data Engineering hype-train

10-08

Distilled News

09-15

Distilled News

08-31

Distilled News

08-28

Magister Dixit

08-16

Introducing Python for data scientists - Pt1

03-15

Installing Python Packages from a Jupyter Notebook

12-05

Genome Analysis Toolkit: Now Using Apache Spark for Data Processing

04-06

The Frog of CIFAR 10

04-06

R Packages worth a look

01-11

Synthetic Data Generation: A must-have skill for new data scientists

12-27

R Packages worth a look

11-15

Python Generators Tutorial

06-13

Superresolution with semantic guide

08-09

Teaching Machines to Draw

05-19

Deconstruction with Discrete Embeddings

02-15

Hyper Networks

09-29

The Frog of CIFAR 10

04-06

Using httr to Detect HTTP(s) Redirects

11-06

Deep Learning for Chatbots, Part 2 – Implementing a Retrieval-Based Model in Tensorflow

07-04

Deep Learning for Chatbots, Part 1 – Introduction

04-07

Sorry I didn’t get that! How to understand what your users want

11-16

Document worth reading: “Neural Approaches to Conversational AI”

10-29

How I Used Deep Learning To Train A Chatbot To Talk Like Me (Sorta)

07-25

Deep Learning for Chatbots, Part 1 – Introduction

04-07

Sorry I didn’t get that! How to understand what your users want

11-16

Deep Learning for Chatbots, Part 1 – Introduction

04-07

Deep Learning for Chatbots, Part 1 – Introduction

04-07

If you did not already know

11-06

Learning in Brains and Machines (2): The Dogma of Sparsity

04-07

Learning in Brains and Machines (2): The Dogma of Sparsity

04-07

Learning in Brains and Machines (2): The Dogma of Sparsity

04-07

Becoming More Efficient

04-07

Becoming More Efficient

04-07

Becoming More Efficient

04-07

$ vs. votes

11-27

R Packages worth a look

10-28

Optimizing Split Sizes for Hadoop’s CombineFileInputFormat

05-09

Becoming More Efficient

04-07

Becoming More Efficient

04-07

Step by step Kaggle competition tutorial

04-10

Reflections on remote data science work

11-03

Summer of Data Science Goal-Setting

06-06

My Experience as a Freelance Data Scientist

01-07

Aligned Clock Hands

11-04

Step by step Kaggle competition tutorial

04-10

Eiffel Tower

04-12

Eiffel Tower

04-12

RFishBC CRAN Release

11-22

R Packages worth a look

10-24

Differentiable Dynamic Programs and SparseMAP Inference

05-15

Eiffel Tower

04-12

Where Will Your Country Stand in World War III?

04-12

GARCH and a rudimentary application to Vol Trading

12-03

Introduction to Learning to Trade with Reinforcement Learning

02-11

Quantitative Finance Resources

06-04

Where Will Your Country Stand in World War III?

04-12

Where Will Your Country Stand in World War III?

04-12

Likes Out! Guerilla Dataset!

10-09

Becoming a Data Scientist Podcast Episode 09: Justin Kiggins

04-12

Becoming a Data Scientist Podcast Episode 09: Justin Kiggins

04-12

Distilled News

11-12

R Packages worth a look

10-20

Distilled News

09-20

“Redshift View Materializer” Now on Github

04-14

What does it mean to write “vectorized” code in R?

01-04

R Packages worth a look

11-06

“Redshift View Materializer” Now on Github

04-14

R Packages worth a look

11-10

“Redshift View Materializer” Now on Github

04-14

Stereograms

11-26

Top October Stories: 9 Must-have skills you need to become a Data Scientist, updated; 10 Best Mobile Apps for Data Scientist / Data Analysts

11-09

If you did not already know

09-27

“Redshift View Materializer” Now on Github

04-14

Hands-on: Creating Neural Networks using Chainer

02-15

Introducing sparklyr, an R Interface for Apache Spark

09-30

Create a Chrome extension to modify a website’s HTML or CSS

04-14

Using gganimate to illustrate the luminance illusion

08-22

Create a Chrome extension to modify a website’s HTML or CSS

04-14

Create a Chrome extension to modify a website’s HTML or CSS

04-14

Custom JavaScript, CSS and HTML in Shiny

12-23

Create a Chrome extension to modify a website’s HTML or CSS

04-14

Exploring convolutional neural networks with DL4J

04-14

Semantic Segmentation algorithm is now available in Amazon SageMaker

11-28

Naive Bayes from Scratch using Python only – No Fancy Frameworks

10-25

Distributed Deep Learning on AZTK and HDInsight Spark Clusters

08-02

AWS Deep Learning AMIs now with optimized TensorFlow 1.9 and Apache MXNet 1.2 with Keras 2 support to accelerate deep learning on Amazon EC2 instances

07-23

Scalable multi-node deep learning training using GPUs in the AWS Cloud

07-20

Weekly Review: 11/11/2017

11-11

mixup: Data-Dependent Data Augmentation

11-02

How I Used Deep Learning To Train A Chatbot To Talk Like Me (Sorta)

07-25

Artificial Neural Networks Introduction (Part II)

11-03

Deep reinforcement learning, battleship

10-15

A fastText-based hybrid recommender

09-27

Exploring convolutional neural networks with DL4J

04-14

Exploring convolutional neural networks with DL4J

04-14

First 3rd party notebook for Databricks Community Edition

04-14

First 3rd party notebook for Databricks Community Edition

04-14

First 3rd party notebook for Databricks Community Edition

04-14

Rethinking Academic Data Sharing

05-15

Animate NBA shot events with Paper.js

06-08

First 3rd party notebook for Databricks Community Edition

04-14

dotify: Recommending Spotify Music Through Country Arithmetic

04-15

Distilled News

10-12

dotify: Recommending Spotify Music Through Country Arithmetic

04-15

If you did not already know

11-16

dotify: Recommending Spotify Music Through Country Arithmetic

04-15

Wesley Crushes Ratings

04-19

Podcast Episodes 0 to 3

08-13

Wesley Crushes Ratings

04-19

Who's at the Center of the Star Trek Universe?

07-22

Wesley Crushes Ratings

04-19

About a Curious Feature and Interpretation of Linear Regressions

10-29

Wesley Crushes Ratings

04-19

How-to: Use Impala and Kudu Together for Analytic Workloads

04-20

Scanning Office 365 documents

07-16

How-to: Use Impala and Kudu Together for Analytic Workloads

04-20

Understanding the maths of Computed Tomography (CT) scans

01-09

Scanning Office 365 documents

07-16

How-to: Use Impala and Kudu Together for Analytic Workloads

04-20

How-to: Use Impala and Kudu Together for Analytic Workloads

04-20

How Can Autonomous Drones Help the Energy and Utilities Industry?

10-23

Where will Artificial Intelligence come from?

04-20

Anticipating the next move in data science – my interview with Thomson Reuters

11-17

Our Favorite Spooky AI & Data Articles

10-30

Holy Grail of AI for Enterprise — Explainable AI

10-19

Your Guide to AI and Machine Learning at re:Invent 2018

09-27

Import AI:

07-16

Import AI:

07-09

Import AI

06-18

Import AI:

05-29

Where will Artificial Intelligence come from?

04-20

Document worth reading: “Small Sample Learning in Big Data Era”

12-14

If you did not already know

12-12

Document worth reading: “An Introduction to Probabilistic Programming”

11-12

Document worth reading: “Cogniculture: Towards a Better Human-Machine Co-evolution”

08-18

Deep Learning Dead-End?

09-17

Where will Artificial Intelligence come from?

04-20

RProtoBuf 0.4.13 (and 0.4.12)

11-03

cransays - Follow your R Package Journey to CRANterbury with our Dashboard!

10-11

conda-forge and PyData's CentOS moment

04-20

conda-forge and PyData's CentOS moment

04-20

Predicting Churn

04-21

10 years of playback history on Last.FM: "Just sit back and listen"

01-12

Co-localization analysis of fluorescence microscopy images

11-27

I fell out with tapply and in love with dplyr

10-15

Building a Linear Regression Model for Real World Problems, in R

08-14

A Practical Guide to the "Open-Source Machine Learning Masters"

02-03

QuantConnect – the only Game in Town

09-10

German Temperature Data

05-12

Predicting Churn

04-21

On receiving the Community Leadership Award at the NumFOCUS Summit 2018

11-11

Building a Linear Regression Model for Real World Problems, in R

08-14

A Practical Guide to the "Open-Source Machine Learning Masters"

02-03

Predicting Churn

04-21

If you did not already know

12-25

Predicting Churn

04-21

Predicting Churn

04-21

Feather and Apache Arrow: Grokking file formats vs. in-memory representations

04-21

Feather and Apache Arrow: Grokking file formats vs. in-memory representations

04-21

More on sigr

11-06

R tip: Make Your Results Clear with sigr

11-04

R tip: Make Your Results Clear with sigr

11-04

Rejoinder: the problem with conda-forge right now

04-21

Rejoinder: the problem with conda-forge right now

04-21

Rejoinder: the problem with conda-forge right now

04-21

Graphs and tables, tables and graphs

11-18

Multithreaded in the Wild

04-09

Rejoinder: the problem with conda-forge right now

04-21

Use your favorite Python library on PySpark cluster with Cloudera Data Science Workbench

04-26

Conda: Myths and Misconceptions

08-25

Rejoinder: the problem with conda-forge right now

04-21

AI and ML Futures 4: The Future of AI Meeting

04-22

AI and ML Futures 4: The Future of AI Meeting

04-22

AI and ML Futures 4: The Future of AI Meeting

04-22

University of Virginia: Faculty, Open Rank Model and Simulation at the Human-Technology Frontier [Charlottesville, VA]

12-24

The Semantic Web: Where is it now?

12-23

Distilled News

08-21

Ethical AI for Data Scientists

08-15

Legal Tech: How Can Lawyers Benefit?

08-13

The Impact of Bitcoin on the Insurance Industry

06-21

Questions on Artificial Intelligence

01-16

AI and ML Futures 4: The Future of AI Meeting

04-22

Interactive Abstract Pattern Generation Javascript Demo

04-24

Satellite imagery generation with Generative Adversarial Networks (GANs)

01-11

Synthetic Data Generation: A must-have skill for new data scientists

12-27

If you did not already know

12-15

If you did not already know

11-30

The new pqR parser, and R’s “else” problem

11-28

Generating data to explore the myriad causal effects that can be estimated in observational data analysis

11-20

Machine Reading Comprehension: Learning to Ask & Answer

10-11

Document worth reading: “Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances”

10-01

If you did not already know

08-14

Python Generators Tutorial

06-13

Transfer Your Font Style with GANs

03-13

Superresolution with semantic guide

08-09

Deconstruction with Discrete Embeddings

02-15

Hyper Networks

09-29

Interactive Abstract Pattern Generation Javascript Demo

04-24

Document worth reading: “Deep learning in agriculture: A survey”

01-12

Whats new on arXiv

01-03

Will Julia Replace Python and R for Data Science?

12-26

If you did not already know

12-21

Document worth reading: “A Study and Comparison of Human and Deep Learning Recognition Performance Under Visual Distortions”

12-18

Data Scientist’s Dilemma – The Cold Start Problem

12-15

Common mistakes when carrying out machine learning and data science

12-06

Distilled News

12-02

Document worth reading: “Big Data and Fog Computing”

11-29

If you did not already know

11-25

A more systematic look at suppressed data by @ellis2013nz

11-17

UnitedHealth Group: Clinical Data Statistical Analyst – SQL SAS (Clinician Required) [Telecommute]

11-16

Distilled News

11-14

Import AI 119: How to benefit AI research in Africa; German politician calls for billions in spending to prevent country being left behind; and using deep learning to spot thefts

11-05

Now use Pipe mode with CSV datasets for faster training on Amazon SageMaker built-in algorithms

11-01

Bank of Canada: Data Scientist [Ottawa, Canada]

10-29

Data Science With R Course Series – Week 6

10-22

Accelerating Your Algorithms in Production [Webinar Replay]

10-16

Is it time to stop using sentinel values for null / "NA" values?

10-12

Document worth reading: “A Comparative Study on using Principle Component Analysis with Different Text Classifiers”

08-29

R Packages worth a look

08-23

Boost Computation Power and Speed with Snowflake

07-02

“If you deprive the robot of your intuition about cause and effect, you’re never going to communicate meaningfully.” – Pearl ’18

06-08

Performance metrics aren't everything

02-09

Counting Efficiently with Bounter pt. 2: CountMinSketch

01-31

How easy is it to moneyball a fantasy football league draft?

10-28

Native Hadoop file system (HDFS) connectivity in Python

01-03

Feather: it's about metadata

04-26

Metadata Enrichment is Essential to Realize the Value of Open Datasets

11-14

R Packages worth a look

07-31

Feather: it's about metadata

04-26

Useful External Resources

04-27

4 Myths of Big Data and 4 Ways to Improve with Deep Data

01-09

Word Morphing – an original idea

11-20

AzureR: R packages to control Azure services

11-08

AzureR: R packages to control Azure services

11-08

Document worth reading: “Resource Management in Fog/Edge Computing: A Survey”

10-30

Distilled News

09-19

If you did not already know

09-11

The Role of Resources in Data Analysis

06-18

New in Cloudera Data Science Workbench 1.2: Usage Monitoring for Administrators

10-31

Useful External Resources

04-27

Monash University: Lecturer/Sr Lecturer – Digital Health [Melbourne, Australia]

11-22

Semantic Interoperability: Are you training your AI by mixing data sources that look the same but aren’t?

10-09

Shifting Causes of Death

10-02

Some clues that this study has big big problems

08-29

Data Analysis, NHS and Industrial Partners

04-28

Data Analysis, NHS and Industrial Partners

04-28

Rolling and Unrolling RNNs

04-28

Rolling and Unrolling RNNs

04-28

R Packages worth a look

10-26

Rolling and Unrolling RNNs

04-28

Purr yourself into a math genius

01-03

Baseball Card Collecting

04-29

One Recipe Step to Rule Them All

12-03

RTutor: Driving Electric or Gasoline Cars? Comparing the Pollution Damages

11-21

Google, Microsoft & Fraunhofer at the First European Edition of Deep Learning World – 12 Nov, 2018

10-23

Python List Comprehension + Set + Dict Comprehension

11-16

Understanding how Deep Learning learns to play SET®

10-12

Model AUC depends on test set difficulty

03-19

Topological Data Analysis - Persistent Homology

02-22

Similarity via Jaccard Index

02-07

Data Readiness Levels: Turning Data from Palid to Vivid

01-12

Baseball Card Collecting

04-29

Baseball Card Collecting

04-29

Becoming a Data Scientist Podcast Episode 10: Trey Causey

05-01

A wild dataset has appeared! Now what?

05-02

A wild dataset has appeared! Now what?

05-02

A wild dataset has appeared! Now what?

05-02

Magister Dixit

01-02

R Packages worth a look

12-23

A wild dataset has appeared! Now what?

05-02

Similar pages for Wikipedia

05-03

Similar pages for Wikipedia

05-03

Similar pages for Wikipedia

05-03

White House launches workshops to prepare for Artificial Intelligence

05-04

White House launches workshops to prepare for Artificial Intelligence

05-04

Document worth reading: “Artificial Intelligence and Robotics”

09-04

A crystal clear book draw

06-01

Podcast Special Episode 2 – The Future of AI with Dr. Ed Felten

12-29

White House launches workshops to prepare for Artificial Intelligence

05-04

The structure of Mafia syndacates

05-04

The structure of Mafia syndacates

05-04

The structure of Mafia syndacates

05-04

The structure of Mafia syndacates

05-04

Google's NHS deal does not bode well for the future of data-sharing

05-05

10 Companies to Work with After a Data Science Course

01-10

The cold start problem: how to build your machine learning portfolio

01-04

Advanced News API search: leveraging DBpedia entity types

12-11

Magister Dixit

12-01

How to Build a Machine Learning Team When You Are Not Google or Facebook

11-28

Telling Truth from Hype When Hunting for Data Science Work

11-05

Introduction to Deep Learning with Keras

10-29

Distilled News

10-01

How to Implement AI-First Business Models at Scale

09-21

Vulcan Post: This AI Startup Is Run By The World’s Top Data Scientists – Lets Anyone Build Predictive Models

09-04

Bad headlines distract from real AI problems

08-20

“Seeding trials”: medical marketing disguised as science

08-01

Model Updates: Entity-level Sentiment Analysis and Brand New Entity Extraction Models Now Live in the Text Analysis API

07-17

Similarity in the Wild

02-19

Your First Job

11-15

Google's NHS deal does not bode well for the future of data-sharing

05-05

Google's NHS deal does not bode well for the future of data-sharing

05-05

AdaSearch: A Successive Elimination Approach to Adaptive Search

11-14

Spooky! Gravedigger in R

10-31

Tidyverse 'Starts_with' in M/Power Query

10-08

Keynote at EuroPython 2018 on “Citizen Science”

07-27

Akka Stream

05-06

Akka Stream

05-06

Akka Stream

05-06

Single Neuron Gradient Descent

05-06

Recurrent Neural Network Gradients, and Lessons Learned Therein

10-18

Gradientes de Recurrent Neural Networks y Lo Que Aprendí Derivándolos

10-18

Single Neuron Gradient Descent

05-06

Single Neuron Gradient Descent

05-06

Single Neuron Gradient Descent

05-06

Single Neuron Gradient Descent

05-06

Document worth reading: “A Taxonomy for Neural Memory Networks”

09-13

Neural Network Evolution Playground with Backprop NEAT

05-07

Building Surveillance System Using USB Camera and Wireless-Connected Raspberry Pi

11-06

Neural Network Evolution Playground with Backprop NEAT

05-07

Neural Network Evolution Playground with Backprop NEAT

05-07

Neural Network Evolution Playground with Backprop NEAT

05-07

Math in Data Science

11-30

Additional Strategies for Confronting the Partition Function

10-30

A Thorough Introduction to Boltzmann Machines

10-20

Integration method to map model scores to conversion rates from example data

03-04

Martingales

10-20

Normal Distributions

05-14

Deriving the Softmax from First Principles

04-19

Random-Walk Bayesian Deep Networks: Dealing with Non-Stationary Data

03-14

Bayesian Linear Regression (in PyMC) - a different way to think about regression

02-09

Streaming Log-sum-exp Computation

05-08

Streaming Log-sum-exp Computation

05-08

Streaming Log-sum-exp Computation

05-08

Streaming Log-sum-exp Computation

05-08

R Packages worth a look

12-23

R Packages worth a look

11-16

Persistent Homology (Part 5)

02-26

Streaming Log-sum-exp Computation

05-08

Future of AI 5: The Singularians

05-09

Future of AI 5: The Singularians

05-09

Future of AI 5: The Singularians

05-09

Do something for yourself in 2019

01-08

Day 11 – little helper trim

12-11

Day 01 – little helper checkdir

12-01

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

What to think about this new study which says that you should limit your alcohol to 5 drinks a week?

10-23

Future of AI 5: The Singularians

05-09

Deep Learning Dead-End?

09-17

Future of AI 6. Discussion of 'Superintelligence: Paths, Dangers, Strategies'

05-09

Future of AI 6. Discussion of 'Superintelligence: Paths, Dangers, Strategies'

05-09

Future of AI 6. Discussion of 'Superintelligence: Paths, Dangers, Strategies'

05-09

A Guide to Decision Trees for Machine Learning and Data Science

12-24

Building Blocks of Decision Tree

11-26

R Packages worth a look

09-21

Optimizing Split Sizes for Hadoop’s CombineFileInputFormat

05-09

Le Monde puzzle [#1076]

12-26

Optimizing Split Sizes for Hadoop’s CombineFileInputFormat

05-09

Optimizing Split Sizes for Hadoop’s CombineFileInputFormat

05-09

Bisecting a triangular cake

05-09

Bisecting a triangular cake

05-09

Bisecting a triangular cake

05-09

Bisecting a triangular cake

05-09

Self-Service Adobe Analytics Data Feeds!

03-03

Adobe: Give Credit. You DID NOT Write RSiteCatalyst.

05-09

Adobe: Give Credit. You DID NOT Write RSiteCatalyst.

05-09

Biggest Deep Learning Summit – Special KDnuggets Offer

01-10

World’s Biggest Deep Learning Summit 3 weeks away

12-27

Free ebook: Exploring Data with python

11-29

Join AI experts from Google Brain, Open AI & Uber AI Labs in San Francisco

11-01

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: October and Beyond

10-03

Adobe: Give Credit. You DID NOT Write RSiteCatalyst.

05-09

Easier data analysis in Python with pandas (video series)

05-10

Introduction to Pandas, NumPy and RegEx in Python

12-17

Smartly select and mutate data frame columns, using dict

12-09

Using a Keras Long Short-Term Memory (LSTM) Model to Predict Stock Prices

11-21

Make Beautiful Tables with the Formattable Package

11-15

7 Awesome Things You Can Do in Dataiku Without Coding

11-02

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Faceted Graphs with cdata and ggplot2

10-21

Faceted Graphs with cdata and ggplot2

10-21

Hitchhiker's guide to Exploratory Data Analysis

10-12

A Better Example of the Confused By The Environment Issue

09-25

A Subtle Flaw in Some Popular R NSE Interfaces

09-24

Python Pandas Tutorial: The Basics

11-23

TensorFlow in a Nutshell — Part Two: Hybrid Learning

09-13

Building a Data Science Portfolio: Storytelling with Data

06-30

Easier data analysis in Python with pandas (video series)

05-10

Easier data analysis in Python with pandas (video series)

05-10

Top Stories, Dec 3-9: Common mistakes when carrying out machine learning and data science; AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-10

R Packages worth a look

11-14

Learn the top things to look for in an AI Vendor

10-12

Distilled News

10-08

Easier data analysis in Python with pandas (video series)

05-10

Bisecting an arbitrary triangular cake

05-11

The Netflix Data War

12-19

Latour Sokal NYT

12-07

Cornell prof (but not the pizzagate guy!) has one quick trick to getting 1700 peer reviewed publications on your CV

11-04

R Packages worth a look

10-26

What to think about this new study which says that you should limit your alcohol to 5 drinks a week?

10-23

Beyond text: How Spokata uses Amazon Polly to make news and information universally accessible as real-time audio

10-04

Distill Update 2018

08-14

Data Makes Possible Many Things: Insights Discovery, Innovation, and Better Decisions

08-01

Why Lies Spread Faster than the Truth

05-24

Weekly Review: 11/18/2017

11-18

Cake cutting part 3

04-10

Bisecting an arbitrary triangular cake

05-11

Bisecting an arbitrary triangular cake

05-11

Bisecting an arbitrary triangular cake

05-11

German Temperature Data

05-12

German Temperature Data

05-12

Meanshift Algorithm for the Rest of Us (Python)

05-14

If you did not already know

10-25

Meanshift Algorithm for the Rest of Us (Python)

05-14

Meanshift Algorithm for the Rest of Us (Python)

05-14

If you did not already know

08-01

Hands-on: Creating Neural Networks using Chainer

02-15

Meanshift Algorithm for the Rest of Us (Python)

05-14

Meanshift Algorithm for the Rest of Us (Python)

05-14

Multilevel data collection and analysis for weight training (with R code)

09-22

Sequence prediction using recurrent neural networks(LSTM) with TensorFlow

05-14

Naive Bayes Classifier: A Geometric Analysis of the Naivete. Part 1

09-07

Collecting Expressions in R

08-05

My notes on (Liang et al., 2017): Generalization and the Fisher-Rao norm

01-25

mixup: Data-Dependent Data Augmentation

11-02

What is an Interaction Effect?

02-25

Summing the Fibonacci Sequence

07-24

Why I’m Not a Fan of R-Squared

07-24

Sequence prediction using recurrent neural networks(LSTM) with TensorFlow

05-14

Sequence prediction using recurrent neural networks(LSTM) with TensorFlow

05-14

Vanilla Neural Nets

05-16

R Packages worth a look

11-24

Vanilla Neural Nets

05-16

Nemirovski’s acceleration

01-09

R Packages worth a look

01-06

If you did not already know

01-06

Whats new on arXiv

12-30

Whats new on arXiv

12-04

Document worth reading: “A Tutorial on Bayesian Optimization”

12-01

Whats new on arXiv

11-29

R Packages worth a look

11-16

The Intuitions Behind Bayesian Optimization with Gaussian Processes

10-19

Machine learning — Is the emperor wearing clothes?

10-12

Whats new on arXiv

10-09

Document worth reading: “Physically optimizing inference”

09-29

If you did not already know

09-28

If you did not already know

09-04

Whats new on arXiv

08-10

Whats new on arXiv

08-04

Differentiable Image Parameterizations

07-25

Do Bayesians Overfit?

07-11

k-server, part 2: continuous time mirror descent

12-20

Algorithms, Machine Learning, and Optimization: we are hiring!

11-12

Feature Visualization

11-07

Bayesian Inference via Simulated Annealing

02-07

Learning to Rank Sketchfab Models with LightFM

11-07

Paper: A Differentiable Physics Engine for Deep Learning in Robotics

11-03

Vanilla Neural Nets

05-16

WPI: Research Scientist [Worcester, MA]

11-30

WPI: Post-Doctoral Fellow [Worcester, MA]

11-21

“On the Diagramatic Diagnosis of Data” at BudapestBI 2018

11-16

Exploring college major and income: a live data analysis in R

10-16

Choose Your Own Adventure – Analytics On-boarding

10-15

Import AI 114: Synthetic images take a big leap forward with BigGANs; US lawmakers call for national AI strategy; researchers probe language reasoning via HotspotQA

10-01

Vanilla Neural Nets

05-16

Top 8 resources for learning data analysis with pandas

05-16

5½ Reasons to Ditch Spreadsheets for Data Science: Code is Poetry

12-10

The evolution of pace in popular movies

11-24

Use R with Excel: Importing and Exporting Data

10-17

Top 8 resources for learning data analysis with pandas

05-16

Top 8 resources for learning data analysis with pandas

05-16

Q & A with Meta Brown

05-18

XmR Chart | Step-by-Step Guide by Hand and with R

01-13

R Packages worth a look

08-30

Because it's Friday: One Million Integers

08-24

Q & A with Meta Brown

05-18

Rcpp now used by 1500 CRAN packages

11-15

GitHub Streak: Round Five

10-13

IP string to integer conversion with Rcpp

05-19

Podcast Listens Analysis

10-02

IP string to integer conversion with Rcpp

05-19

IP string to integer conversion with Rcpp

05-19

IP string to integer conversion with Rcpp

05-19

IP string to integer conversion with Rcpp

05-19

Build your own Deep Learning Box

05-19

OpenCPU 2.1 Release: Scalable R Services

11-22

Build your own Deep Learning Box

05-19

.new_item for python lists

05-22

High-performance mathematical paradigms in Python

11-22

Python List Comprehension + Set + Dict Comprehension

11-16

From Python Hero to Java Rockstar

06-30

Scrape Tweets from Twitter using Python and Tweepy

02-24

.new_item for python lists

05-22

Introducing RcppDynProg

12-31

Introducing RcppDynProg

12-31

Anomaly detection on Amazon DynamoDB Streams using the Amazon SageMaker Random Cut Forest algorithm

12-05

How to Create a Simple Neural Network in Python

10-02

Create video subtitles with translation using machine learning

08-10

Handling Imbalanced Classes in the Dataset

08-03

Two cool features of Python NumPy: Mutating by slicing and Broadcasting

03-17

Python Pandas Tutorial: The Basics

11-23

.new_item for python lists

05-22

.new_item for python lists

05-22

.new_item for python lists

05-22

Piano Keyboards

05-22

Piano Keyboards

05-22

Unevenly Spaced Data

09-26

Piano Keyboards

05-22

My R Take in Advent of Code – Day 5

01-03

Recurrent Neural Networks for Beginners

08-13

Piano Keyboards

05-22

Tutorial: An app in R shiny visualizing biopsy data —  in a pharmaceutical company

01-07

Objects types and some useful R functions for beginners

12-24

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

12-21

Whats new on arXiv

12-20

Classifying yin and yang using MRI

12-18

Learning R: A gentle introduction to higher-order functions

12-14

Sharing Modeling Pipelines in R

12-11

Sharing Modeling Pipelines in R

12-11

Tidyverse 'Starts_with' in M/Power Query

10-08

A Cookbook for Machine Learning: Vol 1

11-16

From Analytical to Numerical to Universal Solutions

03-20

Learning in Brains and Machines (4): Episodic and Interactive Memory

07-24

Why Scala?

07-17

A tour of Factor: 1

05-23

Rev Summit for Data Science Leaders featuring Daniel Kahneman

01-07

Math for Machine Learning

01-04

Seeing the wood for the trees

01-01

Distilled News

12-30

Manning Countdown to 2019 – Big Deals on AI, Data Science, Machine Learning books and videos

12-28

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

12-21

Exploring the Data Jungle Free eBook

12-18

Easy CI/CD of GPU applications on Google Cloud including bare-metal using Gitlab and Kubernetes

12-14

Four Real-Life Machine Learning Use Cases

12-13

R community update: announcing sessions for useR Delhi December meetup

12-13

Math for Machine Learning

12-10

DATAx Presents: AI AND MACHINE LEARNING TRENDS IN 2019

12-06

The Quick Python Book

12-05

Introducing the First AI / Machine Learning Course With a Job Guarantee

11-30

Free ebook: Exploring Data with python

11-29

The Evolution of Build Engineering in Managing Open Source [Webinar Replay]

11-13

Healthcare Analytics Made Simple

11-12

Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake, Nov 8

11-01

Tomorrow, Nov 8 Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake

11-01

Webinar – Integrate AI Across Insurance Operations to Turbocharge Tech Transformation, Nov 14

10-31

How to Mitigate Open Source License Risks

10-30

The Definitive Guide to AI’s “Black Box” Problem

10-17

Proof that 1/7 is a repeated decimal

10-05

DevOps 2.0: Applying Machine Learning in the CI/CD Chain

10-02

Cool postdoc position in Arizona on forestry forecasting using tree ring models!

10-02

Connected Arms – Can AI Revolutionize Prosthetic Devices & Make them More Affordable?

09-07

Using gganimate to illustrate the luminance illusion

08-22

Distilled News

08-09

Data Science and Python

03-29

Movie Genre Ratings - Addendum

02-24

Once Again: Prefer Confidence Intervals to Point Estimates

10-30

WordPress to Jekyll: A 30x Speedup

10-10

Adobe Analytics Clickstream Data Feed: Calculations and Outlier Analysis

05-24

How to Meet Your New Years Resolutions in 2019 (Udemy Coupons $9.99)

01-01

Nimble tweak to use specific python version or virtual environment in RStudio

01-01

Scraping the Turkey Accordion

12-12

“A Guide to Working With Census Data in R” is now Complete!

11-05

2 Quick Announcements

07-25

Blog has migrated from Ghost to Jekyll

08-11

Adobe Analytics Clickstream Data Feed: Calculations and Outlier Analysis

05-24

Adobe Analytics Clickstream Data Feed: Calculations and Outlier Analysis

05-24

Adobe Analytics Clickstream Data Feed: Calculations and Outlier Analysis

05-24

R Packages worth a look

12-19

Amazon SageMaker Automatic Model Tuning now supports early stopping of training jobs

12-13

Amazon SageMaker Automatic Model Tuning becomes more efficient with warm start of hyperparameter tuning jobs

11-19

Amazon SageMaker automatic model tuning produces better models, faster

09-25

Deep Learning Vendor Update: Hyperparameter Tuning Systems

06-29

Automated machine learning is coming... and it won't matter

04-04

Hyperparameter optimization with approximate gradient

05-24

Hyperparameter optimization with approximate gradient

05-24

Document worth reading: “Searching Toward Pareto-Optimal Device-Aware Neural Architectures”

01-05

Document worth reading: “Physically optimizing inference”

09-29

If you did not already know

09-29

Whats new on arXiv

09-17

Whats new on arXiv

08-10

Whats new on arXiv

08-04

Bayesian Inference via Simulated Annealing

02-07

Paper: A Differentiable Physics Engine for Deep Learning in Robotics

11-03

Learning Reinforcement Learning (with Code, Exercises and Solutions)

10-02

Hyperparameter optimization with approximate gradient

05-24

Should we be concerned about MRP estimates being used in later analyses? Maybe. I recommend checking using fake-data simulation.

12-09

Whats new on arXiv

11-30

Horses for courses, or to each model its own (causal effect)

11-28

How to de-Bias Standard Deviation Estimates

11-12

How to de-Bias Standard Deviation Estimates

11-12

R Packages worth a look

10-12

Quick Significance Calculations for A/B Tests in R

10-06

Don’t calculate post-hoc power using observed estimate of effect size

09-24

Crazy Progress Bars

01-31

Maximum Likelihood estimates follow a normal distribution

05-24

Maximum Likelihood estimates follow a normal distribution

05-24

How to make a good data-driven web app

05-25

How to make a good data-driven web app

05-25

What to Consider When Choosing Colors for Data Visualization

08-22

Using Xcode with Github

05-25

Using Xcode with Github

05-25

Top KDnuggets tweets, Dec 19 – Jan 1: Deep Learning Cheat Sheets

01-02

An R Shiny app to recognize flower species

12-17

RTutor: Better Incentive Contracts For Road Construction

12-13

Deep Learning Cheat Sheets

11-28

GitHub Streak: Round Five

10-13

Microsoft R Open 3.5.1 now available

08-14

Announcement

04-27

GitHub's one-dimensional view of open source contributions

11-07

Using Xcode with Github

05-25

Using Xcode with Github

05-25

Should we be concerned about MRP estimates being used in later analyses? Maybe. I recommend checking using fake-data simulation.

12-09

R Packages worth a look

12-08

Whats new on arXiv

11-30

R Packages worth a look

11-30

Horses for courses, or to each model its own (causal effect)

11-28

R Packages worth a look

11-24

More on Bias Corrected Standard Deviation Estimates

11-14

How to de-Bias Standard Deviation Estimates

11-12

How to de-Bias Standard Deviation Estimates

11-12

If you did not already know

10-23

R Packages worth a look

10-20

Whats new on arXiv

10-19

R Packages worth a look

10-12

R Packages worth a look

09-06

Response to Rafa: Why I don’t think ROC [receiver operating characteristic] works as a model for science

08-05

R Packages worth a look

08-05

Sensor Fusion Tutorial

10-05

Binary Stochastic Neurons in Tensorflow

09-24

Talk: Building Machines that Imagine and Reason

07-28

Blending independent estimates

05-25

R Packages worth a look

11-30

Horses for courses, or to each model its own (causal effect)

11-28

Matching (and discarding non-matches) to deal with lack of complete overlap, then regression to adjust for imbalance between treatment and control groups

11-10

Whats new on arXiv

10-19

Quick Significance Calculations for A/B Tests in R

10-06

R Packages worth a look

09-06

Integration method to map model scores to conversion rates from example data

03-04

Intercausal Reasoning in Bayesian Networks

03-13

Crazy Progress Bars

01-31

Binary Stochastic Neurons in Tensorflow

09-24

Blending independent estimates

05-25

R Packages worth a look

11-26

Blending independent estimates

05-25

Finding Similar Sounding Names – Some Basics

05-26

Data Notes: Impact of Game of Thrones on US Baby Names

11-15

NPR Sunday Puzzle Solving, And Other Baby Name Questions

10-02

Finding Similar Sounding Names – Some Basics

05-26

Timing the Same Algorithm in R, Python, and C++

01-06

Timing the Same Algorithm in R, Python, and C++

01-06

Day 09 – little helper object_size_in_env

12-09

Book Review – Sound Analysis and Synthesis with R

11-03

Finding Similar Sounding Names – Some Basics

05-26

Finding Similar Sounding Names – Some Basics

05-26

Using RSiteCatalyst With Microsoft PowerBI Desktop

03-13

A tour of Factor: 2

05-27

A tour of Factor: 2

05-27

Data Trusts

05-29

Data Trusts

05-29

Does Sharing Goals Help or Hurt Your Chances of Success?

10-22

Data Trusts

05-29

How to Find an Entry-Level Job in Data Science

11-13

A Gentle Introduction to Recommender Systems with Implicit Feedback

05-30

An Overview of Recommendation Systems

05-23

A Gentle Introduction to Recommender Systems with Implicit Feedback

05-30

A Gentle Introduction to Recommender Systems with Implicit Feedback

05-30

Concurrent bloom filters

05-30

Concurrent bloom filters

05-30

Concurrent bloom filters

05-30

‘data:’ Scraping & Chart Reproduction : Arrows of Environmental Destruction

01-03

Does imputing model labels using the model predictions can improve it’s performance?

12-21

Automated Dashboard Visualizations with Ranking in R

12-07

R Packages worth a look

12-01

On “Competition” in the R Ecosystem

09-15

Forecast double seasonal time series with multiple linear regression in R

12-03

Concurrent bloom filters

05-30

Visualize the Business Value of your Predictive Models with modelplotr

11-03

ITWire: VIDEO Interview with a DataRobot: Greg Michaelson talks AI, banking, machine learning and more

10-24

Evaluating the Business Value of Predictive Models in Python and R

10-11

Here’s How to Survive the Rise of A.I. – Become a Data Facilitator

07-03

The Impact of Bitcoin on the Insurance Industry

06-21

Big Data Technology Trends in Banking

06-24

Assorted links

05-30

ITWire: VIDEO Interview with a DataRobot: Greg Michaelson talks AI, banking, machine learning and more

10-24

Here’s How to Survive the Rise of A.I. – Become a Data Facilitator

07-03

The Impact of Bitcoin on the Insurance Industry

06-21

Big Data Technology Trends in Banking

06-24

Assorted links

05-30

Think Twice Before You Accept That Fancy Data Science Job

12-19

InformationAge: Will 2019 See the Automation of Automation and Push Up Salaries of Data Scientists?

12-11

Should you become a data scientist?

12-10

Document worth reading: “Advice from the Oracle: Really Intelligent Information Retrieval”

11-10

What does a data scientist REALLY look like?

11-09

Top 10 Mistakes to Avoid to Master Data Science

10-10

Here is How You Can build a data science team from scratch in 2018 - The Definitive Guide

10-05

Top 10 Mistakes to Avoid to Master Data Science

10-04

Some thoughts after reading “Bad Blood: Secrets and Lies in a Silicon Valley Startup”

08-09

How to think about an accelerating string of research successes?

07-26

Defining data science in 2018

07-22

Data Science in 30 Minutes: Holden Karau – A Quick Introduction to PySpark

06-13

Data Science in 30 Minutes: Deep Learning to Detect Fake News with Uber ATG Head of Data Science, Mike Tamir

05-30

Ten Ways Your Data Project is Going to Fail

11-01

“Becoming a Data Scientist” Survey Results 1: Jobs & Education

08-22

Assorted links

05-30

Document worth reading: “Are screening methods useful in feature selection? An empirical study”

12-18

Classifying yin and yang using MRI

12-18

If you did not already know

12-15

“She also observed that results from smaller studies conducted by NGOs – often pilot studies – would often look promising. But when governments tried to implement scaled-up versions of those programs, their performance would drop considerably.”

11-22

Facial feedback: “These findings suggest that minute differences in the experimental protocol might lead to theoretically meaningful changes in the outcomes.”

11-01

What to think about this new study which says that you should limit your alcohol to 5 drinks a week?

10-23

How many deaths were caused by the hurricane in Puerto Rico?

09-14

What if a big study is done and nobody reports it?

09-10

Document worth reading: “PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison”

08-31

Some clues that this study has big big problems

08-29

When anyone claims 80% power, I’m skeptical.

08-24

Let’s be open about the evidence for the benefits of open science

08-06

Amelia, it was just a false alarm

07-31

Millions of social bots invaded Twitter!

03-14

Twitter, Social Bots, and the US Presidential Elections!

11-07

Assorted links

05-30

“Tweeking”: The big problem is not where you think it is.

09-23

Becoming a Data Scientist Podcast Episode 11: Stephanie Rivera

05-31

Deep Reinforcement Learning: Pong from Pixels

05-31

How Different are Conventional Programming and Machine Learning?

12-10

Quantum Machine Learning: A look at myths, realities, and future projections

11-05

AI Masterpieces: But is it Art?

10-27

The Trillion Dollar Question

08-09

Moravec's Paradox

01-31

Deep Reinforcement Learning: Pong from Pixels

05-31

TensorFlow Implementation of "A Neural Algorithm of Artistic Style"

05-31

TensorFlow Implementation of "A Neural Algorithm of Artistic Style"

05-31

The SIAM Book Series on Data Science

01-11

NYU Stern Fubon Center for Technology, Business and Innovation: Fubon Center Faculty Fellow [New York, NY]

01-08

Document worth reading: “Recent Advances in Deep Learning: An Overview”

01-08

On deck for the first half of 2019

01-07

Document worth reading: “Neural Style Transfer: A Review”

01-02

Document worth reading: “A Survey: Non-Orthogonal Multiple Access with Compressed Sensing Multiuser Detection for mMTC”

12-31

Center for Ultrasound Research and Translation, Massachusetts General Hospital: Post-Doctoral Scholar / Research Scientist [Boston, MA]

12-31

Import AI 127: Why language AI advancements may make Google more competitive; COCO image captioning systems don’t live up to the hype, and Amazon sees 3X growth in voice shopping via Alexa

12-31

University of Virginia: Faculty, Open Rank Model and Simulation at the Human-Technology Frontier [Charlottesville, VA]

12-24

Zak David expresses critical views of some published research in empirical quantitative finance

12-24

Highlights of 2018

12-18

Distilled News

12-09

Document worth reading: “Marketing Analytics: Methods, Practice, Implementation, and Links to Other Fields”

12-07

Why Primary Research?

12-04

Import AI: 123: Facebook sees demands for deep learning services in its data centers grow by 3.5X; why advanced AI might require a global policeforce; and diagnosing natural disasters with deep learning

12-03

Monash University: Research Fellow (Bioinformatics) [Melbourne, Australia]

12-03

Introducing medical language processing with Amazon Comprehend Medical

11-27

Import AI: 122: Google obtains new ImageNet state-of-the-art with GPipe; drone learns to land more effectively than PD controller policy; and Facebook releases its ‘CherryPi’ StarCraft bot

11-26

Import AI 121: Sony researchers make ultra-fast ImageNet training breakthrough; Berkeley researchers tackle StarCraft II with modular RL system; and Germany adds €3bn for AI research

11-19

The State of the Art

11-15

Visualization research for non-researchers

11-12

Distilled News

11-11

My two talks in Austria next week, on two of your favorite topics!

11-02

American Association of Colleges of Osteopathic Medicine: Data Analyst [Bethesda, Maryland]

10-29

U. of Zurich: Professorship in Big Data Science (Open Rank) [Zurich, Switzerland]

10-24

U. of Zurich: Assistant Professorship in AI and Machine Learning (Non-tenure Track) [Zurich, Switzerland]

10-24

Document worth reading: “A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress”

10-15

All About Open Source

10-09

Import AI 114: Synthetic images take a big leap forward with BigGANs; US lawmakers call for national AI strategy; researchers probe language reasoning via HotspotQA

10-01

If you did not already know

09-30

(People are missing the point on Wansink, so) what’s the lesson we should be drawing from this story?

09-27

“Tweeking”: The big problem is not where you think it is.

09-23

Distilled News

09-12

Distilled News

08-31

Some clues that this study has big big problems

08-29

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

Document worth reading: “A Survey on Resilient Machine Learning”

08-19

Let’s get hysterical

08-19

Let’s get hysterical

08-19

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Document worth reading: “Model-free, Model-based, and General Intelligence”

08-10

Download 3 million Russian troll tweets

08-02

Import AI:

07-23

Import AI:

07-16

Import AI:

07-09

A Real World Reinforcement Learning Research Program

07-06

Open Source Datasets with Kaggle

06-21

Import AI

06-18

Import AI

06-05

When the bubble bursts…

06-04

AI and Deep Learning in 2017 – A Year in Review

12-31

Announcing Elemetric

06-23

Research Debt

03-22

Engineering is the bottleneck in (Deep Learning) Research

01-17

NIPS 2016 Workshop on Approximate Inference

09-30

A Survival Guide to a PhD

09-07

Deep Learning Trends @ ICLR 2016

06-01

Whats new on arXiv

12-14

If you did not already know

12-13

Join AI experts from Google Brain, Open AI & Uber AI Labs in San Francisco

11-01

Whats new on arXiv

10-30

Whats new on arXiv

10-22

Whats new on arXiv

10-22

Whats new on arXiv

09-19

Whats new on arXiv

08-21

Whats new on arXiv

08-13

An Updated Review of The Data Incubator Data Science Bootcamp

05-29

Deep Learning Trends @ ICLR 2016

06-01

Data trusts could allay our privacy fears

06-03

Data trusts could allay our privacy fears

06-03

Data trusts could allay our privacy fears

06-03

Data trusts could allay our privacy fears

06-03

Using the Economics Value Curve to Drive Digital Transformation

12-27

A couple more papers on genetic diversity as an explanation for why Africa and remote Andean countries are so poor while Europe and North America are so wealthy

09-19

Is it really true that babies should sleep on their backs?

07-31

The persistence of bad reporting and the reluctance of people to criticize it

07-12

Assorted links

08-12

Quantitative Finance Resources

06-04

How to work with strings in base R – An overview of 20+ methods for daily use.

11-24

Quantitative Finance Resources

06-04

Quantitative Finance Resources

06-04

Vanderbilt University: Sr Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University’s Peabody College: Sr. Lecturer in Data and Analytics [Nashville, TN]

11-05

Quantitative Finance Resources

06-04

Top Python Libraries in 2018 in Data Science, Deep Learning, Machine Learning

12-19

A parable regarding changing standards on the presentation of statistical evidence

12-06

Top 10 Python Data Science Libraries

11-16

Distilled News

11-04

R Packages worth a look

08-03

Matter and Neutron Stars

06-04

Matter and Neutron Stars

06-04

Matter and Neutron Stars

06-04

Matter and Neutron Stars

06-04

Matter and Neutron Stars

06-04

LSTMs

06-04

The Backpropagation Algorithm Demystified

01-02

If you did not already know

12-24

Serial and Parallel bulb puzzle

10-18

Text to Speech Deep Learning Architectures

02-20

Java Handwritten Digit Recognition with Neural Networks

11-29

Sequence Modeling with CTC

11-27

LSTMs

06-04

Written Memories: Understanding, Deriving and Extending the LSTM

07-26

LSTMs

06-04

Statistics Sunday: Some Psychometric Tricks in R

10-14

Challenges & Solutions for Production Recommendation Systems

10-05

R Packages worth a look

09-06

Learning to Rank Sketchfab Models with LightFM

11-07

Collaborative Filtering using Alternating Least Squares

09-17

How to scrape a website using Python + Scrapy in 5 simple steps

08-18

Principal Component Analysis Tutorial

06-14

A Gentle Introduction to Bloom Filter

06-05

If you did not already know

09-08

Online Hard Example Mining on PyTorch

10-22

Recurrent Neural Networks for Churn Prediction

02-22

A Gentle Introduction to Bloom Filter

06-05

The Riddler: Santa Needs Some Help With Math

12-22

Monash University: Research Fellow (Bioinformatics) [Melbourne, Australia]

12-03

NG "roll returns" – inflection point?

11-05

Verlet Simulations

07-16

Machine Learning Madden NFL: The best player position switches for Madden 17

01-20

A Gentle Introduction to Bloom Filter

06-05

A Gentle Introduction to Bloom Filter

06-05

Top KDnuggets tweets, Nov 14-20: 10 Free Must-See Courses for Machine Learning and Data Science; Great list of

11-21

How I got in the top 1 % on Kaggle.

08-28

Kaggle’s Mercedes-Benz Greener Manufacturing

07-01

A Guide to Gradient Boosted Trees with XGBoost in Python

06-05

A Guide to Gradient Boosted Trees with XGBoost in Python

06-05

Distilled News

12-12

8 Data Science Projects to Build your Portfolio

12-11

Visual Model-Based Reinforcement Learning as a Path towards Generalist Robots

11-30

If you did not already know

11-29

A deep dive into glmnet: penalty.factor

11-13

A Guide to Gradient Boosted Trees with XGBoost in Python

06-05

Introduction to Amazon SageMaker Object2Vec

11-08

Document worth reading: “An Analysis of Hierarchical Text Classification Using Word Embeddings”

10-06

Document worth reading: “A Tutorial on Network Embeddings”

08-26

Understanding Latent Style

06-28

Document Similarity With Word Movers Distance

06-13

Translating W2v Embedding From One Space To Another

06-06

University of Rhode Island: Data Scientist, DataSpark (2 Positions) [Kingston, RI]

12-18

Making a Profit with Henry Wan in Arkham Horror: The Card Game

12-03

Evolving Stable Strategies

11-12

Deep Learning Research Review Week 2: Reinforcement Learning

11-16

Model-Free Prediction and Control

06-07

Sutton’s Temporal-Difference Learning

02-19

Model-Free Prediction and Control

06-07

Model-Free Prediction and Control

06-07

R Packages worth a look

12-31

Day 09 – little helper object_size_in_env

12-09

Build a serverless Twitter reader using AWS Fargate

12-06

Simulating the iSight Camera in the iOS Simulator

10-09

Animate NBA shot events with Paper.js

06-08

Animate NBA shot events with Paper.js

06-08

Animate NBA shot events with Paper.js

06-08

The Power of IPython Notebook + Pandas + and Scikit-learn

06-11

About that claim in the NYT that the immigration issue helped Hillary Clinton? The numbers don’t seem to add up.

07-03

Document Similarity With Word Movers Distance

06-13

Obtaining the number of components from cross validation of principal components regression

10-15

Guide to a high-performance, powerful R installation

08-31

Principal Component Analysis Tutorial

06-14

Humana: Principal Data Scientist/Informatics Principal [Chicago, IL, Dallas, TX and Louisville, KY]

11-27

Blockchain applications in the Federal Government sector

10-17

Principal Component Analysis Tutorial

06-14

NLP Overview: Modern Deep Learning Techniques Applied to Natural Language Processing

01-08

Principal Component Analysis Tutorial

06-14

Use AWS DeepLens to give Amazon Alexa the power to detect objects via Alexa skills

10-17

Kinesis Savant Elite 2 Foot pedals

06-14

Kinesis Savant Elite 2 Foot pedals

06-14

Kinesis Savant Elite 2 Foot pedals

06-14

Kinesis Savant Elite 2 Foot pedals

06-14

Beyond Binary: Ternary and One-hot Neurons

02-08

Visualizing Features from a Convolutional Neural Network

06-15

Why Can't Gay Men Donate Blood? A Bayesian Analysis

06-16

Why Can't Gay Men Donate Blood? A Bayesian Analysis

06-16

Why Can't Gay Men Donate Blood? A Bayesian Analysis

06-16

Will Models Rule the World? Data Science Salon Miami, Nov 6-7

10-19

The Policy Gradient

06-16

Data Science With R Course Series – Week 8

11-05

If you did not already know

09-15

Dexterous Manipulation with Reinforcement Learning: Efficient, General, and Low-Cost

08-31

Evolving Stable Strategies

11-12

Speeding up TRPO through parallelization and parameter adaptation

12-09

Deep Learning Research Review Week 2: Reinforcement Learning

11-16

The Policy Gradient

06-16

The Policy Gradient

06-16

Announcing hs2client, A Fast New C++ / Python Thrift Client for Impala and Hive

06-16

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-17

LoyaltyOne: Associate Director, CPG [Westborough, MA]

12-17

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-14

LoyaltyOne: Manager, CPG [Westborough, MA]

12-14

If you did not already know

12-03

Add Constrained Optimization To Your Toolbelt

06-21

Announcing hs2client, A Fast New C++ / Python Thrift Client for Impala and Hive

06-16

R Packages worth a look

09-26

Announcing hs2client, A Fast New C++ / Python Thrift Client for Impala and Hive

06-16

Cathy O’Neil discusses the current lack of fairness in artificial intelligence and much more.

11-26

Angela Bassa discusses managing data science teams and much more.

11-12

Peter Bull discusses the importance of human-centered design in data science.

11-05

Arnaub Chatterjee discusses artificial intelligence (AI) and machine learning (ML) in healthcare.

10-29

Cassie Kozyrkov discusses decision making and decision intelligence!

10-22

Making Deep Networks Probabilistic via Test-time Dropout

06-17

Making Deep Networks Probabilistic via Test-time Dropout

06-17

Did she really live 122 years?

01-08

R Packages worth a look

12-19

R Packages worth a look

12-11

A parable regarding changing standards on the presentation of statistical evidence

12-06

Document worth reading: “A Survey on Trust Modeling from a Bayesian Perspective”

11-22

Online Bayesian Deep Learning in Production at Tencent

11-15

R Packages worth a look

11-14

Why would I ever NEED Bayesian Statistics?

11-09

My two talks in Austria next week, on two of your favorite topics!

11-02

If you did not already know

09-17

Distilled News

09-04

Bayesian model comparison in ecology

08-26

If you did not already know

08-07

A Practical Guide to the Lomb-Scargle Periodogram

03-30

Making Bayesian A/B testing more accessible

06-19

Document worth reading: “Importance of the Mathematical Foundations of Machine Learning Methods for Scientific and Engineering Applications”

09-30

Making Bayesian A/B testing more accessible

06-19

Making Bayesian A/B testing more accessible

06-19

R Packages worth a look

01-01

Debiasing Approximate Inference

12-05

If you did not already know

08-24

Making Bayesian A/B testing more accessible

06-19

How things float

06-20

How things float

06-20

Document worth reading: “Advice from the Oracle: Really Intelligent Information Retrieval”

11-10

R Packages worth a look

10-24

R Packages worth a look

10-06

Gradient optimisation on the Poincaré disc

04-10

A tour of Factor: 3

06-20

Magister Dixit

07-31

A tour of Factor: 3

06-20

A tour of Factor: 3

06-20

Making use of the model

06-20

Niall Ferguson and the perils of playing to your audience

12-05

“Law professor Alan Dershowitz’s new book claims that political differences have lately been criminalized in the United States. He has it wrong. Instead, the orderly enforcement of the law has, ludicrously, been framed as political.”

11-12

David Weakliem points out that both economic and cultural issues can be more or less “moralized.”

10-03

The Real Story Behind Today's Referendum

06-23

The Real Story Behind Today's Referendum

06-23

Why do sociologists (and bloggers) focus on the negative? 5 possible explanations. (A post in the style of Fabio Rojas)

12-17

“Statistical insights into public opinion and politics” (my talk for the Columbia Data Science Society this Wed 9pm)

12-04

Curalate makes social sell with AI using Apache MXNet on AWS

08-13

How feminism has made me a better scientist

08-13

How feminism has made me a better scientist

08-13

IEEE Language Rankings 2018

08-07

When wife earns more than husband, they report a lesser gap

07-23

The Rise of Social Bots!

06-28

The Real Story Behind Today's Referendum

06-23

Gradient Boosting explained [demonstration]

06-24

My secret sauce to be in top 2% of a Kaggle competition

11-26

R Packages worth a look

11-20

Mastering The New Generation of Gradient Boosting

11-15

Document worth reading: “A Survey on Influence Maximization in a Social Network”

09-02

Gradient Boosting explained [demonstration]

06-24

From Instance Noise to Gradient Regularisation

06-01

Gradient Boosting explained [demonstration]

06-24

Distilled News

12-19

Intuit: Staff Data Scientist [Mountain View, CA]

12-11

Combating Customer Churn with AI

11-29

Linking Data Science Activities to Business Initiatives Using the Hypothesis Development Canvas

11-29

Data Science Strategy Safari: Aligning Data Science Strategy to Org Strategy

11-26

Top 5 domains Big Data analytics helps to transform

11-23

Amazon Rekognition announces updates to its face detection, analysis, and recognition capabilities

11-22

Driving Success through Business Insight, One Customer at a Time

11-21

Build Your Own Natural Language Models on AWS (no ML experience required)

11-19

Getting Started with Amazon Comprehend custom entities

11-17

Strategy: Customer Analytics: Are you Profiting from your Data?

11-14

Distilled News

10-22

Shopper Sentiment: Analyzing in-store customer experience

10-09

Distilled News

09-24

How I got in the top 1 % on Kaggle.

08-28

Amazon Translate now available in the Memsource translation management system

08-14

An Overview of Recommendation Systems

05-23

Why I'm bullish on Uber - the customer acquisition trough

04-20

Up and running with Apache Spark on Apache Kudu

02-01

Customer lifetime value and the proliferation of misinformation on the internet

01-08

Sales Automation Through a Deep Learning Platform

09-22

Big Data Technology Trends in Banking

06-24

The importance of Data Analytics skills in today’s MBA roles

12-19

Industry Predictions: AI, Machine Learning, Analytics & Data Science Main Developments in 2018 and Key Trends for 2019

12-18

Cummins: Reliability Analytics Leader [Columbus, IN]

12-13

Cummins: Advanced Analytics Systems Architect Principle [Columbus, IN]

12-12

Intuit: Staff Data Scientist – Business Analytics [Mountain View, CA]

12-12

InformationAge: Will 2019 See the Automation of Automation and Push Up Salaries of Data Scientists?

12-11

Should you become a data scientist?

12-10

Combating Customer Churn with AI

11-29

Intro to Data Science for Managers

11-23

Machine Learning. In conversation with Jelena Ilic, Senior Data Scientist at Mango Solutions

11-21

The Big Data Game Board™

11-19

The ultimate guide to starting AI

11-13

Top 5 Trends in Data Science

11-09

Distilled News

11-05

BI to AI: Getting Intelligent Insights to Everyone

10-18

Learn the top things to look for in an AI Vendor

10-12

Business Analysis (BA) Career Path

10-11

Here is How You Can build a data science team from scratch in 2018 - The Definitive Guide

10-05

Distilled News

09-25

Data Projects WILL Fail - Learn to Fail Quickly & Efficiently

09-21

2018 Data Sources for Cool Data Science Projects, provided by Thinknum

08-06

First Data Project? Go Tandem! (AVISIA at Play)

07-27

I Can’t Afford to Hire a Data Scientist. Now What?

07-11

Forbes: DataRobot Puts the Power of Machine Learning in the Hands of Business Analysts

06-04

How to Solve a Problem In 3 Steps -- Define It, Redefine It, Repeat

08-29

Big Data Technology Trends in Banking

06-24

If you did not already know

12-18

Whats new on arXiv

11-29

Whats new on arXiv

11-26

Whats new on arXiv

11-05

Whats new on arXiv

10-19

Modeling Airbnb prices

10-12

Whats new on arXiv

10-09

Against Arianism 2: Arianism Grande

09-12

Whats new on arXiv

08-21

Whats new on arXiv

08-04

Hierarchical Softmax

08-01

Recurrent Neural Network Tutorial for Artists

01-01

Factorization Machines A Theoretical Introduction

06-26

Factorization Machines A Theoretical Introduction

06-26

Factorization Machines A Theoretical Introduction

06-26

Factorization Machines A Theoretical Introduction

06-26

PyConUK 2018

09-19

Managing your expenses with Amazon Lex

08-21

How I built a receipt chatbot over a weekend

06-23

Twitter bots for good, and information contagion!

09-27

Hype or Not? Some Perspective on OpenAI’s DotA 2 Bot

08-12

6279e808ef0c35488ea3a81e9b6d302a

07-06

Millions of social bots invaded Twitter!

03-14

The Rise of Social Bots!

06-28

Complex System Society 2016 Junior Scientific Award!

01-16

The Rise of Social Bots!

06-28

The Rise of Social Bots!

06-28

Lethal Autonomous Weapon Systems are on the way

06-28

Lethal Autonomous Weapon Systems are on the way

06-28

Lethal Autonomous Weapon Systems are on the way

06-28

Lethal Autonomous Weapon Systems are on the way

06-28

Lethal Autonomous Weapon Systems are on the way

06-28

Long-awaited updates to htmlTable

01-07

3 Reasons Counting is the Hardest Thing in Data Science

06-29

Advent of Code: Most Popular Languages

12-15

Multi-Class Text Classification Model Comparison and Selection

11-01

3 Reasons Counting is the Hardest Thing in Data Science

06-29

Looking back on 2018, looking to 2019

01-07

Online Hard Example Mining on PyTorch

10-22

3 Reasons Counting is the Hardest Thing in Data Science

06-29

American Association of Colleges of Osteopathic Medicine: Data Analyst [Bethesda, Maryland]

10-29

Facts and Fallacies of Software Engineering - Book Review

02-11

3 Reasons Counting is the Hardest Thing in Data Science

06-29

Cummins: Data Engineering Apps and Solutions Architect [Columbus, IN]

12-13

Cummins: Advanced Analytics Platform Principle Engineer [Columbus, IN]

12-12

American Association of Colleges of Osteopathic Medicine: Data Analyst [Bethesda, Maryland]

10-29

3 Reasons Counting is the Hardest Thing in Data Science

06-29

The Christmas Eve Selloff was a Classic Capitulation

12-27

Generative Adversarial Networks Explained

06-29

Generative Adversarial Networks Explained

06-29

Generative Adversarial Networks Explained

06-29

Generative Adversarial Networks Explained

06-29

Seasonalities: The Near-Term Future for the Market

04-14

Which is more dangerous, guns or gay sex?

06-29

Document worth reading: “Machine Learning in Official Statistics”

01-11

“The Book of Why” by Pearl and Mackenzie

01-08

On deck for the first half of 2019

01-07

7 Reasons for Policy Professionals to Get Pumped About R Programming in 2019

01-07

Part 2, further comments on OfS grade-inflation report

01-07

Hackers beware: Bootstrap sampling may be harmful

01-07

Magister Dixit

01-03

Music listener statistics: last.fm’s last.year as an R package

01-02

Office for Students report on “grade inflation”

01-02

What to do when you read a paper and it’s full of errors and the author won’t share the data or be open about the analysis?

01-02

Statistical Assessments of AUC

12-26

Data Science & ML : A Complete Interview Guide

12-26

Data Science & ML : A Complete Interview Guide

12-19

R Packages worth a look

12-05

My talk tomorrow (Tues) noon at the Princeton University Psychology Department

12-03

October 2018: “Top 40” New Packages

11-29

“Using numbers to replace judgment”

11-17

The 5 Basic Statistics Concepts Data Scientists Need to Know

11-13

5 Critical Steps to Predictive Business Analytics

11-08

If you did not already know

11-03

My two talks in Austria next week, on two of your favorite topics!

11-02

Is the answer to everything Gaussian?

10-29

High school statistics class builds election prediction model

10-23

Document worth reading: “Declarative Statistics”

10-22

Ethics in statistical practice and communication: Five recommendations.

10-18

In Memoriam: Manfred te Grotenhuis

10-15

Guest Post: Galin Jones on criteria for promotion and tenture in (bio)statistics departments

10-11

Document worth reading: “An Introduction to Inductive Statistical Inference — from Parameter Estimation to Decision-Making”

10-09

Distilled News

09-28

(People are missing the point on Wansink, so) what’s the lesson we should be drawing from this story?

09-27

Job opening at CDC: “The Statistician will play a central role in guiding the statistical methods of all major projects of the Epidemiology and Prevention Branch of the CDC Influenza Division, and aid in designing, analyzing, and interpreting research intended to understand the burden of influenza in the US and internationally and identify the best influenza vaccines and vaccine strategies.”

09-26

Don’t calculate post-hoc power using observed estimate of effect size

09-24

Three Mighty Good Reasons to Learn R for Data Science

09-19

Document worth reading: “Interpreting Deep Learning: The Machine Learning Rorschach Test”

09-04

If you did not already know

09-01

John Hattie’s “Visible Learning”: How much should we trust this influential review of education research?

09-01

Distilled News

08-22

What data scientists really do

08-21

The fallacy of the excluded middle — statistical philosophy edition

08-18

The fallacy of the excluded middle — statistical philosophy edition

08-18

What is a p-value

08-09

Distilled News

08-07

China air pollution regression discontinuity update

08-02

“The idea of replication is central not just to scientific practice but also to formal statistics . . . Frequentist statistics relies on the reference set of repeated experiments, and Bayesian statistics relies on the prior distribution which represents the population of effects.”

07-19

Basic Statistics in Python: Probability

07-18

Basic Statistics in Python: Descriptive Statistics

07-03

Data Readiness Levels: Turning Data from Palid to Vivid

01-12

Type Safety and Statistical Computing

12-12

Book Review: Computer Age Statistical Inference

11-23

Lies, Damned Lies and Big Data

11-19

Is Data Scientist a useless job title?

08-04

Which is more dangerous, guns or gay sex?

06-29

Which is more dangerous, guns or gay sex?

06-29

Top KDnuggets tweets, Dec 19 – Jan 1: Deep Learning Cheat Sheets

01-02

In case you missed it: October 2018 roundup

11-15

Document worth reading: “A Tutorial on Modeling and Inference in Undirected Graphical Models for Hyperspectral Image Analysis”

11-09

Which is more dangerous, guns or gay sex?

06-29

Covariate-Based Diagnostics for Randomized Experiments are Often Misleading

04-06

Which is more dangerous, guns or gay sex?

06-29

Building a Data Science Portfolio: Storytelling with Data

06-30

Debate about genetics and school performance

10-27

John Hattie’s “Visible Learning”: How much should we trust this influential review of education research?

09-01

Building a Data Science Portfolio: Storytelling with Data

06-30

NYC buses: company level predictors with R

11-28

Building a Data Science Portfolio: Storytelling with Data

06-30

Building a Data Science Portfolio: Storytelling with Data

06-30

The year in AI/Machine Learning advances: Xavier Amatriain 2018 Roundup

01-11

ABC intro for Astrophysics

10-15

“Fudged statistics on the Iraq War death toll are still circulating today”

10-06

Researchers.one: A souped-up Arxiv with pre- and post-publication review

09-10

Analysing NLP publication patterns

06-30

“Fudged statistics on the Iraq War death toll are still circulating today”

10-06

Analysing NLP publication patterns

06-30

What to do when you read a paper and it’s full of errors and the author won’t share the data or be open about the analysis?

01-02

Analysing NLP publication patterns

06-30

Researchers.one: A souped-up Arxiv with pre- and post-publication review

09-10

Being at the Center

09-07

Book review: SQL Server 2017 Machine Learning Services with R

09-04

This Website

01-18

Analysing NLP publication patterns

06-30

Exploring the Data Jungle Free eBook

12-18

Let’s get hysterical

08-19

Let’s get hysterical

08-19

How to Overcome That Awkward Silence in Interviews

08-08

ML/NLP Publications in 2017

01-02

NLP and ML Publications – Looking Back at 2016

01-04

Analysing NLP publication patterns

06-30

Certifiably Gone Phishing

12-23

Sorry I didn’t get that! How to understand what your users want

11-16

If you did not already know

10-19

Statistics Sunday: Some Psychometric Tricks in R

10-14

If you did not already know

09-25

The Real Problems with Neural Machine Translation

07-21

Data Science Challenges

07-01

Prior distributions for covariance matrices

12-10

Data Science Challenges

07-01

Simple reinforcement learning methods to learn CartPole

07-01

The Backpropagation Algorithm Demystified

01-02

If you did not already know

10-14

How to Create a Simple Neural Network in Python

10-02

Why Machine Learning Is A Metaphor For Life

08-16

Hyper Networks

09-29

Simple reinforcement learning methods to learn CartPole

07-01

A short proof for Nesterov’s momentum

11-21

First Order Optimization Methods

07-02

First Order Optimization Methods

07-02

Learning in Brains and Machines (3): Synergistic and Modular Action

07-03

Deep Learning for Media Content

12-28

Data, movies and ggplot2

12-19

Why do sociologists (and bloggers) focus on the negative? 5 possible explanations. (A post in the style of Fabio Rojas)

12-17

The evolution of pace in popular movies

11-24

3 recent movies from the 50s and the 70s

08-30

Quick and Dirty Serverless Integer Programming

08-06

The 2018 Best Picture Nominees Ranked, Reviewed, and Reflected Upon

03-03

Interacting with ML Models

10-26

IMDB Data Visualizations with D3 + Dimple.js

08-10

A tour of Factor: 4

07-04

A tour of Factor: 4

07-04

R or Python? Why not both? Using Anaconda Python within R with {reticulate}

12-30

AzureStor: an R package for working with Azure storage

12-18

AzureStor: an R package for working with Azure storage

12-18

If you did not already know

12-17

Distilled News

12-12

Faster garbage collection in pqR

11-30

Visual Model-Based Reinforcement Learning as a Path towards Generalist Robots

11-30

AzureRMR: an R interface to Azure Resource Manager

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

Understanding object detection in deep learning

11-19

Document worth reading: “Visions of a generalized probability theory”

11-14

Latest Trends in Computer Vision Technology and Applications

11-07

R Packages worth a look

11-07

Import AI 119: How to benefit AI research in Africa; German politician calls for billions in spending to prevent country being left behind; and using deep learning to spot thefts

11-05

Semantic Segmentation: Wiki, Applications and Resources

10-04

BDD100K: A Large-scale Diverse Driving Video Database

05-30

Java Autonomous driving – Car detection

01-18

Analyzing The Papers Behind Facebook's Computer Vision Approach

09-01

A tour of Factor: 4

07-04

Deep Learning for Chatbots, Part 2 – Implementing a Retrieval-Based Model in Tensorflow

07-04

Recreating the NBA lead tracker graphic

12-13

Probability and Tennis

08-13

Cribbage Scores

02-25

Deep Learning for Chatbots, Part 2 – Implementing a Retrieval-Based Model in Tensorflow

07-04

Bayesian Deep Learning Part II: Bridging PyMC3 and Lasagne to build a Hierarchical Neural Network

07-05

Gradient Boosting Interactive Playground

07-05

Gradient Boosting Interactive Playground

07-05

Gradient Boosting Interactive Playground

07-05

Le Monde puzzle [#1072]

10-31

Grazing in a circular field

11-23

Gradient Boosting Interactive Playground

07-05

Build your own offshore company

07-06

If you did not already know

12-16

Advanced News API search: leveraging DBpedia entity types

12-11

Introduction to Named Entity Recognition

12-11

Getting Started with Amazon Comprehend custom entities

11-17

Model Updates: Entity-level Sentiment Analysis and Brand New Entity Extraction Models Now Live in the Text Analysis API

07-17

Build your own offshore company

07-06

Build your own offshore company

07-06

Recurrent Neural Networks in Tensorflow I

07-11

Is the answer to everything Gaussian?

10-29

Recurrent Neural Networks in Tensorflow I

07-11

Occam razor vs. machine learning

07-12

Occam razor vs. machine learning

07-12

Multilevel Modeling Solves the Multiple Comparison Problem: An Example with R

10-31

Markets Performance after Election: One Year Update

11-12

MLHEP 2016 lectures slides

07-12

R Packages worth a look

10-14

He had a sudden cardiac arrest. How does this change the probability that he has a particular genetic condition?

10-14

R Packages worth a look

09-15

MLHEP 2016 lectures slides

07-12

Whats new on arXiv

01-09

Distilled News

01-06

Document worth reading: “A Study and Comparison of Human and Deep Learning Recognition Performance Under Visual Distortions”

12-18

KNNs (K-Nearest-Neighbours) in Python

11-22

KDnuggets™ News 18:n42, Nov 7: The Most in Demand Skills for Data Scientists; How Machines Understand Our Language: Intro to NLP

11-07

Distilled News

11-05

Document worth reading: “An Analysis of Hierarchical Text Classification Using Word Embeddings”

10-06

Document worth reading: “Importance of the Mathematical Foundations of Machine Learning Methods for Scientific and Engineering Applications”

09-30

Classifying high-resolution chest x-ray medical images with Amazon SageMaker

09-13

Import AI 111: Hacking computers with Generative Adversarial Networks, Facebook trains world-class speech translation in 85 minutes via 128 GPUs, and Europeans use AI to classify 1,000-year-old graffiti.

09-10

Distilled News

08-10

Understanding deep Convolutional Neural Networks with a practical use-case in Tensorflow and Keras

11-13

MLHEP 2016 lectures slides

07-12

How Data Science Is Improving Higher Education

11-01

Building a Data Science Portfolio: Storytelling with Data (Part 2: Data Exploration)

07-14

John Hattie’s “Visible Learning”: How much should we trust this influential review of education research?

09-01

Building a Data Science Portfolio: Storytelling with Data (Part 2: Data Exploration)

07-14

If you did not already know

11-04

If you did not already know

11-04

You’ve got data on 35 countries, but it’s really just N=3 groups.

09-25

Don’t get fooled by observational correlations

09-16

“Usefully skeptical science journalism”

08-12

“Usefully skeptical science journalism”

08-12

An intuitive, visual guide to copulas

05-03

Building a Data Science Portfolio: Storytelling with Data (Part 2: Data Exploration)

07-14

If you did not already know

11-04

If you did not already know

11-04

“It’s Always Sunny in Correlationville: Stories in Science,” or, Science should not be a game of Botticelli

09-08

Amelia, it was just a false alarm

07-31

On Tensor Networks and the Nature of Non-Linearity

06-20

Trustworthy Data Analysis

06-04

An intuitive, visual guide to copulas

05-03

Building a Data Science Portfolio: Storytelling with Data (Part 2: Data Exploration)

07-14

12 Ways To Cultivate A Data-Savvy Workforce

07-15

Magister Dixit

12-24

12 Ways To Cultivate A Data-Savvy Workforce

07-15

The Big Data Game Board™

11-19

Insights on the role data can play in your organization

11-19

Hey! Here’s what to do when you have two or more surveys on the same population!

11-11

12 Ways To Cultivate A Data-Savvy Workforce

07-15

Becoming a Data Scientist Podcast Episode 13: Debbie Berebichez

07-15

Why Scala?

07-17

Monitoring your cluster in just a few minutes using ISA

07-18

Monitoring your cluster in just a few minutes using ISA

07-18

Monitoring your cluster in just a few minutes using ISA

07-18

If you did not already know

12-21

Coding Gradient boosted machines in 100 lines of code

11-05

If you did not already know

11-01

Sequence Modeling with Neural Networks – Part I

10-03

What to Consider When Choosing Colors for Data Visualization

08-22

Thanksgiving Special 🦃: GANs are Being Fixed in More than One Way

11-23

A Visual Guide to Evolution Strategies

10-29

Why Momentum Really Works

04-04

Deep Learning without Backpropagation

03-21

A intuitive explanation of natural gradient descent

08-07

Written Memories: Understanding, Deriving and Extending the LSTM

07-26

Styles of Truncated Backpropagation

07-19

Styles of Truncated Backpropagation

07-19

Styles of Truncated Backpropagation

07-19

Instagram’s Blind Spot

07-19

Introduction to Statistics for Data Science

12-17

Number of births in the twentieth century by @ellis2013nz

11-30

Ask the Question, Visualize the Answer

10-17

Instagram’s Blind Spot

07-19

Instagram’s Blind Spot

07-19

Dreaming of a white Christmas – with ggmap in R

12-24

collateral

11-02

The Probability Monad and Why it's Important for Data Science

09-05

Instagram’s Blind Spot

07-19

Project Euler using Scala: Problem

07-19

If you did not already know

12-14

R Packages worth a look

12-08

AzureVM: managing virtual machines in Azure

12-03

AzureVM: managing virtual machines in Azure

12-03

If you did not already know

09-29

Collecting Expressions in R

08-05

Project Euler using Scala: Problem

07-19

I'm all about ML, but let's talk about OR

07-20

I'm all about ML, but let's talk about OR

07-20

I'm all about ML, but let's talk about OR

07-20

I'm all about ML, but let's talk about OR

07-20

Linear Regression in the Wild

10-03

Linear regression can be understood in many ways (optimization, probabilistic, bayesian)

07-20

Bulk Downloading Adobe Analytics Data

07-21

Model Server for Apache MXNet v1.0 released

10-31

How to scrape data from a website using Python

09-07

Simple Stock Ticker App

02-04

Bulk Downloading Adobe Analytics Data

07-21

Bulk Downloading Adobe Analytics Data

07-21

Simulating Twitch chat with a Recurrent Neural Network

07-21

Simulating Twitch chat with a Recurrent Neural Network

07-21

Simulating Twitch chat with a Recurrent Neural Network

07-21

Who's at the Center of the Star Trek Universe?

07-22

“Principles of posterior visualization”

01-01

Statistics in Glaucoma: Part I

12-03

The Building Blocks of Interpretability

03-06

Who's at the Center of the Star Trek Universe?

07-22

Overlapping Disks

09-30

Who's at the Center of the Star Trek Universe?

07-22

“Do you have any recommendations for useful priors when datasets are small?”

12-11

R Packages worth a look

10-03

Why I’m Not a Fan of R-Squared

07-24

Learning in Brains and Machines (4): Episodic and Interactive Memory

07-24

Summing the Fibonacci Sequence

07-24

Summing the Fibonacci Sequence

07-24

Is it time to stop using sentinel values for null / "NA" values?

10-12

Incremental means and variances

11-28

Summing the Fibonacci Sequence

07-24

Recurrent Neural Networks in Tensorflow II

07-25

Statistics in Glaucoma: Part III

12-18

Online Bayesian Deep Learning in Production at Tencent

11-15

GANs are Broken in More than One Way: The Numerics of GANs

10-05

Python Deep Learning tutorial: Create a GRU (RNN) in TensorFlow

08-27

Recurrent Neural Networks in Tensorflow II

07-25

If you did not already know

09-20

Recurrent Neural Networks in Tensorflow II

07-25

Re-work Interview Questions

07-26

Strata Data SF 2019 KDnuggets Offer

01-04

R community update: announcing useR Delhi December meetup and CFP

12-07

From the Sidewalk to the Saddle: Data and the Tour de France

07-06

Re-work Interview Questions

07-26

“The hype economy”

11-20

Fitting the Besag, York, and Mollie spatial autoregression model with discrete data

10-17

David Brooks discovers Red State Blue State Rich State Poor State!

10-16

Toward better measurement in K-12 education research

10-15

He had a sudden cardiac arrest. How does this change the probability that he has a particular genetic condition?

10-14

Post-publication peer review: who’s qualified?

09-20

The hot hand—in darts!

09-18

Cosmos DB for Data Science

09-07

How to set up a voting system for a Hall of Fame?

09-02

Hey—take this psychological science replication quiz!

09-02

In statistics, we talk about uncertainty without it being viewed as undesirable

08-25

“Usefully skeptical science journalism”

08-12

“Usefully skeptical science journalism”

08-12

The replication crisis and the political process

08-03

Data-based ways of getting a job

07-18

The statistical checklist: Could there be a list of guidelines to help analysts do better work?

07-17

Should the points in this scatterplot be binned?

07-11

He wants to know what to read and what software to learn, to increase his ability to think about quantitative methods in social science

07-07

All of Life is 6 to 5 Against

07-06

Written Memories: Understanding, Deriving and Extending the LSTM

07-26

Why Almost Everything You’ve Learned About Cheap Custom Essay Is Wrong and What You Should Know

10-04

Written Memories: Understanding, Deriving and Extending the LSTM

07-26

Import AI

06-05

New Year's Resolutions 2018

01-05

Complex System Society 2016 Junior Scientific Award!

01-16

Google F1 Server Reading Summary

11-26

DynamoDB Learnings

10-23

Sensor Fusion Tutorial

10-05

Written Memories: Understanding, Deriving and Extending the LSTM

07-26

Decision Trees Tutorial

07-27

If you did not already know

09-02

Talk: Building Machines that Imagine and Reason

07-28

Talk: Building Machines that Imagine and Reason

07-28

MS in Applied Data Science Online – which track is right for you?

01-10

6 Step Plan to Starting Your Data Science Career

12-05

A Complete Guide to Choosing the Best Machine Learning Course

11-30

Extracting data from news articles: Australian pollution by postcode

11-28

Introducing Drexel new online MS in Data Science

11-15

Help us understand your Data Science goals!

11-13

5 Steps to Prepare for a Data Science Job

10-23

5 Steps to Prepare for a Data Science Job

10-22

Online Master’s in Applied Data Science From Syracuse

10-05

My Open-Source Machine Learning Masters (in Casablanca, Morocco)

07-29

Yeshiva University: Data Science Program Director [New York, NY]

11-30

University of San Francisco: Assistant Professor, Tenure Track, Mathematics and Statistics [San Francisco, CA]

10-17

University of San Francisco: Postdoctoral Fellowship, Data Institute [San Francisco, CA]

10-16

Rising test scores . . . reported as stagnant test scores

10-08

John Hattie’s “Visible Learning”: How much should we trust this influential review of education research?

09-01

“Becoming a Data Scientist” Survey Results 1: Jobs & Education

08-22

My Open-Source Machine Learning Masters (in Casablanca, Morocco)

07-29

My Open-Source Machine Learning Masters (in Casablanca, Morocco)

07-29

Cummins: Data Engineering Technical Specialist [Columbus, IN]

12-13

MINDBODY: Business Intelligence Analyst II [San Luis Obispo, CA]

12-13

Intuit: Staff Data Scientist [Mountain View, CA]

12-12

Free Machine Learning Textbook

12-01

Join us for DataTech19, Scotland’s first technical data science conference as part of DataFest

10-12

UnitedHealth Group: Sr .Net Web Developer, UHC E&I [Indianapolis, IN or Green Bay, WI]

10-04

Data Science at Northwestern

10-03

The Data Science Roadshow is ON!

09-03

Retrospective on leaving academia for industry data science

04-09

My Open-Source Machine Learning Masters (in Casablanca, Morocco)

07-29

Efficient Guttering

07-29

Efficient Guttering

07-29

Efficient Guttering

07-29

Text classification with tidy data principles

12-24

An Overview of the Singapore Hiring Landscape

11-21

Defining data science in 2018

07-22

Where that title came from

07-20

Is Data Scientist a useless job title?

08-04

Human Fuel Consumption

09-02

Balanced Field Length

08-04

Kinesis Advantage2: Impressions

12-05

Balanced Field Length

08-04

Traveling salesman portrait in Python

04-12

Turning Water into Wine

03-13

Weekly Review: 10/28/2017

10-28

Intro to graph optimization: solving the Chinese Postman Problem

10-07

Clustering applied to showers in the OPERA

07-10

Balanced Field Length

08-04

Balanced Field Length

08-04

Trust in ML models. Slides from TWiML & AI EMEA Meetup + iX Articles

12-05

December Reading for Econometricians

12-02

Balanced Field Length

08-04

Counting Efficiently with Bounter pt. 2: CountMinSketch

01-31

Variational Autoencoders Explained

08-06

Variational Autoencoders Explained

08-06

Day 20 – little helper char_replace

12-20

R Packages worth a look

12-16

Quoting Concatenate

12-16

Quoting Concatenate

12-16

The statistical checklist: Could there be a list of guidelines to help analysts do better work?

07-17

Variational Autoencoders Explained

08-06

Variational Autoencoders Explained

08-06

Similarity in the Wild

02-19

A intuitive explanation of natural gradient descent

08-07

Silent Duels and an Old Paper of Restrepo

12-31

All the (NBA) box scores you ever wanted

12-18

A couple of thoughts regarding the hot hand fallacy fallacy

12-14

von Neumann Poker Analysis

12-13

Data Science in Esports

11-21

Data Science in Esports

11-12

Analyzing English Team of the Year Data Since 1973

10-18

R Packages worth a look

09-03

Counting baseball cliches

08-31

First Data Project? Go Tandem! (AVISIA at Play)

07-27

Do AIs dream of pwning FF leagues?

12-10

How easy is it to moneyball a fantasy football league draft?

10-28

Voronoi Soccer

05-31

Machine Learning Madden NFL: The best player position switches for Madden 17

01-20

Machine Learning Madden NFL: How Madden player ratings are actually calculated

01-10

Moscow Math Olympiad Puzzle

08-07

Moscow Math Olympiad Puzzle

08-07

Moscow Math Olympiad Puzzle

08-07

Moscow Math Olympiad Puzzle

08-07

Boosting (in Machine Learning) as a Metaphor for Diverse Teams

08-07

R Packages worth a look

11-26

Boosting (in Machine Learning) as a Metaphor for Diverse Teams

08-07

The Convexity of Improbability: How Rare are K-Sigma Effects?

08-08

Playing with convolutions in TensorFlow

08-09

Convolve all the things

05-31

Playing with convolutions in TensorFlow

08-09

IMDB Data Visualizations with D3 + Dimple.js

08-10

IMDB Data Visualizations with D3 + Dimple.js

08-10

Against Arianism

08-21

Against Arianism

08-21

Neural reinterpretations of movie trailers

07-31

IMDB Data Visualizations with D3 + Dimple.js

08-10

From Microservices to Service Blocks using Spring Cloud Function and AWS Lambda

07-07

How to score 0.8134 in Titanic Kaggle Challenge

08-10

Blog has migrated from Ghost to Jekyll

08-11

Blog has migrated from Ghost to Jekyll

08-11

4 ways to be more efficient using RStudio’s Code Snippets, with 11 ready to use examples

11-10

Blog has migrated from Ghost to Jekyll

08-11

Measuring Bernoulli Probabilities in the Presence of Delayed Reactions

08-11

“Richard Jarecki, Doctor Who Conquered Roulette, Dies at 86”

08-09

Measuring Bernoulli Probabilities in the Presence of Delayed Reactions

08-11

Deriving Expectation-Maximization

11-11

data.table is Really Good at Sorting

08-14

✚ Wrong Tool, Right Tool, More Tools for Visualization

08-02

Measuring Bernoulli Probabilities in the Presence of Delayed Reactions

08-11

Assorted links

08-12

5 Ways in which Data Science is Revolutionizing Web Development

01-03

“And when you did you weren’t much use, you didn’t even know what a peptide was”

11-29

High-profile statistical errors occur in the physical sciences too, it’s not just a problem in social science.

09-15

What makes Robin Pemantle’s bag of tricks for teaching math so great?

07-27

Assorted links

08-12

Cummins: Advanced Analytics Platform Principle Engineer [Columbus, IN]

12-12

Distilled News

08-09

What Data Scientists should focus on in 2018?

06-27

Fontstellations

11-16

Assorted links

08-12

Document worth reading: “Causal inference and the data-fusion problem”

10-26

Assorted links

08-12

Shared items

08-11

Verlet Simulations

07-16

Can a Machine Be Racist or Sexist?

04-16

How to scrape a website using Python + Scrapy in 5 simple steps

08-18

Podcast Episodes 0 to 3

08-13

Japanese Kids Shows, Movies, Games, and Videos for Immersion

07-14

Podcast Episodes 0 to 3

08-13

Recurrent Neural Networks for Beginners

08-13

Recurrent Neural Networks for Beginners

08-13

Evolution of active categorical image classification via saccadic eye movement

08-13

Evolution of active categorical image classification via saccadic eye movement

08-13

Preliminary Note on the Complexity of a Neural Network

08-16

Day 06 – little helper statusbar

12-06

“Economic predictions with big data” using partial pooling

11-26

If you did not already know

11-07

Document worth reading: “Lectures on Statistics in Theory: Prelude to Statistics in Practice”

11-06

Learning to learn in a model-agnostic way

10-29

automl package: part 2/2 first steps how to

10-24

Document worth reading: “An Introduction to Inductive Statistical Inference — from Parameter Estimation to Decision-Making”

10-09

Divisibility in statistics: Where is it needed?

07-08

Sutton’s Temporal-Difference Learning

02-19

Pruning Neural Networks: Two Recent Papers

02-06

Java Image Cat&Dog Recognition with Deep Neural Networks

01-03

A Visual Guide to Evolution Strategies

10-29

Logistic Regression

07-30

Preliminary Note on the Complexity of a Neural Network

08-16

Preliminary Note on the Complexity of a Neural Network

08-16

RcppArmadillo 0.9.200.5.0

11-28

Grokking Deep Learning

08-17

Start your journey into data science today

10-19

Mounting multiple data and outputs volumes

07-15

Grokking Deep Learning

08-17

Opinion mining on Dutch news articles

06-20

How to scrape a website using Python + Scrapy in 5 simple steps

08-18

How to scrape a website using Python + Scrapy in 5 simple steps

08-18

How to scrape a website using Python + Scrapy in 5 simple steps

08-18

Stock Price prediction using ML and DL

01-07

If you did not already know

01-03

Marginal Effects for (mixed effects) regression models

11-28

Prophets of gloom: Using NLP to analyze Radiohead lyrics

10-13

Bayesian inference and religious belief

10-07

Time Series for scikit-learn People (Part II): Autoregressive Forecasting Pipelines

03-22

Creating a Search Engine

08-19

Building a neighbour matrix with python

11-04

Discovering and indexing podcast episodes using Amazon Transcribe and Amazon Comprehend

09-20

Google Dataset Search now in public beta

09-06

Weekly Review: 12/23/2017

12-23

Creating a Search Engine

08-19

Einstein's Spacetime

10-11

Lagrange Points

08-21

Einstein's Spacetime

10-11

Lagrange Points

08-21

Lagrange Points

08-21

Lagrange Points

08-21

AI in Healthcare (With a case study)

01-10

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

Keynote at EuroPython 2018 on “Citizen Science”

07-27

Building a Tic-Tac-Toe web-app in this Webpack tutorial and Babel tutorial

04-07

RNNs in Tensorflow, a Practical Guide and Undocumented Features

08-21

If you don’t pay attention, data can drive you off a cliff

08-21

If you don’t pay attention, data can drive you off a cliff

08-21

If you don’t pay attention, data can drive you off a cliff

08-21

If you don’t pay attention, data can drive you off a cliff

08-21

R community update: announcing useR Delhi December meetup and CFP

12-07

Your Guide to AI and Machine Learning at re:Invent 2018

09-27

TensorFlow in a Nutshell — Part One: Basics

08-22

Example of Overfitting

11-16

“Becoming a Data Scientist” Survey Results 1: Jobs & Education

08-22

“Becoming a Data Scientist” Survey Results 1: Jobs & Education

08-22

5 Ways in which Data Science is Revolutionizing Web Development

01-03

Entering and Exiting 2018

01-02

Miami University: Director of the Center for Analytics & Data Science (CADS) [Oxford, OH]

12-20

If you did not already know

12-20

Cummins: Data Engineering Technical Specialist [Columbus, IN]

12-13

CBH Group: Data Scientist [Perth, Australia]

12-11

CBH Group: Sr Data Scientist [Perth, Australia]

12-11

Community Call – Governance strategies for open source research software projects

12-05

URI: Director, Data Analytics/DataSpark [Kingston, RI]

11-15

What does a data scientist REALLY look like?

11-09

The One reason you should learn Python

10-11

Announcing Ursa Labs's partnership with NVIDIA

10-10

Life in Madrid seen through BiciMAD

10-10

Things you should know when traveling via the Big Data Engineering hype-train

10-08

Distilled News

10-05

UnitedHealth Group: Sr .Net Web Developer, UHC E&I [Indianapolis, IN or Green Bay, WI]

10-04

Document worth reading: “Data Innovation for International Development: An overview of natural language processing for qualitative data analysis”

09-25

Data Science Portfolio Project: Where to Advertise an E-learning Product

09-05

Distilled News

08-30

2017 Outlook: pandas, Arrow, Feather, Parquet, Spark, Ibis

12-27

In Praise Of Reinventing The Wheel

08-23

Core Principles of Sustainable Data Science, Machine Learning and AI Product Development: Research as a core driver

01-09

5 Ways in which Data Science is Revolutionizing Web Development

01-03

Announcing the Winners of the 2018 AWS AI Hackathon

12-05

URI: Director, Data Analytics/DataSpark [Kingston, RI]

11-15

All About Scikit-Learn, with Olivier Grisel

11-13

What does a data scientist REALLY look like?

11-09

Distilled News

10-14

Announcing Ursa Labs's partnership with NVIDIA

10-10

UnitedHealth Group: Sr .Net Web Developer, UHC E&I [Indianapolis, IN or Green Bay, WI]

10-04

AI, Machine Learning and Data Science Announcements from Microsoft Ignite

10-02

Distilled News

09-17

Data Science Portfolio Project: Where to Advertise an E-learning Product

09-05

Distilled News

08-30

A Certification for R Package Quality

07-30

Video: R for AI, and the Not Hotdog workshop

07-17

Free E-Book: A Developer’s Guide to Building AI Applications

06-04

Software as an academic publication

05-03

Announcing Ursa Labs: an innovation lab for open source data science

04-19

In Praise Of Reinventing The Wheel

08-23

In Praise Of Reinventing The Wheel

08-23

Supercharging Classification - The Value of Multi-task Learning

06-26

Is BackPropagation Necessary?

08-23

Is BackPropagation Necessary?

08-23

Conda: Myths and Misconceptions

08-25

Conda: Myths and Misconceptions

08-25

Document worth reading: “Graphical Models for Processing Missing Data”

11-18

Use GitHub Vulnerability Alerts to Keep Users of Your R Packages Safe

11-14

Conda: Myths and Misconceptions

08-25

Conda: Myths and Misconceptions

08-25

Ensemble Learning: 5 Main Approaches

01-03

4 Reasons Santa Needs Machine Learning & AI

12-24

Dr. Data Show Video: Five Reasons Computers Predict When You’ll Die

12-08

Top 5 domains Big Data analytics helps to transform

11-23

Monotonicity constraints in machine learning

09-16

GovernmentCIO: How AI Can Stop Doctors Likely to Overprescribe Opioids — and Stem the Crisis

09-04

Logistic Regression: Concept & Application

09-03

Getting Rich using Bitcoin stockprices and Twitter!

02-22

Random Forest Tutorial: Predicting Crime in San Francisco

08-25

Convert Data Frame to Dictionary List in R

11-17

Basic Math on How Bloom Filter Works

08-27

R Packages worth a look

01-13

R Packages worth a look

01-01

R Packages worth a look

12-28

If you did not already know

12-16

Day 13 – little helper read_files

12-13

Document worth reading: “Causal inference and the data-fusion problem”

10-26

Statistics Challenge Invites Students to Tackle Opioid Crisis Using Real-World Data

10-19

A Deep (But Jargon and Math Free) Dive Into Deep Learning

08-31

Weekly Review: 12/16/2017

12-16

Rec-a-Sketch: a Flask App for Interactive Sketchfab Recommendations

02-05

Multiple Raffle Strategy

10-09

Basic Math on How Bloom Filter Works

08-27

11 Design Tips for Data Visualization

10-25

Turn Whiteboard UX Sketches into Working HTML in Seconds – Introducing Sketch2Code

08-30

Constructing a Data Analysis

08-24

T-Shirt Design Contest!

11-19

Fontstellations

11-16

How to Solve a Problem In 3 Steps -- Define It, Redefine It, Repeat

08-29

Rising test scores . . . reported as stagnant test scores

10-08

How to Solve a Problem In 3 Steps -- Define It, Redefine It, Repeat

08-29

Students Combat MS with Data Science

11-29

If you did not already know

09-09

How to Solve a Problem In 3 Steps -- Define It, Redefine It, Repeat

08-29

Towards optimal personalization: synthesisizing machine learning and operations research

08-30

Towards optimal personalization: synthesisizing machine learning and operations research

08-30

Towards optimal personalization: synthesisizing machine learning and operations research

08-30

Document worth reading: “Declarative Statistics”

10-22

Verlet Simulations

07-16

Towards optimal personalization: synthesisizing machine learning and operations research

08-30

Blogdown – shortcode for radix-like Bibtex

12-21

Lifecycle configuration update for Amazon SageMaker notebook instances

11-06

Building Spring Cloud Microservices That Strangle Legacy Systems

08-30

Building Spring Cloud Microservices That Strangle Legacy Systems

08-30

R now supported in Azure SQL Database

11-28

How To Remotely Send R and Python Execution to SQL Server from Jupyter Notebooks

07-17

Building Spring Cloud Microservices That Strangle Legacy Systems

08-30

Building Spring Cloud Microservices That Strangle Legacy Systems

08-30

Analyzing The Papers Behind Facebook's Computer Vision Approach

09-01

Analyzing The Papers Behind Facebook's Computer Vision Approach

09-01

10 Best Mobile Apps for Data Scientist / Data Analysts

10-10

Python 2.7 still reigns supreme in pip installs

09-03

Python 2.7 still reigns supreme in pip installs

09-03

Republican-leaning states tend to have more traffic deaths

09-04

Republican-leaning states tend to have more traffic deaths

09-04

Republican-leaning states tend to have more traffic deaths

09-04

Republican-leaning states tend to have more traffic deaths

09-04

Canada Map

12-09

7 Awesome Things You Can Do in Dataiku Without Coding

11-02

collateral

11-02

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 1: Basics

10-25

Getting started Stamen maps with ggmap

10-25

Synesthesia: The Sound of Style

08-29

The Probability Monad and Why it's Important for Data Science

09-05

Approaching fairness in machine learning

09-06

If you did not already know

12-06

Delayed Impact of Fair Machine Learning

05-17

Assorted links

12-21

Approaching fairness in machine learning

09-06

RSiteCatalyst Version 1.4.13 Release Notes

07-23

Approaching fairness in machine learning

09-06

Highlights of 2018

12-18

StanCon 2018 Helsinki talk slides, notebooks and code online

12-03

SatRday talks recordings

10-17

PyImageConf 2018 Recap

10-01

StanCon 2018 Helsinki tutorial videos online

09-04

Videos from NYC R Conference

08-28

On the growth of our PyDataLondon community

08-16

Highlights from the useR! 2018 conference in Brisbane

07-18

SatRdays Cardiff

07-04

Highlights of NAACL-HLT 2018: Generalization, Test-of-time, and Dialogue Systems

06-12

“Creating correct and capable classifiers” at PyDataAmsterdam 2018

05-26

My eRum 2018 biggest highlights

05-19

NIPS 2017 Summary

12-11

A Survival Guide to a PhD

09-07

“Creating correct and capable classifiers” at PyDataAmsterdam 2018

05-26

NIPS 2017 Summary

12-11

A Survival Guide to a PhD

09-07

French Mortality Poster

12-27

“Should I get a PhD to be a data scientist/analytics professional?”

11-19

A Survival Guide to a PhD

09-07

Wire Gauges

09-07

Wire Gauges

09-07

Wire Gauges

09-07

Who is a Data Scientist?

12-27

“The hype economy”

11-20

5 Alternatives to the Default R Outputs for GLMs and Linear Models

10-17

Wire Gauges

09-07

Wire Gauges

09-07

Attention and Augmented Recurrent Neural Networks

09-08

Attention and Augmented Recurrent Neural Networks

09-08

NLP Overview: Modern Deep Learning Techniques Applied to Natural Language Processing

01-08

Document worth reading: “The Dynamics of Learning: A Random Matrix Approach”

12-04

How to Create a Simple Neural Network in Python

10-02

If you did not already know

08-30

Whats new on arXiv

08-17

Document worth reading: “Examining the Use of Neural Networks for Feature Extraction: A Comparative Analysis using Deep Learning, Support Vector Machines, and K-Nearest Neighbor Classifiers”

08-08

Attention and Augmented Recurrent Neural Networks

09-08

MS in Applied Data Science Online – which track is right for you?

01-10

Miami University: Director of the Center for Analytics & Data Science (CADS) [Oxford, OH]

12-20

Kent State University: Assistant/Associate Professor – Business Analytics/Information Systems [Kent, OH]

12-19

Yeshiva University: Data Science Program Director [New York, NY]

11-30

University of Tennessee Knoxville: Assistant or Associate Professor in Data Science [Knoxville, TN]

11-30

8 Reasons to Take Data Analytics Certification Courses

11-28

UnitedHealth Group: Clinical Data Statistical Analyst – SQL SAS (Clinician Required) [Telecommute]

11-16

Introducing Drexel new online MS in Data Science

11-15

Strategy: Customer Analytics: Are you Profiting from your Data?

11-14

Vanderbilt University: Sr Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University’s Peabody College: Sr. Lecturer in Data and Analytics [Nashville, TN]

11-05

The role of academia in data science education

11-01

Distilled News

10-22

Why are functional programming languages so popular in the programming languages community?

10-11

The economic consequences of MOOCs

10-08

Chromebook Data Science - a free online data science program for anyone with a web browser.

10-01

If you did not already know

09-22

Thanks, NVIDIA

08-01

Data Science in 30 Minutes: Using Data Science to Predict the Future with Kirk Borne

07-11

Announcement – The Data Incubator Partnership with MRI Network

06-28

An Updated Review of The Data Incubator Data Science Bootcamp

05-29

Kolmogorov and randomness

02-18

Attention and Augmented Recurrent Neural Networks

09-08

Document worth reading: “Analytics for the Internet of Things: A Survey”

09-12

Document worth reading: “Fog Computing: Survey of Trends, Architectures, Requirements, and Research Directions”

08-22

Outside a train rumbles by

09-09

Outside a train rumbles by

09-09

Did she really live 122 years?

01-08

“When Both Men and Women Drop Out of the Labor Force, Why Do Economists Only Ask About Men?”

12-23

Heatmaps of Mortality Rates

12-04

Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

12-03

Ask the Question, Visualize the Answer

10-17

Python Pandas Tutorial: The Basics

11-23

TensorFlow in a Nutshell — Part Two: Hybrid Learning

09-13

If you did not already know

09-01

TensorFlow in a Nutshell — Part Two: Hybrid Learning

09-13

Solving Real-Life Mysteries with Big Data and Apache Spark

09-13

Solving Real-Life Mysteries with Big Data and Apache Spark

09-13

Solving Real-Life Mysteries with Big Data and Apache Spark

09-13

Basic Statistics in Python: Probability

07-18

Basic Statistics in Python: Descriptive Statistics

07-03

k-Nearest Neighbors & Anomaly Detection Tutorial

09-14

k-Nearest Neighbors & Anomaly Detection Tutorial

09-14

k-Nearest Neighbors & Anomaly Detection Tutorial

09-14

If you did not already know

12-30

R Packages worth a look

10-02

Thoughts On Machine Learning Accuracy

07-27

How to Overcome Imposter Syndrome For Good

05-30

Collaborative Filtering using Alternating Least Squares

09-17

Generating data to explore the myriad causal effects that can be estimated in observational data analysis

11-20

Bayesian Linear Regression (in PyMC) - a different way to think about regression

02-09

Collaborative Filtering using Alternating Least Squares

09-17

Document worth reading: “Neural Style Transfer: A Review”

01-02

Probability Calibration And Isotonic Regression

09-18

Probability Calibration And Isotonic Regression

09-18

Raghuveer Parthasarathy’s big idea for fixing science

11-01

Help improve lives through Machine Learning by joining the AWS DeepLens Challenge!

09-25

Document worth reading: “Putting Data Science In Production”

09-07

Distilled News

08-31

Document worth reading: “Cogniculture: Towards a Better Human-Machine Co-evolution”

08-18

Distilled News

07-31

The Data Incubator Unofficial Frequently Asked Questions

05-30

Smart Cities at the Nexus of Emerging Data Technologies and You

09-18

3 Stages of Creating Smart

10-04

Document worth reading: “Human-Machine Inference Networks For Smart Decision Making: Opportunities and Challenges”

09-26

Distilled News

08-28

Smart Cities at the Nexus of Emerging Data Technologies and You

09-18

Discussion of "Fast Approximate Inference for Arbitrarily Large Semiparametric Regression Models via Message Passing"

09-19

Ask Why! Finding motives, causes, and purpose in data science

09-19

Machine Learning Trick of the Day (7): Density Ratio Trick

01-14

Turning Distances into Distributions

09-19

Sales Automation Through a Deep Learning Platform

09-22

Sales Automation Through a Deep Learning Platform

09-22

Poker odds with wild cards

09-27

Poker Odds

09-22

Poker Odds

09-22

Poker Odds

09-22

Poker Odds

09-22

Assorted links

09-22

Assorted links

09-22

Why do sociologists (and bloggers) focus on the negative? 5 possible explanations. (A post in the style of Fabio Rojas)

12-17

NG "roll returns" – inflection point?

11-05

Verlet Simulations

07-16

Twitter bots for good, and information contagion!

09-27

Assorted links

09-22

A Programmer’s Introduction to Mathematics

12-01

While We Were Busy with Prosperity

11-10

Assorted links

09-22

A Programmer’s Introduction to Mathematics

12-01

Assorted links

09-22

No juice for you, CSV format. It just makes you more awful.

09-23

GPU-accelerated Theano & Keras with Windows 10

09-23

Very Non-Standard Calling in R

12-03

Very Non-Standard Calling in R

12-03

A Billion Words and The Limits of Language Modeling

09-23

R Packages worth a look

11-30

R Packages worth a look

08-11

Binary Stochastic Neurons in Tensorflow

09-24

MRP (multilevel regression and poststratification; Mister P): Clearing up misunderstandings about

01-10

Horses for courses, or to each model its own (causal effect)

11-28

More on Bias Corrected Standard Deviation Estimates

11-14

How to de-Bias Standard Deviation Estimates

11-12

How to de-Bias Standard Deviation Estimates

11-12

R Packages worth a look

11-01

Don’t calculate post-hoc power using observed estimate of effect size

09-24

R Packages worth a look

09-06

“Identification of and correction for publication bias,” and another discussion of how forking paths is not the same thing as file drawer

08-31

R Packages worth a look

08-27

Response to Rafa: Why I don’t think ROC [receiver operating characteristic] works as a model for science

08-05

Crazy Progress Bars

01-31

Sensor Fusion Tutorial

10-05

Binary Stochastic Neurons in Tensorflow

09-24

Unevenly Spaced Data

09-26

An even better rOpenSci website with Hugo

01-09

Unevenly Spaced Data

09-26

Unevenly Spaced Data

09-26

Poker odds with wild cards

09-27

Cribbage Scores

02-25

Aligned Clock Hands

11-04

Poker odds with wild cards

09-27

Sock Puzzle Revisited

03-07

Poker odds with wild cards

09-27

Attractive Mathematical Properties Of The Roc Curve

09-27

Attractive Mathematical Properties Of The Roc Curve

09-27

Attractive Mathematical Properties Of The Roc Curve

09-27

Attractive Mathematical Properties Of The Roc Curve

09-27

Your and my 2019 R goals

01-01

Timing Grouped Mean Calculation in R

12-08

Timing Grouped Mean Calculation in R

12-08

Quasiquotation in R via bquote()

10-16

A Subtle Flaw in Some Popular R NSE Interfaces

09-24

Parameterizing with bquote

09-16

A Quick Appreciation of the R transform Function

09-10

How many CRAN package maintainers have been pwned?

04-18

implyr: R Interface for Apache Impala

07-19

Introducing sparklyr, an R Interface for Apache Spark

09-30

Distilled News

12-15

Introducing sparklyr, an R Interface for Apache Spark

09-30

Gift ideas for the R lovers

12-14

Teaching and Learning Materials for Data Visualization

12-12

Escaping the macOS 10.14 (Mojave) Filesystem Sandbox with R / RStudio

11-09

Stan development in RStudio

10-17

Announcing RStudio Package Manager

10-17

RStudio 1.2 Preview: Stan

10-16

Stan on the web! (thanks to RStudio)

10-12

Introducing sparklyr, an R Interface for Apache Spark

09-30

Part 2, further comments on OfS grade-inflation report

01-07

Southern Illinois University Edwardsville: Director of the Center for Predictive Analytics/(Associate) Professor of Mathematics and Statistics [Edwardsville, IL]

01-04

Applying for a PhD program in visualization

01-03

Office for Students report on “grade inflation”

01-02

Kick Start Your Data Career! Tips From the Frontline

12-05

Hey, check this out: Columbia’s Data Science Institute is hiring research scientists and postdocs!

11-16

Lehigh University: Tenure Track Positions in Foundations of Data Science [Bethlehem, PA]

10-30

My talk tomorrow (Tues) 4pm in the Biomedical Informatics department (at 168th St)

10-01

Inside Higher Ed: Pushing the Boundaries of Learning With AI

09-26

“Seeding trials”: medical marketing disguised as science

08-01

COLT 2018 call for papers

10-24

Machine Learning in Science and Industry slides

04-20

Discarded Hard Drives: Data Science as Debugging

03-14

NLP and ML Publications – Looking Back at 2016

01-04

NIPS 2016 Workshop on Approximate Inference

09-30

FAQ on ICML 2019 Code Submission Policy

12-19

COLT 2018 call for papers

10-24

NIPS 2017 Workshop on Approximate Inference

09-25

Reddit science discussions as a dataset

06-22

NIPS 2016 Workshop on Approximate Inference

09-30

Magister Dixit

12-05

If you did not already know

10-04

Document worth reading: “What am I searching for?”

08-28

If you did not already know

08-07

Parsimonious principle vs integration over all uncertainties

07-26

Differentiable Dynamic Programs and SparseMAP Inference

05-15

On Pyro - Deep Probabilistic Programming on PyTorch

11-03

NIPS 2017 Workshop on Approximate Inference

09-25

What's new in PyMC3 3.1

07-05

NIPS 2016 Workshop on Approximate Inference

09-30

NIPS 2016 Workshop on Approximate Inference

09-30

Distilled News

01-06

Remembering Michael

10-08

Synthetic Gradients with Tensorflow

04-08

Colorizing the DRAW Model

12-06

Deep Learning Research Review Week 1: Generative Adversarial Nets

09-30

If you did not already know

01-06

Generative Adversarial Networks – Paper Reading Road Map

10-24

aRt with code

07-27

Deep Learning Research Review Week 1: Generative Adversarial Nets

09-30

Deep Learning Research Review Week 1: Generative Adversarial Nets

09-30

“A Guide to Working With Census Data in R” is now Complete!

11-05

What is DRAW (Deep Recurrent Attentive Writer)?

10-02

What is DRAW (Deep Recurrent Attentive Writer)?

10-02

What is DRAW (Deep Recurrent Attentive Writer)?

10-02

What is DRAW (Deep Recurrent Attentive Writer)?

10-02

5 things that happened in Data Science in 2018

01-08

Deep Reinforcement Learning in Action (Announcement)

06-20

Learning Reinforcement Learning (with Code, Exercises and Solutions)

10-02

Claims and Evidence: A Joke

10-03

Of Tennys players and moral Hazards

07-28

Claims and Evidence: A Joke

10-03

New Year's Resolutions 2019

01-01

Automated Web Scraping in R

12-11

Will Models Rule the World? Data Science Salon Miami, Nov 6-7

10-19

Review of The Data Incubator data science bootcamp

05-29

RSiteCatalyst Version 1.4.10 Release Notes

12-13

Claims and Evidence: A Joke

10-03

Claims and Evidence: A Joke

10-03

Claims and Evidence: A Joke

10-03

TensorFlow in a Nutshell — Part Three: All the Models

10-03

TensorFlow in a Nutshell — Part Three: All the Models

10-03

Practical Apache Spark in 10 Minutes

01-11

Whats new on arXiv

01-09

If you did not already know

01-04

“Principles of posterior visualization”

01-01

2018 Year-in-Review: Machine Learning Open Source Projects & Frameworks

12-17

Distilled News

12-15

Whats new on arXiv

12-13

Document worth reading: “A Short Introduction to Local Graph Clustering Methods and Software”

12-10

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J, PART 2

12-05

Distilled News

12-02

Project planning with plotly

11-26

If you did not already know

11-16

If you did not already know

11-13

Whats new on arXiv

10-26

KDnuggets™ News 18:n40, Oct 24: Graphs Are The Next Frontier In Data Science; Apache Spark Intro for Beginners

10-24

If you did not already know

10-23

Graphs Are The Next Frontier In Data Science

10-18

If you did not already know

10-11

If you did not already know

10-10

Whats new on arXiv

10-04

If you did not already know

09-26

How to graph a function of 4 variables using a grid

09-20

If you did not already know

08-27

Whats new on arXiv

08-09

50 states Rural Postman Problem

11-19

Intro to graph optimization: solving the Chinese Postman Problem

10-07

How-to: Do Scalable Graph Analytics with Apache Spark

10-03

How-to: Do Scalable Graph Analytics with Apache Spark

10-03

Champagne Bottles

10-04

Champagne Bottles

10-04

Champagne Bottles

10-04

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

11-15

Discussion of effects of growth mindset: Let’s not demand unrealistic effect sizes.

09-13

Champagne Bottles

10-04

Champagne Bottles

10-04

Sensor Fusion Tutorial

10-05

Talk: How Do We Support Under-represented Groups To Put Themselves Forward?

11-01

Visualizing The Catholic Lectionary – Part 1

10-27

Sensor Fusion Tutorial

10-05

R Packages worth a look

11-28

Machine Learning Trick of the Day (7): Density Ratio Trick

01-14

ICML 2017 Workshop on Implicit Models

06-02

Nonparametric Density Estimation Parzen Windows And Beyond

10-06

Stan on the web! (thanks to RStudio)

10-12

Webcam based image processing in Jupyter notebooks

04-09

Nonparametric Density Estimation Parzen Windows And Beyond

10-06

A deep dive into glmnet: offset

01-09

A deep dive into glmnet: standardize

11-15

R Packages worth a look

10-28

Nonparametric Density Estimation Parzen Windows And Beyond

10-06

Cinderella science

08-05

Nonparametric Density Estimation Parzen Windows And Beyond

10-06

Document worth reading: “The importance of being dissimilar in Recommendation”

12-30

WNS Hackathon Solutions by Top Finishers

12-13

Document worth reading: “A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition”

12-05

If you did not already know

11-17

A Data Scientist’s Guide to an Efficient Project Lifecycle

10-25

The Main Approaches to Natural Language Processing Tasks

10-17

Robust Quality – Powerful Integration of Data Science and Process Engineering

10-01

Whats new on arXiv

08-21

Optimization of Scientific Code with Cython: Ising Model

12-11

Nonparametric Density Estimation Parzen Windows And Beyond

10-06

Top KDnuggets tweets, Oct 3–9: 5 Reasons Logistic Regression should be the first thing you learn when becoming a Data Scientist

10-10

AI Lab: Learn to Code with the Cutting-Edge Microsoft AI Platform

06-19

Microsoft Weekly Data Science News for May 18, 2018

05-18

Cognitive Machine Learning: Prologue

10-08

10 Data Science Skills to Land your Dream Job in 2019

12-12

R Packages worth a look

11-11

Introduction to PyTorch for Deep Learning

11-07

Why you need GPUs for your deep learning platform

10-16

Data Center Scale Computing and Artificial Intelligence with Matei Zaharia, Inventor of Apache Spark

09-12

Journal: PLXtrum - realtime machine learning for predicting note onset

01-28

Type Safety and Statistical Computing

12-12

Cognitive Machine Learning: Prologue

10-08

Likes Out! Guerilla Dataset!

10-09

Likes Out! Guerilla Dataset!

10-09

Document worth reading: “Recommendation System based on Semantic Scholar Mining and Topic modeling: A behavioral analysis of researchers from six conferences”

01-03

Likes Out! Guerilla Dataset!

10-09

BD reviews

07-11

Multiple Raffle Strategy

10-09

Multiple Raffle Strategy

10-09

WordPress to Jekyll: A 30x Speedup

10-10

PyData DC 2016 Talk

10-11

A fully asynchronous variant of the SAGA algorithm

10-11

Poor Customer Support?

08-28

Support Becoming a Data Scientist!

12-07

A fully asynchronous variant of the SAGA algorithm

10-11

A fully asynchronous variant of the SAGA algorithm

10-11

“When Both Men and Women Drop Out of the Labor Force, Why Do Economists Only Ask About Men?”

12-23

Mastering the Learning Rate to Speed Up Deep Learning

11-06

R Packages worth a look

10-08

Document worth reading: “An Information-Theoretic Analysis of Deep Latent-Variable Models”

08-23

Estimating mortality rates in Puerto Rico after hurricane María using newly released official death counts

06-08

Crazy Progress Bars

01-31

A fully asynchronous variant of the SAGA algorithm

10-11

A fully asynchronous variant of the SAGA algorithm

10-11

Einstein's Spacetime

10-11

Distilled News

10-01

Einstein's Spacetime

10-11

A Bayesian take on ballot order effects

11-21

Benford’s Law for Fraud Detection with an Application to all Brazilian Presidential Elections from 2002 to 2018

11-17

2018: How did people actually vote? (The real story, not the exit polls.)

11-16

2018: What really happened?

11-10

Simple Feed Ranking Algorithm

10-28

Narcolepsy Could Be ‘Sleeper Effect’ in Trump and Brexit Campaigns

09-12

He wants to model a proportion given some predictors that sum to 1

07-10

Retrospective review of my first deep learning competition

07-22

Simulating the Colombian Peace Vote: Did the "No" Really Win?

10-12

Simulating the Colombian Peace Vote: Did the "No" Really Win?

10-12

Simulating the Colombian Peace Vote: Did the "No" Really Win?

10-12

Simulating the Colombian Peace Vote: Did the "No" Really Win?

10-12

Simulación Estadística del Plebiscito Colombiano: ¿Realmente Ganaron Los del "No?"

10-12

Simulación Estadística del Plebiscito Colombiano: ¿Realmente Ganaron Los del "No?"

10-12

Simulación Estadística del Plebiscito Colombiano: ¿Realmente Ganaron Los del "No?"

10-12

Simulación Estadística del Plebiscito Colombiano: ¿Realmente Ganaron Los del "No?"

10-12

Simulación Estadística del Plebiscito Colombiano: ¿Realmente Ganaron Los del "No?"

10-12

Getting Started with Competitions - A Peer to Peer Guide

08-22

Webcam based image processing in Jupyter notebooks

04-09

Quick reference to Python in a single script (and notebook)

10-13

Document worth reading: “Data learning from big data”

09-06

Quick reference to Python in a single script (and notebook)

10-13

Distilled News

01-06

Meta-Learning For Better Machine Learning

12-17

If you did not already know

12-15

If you did not already know

12-11

Image segmentation based on Superpixels and Clustering

11-09

K-means clustering with Amazon SageMaker

11-08

Python vs R: Head to Head Data Analysis

11-01

If you did not already know

10-25

Variety is the Secret Sauce for Big Discoveries in Big Data

09-18

Distilled News

09-14

Distilled News

08-04

Using Clustering Algorithms to Analyze Golf Shots from the U.S. Open

06-19

Image Compression using K-means Clustering.

05-28

Lumpers and Splitters: Tensions in Taxonomies

04-05

TSrepr use case - Clustering time series representations in R

03-13

What Do Data Scientists Need to Know about Containerization? As Little as Possible.

02-22

Self-Organizing Maps Tutorial

11-02

Clustering applied to showers in the OPERA

07-10

How to mine newsfeed data and extract interactive insights in Python

03-15

MULTI-VARIATE ANALYSIS

03-01

Wine dataset demonstrates importance of feature scaling

01-17

Clustering Zeppelin on Zeppelin

10-23

How to Use t-SNE Effectively

10-13

How to Use t-SNE Effectively

10-13

How to Use t-SNE Effectively

10-13

R Packages worth a look

12-13

R Packages worth a look

11-11

R Packages worth a look

10-18

If you did not already know

10-15

If you did not already know

10-03

Document worth reading: “A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines”

09-16

If you did not already know

08-15

On Tensor Networks and the Nature of Non-Linearity

06-20

Machine Learning in Science and Industry slides

04-20

Introduction to Support Vector Machine

02-20

Gradient descent learns linear dynamical systems

10-13

Bayesian Nonparametric Models in NIMBLE, Part 2: Nonparametric Random Effects

12-07

Deriving Expectation-Maximization

11-11

The Generalization Mystery: Sharp vs Flat Minima

01-18

Gradient descent learns linear dynamical systems

10-13

Deep reinforcement learning, battleship

10-15

Document worth reading: “A Study and Comparison of Human and Deep Learning Recognition Performance Under Visual Distortions”

12-18

How AI Can Help Cope with Data Scientists’ Boredom

10-24

Document worth reading: “Automatic Language Identification in Texts: A Survey”

09-20

Asynchronous Scraping with Python

10-16

Automated Web Scraping in R

12-11

R Packages worth a look

09-18

An ode to King James

06-10

Asynchronous Scraping with Python

10-16

Here’s why 2019 is a great year to start with R: A story of 10 year old R code then and now

01-05

If you did not already know

01-01

If you did not already know

12-30

If you did not already know

12-26

Soft Actor Critic—Deep Reinforcement Learning with Real-World Robots

12-14

Intro to Data Science for Managers

11-23

Introducing Octoparse New Version 7.1 – web scraping for dummies is official

11-20

If you did not already know

11-17

If you did not already know

11-13

Introducing a simple and intuitive Python API for UCI machine learning repository

11-12

Introducing Webhooks — Fastest Way to Collect Data

11-08

If you did not already know

10-19

The Main Approaches to Natural Language Processing Tasks

10-17

Distilled News

10-06

A psychology researcher uses Stan, multiverse, and open data exploration to explore human memory

09-21

Document worth reading: “A Taxonomy for Neural Memory Networks”

09-13

Visual Reinforcement Learning with Imagined Goals

09-06

If you did not already know

08-30

If you did not already know

08-30

Azure Functions for Data Science

08-06

Artificial Intelligence in the Workplace

08-03

New Research on Multi-Task Learning

07-24

John Mount speaking on rquery and rqdatatable

07-11

Supercharging Classification - The Value of Multi-task Learning

06-26

Goals and Principles of Representation Learning

04-12

Learning Robot Objectives from Physical Human Interaction

02-06

Evolving Stable Strategies

11-12

When (not) to use Deep Learning for NLP

09-04

Asynchronous Scraping with Python

10-16

Asynchronous Scraping with Python

10-16

R Packages worth a look

01-06

Document worth reading: “Searching Toward Pareto-Optimal Device-Aware Neural Architectures”

01-05

Document worth reading: “A Tutorial on Bayesian Optimization”

12-01

Whats new on arXiv

10-24

Machine learning — Is the emperor wearing clothes?

10-12

Differentiable Image Parameterizations

07-25

Feature Visualization

11-07

COLT 2017 accepted papers

06-03

Why Momentum Really Works

04-04

Learning to Rank Sketchfab Models with LightFM

11-07

Deconvolution and Checkerboard Artifacts

10-17

Deconvolution and Checkerboard Artifacts

10-17

Deconvolution and Checkerboard Artifacts

10-17

Deconvolution and Checkerboard Artifacts

10-17

Deconvolution and Checkerboard Artifacts

10-17

Gradientes de Recurrent Neural Networks y Lo Que Aprendí Derivándolos

10-18

Gradientes de Recurrent Neural Networks y Lo Que Aprendí Derivándolos

10-18

Deriving Expectation-Maximization

11-11

Time Series for scikit-learn People (Part I): Where's the X Matrix?

01-28

Deriving the Softmax from First Principles

04-19

Recurrent Neural Network Gradients, and Lessons Learned Therein

10-18

Gradientes de Recurrent Neural Networks y Lo Que Aprendí Derivándolos

10-18

Gradientes de Recurrent Neural Networks y Lo Que Aprendí Derivándolos

10-18

Recurrent Neural Network Gradients, and Lessons Learned Therein

10-18

Intro to Implicit Matrix Factorization: Classic ALS with Sketchfab Models

10-19

“discover feature relationships” – new EDA tool

01-10

UI Update — Datazar

11-07

Facial feedback: “These findings suggest that minute differences in the experimental protocol might lead to theoretically meaningful changes in the outcomes.”

11-01

How DataCamp Handles Course Quality

10-25

R Packages worth a look

08-01

Recommender System With Implicit Feedback

11-18

Intro to Implicit Matrix Factorization: Classic ALS with Sketchfab Models

10-19

Tutorial: An app in R shiny visualizing biopsy data —  in a pharmaceutical company

01-07

Distilled News

01-06

Meta-Learning For Better Machine Learning

12-17

If you did not already know

12-15

An 8-hour course on R and Data Mining

12-09

An 8-hour course on R and Data Mining

12-09

If you did not already know

11-23

Document worth reading: “To Cluster, or Not to Cluster: An Analysis of Clusterability Methods”

11-23

R Packages worth a look

11-20

Image segmentation based on Superpixels and Clustering

11-09

K-means clustering with Amazon SageMaker

11-08

Distilled News

10-28

Basic Image Data Analysis Using Python – Part 4

10-05

Distilled News

09-14

R Packages worth a look

09-06

If you did not already know

08-30

Distilled News

08-04

Using Clustering Algorithms to Analyze Golf Shots from the U.S. Open

06-19

U.S. Open Data — Gathering and Understanding the Data from 2018 Shinnecock

06-15

Lumpers and Splitters: Tensions in Taxonomies

04-05

What is Machine Learning?

07-17

Clustering applied to showers in the OPERA

07-10

MULTI-VARIATE ANALYSIS

03-01

Wine dataset demonstrates importance of feature scaling

01-17

Clustering Zeppelin on Zeppelin

10-23

Clustering Zeppelin on Zeppelin

10-23

DynamoDB Learnings

10-23

NLP Overview: Modern Deep Learning Techniques Applied to Natural Language Processing

01-08

P&G: Data Scientist – Machine Learning/NLP [Cincinnati, OH]

12-11

Data Representation for Natural Language Processing Tasks

11-02

Moody’s Analytics: Machine Learning / NLP – Research Scientist / Engineer [New York, NY]

10-30

The Main Approaches to Natural Language Processing Tasks

10-17

When (not) to use Deep Learning for NLP

09-04

AI ‘judge’ doesn’t explain why it reaches certain decisions

10-24

AI ‘judge’ doesn’t explain why it reaches certain decisions

10-24

Six Sigma DMAIC Series in R – Part4

12-15

Distilled News

12-06

Interacting with ML Models

10-26

Goals of Interpretability

11-17

Interacting with ML Models

10-26

Once Again: Prefer Confidence Intervals to Point Estimates

10-30

MRP (multilevel regression and poststratification; Mister P): Clearing up misunderstandings about

01-10

Document worth reading: “Which Knowledge Graph Is Best for Me?”

01-07

Document worth reading: “Neural Style Transfer: A Review”

01-02

Magister Dixit

12-24

Update on the R Consortium Census Working Group

10-22

Document worth reading: “Deep Learning for Generic Object Detection: A Survey”

10-10

The complex process of obtaining Puerto Rico mortality data: a timeline

09-28

Document worth reading: “Graph-based Ontology Summarization: A Survey”

09-23

Document worth reading: “Automatic Language Identification in Texts: A Survey”

09-20

N=1 survey tells me Cynthia Nixon will lose by a lot (no joke)

09-13

Data concerns when interpreting comparisons of gender equality between countries

08-21

Data concerns when interpreting comparisons of gender equality between countries

08-21

Data Notes: Predict the World Cup 2018 Winner

06-14

Announcing Elemetric

06-23

Once Again: Prefer Confidence Intervals to Point Estimates

10-30

Once Again: Prefer Confidence Intervals to Point Estimates

10-30

Once Again: Prefer Confidence Intervals to Point Estimates

10-30

Two papers released on arXiv, "Operator Variational Inference" and "Model Criticism for Bayesian Causal Inference"

10-30

Document worth reading: “Deep learning in agriculture: A survey”

01-12

Core Principles of Sustainable Data Science, Machine Learning and AI Product Development: Research as a core driver

01-09

3 Challenges for Companies Tackling Data Science

11-26

Machine Learning in Action: Going Beyond Decision Support Data Science

11-20

Applications of R presented at EARL London 2018

09-21

Curalate makes social sell with AI using Apache MXNet on AWS

08-13

Everything is a Model

12-13

Lies, Damned Lies and Big Data

11-19

Ten Ways Your Data Project is Going to Fail

11-01

Analyzing Housing Prices in Berkeley

11-02

Analyzing Housing Prices in Berkeley

11-02

How to build a data science project from scratch

12-05

Are you buying an apartment? How to hack competition in the real estate market

10-26

Analyzing Housing Prices in Berkeley

11-02

Analyzing Housing Prices in Berkeley

11-02

AzureR packages now on CRAN

01-08

Announcing the ultimate seminar speaker contest: 2019 edition!

01-06

Analyzing Housing Prices in Berkeley

11-02

Paper: A Differentiable Physics Engine for Deep Learning in Robotics

11-03

Paper: A Differentiable Physics Engine for Deep Learning in Robotics

11-03

Paper: A Differentiable Physics Engine for Deep Learning in Robotics

11-03

Artificial Neural Networks Introduction (Part II)

11-03

Aligned Clock Hands

11-04

Le Monde puzzle [#1075]

12-11

Aligned Clock Hands

11-04

Aligned Clock Hands

11-04

Learning to Rank Sketchfab Models with LightFM

11-07

Whats new on arXiv

10-24

If you did not already know

09-04

Differentiable Image Parameterizations

07-25

Feature Visualization

11-07

Learning to Rank Sketchfab Models with LightFM

11-07

Benford’s Law for Fraud Detection with an Application to all Brazilian Presidential Elections from 2002 to 2018

11-17

Maps of the issues mentioned most in election advertising

11-05

New Course: Analyzing Election and Polling Data in R

11-01

Vote suppression in corrupt NY State

10-24

High school statistics class builds election prediction model

10-23

Twitter, Social Bots, and the US Presidential Elections!

11-07

Twitter, Social Bots, and the US Presidential Elections!

11-07

Twitter, Social Bots, and the US Presidential Elections!

11-07

Software as an academic publication

05-03

GitHub's one-dimensional view of open source contributions

11-07

GitHub's one-dimensional view of open source contributions

11-07

Demystifying Data Science

11-10

Top Skills Needed to Work as Data Scientist in iGaming

01-10

How to Write a Great Data Science Resume

01-03

Who is a Data Scientist?

12-27

How to Land a Job As a Data Scientist in 2019

12-24

How to Find an Entry-Level Job in Data Science

11-13

The Most in Demand Skills for Data Scientists

11-02

NAIC: Analyst I (Capital Markets) [New York, NY]

10-29

RConsortium — Building an R Certification

10-26

From Project Manager to Data Champion — Conquer Your Data Projects

10-18

Top 10 Mistakes to Avoid to Master Data Science

10-10

Here is How You Can build a data science team from scratch in 2018 - The Definitive Guide

10-05

Top 10 Mistakes to Avoid to Master Data Science

10-04

Data Science at Northwestern

10-03

Advice on soft skills for academics

07-25

DIY AI for the Future

06-27

How to make the transition from academia to data science

04-23

Demystifying Data Science

11-10

InformationAge: Will 2019 See the Automation of Automation and Push Up Salaries of Data Scientists?

12-11

Jeremy Freese was ahead of the curve

08-10

Demystifying Data Science

11-10

A Complete Guide to Choosing the Best Machine Learning Course

11-30

Free ebook: Exploring Data with python

11-29

Drexel University: 2 Teaching Faculty Positions in Data Science [Philadelphia, PA]

11-27

Vanderbilt University: Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University: Sr Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University’s Peabody College: Lecturer in Data and Analytics [Online Teaching]

11-05

Vanderbilt University’s Peabody College: Sr. Lecturer in Data and Analytics [Nashville, TN]

11-05

While We Were Busy with Prosperity

11-10

While We Were Busy with Prosperity

11-10

Bob Erikson on the 2018 Midterms

10-01

Narcolepsy Could Be ‘Sleeper Effect’ in Trump and Brexit Campaigns

09-12

Using Natural Language Processing to Combat Filter Bubbles and Fake News – 360° Stance Detection

04-24

6279e808ef0c35488ea3a81e9b6d302a

07-06

While We Were Busy with Prosperity

11-10

R Packages worth a look

09-22

Ensemble learning for time series forecasting in R

10-19

Enernoc smart meter data - forecast electricity consumption with similar day approach in R

11-12

Enernoc smart meter data - forecast electricity consumption with similar day approach in R

11-12

Combining apparently contradictory evidence

12-30

Sales Forecasting Using Facebook’s Prophet

11-28

Forecasting financial time series with dynamic deep learning on AWS

08-20

Ensemble learning for time series forecasting in R

10-19

R<-Slovakia meetup started to build community in Bratislava

03-26

Enernoc smart meter data - forecast electricity consumption with similar day approach in R

11-12

TSstudio 0.1.3

12-02

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

11-15

“Statistical and Machine Learning forecasting methods: Concerns and ways forward”

11-06

Document worth reading: “Machine Learning for Spatiotemporal Sequence Forecasting: A Survey”

10-21

Forecasting financial time series with dynamic deep learning on AWS

08-20

Ensemble learning for time series forecasting in R

10-19

R<-Slovakia meetup started to build community in Bratislava

03-26

Enernoc smart meter data - forecast electricity consumption with similar day approach in R

11-12

Tangent Length Puzzle

11-14

Document worth reading: “Deep Learning for Image Denoising: A Survey”

11-04

Tangent Length Puzzle

11-14

Estimating Pi

10-16

Tangent Length Puzzle

11-14

R Packages worth a look

11-14

R Packages worth a look

08-22

Vestigial trigonometry functions

03-08

Tangent Length Puzzle

11-14

Tangent Length Puzzle

11-14

Dataiku Series C: New Year, New Chapter

12-19

A Deep (But Jargon and Math Free) Dive Into Deep Learning

08-31

Becoming a Data Scientist Podcast Special Episode

11-14

How to Write a Great Data Science Resume

01-03

How to Land a Job As a Data Scientist in 2019

12-24

How Miguel Got 3 Data Science Job Offers Fast With Dataquest

12-24

Analyzing contact center calls—Part 1: Use Amazon Transcribe and Amazon Comprehend to analyze customer sentiment

12-18

Amazon SageMaker Automatic Model Tuning now supports early stopping of training jobs

12-13

10 Data Science Skills to Land your Dream Job in 2019

12-12

6 Step Plan to Starting Your Data Science Career

12-05

Anomaly detection on Amazon DynamoDB Streams using the Amazon SageMaker Random Cut Forest algorithm

12-05

Kick Start Your Data Career! Tips From the Frontline

12-05

Amazon SageMaker Automatic Model Tuning becomes more efficient with warm start of hyperparameter tuning jobs

11-19

How to Find an Entry-Level Job in Data Science

11-13

To get hired as a data scientist, don’t follow the herd

11-12

How a meme grew into a campaign slogan

11-05

Telling Truth from Hype When Hunting for Data Science Work

11-05

How AI Can Help Cope with Data Scientists’ Boredom

10-24

“Demystifying Data Science” remote notes

10-24

How to get a Data Science Job in 6 Months

10-10

Journey from Non-Technical background to an expert in Data Science

10-05

Amazon SageMaker automatic model tuning produces better models, faster

09-25

If you did not already know

09-11

Ethical AI for Data Scientists

08-15

Take These 7 Small Steps To Make a Big Career Move

06-11

How Americans make a living based on their age

03-06

What Do Data Scientists Need to Know about Containerization? As Little as Possible.

02-22

My 10-step path to becoming a remote data scientist with Automattic

07-29

Review of The Data Incubator data science bootcamp

05-29

How to make the transition from academia to data science

04-23

Artificial Intelligence to replace staff at O2

02-28

Becoming a Data Scientist Podcast Special Episode

11-14

Practical Apache Spark in 10 Minutes

01-11

The Role of the Data Engineer is Changing

01-10

If you did not already know

01-04

How Machines Understand Our Language: An Introduction to Natural Language Processing

10-31

How to create useful features for Machine Learning

10-30

Lehigh University: Tenure Track Positions in Foundations of Data Science [Bethlehem, PA]

10-30

Top Obstacles to Overcome when Implementing Predictive Maintenance

10-29

Accelerating Your Algorithms in Production [Webinar Replay]

10-16

a4 Media: Manager, Machine Learning Data Engineer [Long Island City, NY]

10-10

Announcing Ursa Labs: an innovation lab for open source data science

04-19

Engineering Data Science at Automattic

03-20

Joining ASAPP

09-09

Moving On, Looking Back

07-28

Your First Job

11-15

Practical Apache Spark in 10 Minutes

01-11

Polished statistical analysis chapters in evidence-based software engineering

11-24

Data Science in 30 Minutes with Jake Porway of DataKind

11-06

Top Obstacles to Overcome when Implementing Predictive Maintenance

10-29

Engineering Data Science at Automattic

03-20

Moving On, Looking Back

07-28

Your First Job

11-15

Recurrent Neural Networks in Tensorflow III - Variable Length Sequences

11-15

Recurrent Neural Networks in Tensorflow III - Variable Length Sequences

11-15

Using Artificial Intelligence to Augment Human Intelligence

12-04

Fontstellations

11-16

Cummins: Advanced Analytics Platform Principle Engineer [Columbus, IN]

12-12

Physics-Based Learned Design: Teaching a Microscope How to Image

11-26

If you did not already know

10-08

Turn Whiteboard UX Sketches into Working HTML in Seconds – Introducing Sketch2Code

08-30

R Packages worth a look

08-15

R Packages worth a look

08-14

Distilled News

08-09

What Data Scientists should focus on in 2018?

06-27

Technology and Information: Data Science and UX

05-01

T-Shirt Design Contest!

11-19

Fontstellations

11-16

Purr yourself into a math genius

01-03

Fontstellations

11-16

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: December and Beyond

12-04

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: November and Beyond

11-01

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: October and Beyond

10-03

Being at the Center

09-07

Lost Car Key Puzzle

11-16

Lost Car Key Puzzle

11-16

Lost Car Key Puzzle

11-16

Simple python to LaTeX parser

11-18

Simple python to LaTeX parser

11-18

Mirrors

11-16

A transforming river seen from above

08-14

Shared items

08-11

Simple python to LaTeX parser

11-18

Simple python to LaTeX parser

11-18

Lies, Damned Lies and Big Data

11-19

Top 10 Books on NLP and Text Analysis

01-09

Updated Review: jamovi User Interface to R

01-09

R Packages worth a look

12-23

R Packages worth a look

12-22

R Packages worth a look

12-20

UnitedHealth Group: Director, Data Science [Minnetonka, MN]

12-19

R Packages worth a look

12-18

R Packages worth a look

12-11

An 8-hour course on R and Data Mining

12-09

An 8-hour course on R and Data Mining

12-09

R Packages worth a look

12-02

If you did not already know

12-02

If you did not already know

11-30

If you did not already know

11-23

Distilled News

11-16

Discourse Network Analysis: Undertaking Literature Reviews in R

11-15

Finding a house to buy, using statistics

11-14

Data Science With R Course Series – Week 9

11-12

Practical statistics books for software engineers

11-08

New R Cheatsheet: Data Science Workflow with R

11-04

R Packages worth a look

10-28

SQL, Python, & R in One Platform

10-26

R Packages worth a look

10-15

BIG, small or Right Data: Which is the proper focus?

10-08

Document worth reading: “Data Science vs. Statistics: Two Cultures”

09-27

Job opening at CDC: “The Statistician will play a central role in guiding the statistical methods of all major projects of the Epidemiology and Prevention Branch of the CDC Influenza Division, and aid in designing, analyzing, and interpreting research intended to understand the burden of influenza in the US and internationally and identify the best influenza vaccines and vaccine strategies.”

09-26

R Packages worth a look

09-13

Book review: SQL Server 2017 Machine Learning Services with R

09-04

R Packages worth a look

09-03

If you did not already know

08-29

Distilled News

08-22

R Packages worth a look

08-12

If you did not already know

08-12

Lana Del Rey’s Discography through the Lens of Text Analytics

08-09

R Packages worth a look

08-07

Response to Rafa: Why I don’t think ROC [receiver operating characteristic] works as a model for science

08-05

What Is Machine Learning and How Is It Making Our World a Better Place?

06-23

Hail: Scalable Genomics Analysis with Apache Spark

05-02

Wine dataset demonstrates importance of feature scaling

01-17

Data Readiness Levels: Turning Data from Palid to Vivid

01-12

Lies, Damned Lies and Big Data

11-19

Deep learning in Satellite imagery

12-26

Deep learning in Satellite imagery

12-04

The Role of Resources in Data Analysis

06-18

Lies, Damned Lies and Big Data

11-19

T-Shirt Design Contest!

11-19

The Two Tribes of Language Researchers

11-19

Data-Informed vs Data-Driven

11-20

Data-Informed vs Data-Driven

11-20

If you did not already know

10-14

ICML Board and Reviewer profiles

03-05

Artificial Intelligence to replace staff at O2

02-28

Data-Informed vs Data-Driven

11-20

Non-Zero Initial States for Recurrent Neural Networks

11-20

If you did not already know

09-11

Non-Zero Initial States for Recurrent Neural Networks

11-20

Non-Zero Initial States for Recurrent Neural Networks

11-20

Non-Zero Initial States for Recurrent Neural Networks

11-20

Docker and Kaggle with Ernie and Bert

11-22

Docker and Kaggle with Ernie and Bert

11-22

Introductory Machine Learning Terminology with Food

07-18

Docker and Kaggle with Ernie and Bert

11-22

R Packages worth a look

08-11

Grazing in a circular field

11-23

Data Science & ML : A Complete Interview Guide

12-26

Data Science & ML : A Complete Interview Guide

12-19

The Price of Transformation

09-26

Nine digits puzzle

03-02

Grazing in a circular field

11-23

Grazing in a circular field

11-23

Grazing in a circular field

11-23

The SIAM Book Series on Data Science

01-11

Top 10 Books on NLP and Text Analysis

01-09

Modern reproduction of 1847 geometry books

12-18

Must-Have Resources to Become a Data Scientist

12-06

A Programmer’s Introduction to Mathematics

12-01

Practical statistics books for software engineers

11-08

Visualising Networks in ASOIAF – Part II

10-14

Robert Heinlein vs. Lawrence Summers

09-04

Writing Effective Amazon Machine Learning

02-19

Book Review: Computer Age Statistical Inference

11-23

Document worth reading: “Fog Computing: Survey of Trends, Architectures, Requirements, and Research Directions”

08-22

Quantum Computing: Cats, Crushes, and Chemistry

07-30

Book Review: Computer Age Statistical Inference

11-23

Google F1 Server Reading Summary

11-26

Google F1 Server Reading Summary

11-26

R Packages worth a look

12-01

Online Master’s in Applied Data Science From Syracuse

10-05

The Data Incubator Unofficial Frequently Asked Questions

05-30

Review of The Data Incubator data science bootcamp

05-29

Django and Elastic Beanstalk, a perfect combination

11-28

Django and Elastic Beanstalk, a perfect combination

11-28

Django and Elastic Beanstalk, a perfect combination

11-28

Respecting Boundaries with Inhomogeneous Kernels

11-29

Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation

12-04

Deriving Expectation-Maximization

11-11

Covariate-Based Diagnostics for Randomized Experiments are Often Misleading

04-06

Bayesian Linear Regression (in PyMC) - a different way to think about regression

02-09

Respecting Boundaries with Inhomogeneous Kernels

11-29

A deep dive into glmnet: penalty.factor

11-13

A quick look at GHCN version 4

11-03

Bayesian Linear Regression (in PyMC) - a different way to think about regression

02-09

Respecting Boundaries with Inhomogeneous Kernels

11-29

Intuition for principal component analysis (PCA)

12-06

Document worth reading: “Attribute-aware Collaborative Filtering: Survey and Classification”

10-23

Echo Chamber Incites Online Mob to Attack Math Profs

09-14

Don't Panic: Deep Learning will be Mostly Harmless

11-29

Salon des Refusés

12-02

“Optimized” floor plan with genetic algorithms

08-06

Salon des Refusés

12-02

Salon des Refusés

12-02

Salon des Refusés

12-02

ML and NLP Publications in 2018

01-09

FAQ on ICML 2019 Code Submission Policy

12-19

Distill Update 2018

08-14

ML/NLP Publications in 2017

01-02

Salon des Refusés

12-02

Forecast double seasonal time series with multiple linear regression in R

12-03

Forecast double seasonal time series with multiple linear regression in R

12-03

✚ Google Dataset Search Impressions, the Challenges of Looking for Data, and Other Places to Find Data

09-13

RescueTime Inference via the "Poor Man's Dirichlet"

02-03

Forecast double seasonal time series with multiple linear regression in R

12-03

Freudenstein’s Equation

12-07

Ackerman Steering

12-03

Ackerman Steering

12-03

Ackerman Steering

12-03

Ackerman Steering

12-03

Kinesis Advantage2: Impressions

12-05

Kinesis Advantage2: Impressions

12-05

Kinesis Advantage2: Impressions

12-05

Kinesis Advantage2: Impressions

12-05

Using Keras' Pretrained Neural Networks for Visual Similarity Recommendations

12-05

Using Keras' Pretrained Neural Networks for Visual Similarity Recommendations

12-05

Using Keras' Pretrained Neural Networks for Visual Similarity Recommendations

12-05

R Packages worth a look

12-12

Experiments in Handwriting with a Neural Network

12-06

Experiments in Handwriting with a Neural Network

12-06

Experiments in Handwriting with a Neural Network

12-06

Experiments in Handwriting with a Neural Network

12-06

Experiments in Handwriting with a Neural Network

12-06

Document worth reading: “Instance-Level Explanations for Fraud Detection: A Case Study”

01-01

Can Lessons from Data Science Help Journalism?

06-27

PyDataBudapest and “Machine Learning Libraries You’d Wish You’d Known About”

11-15

Cognitive Machine Learning (1): Learning to Explain

02-05

Properties of Interpretability

12-06

Properties of Interpretability

12-06

Chromebook Data Science - a free online data science program for anyone with a web browser.

10-01

Support Becoming a Data Scientist!

12-07

Turning Water into Wine

03-13

Support Becoming a Data Scientist!

12-07

Support Becoming a Data Scientist!

12-07

Freudenstein’s Equation

12-07

Understanding the maths of Computed Tomography (CT) scans

01-09

Document worth reading: “The Three Pillars of Machine-Based Programming”

09-18

Freudenstein’s Equation

12-07

Freudenstein’s Equation

12-07

Speeding up TRPO through parallelization and parameter adaptation

12-09

Speeding up TRPO through parallelization and parameter adaptation

12-09

NIPS 2016 Generative Adversarial Training workshop talk

12-10

NIPS 2016 Generative Adversarial Training workshop talk

12-10

NIPS 2016 Generative Adversarial Training workshop talk

12-10

If you did not already know

01-06

Generative Adversarial Networks – Paper Reading Road Map

10-24

If you did not already know

10-21

Document worth reading: “Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances”

10-01

Alchemy, Rigour and Engineering

12-07

Thanksgiving Special 🦃: GANs are Being Fixed in More than One Way

11-23

GANs are Broken in More than One Way: The Numerics of GANs

10-05

Work in progress: Portraits of Imaginary People

06-06

From Instance Noise to Gradient Regularisation

06-01

NIPS 2016 Generative Adversarial Training workshop talk

12-10

NIPS 2016 Generative Adversarial Training workshop talk

12-10

3D printing glass and bronze: Lost-PLA casting

12-11

3D printing glass and bronze: Lost-PLA casting

12-11

3D printing glass and bronze: Lost-PLA casting

12-11

Think slow, think fast

12-12

Think slow, think fast

12-12

Think slow, think fast

12-12

How Different are Conventional Programming and Machine Learning?

12-10

Weekly Review: 11/11/2017

11-11

Type Safety and Statistical Computing

12-12

RSiteCatalyst Version 1.4.10 Release Notes

12-13

RSiteCatalyst Version 1.4.10 Release Notes

12-13

Miami University: Assistant Provost for Institutional Research and Effectiveness [Oxford, OH]

12-26

One-stop-learning-shop for data pros – get exclusive access for less than a cup of coffee

12-06

RSiteCatalyst Version 1.4.10 Release Notes

12-13

On Model Mismatch and Bayesian Analysis

12-13

On Model Mismatch and Bayesian Analysis

12-13

Join AI experts from Google Brain, Open AI & Uber AI Labs in San Francisco

11-01

PyImageConf 2018 Recap

10-01

My eRum 2018 biggest highlights

05-19

At NIPS 2017

12-04

Post NIPS Reflections

12-13

Generating World Flags with Sparse Auto-Encoders

12-14

Generating World Flags with Sparse Auto-Encoders

12-14

Extreme IO performance with parallel Apache Parquet in Python

02-10

Generating World Flags with Sparse Auto-Encoders

12-14

Generating World Flags with Sparse Auto-Encoders

12-14

Analyzing US flight data on Amazon S3 with sparklyr and Apache Spark 2.0

02-06

How-to: Automate Your sparklyr Environment with Cloudera Director

12-15

Power Bat – How Spektacom is Powering the Game of Cricket with Microsoft AI

10-11

Human Fuel Consumption

09-02

Four Ways to Harness Big Data in the Energy Sector

07-30

Wind Turbine Efficiency

06-19

Hamiltonian Monte Carlo explained

12-19

Hamiltonian Monte Carlo explained

12-19

Hamiltonian Monte Carlo explained

12-19

Assorted links

12-21

Assorted links

12-21

Peter Bull discusses the importance of human-centered design in data science.

11-05

R Generation: 25 Years of R

08-01

Assorted links

12-21

Assorted links

12-21

Dow Jones Stock Market Index (2/4): Trade Volume Exploratory Analysis

01-07

Mathematically, what is the optimal pitch for a roof?

12-23

Mathematically, what is the optimal pitch for a roof?

12-23

Whats new on arXiv

01-10

If you did not already know

01-08

Comparison of the Text Distance Metrics

01-07

Whats new on arXiv

01-03

Document worth reading: “A Review for Weighted MinHash Algorithms”

01-02

Whats new on arXiv

12-30

Document worth reading: “The importance of being dissimilar in Recommendation”

12-30

R Packages worth a look

12-28

If you did not already know

12-13

Whats new on arXiv

12-10

Whats new on arXiv

12-09

Distilled News

12-07

R Packages worth a look

12-07

R Packages worth a look

12-04

Document worth reading: “A Survey of Modern Object Detection Literature using Deep Learning”

12-03

Whats new on arXiv

11-30

Whats new on arXiv

11-28

Whats new on arXiv

11-28

Whats new on arXiv

11-27

More Robust Monotonic Binning Based on Isotonic Regression

11-24

“She also observed that results from smaller studies conducted by NGOs – often pilot studies – would often look promising. But when governments tried to implement scaled-up versions of those programs, their performance would drop considerably.”

11-22

R Packages worth a look

11-17

Whats new on arXiv

11-15

Whats new on arXiv

11-15

Whats new on arXiv

11-07

Whats new on arXiv

11-07

Whats new on arXiv

11-06

Document worth reading: “A Comprehensive Study of Deep Learning for Image Captioning”

10-31

Whats new on arXiv

10-30

R Packages worth a look

10-26

Distilled News

10-25

Whats new on arXiv

10-24

Whats new on arXiv

10-09

If you did not already know

10-08

Whats new on arXiv

10-04

Import AI 114: Synthetic images take a big leap forward with BigGANs; US lawmakers call for national AI strategy; researchers probe language reasoning via HotspotQA

10-01

Whats new on arXiv

09-28

Whats new on arXiv

09-21

If you did not already know

09-21

Whats new on arXiv

09-14

Whats new on arXiv

09-13

Whats new on arXiv

09-11

Whats new on arXiv

09-08

Whats new on arXiv

09-07

Whats new on arXiv

09-07

If you did not already know

09-06

Whats new on arXiv

08-31

Whats new on arXiv

08-29

Whats new on arXiv

08-28

Forbes: 25 Machine Learning Startups to Watch in 2018

08-26

R Packages worth a look

08-25

Whats new on arXiv

08-24

Whats new on arXiv

08-21

Whats new on arXiv

08-15

R Packages worth a look

08-14

Whats new on arXiv

08-08

Whats new on arXiv

08-07

Document worth reading: “Foundations of Complex Event Processing”

08-04

Whats new on arXiv

08-04

The Real Problems with Neural Machine Translation

07-21

Data Notes: How to Forecast the S&P 500 with Prophet

07-12

Mathematically, what is the optimal pitch for a roof?

12-23

Mathematically, what is the optimal pitch for a roof?

12-23

Mathematically, what is the optimal pitch for a roof?

12-23

Chuck-a-Luck

12-26

Chuck-a-Luck

12-26

2017 Outlook: pandas, Arrow, Feather, Parquet, Spark, Ibis

12-27

Some comments to Daniel Abadi's blog about Apache Arrow

11-01

Extreme IO performance with parallel Apache Parquet in Python

02-10

2017 Outlook: pandas, Arrow, Feather, Parquet, Spark, Ibis

12-27

Core Principles of Sustainable Data Science, Machine Learning and AI Product Development: Research as a core driver

01-09

Entering and Exiting 2018

01-02

Cummins: Data Engineering Technical Specialist [Columbus, IN]

12-13

CBH Group: Data Scientist [Perth, Australia]

12-11

CBH Group: Sr Data Scientist [Perth, Australia]

12-11

What does a data scientist REALLY look like?

11-09

Distilled News

10-18

Import AI:

07-09

Announcing Ursa Labs: an innovation lab for open source data science

04-19

2017 Outlook: pandas, Arrow, Feather, Parquet, Spark, Ibis

12-27

From Arrow to pandas at 10 Gigabytes Per Second

12-27

From Arrow to pandas at 10 Gigabytes Per Second

12-27

From Arrow to pandas at 10 Gigabytes Per Second

12-27

Painted Cube Puzzle

12-28

Painted Cube Puzzle

12-28

Whistler, British Columbia

07-26

Painted Cube Puzzle

12-28

Painted Cube Puzzle

12-28

Movie Genre Ratings - Addendum

02-24

Painted Cube Puzzle

12-28

Cornell prof (but not the pizzagate guy!) has one quick trick to getting 1700 peer reviewed publications on your CV

11-04

Building a Diabetic Retinopathy Prediction Application using Azure Machine Learning

06-25

Podcast Special Episode 2 – The Future of AI with Dr. Ed Felten

12-29

Discovering and indexing podcast episodes using Amazon Transcribe and Amazon Comprehend

09-20

Podcast Special Episode 2 – The Future of AI with Dr. Ed Felten

12-29

Podcast Special Episode 2 – The Future of AI with Dr. Ed Felten

12-29

What is the natural gradient, and how does it work?

12-30

Our R package roundup

12-31

Recurrent Neural Network Tutorial for Artists

01-01

New Course: Interactive Data Visualization with rbokeh

10-19

Recurrent Neural Network Tutorial for Artists

01-01

Native Hadoop file system (HDFS) connectivity in Python

01-03

If you did not already know

12-26

10 Data Science Skills to Land your Dream Job in 2019

12-12

Apache Spark Introduction for Beginners

10-18

Things you should know when traveling via the Big Data Engineering hype-train

10-08

The Benefits of Migrating HPC Workloads To Apache Spark

05-04

Native Hadoop file system (HDFS) connectivity in Python

01-03

A more systematic look at suppressed data by @ellis2013nz

11-17

Scalable multi-node deep learning training using GPUs in the AWS Cloud

07-20

Native Hadoop file system (HDFS) connectivity in Python

01-03

NLP and ML Publications – Looking Back at 2016

01-04

Part 2, further comments on OfS grade-inflation report

01-07

Southern Illinois University Edwardsville: Director of the Center for Predictive Analytics/(Associate) Professor of Mathematics and Statistics [Edwardsville, IL]

01-04

Office for Students report on “grade inflation”

01-02

Kick Start Your Data Career! Tips From the Frontline

12-05

Lehigh University: Tenure Track Positions in Foundations of Data Science [Bethlehem, PA]

10-30

Inside Higher Ed: Pushing the Boundaries of Learning With AI

09-26

Let’s get hysterical

08-19

Let’s get hysterical

08-19

NLP and ML Publications – Looking Back at 2016

01-04

Overview and benchmark of traditional and deep learning models in text classification

06-12

Attending to characters in neural sequence labeling models

01-06

If you did not already know

10-03

Attending to characters in neural sequence labeling models

01-06

How to work with strings in base R – An overview of 20+ methods for daily use.

11-24

epubr 0.5.0 CRAN release

11-18

The 2018 Best Picture Nominees Ranked, Reviewed, and Reflected Upon

03-03

Data Cleaning, Categorization and Normalization

01-30

Attending to characters in neural sequence labeling models

01-06

My Experience as a Freelance Data Scientist

01-07

My Experience as a Freelance Data Scientist

01-07

My Experience as a Freelance Data Scientist

01-07

My Experience as a Freelance Data Scientist

01-07

Customer lifetime value and the proliferation of misinformation on the internet

01-08

CES 2019

01-12

CES 2017

01-09

CES 2017

01-09

CES 2019

01-12

Document worth reading: “Searching Toward Pareto-Optimal Device-Aware Neural Architectures”

01-05

Document worth reading: “Internet of Things: An Overview”

11-25

Why Would Prosthetic Arms Need to See or Connect to Cloud AI?

09-10

CES 2017

01-09

CES 2017

01-09

Machine Learning Madden NFL: How Madden player ratings are actually calculated

01-10

Machine Learning Madden NFL: How Madden player ratings are actually calculated

01-10

Optimization inequalities cheatsheet

01-10

A Cookbook for Machine Learning: Vol 1

11-16

Optimization inequalities cheatsheet

01-10

Optimization inequalities cheatsheet

01-10

Optimization inequalities cheatsheet

01-10

Optimization inequalities cheatsheet

01-10

Self Driving Cars

01-10

Self Driving Cars

01-10

Self Driving Cars

01-10

Self Driving Cars

01-10

Becoming a Data Scientist Podcast Episode 14: Jasmine Dumas

01-11

Becoming a Data Scientist Podcast Episode 14: Jasmine Dumas

01-11

Creating an Azure VHD from Ubuntu Cloud Images on Mac OS X

01-13

Creating an Azure VHD from Ubuntu Cloud Images on Mac OS X

01-13

Creating an Azure VHD from Ubuntu Cloud Images on Mac OS X

01-13

Creating an Azure VHD from Ubuntu Cloud Images on Mac OS X

01-13

SQL, Python, & R in One Platform

10-26

SQL, Python, & R: All in One Platform

10-11

Creating an Azure VHD from Ubuntu Cloud Images on Mac OS X

01-13

5 things that happened in Data Science in 2018

01-08

Einops — a new style of deep learning code

12-06

Introduction to PyTorch for Deep Learning

11-07

PyTorch 1.0 preview now available in Amazon SageMaker and the AWS Deep Learning AMIs

10-03

Keras vs PyTorch:谁是「第一」深度学习框架?

06-30

Tutorial: Deep Learning in PyTorch

01-15

Centroids of semicircles and hemispheres

01-16

Centroids of semicircles and hemispheres

01-16

Centroids of semicircles and hemispheres

01-16

Centroids of semicircles and hemispheres

01-16

Centroids of semicircles and hemispheres

01-16

Complex System Society 2016 Junior Scientific Award!

01-16

Neural Ordinary Differential Equations

12-15

Complex System Society 2016 Junior Scientific Award!

01-16

“Using numbers to replace judgment”

11-17

Raghuveer Parthasarathy’s big idea for fixing science

11-01

If you did not already know

10-21

Document worth reading: “An Introduction to Inductive Statistical Inference — from Parameter Estimation to Decision-Making”

10-09

Let’s get hysterical

08-19

Let’s get hysterical

08-19

Optimization of Scientific Code with Cython: Ising Model

12-11

Complex System Society 2016 Junior Scientific Award!

01-16

Are you buying an apartment? How to hack competition in the real estate market

10-26

Document worth reading: “A Survey on Influence Maximization in a Social Network”

09-02

Journals and refereeing: toward a new equilibrium

07-25

Questions on Artificial Intelligence

01-16

Questions on Artificial Intelligence

01-16

Practical Data Science with R, 2nd Edition discount!

01-12

The Ultrarich's dirty secret: not paying taxes

01-07

Should I do a Data Science bootcamp?

01-03

Hello, world!

01-16

Hello, world!

01-16

Hello, world!

01-16

Being at the Center

09-07

T-Shirt Contest Finalists

01-17

T-Shirt Contest Finalists

01-17

T-Shirt Contest Finalists

01-17

“discover feature relationships” – new EDA tool

01-10

AHL Python Data Hackathon

04-22

T-Shirt Contest Finalists

01-17

Physics-Based Learned Design: Teaching a Microscope How to Image

11-26

Can we do better than using averaged measurements?

10-26

What to do when your measured outcome doesn’t quite line up with what you’re interested in?

09-17

Testing code with random output

08-06

Talking about clinical significance

06-01

Context Compatibility in Data Analysis

05-24

Layman’s Guide to A/B Testing

07-18

Wine dataset demonstrates importance of feature scaling

01-17

Wine dataset demonstrates importance of feature scaling

01-17

Engineering is the bottleneck in (Deep Learning) Research

01-17

This Website

01-18

This Website

01-18

This Website

01-18

Driving Success through Business Insight, One Customer at a Time

11-21

Analyze live video at scale in real time using Amazon Kinesis Video Streams and Amazon SageMaker

11-19

Customize your notebook volume size, up to 16 TB, with Amazon SageMaker

11-06

This Website

01-18

Building a Data Science Workstation (2017)

01-18

Building a Data Science Workstation (2017)

01-18

Building a Data Science Workstation (2017)

01-18

Building a Data Science Workstation (2017)

01-18

Machine Learning Madden NFL: The best player position switches for Madden 17

01-20

Machine Learning Madden NFL: The best player position switches for Madden 17

01-20

Whats new on arXiv

01-13

If you did not already know

12-17

Whats new on arXiv

12-10

Whats new on arXiv

12-07

Distilled News

12-03

Whats new on arXiv

12-03

Whats new on arXiv

11-29

Whats new on arXiv

11-28

Whats new on arXiv

11-26

Whats new on arXiv

11-25

Whats new on arXiv

11-25

Whats new on arXiv

11-12

Whats new on arXiv

11-06

Whats new on arXiv

11-05

Whats new on arXiv

11-04

Whats new on arXiv

11-04

Whats new on arXiv

11-02

Improving model interpretability with LIME

10-31

Whats new on arXiv

10-27

Whats new on arXiv

10-25

Whats new on arXiv

10-23

Whats new on arXiv

10-16

Whats new on arXiv

10-10

If you did not already know

10-10

Whats new on arXiv

10-05

Whats new on arXiv

10-02

Whats new on arXiv

09-28

Whats new on arXiv

09-21

Whats new on arXiv

09-21

Whats new on arXiv

09-17

Whats new on arXiv

09-14

Whats new on arXiv

09-13

Bothered by non-monotonicity? Here’s ONE QUICK TRICK to make you happy.

09-07

Whats new on arXiv

09-03

Whats new on arXiv

08-31

If you did not already know

08-29

Whats new on arXiv

08-21

Whats new on arXiv

08-21

Whats new on arXiv

08-17

Whats new on arXiv

08-14

Whats new on arXiv

08-13

Whats new on arXiv

08-02

Deep Learning with Intel’s BigDL and Apache Spark

09-06

Time Series Analysis with Generalized Additive Models

04-04

RescueTime Inference via the "Poor Man's Dirichlet"

02-03

Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

01-24

Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R

01-24

Radiocarbon dating

01-24

Radiocarbon dating

01-24

Radiocarbon dating

01-24

Radiocarbon dating

01-24

Radiocarbon dating

01-24

Development update: High speed Apache Parquet in Python with Apache Arrow

01-25

Development update: High speed Apache Parquet in Python with Apache Arrow

01-25

Timing the Same Algorithm in R, Python, and C++

01-06

Timing the Same Algorithm in R, Python, and C++

01-06

Recreating the NBA lead tracker graphic

12-13

Timing Grouped Mean Calculation in R

12-08

Timing Grouped Mean Calculation in R

12-08

Timing Column Indexing in R

09-21

R tip: Use Radix Sort

08-21

data.table is Really Good at Sorting

08-14

Collecting Expressions in R

08-05

Speed up your R Work

07-08

Development update: High speed Apache Parquet in Python with Apache Arrow

01-25

Development update: High speed Apache Parquet in Python with Apache Arrow

01-25

Where Predictive Modeling Goes Astray

01-27

Where Predictive Modeling Goes Astray

01-27

Anomaly detection on Amazon DynamoDB Streams using the Amazon SageMaker Random Cut Forest algorithm

12-05

Streaming Columnar Data with Apache Arrow

01-27

Journal: PLXtrum - realtime machine learning for predicting note onset

01-28

Journal: PLXtrum - realtime machine learning for predicting note onset

01-28

Data Cleaning, Categorization and Normalization

01-30

Becoming a Data Scientist Podcast Episode 15: David Meza

01-30

Distilled News

12-15

6 Goals Every Wannabe Data Scientist Should Make for 2019

11-22

Report from the Enterprise Applications of the R Language conference

11-16

Report from the Enterprise Applications of the R Language conference

11-16

Decolonising Artificial Intelligence

10-11

Magister Dixit

10-05

Magister Dixit

09-02

Against Arianism

08-21

Against Arianism

08-21

Magister Dixit

08-11

From Analytical to Numerical to Universal Solutions

03-20

Becoming a Data Scientist Podcast Episode 15: David Meza

01-30

Crazy Progress Bars

01-31

If you did not already know

12-23

Analyzing contact center calls—Part 1: Use Amazon Transcribe and Amazon Comprehend to analyze customer sentiment

12-18

Building a conversational business intelligence bot with Amazon Lex

11-21

Why R? 2018 Conference – After Movie and Summary

11-07

Master R shiny: One trick to build maintainable and scalable event chains

11-02

What does it mean to talk about a “1 in 600 year drought”?

10-29

Join us for DataTech19, Scotland’s first technical data science conference as part of DataFest

10-12

Putting the Power of Kafka into the Hands of Data Scientists

09-05

Four Ways to Harness Big Data in the Energy Sector

07-30

PyConUK 2017, PyDataCardiff and “Machine Learning Libraries You’d Wish You’d Known About”

11-05

Inferring data loss (and correcting for it) from fundamental relationships

09-01

Building Event-driven Microservices Using CQRS and Serverless

02-01

Building Event-driven Microservices Using CQRS and Serverless

02-01

Building Event-driven Microservices Using CQRS and Serverless

02-01

NYU Stern Fubon Center for Technology, Business and Innovation: Fubon Center Faculty Fellow [New York, NY]

01-08

From Microservices to Service Blocks using Spring Cloud Function and AWS Lambda

07-07

Building Event-driven Microservices Using CQRS and Serverless

02-01

Up and running with Apache Spark on Apache Kudu

02-01

Up and running with Apache Spark on Apache Kudu

02-01

Up and running with Apache Spark on Apache Kudu

02-01

Using ggplot2 for functional time series

12-12

To get hired as a data scientist, don’t follow the herd

11-12

Up and running with Apache Spark on Apache Kudu

02-01

Create Inverted Music using Python

02-02

Create Inverted Music using Python

02-02

Create Inverted Music using Python

02-02

Create Inverted Music using Python

02-02

Music listener statistics: last.fm’s last.year as an R package

01-02

Spark + AI Summit: learn best practices in ML and DL, latest frameworks, and more – special KDnuggets offer

12-14

Winner Interview | Particle Tracking Challenge first runner-up, Pei-Lien Chou

09-14

Three flavors of data scientist

08-02

Create Inverted Music using Python

02-02

RescueTime Inference via the "Poor Man's Dirichlet"

02-03

Simple Stock Ticker App

02-04

Top KDnuggets tweets, Oct 24-30: Building a Question-Answering System from Scratch

10-31

Simple Stock Ticker App

02-04

Don’t use AI when BI will suffice!

11-05

How Americans make a living based on their age

03-06

Similarity via Jaccard Index

02-07

Topic Modeling for Keyword Extraction

02-05

How to Land a Job As a Data Scientist in 2019

12-24

“Do you have any recommendations for useful priors when datasets are small?”

12-11

Don’t use AI when BI will suffice!

11-05

5 Steps to Prepare for a Data Science Job

10-23

5 Steps to Prepare for a Data Science Job

10-22

Data Science at Northwestern

10-03

Take These 7 Small Steps To Make a Big Career Move

06-11

How Americans make a living based on their age

03-06

Top 10 oldest and youngest industries in the U.S.

03-05

Interactive Broker’s SNAP Orders for Delayed Trading

01-03

“Should I get a PhD to be a data scientist/analytics professional?”

11-19

Retrospective on leaving academia for industry data science

04-09

Similarity via Jaccard Index

02-07

Topic Modeling for Keyword Extraction

02-05

5 amazing free tools that can help with publishing R results and blogging

12-22

binb 0.0.3: Now with Monash

10-12

Topic Modeling for Keyword Extraction

02-05

Cognitive Machine Learning (1): Learning to Explain

02-05

Cognitive Machine Learning (1): Learning to Explain

02-05

Rec-a-Sketch: a Flask App for Interactive Sketchfab Recommendations

02-05

Rec-a-Sketch: a Flask App for Interactive Sketchfab Recommendations

02-05

Analyzing US flight data on Amazon S3 with sparklyr and Apache Spark 2.0

02-06

Analyzing US flight data on Amazon S3 with sparklyr and Apache Spark 2.0

02-06

Toward better measurement in K-12 education research

10-15

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount.

10-09

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount

10-01

Analyzing US flight data on Amazon S3 with sparklyr and Apache Spark 2.0

02-06

Similarity via Jaccard Index

02-07

Similarity via Jaccard Index

02-07

Why hierarchical models are awesome, tricky, and Bayesian

02-08

Why hierarchical models are awesome, tricky, and Bayesian

02-08

Accelerating Apache Spark MLlib with Intel® Math Kernel Library (Intel® MKL)

02-08

Accelerating Apache Spark MLlib with Intel® Math Kernel Library (Intel® MKL)

02-08

Nemirovski’s acceleration

01-09

Whats new on arXiv

12-30

The Intuitions Behind Bayesian Optimization with Gaussian Processes

10-19

If you did not already know

09-04

Differentiable Image Parameterizations

07-25

Feature Visualization

11-07

Accelerating Apache Spark MLlib with Intel® Math Kernel Library (Intel® MKL)

02-08

Whats new on arXiv

10-24

If you did not already know

09-29

Whats new on arXiv

09-17

Document worth reading: “Accelerating CNN inference on FPGAs: A Survey”

09-08

Accelerating Apache Spark MLlib with Intel® Math Kernel Library (Intel® MKL)

02-08

Beyond Binary: Ternary and One-hot Neurons

02-08

What's new in PyMC3 3.1

07-05

Bayesian Linear Regression (in PyMC) - a different way to think about regression

02-09

Pdftools 2.0: powerful pdf text extraction tools

12-14

epubr 0.5.0 CRAN release

11-18

Machine Reading Comprehension: Learning to Ask & Answer

10-11

“Optimized” floor plan with genetic algorithms

08-06

Extreme IO performance with parallel Apache Parquet in Python

02-10

The “Carl Sagan effect”

07-16

Motivation in Academia vs Industry

01-21

Retrospective on leaving academia for industry data science

04-09

Reasons I left academia

02-12

Reasons I left academia

02-12

WPI: Research Scientist [Worcester, MA]

11-30

WPI: Post-Doctoral Fellow [Worcester, MA]

11-21

Job: Postdoctoral Researcher in Small Data Deep Learning and Explainable Machine Learning, Livermore, CA

10-08

Algorithms, Machine Learning, and Optimization: we are hiring!

11-12

Reasons I left academia

02-12

10 famous TV shows related to Data science & AI (Artificial Intelligence)

02-14

Against Arianism

08-21

Against Arianism

08-21

10 famous TV shows related to Data science & AI (Artificial Intelligence)

02-14

Introduction to XGBoost

02-17

Document worth reading: “Computing the Unique Information”

12-12

R Packages worth a look

11-07

Multilevel models with group-level predictors

10-21

Introduction to XGBoost

02-17

Introduction to XGBoost

02-17

Document worth reading: “Can Machines Design An Artificial General Intelligence Approach”

12-10

Graph-Powered Machine Learning

12-03

On Pyro - Deep Probabilistic Programming on PyTorch

11-03

T-Shirts!!

02-18

T-Shirts!!

02-18

T-Shirts!!

02-18

T-Shirts!!

02-18

T-Shirts!!

02-18

The Price is Right

02-19

The Price is Right

02-19

The Price is Right

02-19

Slot Machines

10-15

Exploiting Daily Fantasy Football for Fun and Profit

09-22

NSA Easter Egg Puzzle

03-05

The Price is Right

02-19

The Price is Right

02-19

Similarity in the Wild

02-19

From a Night of Insomnia to Competition Winner | An Interview with Martin Barron

01-08

Hackathon Winner Interview: Penn State | Kaggle University Club

12-19

Document worth reading: “Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences”

12-16

Pulse of the Competition: November Edition

11-02

Distilled News

10-28

Getting Started with Competitions - A Peer to Peer Guide

08-22

Profiling Top Kagglers: Martin Henze (AKA Heads or Tails), World's First Kernels Grandmaster

06-19

From Gaussian Algebra to Gaussian Processes, Part 2

06-12

Quarterly product update: Create your data science projects on Kaggle

04-04

Introduction to Support Vector Machine

02-20

Introduction to Support Vector Machine

02-20

Spam Detection with Natural Language Processing – Part 3

11-01

Python Vs R : The Eternal Question for Data Scientists

09-29

Python Vs R : The Eternal Question for Data Scientists

09-24

Introduction to Support Vector Machine

02-20

Self-Organizing Maps Tutorial

11-02

Introduction to Support Vector Machine

02-20

Persistent Homology (Part 2)

02-22

Persistent Homology (Part 2)

02-22

Persistent Homology (Part 2)

02-22

Persistent Homology (Part 2)

02-22

Persistent Homology (Part 3)

02-23

Persistent Homology (Part 2)

02-22

Topological Data Analysis - Persistent Homology

02-22

Topological Data Analysis - Persistent Homology

02-22

Topological Data Analysis - Persistent Homology

02-22

Topological Data Analysis - Persistent Homology

02-22

Bitcoin and Taxes: What You May Not Know

12-06

The Impact of Bitcoin on the Insurance Industry

06-21

Bitcoin and Cryptocurrency Litigation

06-08

Getting Rich using Bitcoin stockprices and Twitter!

02-22

Recurrent Neural Networks for Churn Prediction

02-22

Recurrent Neural Networks for Churn Prediction

02-22

Genres Where Audiences and Critics Disagree Most

02-23

Genres Where Audiences and Critics Disagree Most

02-23

Genres Where Audiences and Critics Disagree Most

02-23

Genres Where Audiences and Critics Disagree Most

02-23

Genres Where Audiences and Critics Disagree Most

02-23

Persistent Homology (Part 4)

02-23

Persistent Homology (Part 3)

02-23

Persistent Homology (Part 4)

02-23

Persistent Homology (Part 3)

02-23

Using the Economics Value Curve to Drive Digital Transformation

12-27

If you did not already know

10-14

3 Stages of Creating Smart

10-04

Facilitate Proactive Cybersecurity Operations with Big Data Analytics and Machine Intelligence

07-30

Learn to R blog series - Operators and Objects

07-19

What's New in Dataquest v1.85: Takeaways, Intermediate R, and More

05-25

Apache Arrow and the "10 Things I Hate About pandas"

09-21

Vestigial trigonometry functions

03-08

Persistent Homology (Part 3)

02-23

Persistent Homology (Part 4)

02-23

Persistent Homology (Part 4)

02-23

Movie Genre Ratings - Addendum

02-24

A Neural Network for predicting Restaurant Reservations

11-30

Movie Genre Ratings - Addendum

02-24

Scrape Tweets from Twitter using Python and Tweepy

02-24

Scrape Tweets from Twitter using Python and Tweepy

02-24

Scrape Tweets from Twitter using Python and Tweepy

02-24

Don’t reinvent the wheel: making use of shiny extension packages. Join MünsteR for our next meetup!

01-08

Learning Robot Objectives from Physical Human Interaction

02-06

What is an Interaction Effect?

02-25

University of Rhode Island: Assistant Professor of Data Science [Kingston, RI]

10-22

What is an Interaction Effect?

02-25

Alibaba acquires Data Artisans?

01-10

Document worth reading: “Saliency Prediction in the Deep Learning Era: An Empirical Investigation”

11-16

Self-Service Analytics or Operationalization: Which Should I Implement?

10-16

Persistent Homology (Part 5)

02-26

If you did not already know

12-21

Persistent Homology (Part 5)

02-26

Persistent Homology (Part 5)

02-26

Artificial Intelligence to replace staff at O2

02-28

Artificial Intelligence to replace staff at O2

02-28

Maryville University: Business Intelligence Analyst [St. Louis, MO]

01-04

If you did not already know

12-09

Document worth reading: “An Introduction to Inductive Statistical Inference — from Parameter Estimation to Decision-Making”

10-09

Statistical Modeling, Causal Inference, and Social Science Regrets Its Decision to Hire Cannibal P-hacker as Writer-at-Large

09-29

Document worth reading: “Human-Machine Inference Networks For Smart Decision Making: Opportunities and Challenges”

09-26

Document worth reading: “Decision-Making with Belief Functions: a Review”

09-19

Introduction to Random forest

02-28

Distilled News

10-14

Introduction to Random forest

02-28

A Guide to Decision Trees for Machine Learning and Data Science

12-24

Code for case study – Customer Churn with Keras/TensorFlow and H2O

12-12

Coding Regression trees in 150 lines of R code

11-09

Text Segmentation using Word Embeddings

10-16

Introduction to Random forest

02-28

Distilled News

11-07

How to be an Artificial Intelligence (AI) Expert?

10-25

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

Introduction to Random forest

02-28

What to Consider When Choosing Colors for Data Visualization

08-22

Deepcolor: automatic coloring and shading of manga-style lineart

03-01

Deepcolor: automatic coloring and shading of manga-style lineart

03-01

Deepcolor: automatic coloring and shading of manga-style lineart

03-01

MULTI-VARIATE ANALYSIS

03-01

MULTI-VARIATE ANALYSIS

03-01

MULTI-VARIATE ANALYSIS

03-01

Data Engineer vs Data Scientist (Infographic)

03-02

Self-Service Adobe Analytics Data Feeds!

03-03

Understanding object detection in deep learning

11-19

What is a Box Plot?

08-24

Java Autonomous driving – Car detection

01-18

NSA Easter Egg Puzzle

03-05

NSA Easter Egg Puzzle

03-05

NSA Easter Egg Puzzle

03-05

Monash University: Academic Opportunities in Dialogue Research [Melbourne, Australia]

10-25

Discarded Hard Drives: Data Science as Debugging

03-14

Topic Modeling Amazon Reviews

03-07

Automated and continuous deployment of Amazon SageMaker models with AWS Step Functions

01-10

Distilled News

01-05

Use AWS Machine Learning to Analyze Customer Calls from Contact Centers (Part 2): Automate, Deploy, and Visualize Analytics using Amazon Transcribe, Amazon Comprehend, AWS CloudFormation, and Amazon QuickSight

12-28

Power your website with on-demand translated reviews using Amazon Translate

12-20

Analyzing contact center calls—Part 1: Use Amazon Transcribe and Amazon Comprehend to analyze customer sentiment

12-18

Top KDnuggets tweets, Nov 28 – Dec 4: Deep Learning Cheat Sheets; Amazon opens its internal

12-05

Announcing the Winners of the 2018 AWS AI Hackathon

12-05

Semantic Segmentation algorithm is now available in Amazon SageMaker

11-28

Amazon Launches Machine Learning University

11-27

Analyze live video at scale in real time using Amazon Kinesis Video Streams and Amazon SageMaker

11-19

Because it's Friday: The physics of The Expanse

11-16

AWS expands HIPAA eligible machine learning services for healthcare customers

11-08

Customize your notebook volume size, up to 16 TB, with Amazon SageMaker

11-06

Amazon SageMaker Batch Transform now supports Amazon VPC and AWS KMS-based encryption

10-22

Beyond text: How Spokata uses Amazon Polly to make news and information universally accessible as real-time audio

10-04

PyTorch 1.0 preview now available in Amazon SageMaker and the AWS Deep Learning AMIs

10-03

Your Guide to AI and Machine Learning at re:Invent 2018

09-27

Discovering and indexing podcast episodes using Amazon Transcribe and Amazon Comprehend

09-20

Pixm takes on phishing attacks with deep learning using Apache MXNet on AWS

08-28

New speed record set for training deep learning models on AWS

08-22

Build an automatic alert system to easily moderate content at scale with Amazon Rekognition Video

08-15

Announcing the Artificial Intelligence (AI) Hackathon: Build Intelligent Applications using machine learning APIs and serverless

08-15

Build a document search bot using Amazon Lex and Amazon Elasticsearch Service

08-01

Topic Modeling Amazon Reviews

03-07

Topic Modeling Amazon Reviews

03-07

Topic Modeling Amazon Reviews

03-07

Topic Modeling Amazon Reviews

03-07

Tic-Tac-AI: A Strong Tic-Tac-Toe AI Opponent using Forward Sampling

03-07

Principle Component Analysis in Regression

03-08

Principle Component Analysis in Regression

03-08

How to Meet Your New Years Resolutions in 2019 (Udemy Coupons $9.99)

01-01

A tutorial on tidy cross-validation with R

11-25

Principle Component Analysis in Regression

03-08

Vestigial trigonometry functions

03-08

Long-awaited updates to htmlTable

01-07

Windows Clipboard Access with R

11-14

Modifying Excel Files using openxlsx

10-16

R Packages worth a look

10-03

Using a Column as a Column Index

09-21

R Packages worth a look

09-01

Large-Scale Health Data Analytics with OHDSI

12-21

Vestigial trigonometry functions

03-08

An introduction to Bayesian Belief Networks

03-10

An introduction to Bayesian Belief Networks

03-10

Does the Muslim ban make us safer?

03-10

Statistics Challenge Invites Students to Tackle Opioid Crisis Using Real-World Data

10-19

What is “party balancing” and how does it explain midterm elections?

08-08

Does the Muslim ban make us safer?

03-10

Does the Muslim ban make us safer?

03-10

Does the Muslim ban make us safer?

03-10

The Benefits of Migrating HPC Workloads To Apache Spark

05-04

Square to Hex

03-11

Square to Hex

03-11

Distilled News

09-30

Square to Hex

03-11

R Packages worth a look

11-18

M4 Forecasting Conference

10-24

Document worth reading: “Idealised Bayesian Neural Networks Cannot Have Adversarial Examples: Theoretical and Empirical Study”

08-29

Document worth reading: “The State of the Art in Developing Fuzzy Ontologies: A Survey”

08-26

Cognitive Machine Learning (2): Uncertain Thoughts

03-12

Cognitive Machine Learning (2): Uncertain Thoughts

03-12

Quidditch: is it all about the Snitch?

11-24

Data Science in Esports

11-21

Autonomy – Do we have the choice?

11-21

Data Science in Esports

11-12

Spooky! Gravedigger in R

10-31

Spooky! Gravedigger in R

10-31

Distilled News

10-27

Simulating simple dice games by @ellis2013nz

10-26

Probability and Tennis

08-13

RAIN Project: evolution of the game development dream

07-13

Import AI:

05-29

AlphaGo Zero Is Not A Sign of Imminent Human-Level AI

03-30

Exploiting Daily Fantasy Football for Fun and Profit

09-22

Japanese Kids Shows, Movies, Games, and Videos for Immersion

07-14

Applying Machine Learning To March Madness

03-12

Intercausal Reasoning in Bayesian Networks

03-13

Discarded Hard Drives: Data Science as Debugging

03-14

Document worth reading: “An Overview of Blockchain Integration with Robotics and Artificial Intelligence”

11-08

Turbocharge Tech Transformation: Integrate AI Across Insurance

11-06

Discarded Hard Drives: Data Science as Debugging

03-14

Turbocharge Tech Transformation: Integrate AI Across Insurance

11-06

Discarded Hard Drives: Data Science as Debugging

03-14

Millions of social bots invaded Twitter!

03-14

How to mine newsfeed data and extract interactive insights in Python

03-15

How to mine newsfeed data and extract interactive insights in Python

03-15

My Approach to Natas Level 11 (a Web Security Game)

02-23

Safe Crime Detection

06-05

Building Safe A.I.

03-17

Building Safe A.I.

03-17

Building Safe A.I.

03-17

Building Safe A.I.

03-17

Ordered Categorical GLMs for Product Feedback Scores

03-17

Paper review: EraseReLU

09-26

Ordered Categorical GLMs for Product Feedback Scores

03-17

Model AUC depends on test set difficulty

03-19

Cryptogram Puzzle

03-20

Cryptogram Puzzle

03-20

If you did not already know

12-02

If you did not already know

10-19

If you did not already know

09-15

Multithreaded in the Wild

09-07

How I got in the top 1 % on Kaggle.

08-28

Recommender System

10-30

Cryptogram Puzzle

03-20

Cryptogram Puzzle

03-20

Cryptogram Puzzle

03-20

“Dissolving the Fermi Paradox”

01-05

Top KDnuggets tweets, Dec 12-18: Deep Learning Cheat Sheets; The Nate Silver vs. Nassim Taleb Twitter War

12-19

If you did not already know

10-02

From Analytical to Numerical to Universal Solutions

03-20

R Packages worth a look

09-05

Where do I learn about log_sum_exp, log1p, lccdf, and other numerical analysis tricks?

07-12

From Analytical to Numerical to Universal Solutions

03-20

Research Debt

03-22

Research Debt

03-22

Research Debt

03-22

Research Debt

03-22

Becoming a Data Scientist Podcast Episode 16: Randy Olson

03-22

Practical Data Science with R, 2nd Edition discount!

01-12

Preview my new book: Introduction to Reproducible Science in R

11-12

Book Review – Sound Analysis and Synthesis with R

11-03

Deep Reinforcement Learning in Action (Announcement)

06-20

Free E-Book: A Developer’s Guide to Building AI Applications

06-04

Minsky & Papert’s “Perceptrons”

06-08

Becoming a Data Scientist Podcast Episode 16: Randy Olson

03-22

Docker y Kaggle con Enrique y Beto

03-22

Docker y Kaggle con Enrique y Beto

03-22

Docker y Kaggle con Enrique y Beto

03-22

Docker y Kaggle con Enrique y Beto

03-22

Docker y Kaggle con Enrique y Beto

03-22

Three Bag Logic Puzzle

03-23

Three Bag Logic Puzzle

03-23

Three Bag Logic Puzzle

03-23

What to do when you read a paper and it’s full of errors and the author won’t share the data or be open about the analysis?

01-02

Six Dice Betting Game

05-31

Superbowl Helmet Puzzle

02-04

RSiteCatalyst Version 1.4.13 Release Notes

07-23

Cake cutting part 2

04-01

Three Bag Logic Puzzle

03-23

Emojis Analysis in R

03-24

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

01-08

Using Entity-level Sentiment Analysis to understand News Content

07-30

Emojis Analysis in R

03-24

Emojis Analysis in R

03-24

Emojis Analysis in R

03-24

Bias in Machine Learning Flipboard Magazine

03-25

R<-Slovakia meetup started to build community in Bratislava

03-26

The Data Science Roadshow is ON!

09-03

Year 3 of Data, Beer, & Inspiration

07-23

R<-Slovakia meetup started to build community in Bratislava

03-26

R<-Slovakia meetup started to build community in Bratislava

03-26

Monotonic Binning with Equal-Sized Bads for Scorecard Development

10-14

Cake cutting part 3

04-10

Cake cutting part 2

04-01

Cake cutting

03-28

The Final Data Science Roadshow is Just the Beginning

10-26

Cake cutting part 3

04-10

Cake cutting

03-28

Cake cutting part 2

04-01

Cake cutting

03-28

Cake cutting

03-28

Cake cutting

03-28

A Practical Guide to the Lomb-Scargle Periodogram

03-30

A Practical Guide to the Lomb-Scargle Periodogram

03-30

Practical Data Science with R, 2nd Edition discount!

01-12

Notebooks from the Practical AI Workshop

01-03

My book ‘Practical Machine Learning in R and Python: Third edition’ on Amazon

01-02

If you did not already know

11-29

Data Science Interview Questions with Answers

10-28

Spotlight on Julia Silge, Keynote Speaker EARL Seattle 7th November

10-26

Welcome to Dataiku University!

09-07

More Practical Data Science with R Book News

08-19

Announcing Practical Data Science with R, 2nd Edition

08-15

A Practical Guide to the Lomb-Scargle Periodogram

03-30

Christmas elves puzzle

12-27

Cake cutting part 2

04-01

Cake cutting part 2

04-01

KDnuggets™ News 18:n46, Dec 5: AI, Data Science, Analytics 2018 Main Developments, 2019 Key Trends; Deep Learning Cheat Sheets

12-05

My secret sauce to be in top 2% of a Kaggle competition

11-26

Top 3 Trends in Deep Learning

10-03

Skills that Employers look in a Data Scientist

08-02

Time Series Analysis with Generalized Additive Models

04-04

Why Momentum Really Works

04-04

The Intuitions Behind Bayesian Optimization with Gaussian Processes

10-19

Whats new on arXiv

10-09

Why Momentum Really Works

04-04

Covariate-Based Diagnostics for Randomized Experiments are Often Misleading

04-06

Covariate-Based Diagnostics for Randomized Experiments are Often Misleading

04-06

Covariate-Based Diagnostics for Randomized Experiments are Often Misleading

04-06

Building a Tic-Tac-Toe web-app in this Webpack tutorial and Babel tutorial

04-07

Building a Tic-Tac-Toe web-app in this Webpack tutorial and Babel tutorial

04-07

Approximating Implicit Matrix Factorization with Shallow Neural Networks

04-07

If you did not already know

01-06

What is “party balancing” and how does it explain midterm elections?

08-08

Recommender System With Implicit Feedback

11-18

Approximating Implicit Matrix Factorization with Shallow Neural Networks

04-07

Approximating Implicit Matrix Factorization with Shallow Neural Networks

04-07

Christmas elves puzzle

12-27

RSiteCatalyst Version 1.4.12 (and 1.4.11) Release Notes

04-10

RSiteCatalyst Version 1.4.12 (and 1.4.11) Release Notes

04-10

BH 1.69.0-1 on CRAN

01-07

AI, Machine Learning and Data Science Roundup: July 2018

07-23

RSiteCatalyst Version 1.4.12 (and 1.4.11) Release Notes

04-10

BH 1.69.0-1 on CRAN

01-07

Rcpp 1.0.0: The Tenth Birthday Release

11-08

Introducing the New Zealand Trade Intelligence Dashboard

10-14

RSiteCatalyst Version 1.4.12 (and 1.4.11) Release Notes

04-10

11 Design Tips for Data Visualization

10-25

Turn Whiteboard UX Sketches into Working HTML in Seconds – Introducing Sketch2Code

08-30

Learn D3.js in 5 minutes

05-16

Linked Lists

12-28

RSiteCatalyst Version 1.4.12 (and 1.4.11) Release Notes

04-10

Getting Started with Sonnet, Deep Mind’s Deep Learning Library

04-10

Cake cutting part 3

04-10

Sakura blossoms in Japan

04-11

ML and NLP Publications in 2018

01-09

Estimating mortality rates in Puerto Rico after hurricane María using newly released official death counts

06-08

Poor Customer Support?

08-28

Sakura blossoms in Japan

04-11

How to combine Multiple ggplot Plots to make Publication-ready Plots

01-12

Does imputing model labels using the model predictions can improve it’s performance?

12-21

Introduction to Active Learning

10-23

Deep Learning Without Labels

10-03

Harmonizing and emojifying our GitHub issue trackers

07-12

Sakura blossoms in Japan

04-11

Ensure consistency in data processing code between training and inference in Amazon SageMaker

01-11

‘data:’ Scraping & Chart Reproduction : Arrows of Environmental Destruction

01-03

If you did not already know

12-29

Does imputing model labels using the model predictions can improve it’s performance?

12-21

Easily train models using datasets labeled by Amazon SageMaker Ground Truth

12-20

Yet another visualization of the Bayesian Beta-Binomial model

12-13

Introduction to Amazon SageMaker Object2Vec

11-08

Document worth reading: “Transfer Metric Learning: Algorithms, Applications and Outlooks”

11-02

Introduction to Active Learning

10-23

Deep learning, hydroponics, and medical marijuana

10-15

If you did not already know

10-05

Classifying high-resolution chest x-ray medical images with Amazon SageMaker

09-13

Harmonizing and emojifying our GitHub issue trackers

07-12

Sequence labeling with semi-supervised multi-task learning

06-29

Automatically Tag Trello Cards with Zapier and Natural Language Processing

06-07

Logistic Regression

07-30

Sakura blossoms in Japan

04-11

If you did not already know

11-23

UnitedHealth Group: Director, Omni-Channel Analytics [Minnetonka, MN]

11-19

Amazon Transcribe now supports multi-channel transcriptions

08-27

Image Recognition and Object Detection

02-28

Audio Signals in Python

04-17

Audio Signals in Python

04-17

AI-Based Virtual Tutors – The Future of Education?

09-21

Audio Signals in Python

04-17

My R take on Advent of Code – Day 1

12-17

R Packages worth a look

11-06

R Packages worth a look

10-15

Audio Signals in Python

04-17

Whats new on arXiv

12-22

Deriving the Softmax from First Principles

04-19

Delayed Impact of Fair Machine Learning

05-17

Why I'm bullish on Uber - the customer acquisition trough

04-20

Why I'm bullish on Uber - the customer acquisition trough

04-20

Why I'm bullish on Uber - the customer acquisition trough

04-20

Keras Conv2D and Convolutional Layers

12-31

R or Python? Why not both? Using Anaconda Python within R with {reticulate}

12-30

Implementing ResNet with MXNET Gluon and Comet.ml for Image Classification

12-14

Scikit-learn Tutorial: Machine Learning in Python

11-15

Beginner Data Visualization & Exploration Using Pandas

10-22

Deep learning, hydroponics, and medical marijuana

10-15

Keras vs. TensorFlow – Which one is better and which one should I learn?

10-08

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 1

09-26

AWS Deep Learning AMIs now include ONNX, enabling model portability across deep learning frameworks

07-26

Hitchhiker’s guide to Used Car Prices Estimation

12-04

Sentiment analysis on Twitter using word2vec and keras

04-20

Sentiment analysis on Twitter using word2vec and keras

04-20

Your and my 2019 R goals

01-01

NLP for Log Analysis – Tokenization

11-13

How Machines Understand Our Language: An Introduction to Natural Language Processing

10-31

Sentiment analysis on Twitter using word2vec and keras

04-20

Customizing Docker Images in Cloudera Data Science Workbench

09-14

Deep Learning Frameworks on CDH and Cloudera Data Science Workbench

04-20

Scaling Multi-Agent Reinforcement Learning

12-12

Running R scripts within in-database SQL Server Machine Learning

10-14

World Models Experiments

06-09

Deep Learning Frameworks on CDH and Cloudera Data Science Workbench

04-20

Document worth reading: “Deep learning in agriculture: A survey”

01-12

Four Techniques for Outlier Detection

12-06

Document worth reading: “A Tutorial on Bayesian Optimization”

12-01

Top KDnuggets tweets, Nov 07-13: 10 Free Must-See Courses for Machine Learning and Data Science

11-14

AI Meets Mail Processing (Automation for Admin Tasks)

08-09

Machine Learning in Science and Industry slides

04-20

Document worth reading: “A second-quantised Shannon theory”

12-20

Distilled News

11-30

Machine Learning in Science and Industry slides

04-20

Alphabear Solver

04-21

Alphabear Solver

04-21

Alphabear Solver

04-21

Alphabear Solver

04-21

Alphabear Solver

04-21

6279e808ef0c35488ea3a81e9b6d302a

07-06

Fact over Fiction

04-22

Fact over Fiction

04-22

Fact over Fiction

04-22

F beta score for Keras

04-23

F beta score for Keras

04-23

How Miguel Got 3 Data Science Job Offers Fast With Dataquest

12-24

Self Avoiding Walks

12-08

The Future of AI is the Enterprise

11-30

How Data Science (+ Friends) Helped Me Learn French

11-01

Habits and Tools, Old and New

01-26

Talk like a pirate day 2017

09-19

Summer of Data Science 2017

05-29

How to make the transition from academia to data science

04-23

Re-parameterising for non-negativity yields multiplicative updates

04-24

Re-parameterising for non-negativity yields multiplicative updates

04-24

Residential Property Investment Visualization and Analysis Shiny App

10-22

Shifting Causes of Death

10-02

Re-parameterising for non-negativity yields multiplicative updates

04-24

XOR Revisited: Keras and TensorFlow

04-24

Use your favorite Python library on PySpark cluster with Cloudera Data Science Workbench

04-26

Getting Started with Cloudera Data Science Workbench

05-08

Use your favorite Python library on PySpark cluster with Cloudera Data Science Workbench

04-26

Announcement

04-27

Tutorial: Sentiment Analysis of Airlines Using the syuzhet Package and Twitter

04-30

My

12-28

Tutorial: Sentiment Analysis of Airlines Using the syuzhet Package and Twitter

04-30

Tutorial: Sentiment Analysis of Airlines Using the syuzhet Package and Twitter

04-30

Tutorial: Sentiment Analysis of Airlines Using the syuzhet Package and Twitter

04-30

Use AWS Machine Learning to Analyze Customer Calls from Contact Centers (Part 2): Automate, Deploy, and Visualize Analytics using Amazon Transcribe, Amazon Comprehend, AWS CloudFormation, and Amazon QuickSight

12-28

Using Entity-level Sentiment Analysis to understand News Content

07-30

Tutorial: Sentiment Analysis of Airlines Using the syuzhet Package and Twitter

04-30

Hacking A Hackaton

04-30

Hacking A Hackaton

04-30

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

Amazon Polly adds Italian and Castilian Spanish voices, and Mexican Spanish language support

11-12

R Packages worth a look

09-12

R Packages worth a look

08-23

Philippine Senate Bills: NLP Word Cloud Analysis for the 13th to 17th Congress

06-09

AWS Machine Learning Big Data NYC

10-24

Hacking A Hackaton

04-30

Hacking A Hackaton

04-30

What is “party balancing” and how does it explain midterm elections?

08-08

Flipping a Coin on a Crazy Plane

05-01

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

11-14

Flipping a Coin on a Crazy Plane

05-01

Help! I can’t reproduce a machine learning project!

09-19

Kolmogorov and randomness

02-18

Is the Universe Random?

06-19

Flipping a Coin on a Crazy Plane

05-01

The jet plane that shot itself down

08-27

Flipping a Coin on a Crazy Plane

05-01

Hail: Scalable Genomics Analysis with Apache Spark

05-02

Hail: Scalable Genomics Analysis with Apache Spark

05-02

Distilled News

12-02

Transfer Learning for Flight Delay Prediction via Variational Autoencoders

05-08

The AAA tranche of subprime science, revisited

10-16

The rise and plummet of the name Heather

09-21

Nextgov: DHS Funds Machine Learning Tool to Boost Other Countries’ Airport Security

08-20

Document worth reading: “Learning to Succeed while Teaching to Fail: Privacy in Closed Machine Learning Systems”

08-10

Getting Started with Cloudera Data Science Workbench

05-08

Voronoi diagram with ggvoronoi package with Train Station data

11-10

Voronoi Soccer

05-31

Voronoi Diagrams

05-12

Voronoi Diagrams

05-12

Voronoi Diagrams

05-12

Create conda recipe to use C extended Python library on PySpark cluster with Cloudera Data Science Workbench

05-15

Create conda recipe to use C extended Python library on PySpark cluster with Cloudera Data Science Workbench

05-15

New package in CRAN: PkgsFromFiles

10-13

Create conda recipe to use C extended Python library on PySpark cluster with Cloudera Data Science Workbench

05-15

Python Deep Learning tutorial: Elman RNN implementation in Tensorflow

05-17

Python Deep Learning tutorial: Elman RNN implementation in Tensorflow

05-17

Workshop sur le Topic Modeling

05-17

Workshop sur le Topic Modeling

05-17

Workshop sur le Topic Modeling

05-17

Workshop sur le Topic Modeling

05-17

Workshop sur le Topic Modeling

05-17

Parallel computation with two lines of code

05-18

Reading List Faster With parallel, doParallel, and pbapply

12-12

Parallel computation with two lines of code

05-18

AlphaGo Zero: Minimal Policy Improvement, Expectation Propagation and other Connections

10-26

Minimizing the Negative Log-Likelihood, in English

05-18

My notes on (Liang et al., 2017): Generalization and the Fisher-Rao norm

01-25

AlphaGo Zero: Minimal Policy Improvement, Expectation Propagation and other Connections

10-26

Further Exploring Common Probabilistic Models

06-06

Minimizing the Negative Log-Likelihood, in English

05-18

Teaching Machines to Draw

05-19

Teaching Machines to Draw

05-19

Exposing Python 3.6's Private Dict Version

05-26

Exposing Python 3.6's Private Dict Version

05-26

Exposing Python 3.6's Private Dict Version

05-26

Exposing Python 3.6's Private Dict Version

05-26

Exposing Python 3.6's Private Dict Version

05-26

Summer of Data Science 2018

05-28

Summer of Data Science 2017

05-29

Self Avoiding Walks

12-08

The Future of AI is the Enterprise

11-30

How Data Science (+ Friends) Helped Me Learn French

11-01

Cannibus Curve with ggplot2

10-17

Summer of Data Science 2017

05-29

Self Avoiding Walks

12-08

Compound interest and retirement

03-05

Talk like a pirate day 2017

09-19

Summer of Data Science 2017

05-29

Summer of Data Science 2017

05-29

NYU Stern Fubon Center for Technology, Business and Innovation: Fubon Center Faculty Fellow [New York, NY]

01-08

Southern Illinois University Edwardsville: Director of the Center for Predictive Analytics/(Associate) Professor of Mathematics and Statistics [Edwardsville, IL]

01-04

DePaul University: Professor of Practice position in Data Science [Chicago, IL]

11-07

DePaul University: Two tenure-track/tenured positions in Data Science/Computer Science [Chicago, IL]

11-07

Vanderbilt University: Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University’s Peabody College: Lecturer in Data and Analytics [Online Teaching]

11-05

University of Rhode Island: Assistant Professor of Data Science [Kingston, RI]

10-22

University of Nebraska at Omaha: Faculty Position in Computer Science [Omaha, NE]

10-05

Online Master’s in Applied Data Science From Syracuse

10-05

Review of The Data Incubator data science bootcamp

05-29

confint3: 2-Sided Confidence Interval (Extended Moodle Version)

12-08

Modularize your Shiny Apps: Exercises

10-15

JMP Publishes Exercises to Accompany Data Mining Techniques (3rd Edition)

05-31

Add a static pdf vignette to an R package

01-11

R Packages worth a look

01-04

R Packages worth a look

12-31

Bubble Packed Chart with R using packcircles package

12-22

Why R for data science – and not Python?

12-02

Growing List vs Growing Queue

11-18

RProtoBuf 0.4.13 (and 0.4.12)

11-03

Introducing cricpy:A python package to analyze performances of cricketers

10-28

Loops and Pizzas

10-19

Distilled News

10-06

Applications of R presented at EARL London 2018

09-21

R Packages worth a look

09-10

R Packages worth a look

09-03

Tips for analyzing Excel data in R

08-30

R Packages worth a look

08-23

Learn to R blog series - Operators and Objects

07-19

JMP Publishes Exercises to Accompany Data Mining Techniques (3rd Edition)

05-31

Voronoi Soccer

05-31

Voronoi Soccer

05-31

Voronoi Soccer

05-31

From Instance Noise to Gradient Regularisation

06-01

ICML 2017 Workshop on Implicit Models

06-02

The JapanR Conference 2018 Round-Up!

12-06

R Packages worth a look

10-18

Trustworthy Data Analysis

06-04

ICML 2017 Workshop on Implicit Models

06-02

ICML 2017 Workshop on Implicit Models

06-02

Murmuration: Data Scientist [New York, NY]

01-10

If you did not already know

12-18

Vanguard: Senior AI Engineer [Malvern, PA]

12-17

Six Sigma DMAIC Series in R – Part4

12-15

UnitedHealth Group: Sr Manager, Data Science [Telecommute, Central or Eastern Time Zones]

11-19

Young Investigator Special Competition for Time-Sharing Experiment for the Social Sciences

10-19

If you did not already know

09-21

Mounting multiple data and outputs volumes

07-15

Saving, resuming, and restarting experiments with Polyaxon

05-03

Simple Architectures Outperform Complex Ones in Language Modeling

04-25

A Research to Engineering Workflow

06-03

Pear Therapeutics: Data Scientist [San Francisco, CA]

01-11

Murmuration: Data Scientist [New York, NY]

01-10

Maryville University: Business Intelligence Analyst [St. Louis, MO]

01-04

UnitedHealth Group: Director, Data Science [Minnetonka, MN]

12-19

Vanguard: Senior AI Architect [Malvern, PA]

12-17

Vanguard: Senior AI Engineer [Malvern, PA]

12-17

Intuit: Staff Data Scientist – Business Analytics [Mountain View, CA]

12-12

Ronin: Sr Machine Learning and AI Data Scientist [San Mateo, CA]

12-03

Humana: Principal Data Scientist/Informatics Principal [Chicago, IL, Dallas, TX and Louisville, KY]

11-27

UnitedHealth Group: Sr Manager, Data Science [Telecommute, Central or Eastern Time Zones]

11-19

UnitedHealth Group: Sr Director, Decision Analytics [Minnetonka, MN]

11-19

UnitedHealth Group: Director, Omni-Channel Analytics [Minnetonka, MN]

11-19

UnitedHealth Group: Clinical Data Statistical Analyst – SQL SAS (Clinician Required) [Telecommute]

11-16

UnitedHealth Group: Data Analytics and Reporting Lead [Minnetonka, MN or Telecommute]

11-16

UnitedHealth Group: Senior Principal Data Scientist [Telecommute, Central or Eastern Time Zones]

11-16

Moody’s Analytics: Machine Learning / NLP – Research Scientist / Engineer [New York, NY]

10-30

a4 Media: Manager, Machine Learning Data Engineer [Long Island City, NY]

10-10

UnitedHealth Group: UHC Digital Director of Project Management [Minnetonka, MN]

10-04

UnitedHealth Group: Sr .Net Web Developer, UHC E&I [Indianapolis, IN or Green Bay, WI]

10-04

Job opening at CDC: “The Statistician will play a central role in guiding the statistical methods of all major projects of the Epidemiology and Prevention Branch of the CDC Influenza Division, and aid in designing, analyzing, and interpreting research intended to understand the burden of influenza in the US and internationally and identify the best influenza vaccines and vaccine strategies.”

09-26

Meta-packages, nails in CRAN’s coffin

08-07

A Research to Engineering Workflow

06-03

A Research to Engineering Workflow

06-03

Vanguard: Senior AI Architect [Malvern, PA]

12-17

A Research to Engineering Workflow

06-03

From a Night of Insomnia to Competition Winner | An Interview with Martin Barron

01-08

How to Gather Your Own Data by Conducting a Great Survey

11-27

Working with US Census Data in R

11-07

Working with US Census Data in R

11-07

Data + Art STEAM Project: Final Results

10-30

Distilled News

08-16

“The most important aspect of a statistical analysis is not what you do with the data, it’s what data you use” (survey adjustment edition)

08-07

Exploring and visualising reef life survey data

06-03

How to Gather Your Own Data by Conducting a Great Survey

11-27

Exploring and visualising reef life survey data

06-03

Exploring and visualising reef life survey data

06-03

Leaf Plant Classification: An Exploratory Analysis – Part 1

12-29

Smartly select and mutate data frame columns, using dict

12-09

Checklist Recipe – How we created a template to standardize species data

11-20

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Faceted Graphs with cdata and ggplot2

10-21

Faceted Graphs with cdata and ggplot2

10-21

Exploring and visualising reef life survey data

06-03

2018 Traffic Data

01-03

RStudio Pandoc – HTML To Markdown

12-15

Poor Customer Support?

08-28

Exploring and visualising reef life survey data

06-03

More Sandwiches, Anyone?

11-14

R Packages worth a look

08-05

Rules to Learn By

05-31

COLT 2017 accepted papers

06-03

Document worth reading: “A Tutorial on Bayesian Optimization”

12-01

Document worth reading: “Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches”

11-17

R Packages worth a look

09-24

Don’t call it a bandit

08-04

New Research on Multi-Task Learning

07-24

COLT 2017 accepted papers

06-03

“Economic predictions with big data” using partial pooling

11-26

If you did not already know

11-06

R Packages worth a look

09-07

COLT 2017 accepted papers

06-03

My Approach to Natas Level 11 (a Web Security Game)

02-23

Safe Crime Detection

06-05

If you did not already know

10-17

Distilled News

09-15

aRt with code

07-27

Work in progress: Portraits of Imaginary People

06-06

Work in progress: Portraits of Imaginary People

06-06

Work in progress: Portraits of Imaginary People

06-06

Further Exploring Common Probabilistic Models

06-06

Whats new on arXiv

12-29

Maximized Monte Carlo Testing with MCHT

10-22

Further Exploring Common Probabilistic Models

06-06

My Video Game Playlists in Japanese for Immersion

06-07

Generating data to explore the myriad causal effects that can be estimated in observational data analysis

11-20

My Video Game Playlists in Japanese for Immersion

06-07

My Video Game Playlists in Japanese for Immersion

06-07

My Video Game Playlists in Japanese for Immersion

06-07

Notebooks from the Practical AI Workshop

01-03

3 More Google Colab Environment Management Tips

01-02

Advanced Jupyter Notebooks: A Tutorial

01-02

AI for Good: slides and notebooks from the ODSC workshop

11-13

Best Practices for Using Notebooks for Data Science

11-08

UI Update — Datazar

11-07

Direct access to Amazon SageMaker notebooks from Amazon VPC by using an AWS PrivateLink endpoint

11-06

If not Notebooks, then what? Look to Literate Programming

09-12

Run SQL queries from your SageMaker notebooks using Amazon Athena

09-12

Access Amazon S3 data managed by AWS Glue Data Catalog from Amazon SageMaker notebooks

08-29

How to update your scikit-learn code for 2018

07-04

Run some cool GitHubs on Azure (Python)

09-26

Docker for AWS

06-27

Kaggle’s Quora Question Pairs Competition

06-07

Kaggle’s Quora Question Pairs Competition

06-07

Reflections on the 10th anniversary of the Revolutions blog

12-10

Reflections on the 10th anniversary of the Revolutions blog

12-10

EARL conference recap: Seattle 2018

11-24

My talk tomorrow (Tues) 4pm in the Biomedical Informatics department (at 168th St)

10-01

Minsky & Papert’s “Perceptrons”

06-08

“Principles of posterior visualization”

01-01

“Thus, a loss aversion principle is rendered superfluous to an account of the phenomena it was introduced to explain.”

12-25

Goals and Principles of Representation Learning

04-12

Minsky & Papert’s “Perceptrons”

06-08

Minsky & Papert’s “Perceptrons”

06-08

Minsky & Papert’s “Perceptrons”

06-08

Using a Keras Long Short-Term Memory (LSTM) Model to Predict Stock Prices

11-21

Make a Profitable Portfolio using Python

06-08

Make a Profitable Portfolio using Python

06-08

R Packages worth a look

12-10

R Packages worth a look

11-17

R Packages worth a look

10-30

ITWire: VIDEO Interview with a DataRobot: Greg Michaelson talks AI, banking, machine learning and more

10-24

Document worth reading: “The Risk of Machine Learning”

10-11

Make a Profitable Portfolio using Python

06-08

Part 3: Two more implementations of optimism corrected bootstrapping show shocking bias

12-27

Part 2: Optimism corrected bootstrapping is definitely bias, further evidence

12-26

If you did not already know

12-12

Distilled News

11-27

Mega-PAW Las Vegas Registration is Live & Super Early Bird Pricing is Now Available!

11-20

Example of Overfitting

11-16

Top Obstacles to Overcome when Implementing Predictive Maintenance

10-29

R Packages worth a look

10-23

R Packages worth a look

10-13

Fewer Headaches (Thanks to Data Science)

10-08

Magister Dixit

09-02

If you did not already know

09-01

Magister Dixit

08-04

From the Sidewalk to the Saddle: Data and the Tour de France

07-06

Lessons learned in my first year as a data scientist

01-25

Machine Learning the Future Class

06-12

If you did not already know

11-16

Differentiable Dynamic Programs and SparseMAP Inference

05-15

Machine Learning the Future Class

06-12

Machine Learning the Future Class

06-12

R Packages worth a look

12-28

R Packages worth a look

11-11

Some updates

05-29

Machine Learning the Future Class

06-12

R Packages worth a look

01-01

Amazon Rekognition announces updates to its face detection, analysis, and recognition capabilities

11-22

YOLO object detection with OpenCV

11-12

K-means clustering with Amazon SageMaker

11-08

Document worth reading: “Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks”

10-25

Mapillary uses Amazon Rekognition to work towards building parking solutions for US cities

09-13

If you did not already know

08-24

If you did not already know

08-17

Image Recognition and Object Detection

02-28

Machine Learning Fraud Detection: A Simple Machine Learning Approach

06-15

Random Effects Neural Networks in Edward and Keras

06-15

If you did not already know

10-04

A Right to Reasonable Inferences

10-01

If you did not already know

08-07

Random Effects Neural Networks in Edward and Keras

06-15

If you did not already know

12-26

How to Engineer Your Way Out of Slow Models

11-27

Neurally Embedded Emojis

06-19

Neurally Embedded Emojis

06-19

Neurally Embedded Emojis

06-19

rnoaa: new data sources and NCDC units

12-04

Wind Turbine Efficiency

06-19

Wind Turbine Efficiency

06-19

Distilled News

11-27

Is the Universe Random?

06-19

Document worth reading: “Fractal AI: A fragile theory of intelligence”

10-22

Matrix Factorization in PyTorch

06-20

Matrix Factorization in PyTorch

06-20

Matrix Factorization in PyTorch

06-20

Growth of Subreddits

10-30

A Study Of Reddit Politics

06-20

Reddit science discussions as a dataset

06-22

Automated Web Scraping in R

12-11

Growth of Subreddits

10-30

Reddit science discussions as a dataset

06-22

Reddit science discussions as a dataset

06-22

Announcing Elemetric

06-23

R Packages worth a look

01-11

2018.

12-31

Historic Wildfire Data: Exploratory Visualization in R

12-11

Computer Vision for Model Assessment

10-23

Computer Vision for Model Assessment

10-23

The Chartmaker Directory: Data visualizations in every tool

08-24

Life-cycle of a Data Science Project

05-18

The Building Blocks of Interpretability

03-06

Announcing Elemetric

06-23

Machine learning applied to showers in the OPERA

06-24

Machine learning applied to showers in the OPERA

06-24

Machine learning applied to showers in the OPERA

06-24

Hexagon Geometry Puzzle

06-27

Hexagon Geometry Puzzle

06-27

Hexagon Geometry Puzzle

06-27

Docker for AWS

06-27

From Python Hero to Java Rockstar

06-30

Convert Data Frame to Dictionary List in R

11-17

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

11-15

RcppTOML 0.1.4: Now with TOML v0.5.0

10-23

R Packages worth a look

09-30

How to update your scikit-learn code for 2018

07-04

Linked Lists

12-28

Python List Comprehension + Set + Dict Comprehension

11-16

From Python Hero to Java Rockstar

06-30

Kaggle’s Mercedes-Benz Greener Manufacturing

07-01

Smooth distributed convex optimization

07-06

Smooth distributed convex optimization

07-06

Smooth distributed convex optimization

07-06

gganimate has transitioned to a state of release

01-03

Columbia Data Science Institute art contest

09-16

What makes the Python Cool.

07-31

Smooth distributed convex optimization

07-06

Recently in the sister blog

07-24

Feature-wise transformations

07-09

Smooth distributed convex optimization

07-06

6279e808ef0c35488ea3a81e9b6d302a

07-06

Voice Control your Shiny Apps

10-15

From Microservices to Service Blocks using Spring Cloud Function and AWS Lambda

07-07

From Microservices to Service Blocks using Spring Cloud Function and AWS Lambda

07-07

From Microservices to Service Blocks using Spring Cloud Function and AWS Lambda

07-07

Using Clustering Algorithms to Analyze Golf Shots from the U.S. Open

06-19

Image Compression using K-means Clustering.

05-28

TSrepr use case - Clustering time series representations in R

03-13

Clustering applied to showers in the OPERA

07-10

Clustering applied to showers in the OPERA

07-10

Guest Post – Learning R as an MBA Student

07-12

RcppTOML 0.1.5: Small extensions

11-01

RcppTOML 0.1.4: Now with TOML v0.5.0

10-23

R Consortium grant applications due October 31

10-09

R Consortium grant applications due October 31

10-09

AI, Machine Learning and Data Science Roundup: September 2018

09-20

Cloudera Enterprise 5.12 is Now Available

07-13

Cloudera Enterprise 5.12 is Now Available

07-13

Cloudera Enterprise 5.12 is Now Available

07-13

The JapanR Conference 2018 Round-Up!

12-06

Japanese Kids Shows, Movies, Games, and Videos for Immersion

07-14

Recent top-selling books in AI and Machine Learning

07-31

Japanese Kids Shows, Movies, Games, and Videos for Immersion

07-14

Japanese Kids Shows, Movies, Games, and Videos for Immersion

07-14

The importance of Data Analytics skills in today’s MBA roles

12-19

Azure ML Studio now supports R 3.4

11-01

Introducing a tensorflow library for deep learning and reinforcement learning

07-17

Introducing a tensorflow library for deep learning and reinforcement learning

07-17

Introducing a tensorflow library for deep learning and reinforcement learning

07-17

Distributed Deep Learning with Polyaxon

03-18

Introducing a tensorflow library for deep learning and reinforcement learning

07-17

What is Machine Learning?

07-17

Introductory Machine Learning Terminology with Food

07-18

Introductory Machine Learning Terminology with Food

07-18

Layman’s Guide to A/B Testing

07-18

Document worth reading: “The importance of being dissimilar in Recommendation”

12-30

Mindstrong Health: Sr Data Scientist / Machine Learning, Statistics, Coding [Palo Alto, CA]

10-17

Talking about clinical significance

06-01

Layman’s Guide to A/B Testing

07-18

Can we do better than using averaged measurements?

10-26

A Concise Explanation of Learning Algorithms with the Mitchell Paradigm

10-05

What to do when your measured outcome doesn’t quite line up with what you’re interested in?

09-17

Testing code with random output

08-06

Context Compatibility in Data Analysis

05-24

Layman’s Guide to A/B Testing

07-18

Layman’s Guide to A/B Testing

07-18

implyr: R Interface for Apache Impala

07-19

implyr: R Interface for Apache Impala

07-19

Retrospective review of my first deep learning competition

07-22

Time series of Democratic/Republican vote share in House elections

12-12

$ vs. votes

11-27

A Bayesian take on ballot order effects

11-21

Benford’s Law for Fraud Detection with an Application to all Brazilian Presidential Elections from 2002 to 2018

11-17

2018: What really happened?

11-10

Simple Feed Ranking Algorithm

10-28

What’s gonna happen in the 2018 midterm elections?

08-09

He wants to model a proportion given some predictors that sum to 1

07-10

About that claim in the NYT that the immigration issue helped Hillary Clinton? The numbers don’t seem to add up.

07-03

Retrospective review of my first deep learning competition

07-22

Benford’s Law for Fraud Detection with an Application to all Brazilian Presidential Elections from 2002 to 2018

11-17

2018: How did people actually vote? (The real story, not the exit polls.)

11-16

2018: What really happened?

11-10

Narcolepsy Could Be ‘Sleeper Effect’ in Trump and Brexit Campaigns

09-12

Retrospective review of my first deep learning competition

07-22

Cognitive Services in Containers

11-19

Cognitive Services in Containers

11-19

I fell out with tapply and in love with dplyr

10-15

Visual search on AWS—Part 2: Deployment with AWS DeepLens

09-05

RSiteCatalyst Version 1.4.14 Release Notes

02-16

RSiteCatalyst Version 1.4.13 Release Notes

07-23

RSiteCatalyst Version 1.4.13 Release Notes

07-23

RSiteCatalyst Version 1.4.13 Release Notes

07-23

2 Quick Announcements

07-25

2 Quick Announcements

07-25

Streamlining Production with Predictive Maintenance and Essilor

09-04

2 Quick Announcements

07-25

How I Used Deep Learning To Train A Chatbot To Talk Like Me (Sorta)

07-25

10 years of playback history on Last.FM: "Just sit back and listen"

01-12

Custom JavaScript, CSS and HTML in Shiny

12-23

crfsuite for natural language processing

10-29

Named Entity Recognition and Classification with Scikit-Learn

10-25

Shared items

08-11

On the "we have naughty videos of you" scam

08-03

How to maraaverickfy a blog post without even reading it

02-12

Should I do a Data Science bootcamp?

01-03

Data Science in Healthcare

11-14

Web scraping the President's lies in 16 lines of Python

07-27

Named Entity Recognition and Classification with Scikit-Learn

10-25

Package support offer

10-15

Web scraping the President's lies in 16 lines of Python

07-27

Web scraping the President's lies in 16 lines of Python

07-27

Web scraping the President's lies in 16 lines of Python

07-27

Some Thoughts on Meditation

10-22

Moving On, Looking Back

07-28

Advice on soft skills for academics

07-25

Moving On, Looking Back

07-28

Twitter bots for good, and information contagion!

09-27

Diffusion of ISIS propaganda on Twitter

07-28

Diffusion of ISIS propaganda on Twitter

07-28

Diffusion of ISIS propaganda on Twitter

07-28

Diffusion of ISIS propaganda on Twitter

07-28

My 10-step path to becoming a remote data scientist with Automattic

07-29

Reflections on remote data science work

11-03

My 10-step path to becoming a remote data scientist with Automattic

07-29

From a Night of Insomnia to Competition Winner | An Interview with Martin Barron

01-08

My 10-step path to becoming a remote data scientist with Automattic

07-29

More silliness

07-29

More silliness

07-29

More silliness

07-29

More silliness

07-29

Random Dilation Networks for Action Recognition in Videos

07-29

Random Dilation Networks for Action Recognition in Videos

07-29

Java Object Tracking for Cars

11-30

Instance segmentation with OpenCV

11-26

Multi-object tracking with dlib

10-29

Object tracking with dlib

10-22

Random Dilation Networks for Action Recognition in Videos

07-29

If you did not already know

11-23

If you did not already know

10-14

Random Dilation Networks for Action Recognition in Videos

07-29

If you did not already know

11-19

R Packages worth a look

11-16

R Packages worth a look

09-13

Against Arianism

08-21

Against Arianism

08-21

Logistic Regression

07-30

Logistic Regression

07-30

Keras vs PyTorch:谁是「第一」深度学习框架?

06-30

从决策树到随机森林:树型算法的原理与实现

07-31

从决策树到随机森林:树型算法的原理与实现

07-31

Python数据分析之pandas

07-18

从决策树到随机森林:树型算法的原理与实现

07-31

从决策树到随机森林:树型算法的原理与实现

07-31

从决策树到随机森林:树型算法的原理与实现

07-31

Cinderella science

08-05

Cinderella science

08-05

Cinderella science

08-05

My Qualifying Exam (Oral)

08-07

My Qualifying Exam (Oral)

08-07

My Qualifying Exam (Oral)

08-07

Request for Proposal: Topical Projects for January 2019

11-29

Superresolution with semantic guide

08-09

Hype or Not? Some Perspective on OpenAI’s DotA 2 Bot

08-12

Document worth reading: “Restricted Boltzmann Machines: Introduction and Review”

10-27

Hype or Not? Some Perspective on OpenAI’s DotA 2 Bot

08-12

Hype or Not? Some Perspective on OpenAI’s DotA 2 Bot

08-12

If you did not already know

11-12

Hype or Not? Some Perspective on OpenAI’s DotA 2 Bot

08-12

Van der Waerden Numbers

08-15

How to Remove Unfair Bias From Your AI

01-11

Van der Waerden Numbers

08-15

Practical Data Science with R, 2nd Edition discount!

01-12

R Packages worth a look

12-02

Van der Waerden Numbers

08-15

Why Machine Learning Is A Metaphor For Life

08-16

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J, PART 2

12-05

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J

12-05

Understanding Amazon SageMaker notebook instance networking configurations and advanced routing options

10-24

Why Machine Learning Is A Metaphor For Life

08-16

He grins like a Cheshire cat; said of anyone who shows his teeth and gums in laughing

08-22

He grins like a Cheshire cat; said of anyone who shows his teeth and gums in laughing

08-22

Estimating Pi

10-16

Even more images as x-axis labels

10-16

He grins like a Cheshire cat; said of anyone who shows his teeth and gums in laughing

08-22

He grins like a Cheshire cat; said of anyone who shows his teeth and gums in laughing

08-22

He grins like a Cheshire cat; said of anyone who shows his teeth and gums in laughing

08-22

Using regression trees for forecasting double-seasonal time series with trend in R

08-22

Using regression trees for forecasting double-seasonal time series with trend in R

08-22

Using regression trees for forecasting double-seasonal time series with trend in R

08-22

Designing a Deep Learning Project

08-23

Designing a Deep Learning Project

08-23

Top 5 Data Visualization Tools for 2019

01-03

Document worth reading: “Bayesian model reduction”

10-03

Document worth reading: “A Survey on Influence Maximization in a Social Network”

09-02

Six Dice Betting Game

05-31

Designing a Deep Learning Project

08-23

Designing a Deep Learning Project

08-23

Python Deep Learning tutorial: Create a GRU (RNN) in TensorFlow

08-27

The jet plane that shot itself down

08-27

The jet plane that shot itself down

08-27

The jet plane that shot itself down

08-27

The jet plane that shot itself down

08-27

Poor Customer Support?

08-28

Poor Customer Support?

08-28

A.I. 'Bias' Doesn't Mean What Journalists Say It Means

08-30

A.I. 'Bias' Doesn't Mean What Journalists Say It Means

08-30

Last academic results

06-23

How much compute do we need to train generative models?

08-31

How much compute do we need to train generative models?

08-31

Icon making with ggplot2 and magick

01-03

A Three Month Data Analysis in Excel Could Have Taken Me One Day

10-01

Crosslingual document comparison

08-31

Crosslingual document comparison

08-31

The Advent of Analytics Engineering

09-01

The Advent of Analytics Engineering

09-01

Inferring data loss (and correcting for it) from fundamental relationships

09-01

Inferring data loss (and correcting for it) from fundamental relationships

09-01

Cassie Kozyrkov discusses decision making and decision intelligence!

10-22

3 recent movies from the 50s and the 70s

08-30

Inferring data loss (and correcting for it) from fundamental relationships

09-01

When (not) to use Deep Learning for NLP

09-04

Software patents are evil, but BSD+Patents is probably not the solution

09-05

Data Notes: The Secret to Getting to a Second Date

09-06

Software patents are evil, but BSD+Patents is probably not the solution

09-05

Software patents are evil, but BSD+Patents is probably not the solution

09-05

Software patents are evil, but BSD+Patents is probably not the solution

09-05

Software patents are evil, but BSD+Patents is probably not the solution

09-05

If you did not already know

12-06

Distilled News

11-24

Tom Wolfe

11-19

If you did not already know

11-17

If you did not already know

11-07

Document worth reading: “A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines”

09-16

If you did not already know

08-21

What Killed the Curse of Dimensionality?

09-06

What Killed the Curse of Dimensionality?

09-06

Making Smart Phones Dumb Again

09-07

Making Smart Phones Dumb Again

09-07

Semantic trees for training word embeddings with hierarchical softmax

09-07

Semantic trees for training word embeddings with hierarchical softmax

09-07

QuantConnect – the only Game in Town

09-10

QuantConnect – the only Game in Town

09-10

Customizing Docker Images in Cloudera Data Science Workbench

09-14

Build Your Own Natural Language Models on AWS (no ML experience required)

11-19

Customizing Docker Images in Cloudera Data Science Workbench

09-14

Customizing Docker Images in Cloudera Data Science Workbench

09-14

Deep Learning Dead-End?

09-17

Document worth reading: “Deep learning in agriculture: A survey”

01-12

Document worth reading: “Universality of Deep Convolutional Neural Networks”

01-10

Distilled News

12-21

An Intro to Deep Learning in Python

12-06

Distilled News

11-30

Introducing Dynamic Training for deep learning with Amazon EC2

11-27

Distilled News

11-24

Document worth reading: “Opening the black box of deep learning”

10-28

Distilled News

10-24

Document worth reading: “Review of Deep Learning”

10-19

Document worth reading: “Deep Facial Expression Recognition: A Survey”

10-17

Why you need GPUs for your deep learning platform

10-16

Document worth reading: “Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation”

10-13

Document worth reading: “Deep Learning for Generic Object Detection: A Survey”

10-10

Top 3 Trends in Deep Learning

10-03

Document worth reading: “Do Deep Learning Models Have Too Many Parameters An Information Theory Viewpoint”

09-22

Deep learning made easier with transfer learning

09-17

Data Science Glossary

09-12

Whats new on arXiv

09-07

A Deep (But Jargon and Math Free) Dive Into Deep Learning

08-31

If you did not already know

08-30

If you did not already know

08-21

Document worth reading: “Are Efficient Deep Representations Learnable”

07-31

Distributed Deep Learning with Polyaxon

03-18

NIPS 2017 Summary

12-11

How To Predict ICU Mortality with Digital Health Data, DL4J, Apache Spark and Cloudera

09-18

If you did not already know

11-01

How To Predict ICU Mortality with Digital Health Data, DL4J, Apache Spark and Cloudera

09-18

Face Similarity searching ~ landmark detecting

09-18

Face Similarity searching ~ landmark detecting

09-18

Face Similarity searching ~ landmark detecting

09-18

Face Similarity searching ~ landmark detecting

09-18

Talk like a pirate day 2017

09-19

Talk like a pirate day 2017

09-19

Apache Arrow and the "10 Things I Hate About pandas"

09-21

Exploiting Daily Fantasy Football for Fun and Profit

09-22

Exploiting Daily Fantasy Football for Fun and Profit

09-22

Paper review: EraseReLU

09-26

Paper review: EraseReLU

09-26

Paper review: EraseReLU

09-26

If you did not already know

01-11

Pivot Billions and Deep Learning enhanced trading models achieve 100% net profit

12-24

If you did not already know

10-28

If you did not already know

08-27

Paper review: EraseReLU

09-26

R now supported in Azure SQL Database

11-28

AI, Machine Learning and Data Science Roundup: October 2018

10-25

Distilled News

10-04

AI, Machine Learning and Data Science Announcements from Microsoft Ignite

10-02

AI, Machine Learning and Data Science Roundup: September 2018

09-20

Videos from NYC R Conference

08-28

AI, Machine Learning and Data Science Roundup: July 2018

07-23

Enterprise Deployment Tips for Azure Data Science Virtual Machine (DSVM)

05-21

Microsoft Weekly Data Science News for May 18, 2018

05-18

Run some cool GitHubs on Azure (Python)

09-26

Run some cool GitHubs on Azure (Python)

09-26

How to generalize (algorithmically)

09-18

Run some cool GitHubs on Azure (Python)

09-26

Hi

09-27

Hi

09-27

Hi

09-27

Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data

01-10

Open Workshop: Data Visualization in R and ggplot2, January 25th in Munich

11-26

Don’t miss Big Data LDN 2018

10-22

Top KDnuggets tweets, Oct 10-16: 7 Books to Grasp Mathematical Foundations of Data Science and Machine Learning; 6 Books Every Data Scientist Should Keep Nearby

10-17

Welcome to Dataiku University!

09-07

I Can’t Afford to Hire a Data Scientist. Now What?

07-11

Hi

09-27

Twitter bots for good, and information contagion!

09-27

Defining visualization literacy

11-30

Against Winner-Take-All Attribution

09-05

Data Science in 30 Minutes: Using Data Science to Predict the Future with Kirk Borne

07-11

Data Science in 30 Minutes: Holden Karau – A Quick Introduction to PySpark

06-13

Data Science in 30 Minutes: Deep Learning to Detect Fake News with Uber ATG Head of Data Science, Mike Tamir

05-30

Michael B. Cohen

09-28

Michael B. Cohen

09-28

Michael B. Cohen

09-28

Top 5 Data Visualization Tools for 2019

01-03

Podcast Listens Analysis

10-02

Proof that 1/7 is a repeated decimal

10-05

Year 3 of Data, Beer, & Inspiration

07-23

Advice for aspiring data scientists and other FAQs

10-15

Podcast Listens Analysis

10-02

Podcast Listens Analysis

10-02

NPR Sunday Puzzle Solving, And Other Baby Name Questions

10-02

NPR Sunday Puzzle Solving, And Other Baby Name Questions

10-02

NPR Sunday Puzzle Solving, And Other Baby Name Questions

10-02

The replication crisis and the political process

08-03

GANs are Broken in More than One Way: The Numerics of GANs

10-05

GANs are Broken in More than One Way: The Numerics of GANs

10-05

Intro to graph optimization: solving the Chinese Postman Problem

10-07

Beautiful Chaos: The Double Pendulum

11-22

Aosta Valley, Italy

09-05

Whistler, British Columbia

07-26

Intro to graph optimization: solving the Chinese Postman Problem

10-07

LoyaltyOne: Associate Director, Client Services [Westborough, MA]

12-17

Tribes.ai: Sr Data Scientist [Remote, India / Eastern Europe]

12-01

Amazon SageMaker notebooks now support Git integration for increased persistence, collaboration, and reproducibility

11-29

New Features For Amazon SageMaker: Workflows, Algorithms, and Accreditation

11-21

An Overview of the Singapore Hiring Landscape

11-21

R Packages worth a look

11-01

Business Analysis (BA) Career Path

10-11

UnitedHealth Group: UHC Digital Director of Project Management [Minnetonka, MN]

10-04

UnitedHealth Group: UHC Digital Project Manager [Minnetonka, MN]

10-04

Distilled News

09-26

Distilled News

09-23

No code chatbots: TIBCO uses Amazon Lex to put chat interfaces into the hands of business users

09-05

Nextgov: DHS Funds Machine Learning Tool to Boost Other Countries’ Airport Security

08-20

Distilled News

08-16

Announcing the Amazon SageMaker MXNet 1.2 container

08-06

Data Science for Managers and Directors (DS4MAD)

10-10

Think Twice Before You Accept That Fancy Data Science Job

12-19

LoyaltyOne: Associate Director, Client Services [Westborough, MA]

12-17

KDnuggets™ News 18:n45, Nov 28: Your Favorite Python IDE/editor? Intro to Data Science for Managers

11-28

UnitedHealth Group: UHC Digital Project Manager [Minnetonka, MN]

10-04

Forbes: 25 Machine Learning Startups to Watch in 2018

08-26

Data Science for Managers and Directors (DS4MAD)

10-10

Robin Pemantle’s updated bag of tricks for math teaching!

01-04

Thoughts On Machine Learning Accuracy

07-27

Understanding how Deep Learning learns to play SET®

10-12

Understanding how Deep Learning learns to play SET®

10-12

Pear Therapeutics: Data Scientist [San Francisco, CA]

01-11

Core Principles of Sustainable Data Science, Machine Learning and AI Product Development: Research as a core driver

01-09

3 Challenges for Companies Tackling Data Science

11-26

MVP for Data Projects

10-22

We Sized Washington’s Edible Marijuana Market Using AI

10-12

3 Stages of Creating Smart

10-04

Applications of R presented at EARL London 2018

09-21

Curalate makes social sell with AI using Apache MXNet on AWS

08-13

The AWS DeepLens Inclusivity Challenge submission period extended to 8/19

07-24

An Overview of Recommendation Systems

05-23

Crossing Your Data Science Chasm

03-22

Everything is a Model

12-13

Building a Visual Search Algorithm

10-13

Building a Visual Search Algorithm

10-13

Best Machine Learning languages, Data Visualization Tools, DL Frameworks, and Big Data Tools

12-03

Cosmos DB for Data Science

09-07

Distilled News

09-07

Vulcan Post: This AI Startup Is Run By The World’s Top Data Scientists – Lets Anyone Build Predictive Models

09-04

Engineering Data Science at Automattic

03-20

Advice for aspiring data scientists and other FAQs

10-15

Journals and refereeing: toward a new equilibrium

07-25

An Updated Review of The Data Incubator Data Science Bootcamp

05-29

CES 2018

01-12

Advice for aspiring data scientists and other FAQs

10-15

Scikit-learn Tutorial: Machine Learning in Python

11-15

Markets Performance after Election

10-16

“2010: What happened?” in light of 2018

10-31

Markets Performance after Election

10-16

Markets Performance after Election

10-16

The Bear is Here

12-22

The Bear is Here

12-22

NAIC: Analyst I (Capital Markets) [New York, NY]

10-29

If you did not already know

09-24

Aella Credit empowers underbanked individuals by using Amazon Rekognition for identity verification

08-15

Introduction to Learning to Trade with Reinforcement Learning

02-11

Markets Performance after Election

10-16

The Evolution of Build Engineering in Managing Open Source [Webinar Replay]

11-13

Markets Performance after Election

10-16

Feather format update: Whence and Whither?

10-16

How to use Tensorboard with PyTorch

10-16

How to use Tensorboard with PyTorch

10-16

How to use Tensorboard with PyTorch

10-16

Lumpers and Splitters: Tensions in Taxonomies

04-05

Text Segmentation using Word Embeddings

10-16

Text Segmentation using Word Embeddings

10-16

How to Build Your Own Blockchain Part 1 — Creating, Storing, Syncing, Displaying, Mining, and Proving Work

10-17

R Packages worth a look

01-09

Document worth reading: “An Overview of Blockchain Integration with Robotics and Artificial Intelligence”

11-08

How to Build Your Own Blockchain Part 1 — Creating, Storing, Syncing, Displaying, Mining, and Proving Work

10-17

How to Build Your Own Blockchain Part 1 — Creating, Storing, Syncing, Displaying, Mining, and Proving Work

10-17

How to Build Your Own Blockchain Part 1 — Creating, Storing, Syncing, Displaying, Mining, and Proving Work

10-17

DeformNet, Or A Tale of Broken Chairs

10-18

DeformNet, Or A Tale of Broken Chairs

10-18

DeformNet, Or A Tale of Broken Chairs

10-18

Top November Stories: The Most in Demand Skills for Data Scientists; What is the Best Python IDE for Data Science?

12-11

Does Sharing Goals Help or Hurt Your Chances of Success?

10-22

Top September Stories: Essential Math for Data Science: Why and How; Machine Learning Cheat Sheets

10-10

Build this media monitoring Slack bot in 20 minutes without writing code

07-04

Open Source Datasets with Kaggle

06-21

DeformNet, Or A Tale of Broken Chairs

10-18

DeformNet, Or A Tale of Broken Chairs

10-18

Model Server for Apache MXNet v1.0 released

10-31

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 2

10-04

Pixm takes on phishing attacks with deep learning using Apache MXNet on AWS

08-28

Curalate makes social sell with AI using Apache MXNet on AWS

08-13

Bring your own pre-trained MXNet or TensorFlow models into Amazon SageMaker

08-03

AWS Deep Learning AMIs now include ONNX, enabling model portability across deep learning frameworks

07-26

AWS Deep Learning AMIs now with optimized TensorFlow 1.9 and Apache MXNet 1.2 with Keras 2 support to accelerate deep learning on Amazon EC2 instances

07-23

Deep learning with Apache MXNet on Cloudera Data Science Workbench

10-19

Deep learning with Apache MXNet on Cloudera Data Science Workbench

10-19

Linguistic Signals of Album Quality: A Predictive Analysis of Pitchfork Review Scores Using Quanteda

01-10

Deep learning with Apache MXNet on Cloudera Data Science Workbench

10-19

Forecasting financial time series with dynamic deep learning on AWS

08-20

Ensemble learning for time series forecasting in R

10-19

Ensemble learning for time series forecasting in R

10-19

Martingales

10-20

Martingales

10-20

Martingales

10-20

Ed Sullivan (3) vs. Sid Caesar; DJ Jazzy Jeff advances

01-09

Markets Performance after Election: Day 239

10-21

If you did not already know

10-19

Distilled News

08-31

What is Data Science?

08-20

The Bull Survived on Friday, but Barely

03-25

Markets Performance after Election: One Year Update

11-12

Markets Performance after Election: Day 239

10-21

Markets Performance after Election: One Year Update

11-12

Markets Performance after Election: Day 239

10-21

Markets Performance after Election: Day 239

10-21

If you did not already know

01-07

Day 22 – little helper get_files

12-22

An Introduction to AI

11-21

If you did not already know

10-02

If you did not already know

09-17

✚ Google Dataset Search Impressions, the Challenges of Looking for Data, and Other Places to Find Data

09-13

Google Dataset Search now in public beta

09-06

Markets Performance after Election: Day 239

10-21

Weekly Review: 10/21/2017

10-21

Document worth reading: “Marketing Analytics: Methods, Practice, Implementation, and Links to Other Fields”

12-07

If you did not already know

11-29

Data science books - theory and practice

06-29

Some Thoughts on Meditation

10-22

Some Thoughts on Meditation

10-22

Some Thoughts on Meditation

10-22

JUnit,Integration,End to End Tests

10-22

Hard Examples Mining in Keras

10-22

Hard Examples Mining in Keras

10-22

Online Hard Example Mining on PyTorch

10-22

R Packages worth a look

01-04

R Packages worth a look

01-01

R Packages worth a look

12-29

8 Data Science Projects to Build your Portfolio

12-11

8 Data Science Projects to Build your Portfolio

12-03

Document worth reading: “A Survey of Modern Object Detection Literature using Deep Learning”

12-03

Latest Trends in Computer Vision Technology and Applications

11-07

Document worth reading: “Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks”

10-25

Distilled News

10-12

Document worth reading: “Deep Learning for Generic Object Detection: A Survey”

10-10

If you did not already know

08-24

Online Hard Example Mining on PyTorch

10-22

I Can’t Afford to Hire a Data Scientist. Now What?

07-11

When the bubble bursts…

06-04

AWS Machine Learning Big Data NYC

10-24

COLT 2018 call for papers

10-24

COLT 2018 call for papers

10-24

AlphaGo Zero: Minimal Policy Improvement, Expectation Propagation and other Connections

10-26

AlphaGo Zero: Minimal Policy Improvement, Expectation Propagation and other Connections

10-26

Reinforcement Learning: Super Mario, AlphaGo and beyond

10-01

AlphaGo Zero: Minimal Policy Improvement, Expectation Propagation and other Connections

10-26

Sock Puzzle Revisited

03-07

Multi Armed Bandit

10-26

Multi Armed Bandit

10-26

Multi Armed Bandit

10-26

Multi Armed Bandit

10-26

If you did not already know

11-04

If you did not already know

11-04

Announcing wrapr 1.6.2

09-13

How to Build Your Own Blockchain Part 4.2 — Ethereum Proof of Work Difficulty Explained

11-21

How to Build Your Own Blockchain Part 4.1 — Bitcoin Proof of Work Difficulty Explained

11-13

How to Build Your Own Blockchain Part 3 — Writing Nodes that Mine and Talk

11-02

How to Build Your Own Blockchain Part 2 — Syncing Chains From Different Nodes

10-27

How to Build Your Own Blockchain Part 2 — Syncing Chains From Different Nodes

10-27

How to Build Your Own Blockchain Part 2 — Syncing Chains From Different Nodes

10-27

How to Build Your Own Blockchain Part 2 — Syncing Chains From Different Nodes

10-27

Do AIs dream of pwning FF leagues?

12-10

How easy is it to moneyball a fantasy football league draft?

10-28

Pruning Neural Networks: Two Recent Papers

02-06

My notes on (Liang et al., 2017): Generalization and the Fisher-Rao norm

01-25

The Generalization Mystery: Sharp vs Flat Minima

01-18

Weekly Review: 10/28/2017

10-28

Weekly Review: 10/28/2017

10-28

Document worth reading: “A Survey of Knowledge Representation and Retrieval for Learning in Service Robotics”

12-26

Soft Actor Critic—Deep Reinforcement Learning with Real-World Robots

12-14

Visual Model-Based Reinforcement Learning as a Path towards Generalist Robots

11-30

Document worth reading: “Artificial Intelligence for Long-Term Robot Autonomy: A Survey”

11-04

Drilling Down on Depth Sensing and Deep Learning

10-23

Distilled News

08-29

Import AI: 108: Learning language with fake sentences, Chinese researchers use RL to train prototype warehouse robots; and what the implications are of scaled-up Neural Architecture Search

08-20

Artificial Intelligence in the Workplace

08-03

Import AI:

07-23

Shared Autonomy via Deep Reinforcement Learning

04-18

Learning Robot Objectives from Physical Human Interaction

02-06

Weekly Review: 11/18/2017

11-18

Weekly Review: 10/28/2017

10-28

Document worth reading: “The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers”

12-27

A Visual Guide to Evolution Strategies

10-29

Recommender System

10-30

New in Cloudera Data Science Workbench 1.2: Usage Monitoring for Administrators

10-31

The rise and plummet of the name Heather

09-21

New in Cloudera Data Science Workbench 1.2: Usage Monitoring for Administrators

10-31

New in Cloudera Data Science Workbench 1.2: Usage Monitoring for Administrators

10-31

Book Review – Sound Analysis and Synthesis with R

11-03

Two Recent Results in Transfer Learning for Music and Speech

11-01

Two Recent Results in Transfer Learning for Music and Speech

11-01

Will Compression Be Machine Learning’s Killer App?

10-16

Some comments to Daniel Abadi's blog about Apache Arrow

11-01

Linear compression in python: PCA vs unsupervised feature selection

08-11

Some comments to Daniel Abadi's blog about Apache Arrow

11-01

Some comments to Daniel Abadi's blog about Apache Arrow

11-01

The ‘knight on an infinite chessboard’ puzzle: efficient simulation in R

12-10

R Packages worth a look

11-24

Multilevel data collection and analysis for weight training (with R code)

09-22

Towards a Virtual Stuntman

04-10

PokerBot: Create your poker AI bot in Python

11-01

The ‘knight on an infinite chessboard’ puzzle: efficient simulation in R

12-10

Designing Turbofan Tycoon

12-06

R Packages worth a look

09-29

Multilevel data collection and analysis for weight training (with R code)

09-22

A Real World Reinforcement Learning Research Program

07-06

Does batting order matter in Major League Baseball? A simulation approach

07-04

PokerBot: Create your poker AI bot in Python

11-01

The ‘knight on an infinite chessboard’ puzzle: efficient simulation in R

12-10

“Simulations are not scalable but theory is scalable”

11-02

Multilevel data collection and analysis for weight training (with R code)

09-22

Verlet Simulations

07-16

Does batting order matter in Major League Baseball? A simulation approach

07-04

PokerBot: Create your poker AI bot in Python

11-01

Self-Organizing Maps Tutorial

11-02

Understanding object detection in deep learning

11-19

Machine Learning Classification: A Dataset-based Pictorial

11-05

Multi-Class Text Classification Model Comparison and Selection

11-01

Deep Learning Without Labels

10-03

Distributed Deep Learning on AZTK and HDInsight Spark Clusters

08-02

Design Patterns for Production NLP Systems

07-09

How to Do Distributed Deep Learning for Object Detection Using Horovod on Azure

06-20

Self-Organizing Maps Tutorial

11-02

Top gsutil command lines to get started on Google Cloud Storage

01-01

Gsutil cheatsheet

11-02

Shopper Sentiment: Analyzing in-store customer experience

10-09

Top gsutil command lines to get started on Google Cloud Storage

01-01

Gsutil cheatsheet

11-02

Stan development in RStudio

10-17

Stan on the web! (thanks to RStudio)

10-12

Boost Computation Power and Speed with Snowflake

07-02

Gsutil cheatsheet

11-02

mixup: Data-Dependent Data Augmentation

11-02

How to Build Your Own Blockchain Part 3 — Writing Nodes that Mine and Talk

11-02

How to Build Your Own Blockchain Part 3 — Writing Nodes that Mine and Talk

11-02

How to Build Your Own Blockchain Part 3 — Writing Nodes that Mine and Talk

11-02

On Pyro - Deep Probabilistic Programming on PyTorch

11-03

On Pyro - Deep Probabilistic Programming on PyTorch

11-03

LightOn: Forward We Go !

12-20

Project planning with plotly

11-26

Creating List with Iterator

11-23

Growing List vs Growing Queue

11-18

Those “other” apply functions…

11-13

Paris Machine Learning

10-10

Fast Company's 2018 World's Most Innovative Companies List

02-20

PyConUK 2017, PyDataCardiff and “Machine Learning Libraries You’d Wish You’d Known About”

11-05

When Traditional Programming Meets Machine Learning

11-05

When Traditional Programming Meets Machine Learning

11-05

When Traditional Programming Meets Machine Learning

11-05

Weekly Review: 11/04/2017

11-06

R Packages worth a look

11-10

Weekly Review: 11/04/2017

11-06

Weekly Review: 11/04/2017

11-06

Pulse of the Competition: November Edition

11-02

The Microsoft AI Idea Challenge – Breakthrough Ideas Wanted!

08-14

Weekly Review: 11/04/2017

11-06

R 3.5.2 now available

12-20

Running an R script on heroku

12-06

Azure ML Studio now supports R 3.4

11-01

Data Notes: The Secret of Academic Success

10-17

Reduce GPU costs with startup scripts on the Google Cloud Engine

02-21

Weekly Review: 11/04/2017

11-06

Why Indian companies should take on different projects than competing Valley companies - an application of Cobb-Douglas

11-07

Why Indian companies should take on different projects than competing Valley companies - an application of Cobb-Douglas

11-07

From Gaussian Algebra to Gaussian Processes, Part 1

03-31

Gaussian Distributions are Soap Bubbles

11-09

Gaussian Distributions are Soap Bubbles

11-09

How digital cameras work

05-25

Gaussian Distributions are Soap Bubbles

11-09

Gaussian Distributions are Soap Bubbles

11-09

Gaussian Distributions are Soap Bubbles

11-09

R Packages worth a look

11-23

Online Bayesian Deep Learning in Production at Tencent

11-15

If you did not already know

10-25

R Packages worth a look

10-13

R Packages worth a look

09-30

If you did not already know

09-18

If you did not already know

09-12

Exploring Line Lengths in Python Packages

11-09

Exploring Line Lengths in Python Packages

11-09

Weekly Review: 11/11/2017

11-11

Markets Performance after Election: One Year Update

11-12

Markets Performance after Election: One Year Update

11-12

Awesome postdoc opportunities in computational genomics at JHU

05-17

Algorithms, Machine Learning, and Optimization: we are hiring!

11-12

Understanding deep Convolutional Neural Networks with a practical use-case in Tensorflow and Keras

11-13

Understanding deep Convolutional Neural Networks with a practical use-case in Tensorflow and Keras

11-13

How to Build Your Own Blockchain Part 4.1 — Bitcoin Proof of Work Difficulty Explained

11-13

How to Build Your Own Blockchain Part 4.1 — Bitcoin Proof of Work Difficulty Explained

11-13

How to Build Your Own Blockchain Part 4.1 — Bitcoin Proof of Work Difficulty Explained

11-13

AWS expands HIPAA eligible machine learning services for healthcare customers

11-08

How AI Will Change Healthcare

10-15

Data Science in Healthcare

11-14

New public course on Successfully Delivering Data Science Projects for Feb 1st

12-18

Talking on “High Performance Python” at Linuxing In London last week

11-26

“On the Diagramatic Diagnosis of Data” at BudapestBI 2018

11-16

Speed up your R Work

07-08

PyDataLondon 2018 and “Creating Correct and Capable Classifiers”

04-30

PyData Conference & AHL Hackathon

02-16

Python Data Science jobs list into 2018

12-31

PyDataBudapest and “Machine Learning Libraries You’d Wish You’d Known About”

11-15

New public course on Successfully Delivering Data Science Projects for Feb 1st

12-18

Talking on “High Performance Python” at Linuxing In London last week

11-26

“On the Diagramatic Diagnosis of Data” at BudapestBI 2018

11-16

PyDataLondon 2018 and “Creating Correct and Capable Classifiers”

04-30

PyData Conference & AHL Hackathon

02-16

Python Data Science jobs list into 2018

12-31

PyDataBudapest and “Machine Learning Libraries You’d Wish You’d Known About”

11-15

A Non-Compromising Approach to Privacy-Preserving Personalized Services

01-08

“Thus, a loss aversion principle is rendered superfluous to an account of the phenomena it was introduced to explain.”

12-25

R Packages worth a look

11-05

Decision Making and Diversity

11-15

Decision Making and Diversity

11-15

Visualize the Business Value of your Predictive Models with modelplotr

11-03

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 3: Layouts

10-25

Evaluating the Business Value of Predictive Models in Python and R

10-11

If you did not already know

08-08

Generating Climate Temperature Spirals in Python

05-21

Python Matplotlib (pyplot), a step-by-step Tutorial

11-15

Python Matplotlib (pyplot), a step-by-step Tutorial

11-15

Silent Duels and an Old Paper of Restrepo

12-31

confint3: 2-Sided Confidence Interval (Extended Moodle Version)

12-08

Document worth reading: “Restricted Boltzmann Machines: Introduction and Review”

10-27

Top KDnuggets tweets, Oct 10-16: 7 Books to Grasp Mathematical Foundations of Data Science and Machine Learning; 6 Books Every Data Scientist Should Keep Nearby

10-17

Document worth reading: “Vector and Matrix Optimal Mass Transport: Theory, Algorithm, and Applications”

10-14

5 Tips To Learn Machine Learning

06-17

Engineering a New Career in Data Science: Alumni Spotlight on Abhishek Mishra

06-06

Python List Comprehension + Set + Dict Comprehension

11-16

A Cookbook for Machine Learning: Vol 1

11-16

A Cookbook for Machine Learning: Vol 1

11-16

Document worth reading: “Declarative Statistics”

10-22

A Cookbook for Machine Learning: Vol 1

11-16

8 Important Python Interview Questions and Answers

11-17

Ed Sullivan (3) vs. Sid Caesar; DJ Jazzy Jeff advances

01-09

Philip Roth (4) vs. DJ Jazzy Jeff; Jim Thorpe advances

01-08

The seminar speaker contest begins: Jim Thorpe (1) vs. John Oliver

01-07

AzureVM: managing virtual machines in Azure

12-03

AzureVM: managing virtual machines in Azure

12-03

R Tip: Be Wary of “…”

06-15

8 Important Python Interview Questions and Answers

11-17

Recommender System With Implicit Feedback

11-18

Weekly Review: 11/18/2017

11-18

Document worth reading: “PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison”

08-31

Weekly Review: 11/18/2017

11-18

Understanding Chicago’s homicide spike; comparisons to other cities

10-13

Top KDnuggets tweets, Oct 3–9: 5 Reasons Logistic Regression should be the first thing you learn when becoming a Data Scientist

10-10

“Should I get a PhD to be a data scientist/analytics professional?”

11-19

Sleeping Giant Rural Postman Problem

12-01

50 states Rural Postman Problem

11-19

50 states Rural Postman Problem

11-19

50 states Rural Postman Problem

11-19

Linear Feedback Shift Registers

11-19

Linear Feedback Shift Registers

11-19

Linear Feedback Shift Registers

11-19

How to Build Your Own Blockchain Part 4.2 — Ethereum Proof of Work Difficulty Explained

11-21

How to Build Your Own Blockchain Part 4.2 — Ethereum Proof of Work Difficulty Explained

11-21

How to Build Your Own Blockchain Part 4.2 — Ethereum Proof of Work Difficulty Explained

11-21

Thanksgiving Special 🦃: GANs are Being Fixed in More than One Way

11-23

Python Pandas Tutorial: The Basics

11-23

Communicating results with R Markdown

11-01

Markdown Language Reference

11-24

Markdown Language Reference

11-24

Markdown Language Reference

11-24

Gaussian Processes

11-25

New download API for pretrained NLP models and datasets in Gensim

11-27

New download API for pretrained NLP models and datasets in Gensim

11-27

Gensim Survey 2018

04-30

New download API for pretrained NLP models and datasets in Gensim

11-27

Personal Data Analytics

12-10

R Packages worth a look

11-20

R plus Magento 2 REST API revisited: part 1- authentication and universal search

11-06

R Packages worth a look

11-04

R Packages worth a look

10-14

R Packages worth a look

10-08

R Packages worth a look

10-04

R Packages worth a look

09-10

R Packages worth a look

08-15

New download API for pretrained NLP models and datasets in Gensim

11-27

Sequence Modeling with CTC

11-27

Understanding rolling calculations in R

03-07

Sequence Modeling with CTC

11-27

Incremental means and variances

11-28

Grosse's challenge: duality and exponential families

11-29

Gradient optimisation on the Poincaré disc

04-10

Grosse's challenge: duality and exponential families

11-29

Grosse's challenge: duality and exponential families

11-29

Grosse's challenge: duality and exponential families

11-29

Grosse's challenge: duality and exponential families

11-29

Java Handwritten Digit Recognition with Neural Networks

11-29

My R take on Advent of Code – Day 1

12-17

Java Handwritten Digit Recognition with Neural Networks

11-29

House Price Prediction using a Random Forest Classifier

11-29

A Neural Network for predicting Restaurant Reservations

11-30

Tutorial: Time Series Analysis with Pandas

01-10

Marketing analytics with greybox

01-07

Dreaming of a white Christmas – with ggmap in R

12-24

Day 15 – little helper sci_palette

12-15

Simulating dinosaur populations, with R

11-30

Simulating dinosaur populations, with R

11-30

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 3: Layouts

10-25

How we use emojis

10-15

If you did not already know

10-02

R Packages worth a look

09-22

How to graph a function of 4 variables using a grid

09-20

Python and Tidyverse

06-01

Generating Climate Temperature Spirals in Python

05-21

A Neural Network for predicting Restaurant Reservations

11-30

A Neural Network for predicting Restaurant Reservations

11-30

Sleeping Giant Rural Postman Problem

12-01

Weekly Review: 12/03/2017

12-03

At NIPS 2017

12-04

When the bubble bursts…

06-04

Graph embeddings in Hyperbolic Space

04-10

NIPS 2017 Summary

12-11

At NIPS 2017

12-04

At NIPS 2017

12-04

Hitchhiker’s guide to Used Car Prices Estimation

12-04

Hitchhiker’s guide to Used Car Prices Estimation

12-04

Bringing Machine Learning Research to Product Commercialization

11-27

The Long Tail of Medical Data

11-12

Document worth reading: “Deep Learning for Image Denoising: A Survey”

11-04

Why AI will not replace radiologists

11-01

How to Highlight 3D Brain Regions

10-31

Basic Image Data Analysis Using Python – Part 4

10-05

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 2

10-04

If you did not already know

09-30

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 1

09-26

Whats new on arXiv

08-28

Building a Diabetic Retinopathy Prediction Application using Azure Machine Learning

06-25

The Last 5 Years In Deep Learning

12-04

Using Artificial Intelligence to Augment Human Intelligence

12-04

Installing Python Packages from a Jupyter Notebook

12-05

Installing Python Packages from a Jupyter Notebook

12-05

Alchemy, Rigour and Engineering

12-07

Alchemy, Rigour and Engineering

12-07

Alchemy, Rigour and Engineering

12-07

Implementing Poincaré Embeddings

12-09

Implementing Poincaré Embeddings

12-09

Part 3: Two more implementations of optimism corrected bootstrapping show shocking bias

12-27

State of Deep Learning and Major Advances: H2 2018 Review

12-13

Implementing Poincaré Embeddings

12-09

Document worth reading: “Marketing Analytics: Methods, Practice, Implementation, and Links to Other Fields”

12-07

Making Machine Learning Accessible [Webinar Replay]

11-27

New version of pqR, with major speed improvements

11-25

Top KDnuggets tweets, Oct 24-30: Building a Question-Answering System from Scratch

10-31

Implementing Poincaré Embeddings

12-09

RcppMsgPack 0.2.3

11-18

Optimization of Scientific Code with Cython: Ising Model

12-11

Optimization of Scientific Code with Cython: Ising Model

12-11

Everything is a Model

12-13

Java Handwritten Digit Recognition with Convolutional Neural Networks

12-13

Distilled News

11-29

Data professional definitions: Data analyst vs data scientist vs data engineer

12-14

Data professional definitions: Data analyst vs data scientist vs data engineer

12-14

Data professional definitions: Data analyst vs data scientist vs data engineer

12-14

How To Write, Deploy, and Interact with Ethereum Smart Contracts on a Private Blockchain

12-15

How To Write, Deploy, and Interact with Ethereum Smart Contracts on a Private Blockchain

12-15

How To Write, Deploy, and Interact with Ethereum Smart Contracts on a Private Blockchain

12-15

Because it's Friday: Street Orientation

07-27

Weekly Review: 12/16/2017

12-16

Weekly Review: 12/16/2017

12-16

k-server, part 1: online learning and online algorithms

12-17

k-server, part 1: online learning and online algorithms

12-17

k-server, part 1: online learning and online algorithms

12-17

k-server, part 1: online learning and online algorithms

12-17

k-server, part 1: online learning and online algorithms

12-17

Why mere Machine Learning cannot predict Bitcoin price

12-18

Gift ideas for the R lovers

12-14

Why mere Machine Learning cannot predict Bitcoin price

12-18

Why mere Machine Learning cannot predict Bitcoin price

12-18

Simulating Chutes & Ladders in Python

12-18

Designing Turbofan Tycoon

12-06

One-arm Bayesian Adaptive Trial Simulation Code

11-10

simmer 4.1.0

11-09

A Real World Reinforcement Learning Research Program

07-06

Towards a Virtual Stuntman

04-10

Simulating Chutes & Ladders in Python

12-18

Simulating Chutes & Ladders in Python

12-18

Document worth reading: “Searching Toward Pareto-Optimal Device-Aware Neural Architectures”

01-05

Whats new on arXiv

10-24

If you did not already know

09-28

k-server, part 2: continuous time mirror descent

12-20

k-server, part 2: continuous time mirror descent

12-20

k-server, part 2: continuous time mirror descent

12-20

Large-Scale Health Data Analytics with OHDSI

12-21

restez: Query GenBank locally

12-03

Building a conversational business intelligence bot with Amazon Lex

11-21

Cosmos DB for Data Science

09-07

R Packages worth a look

08-21

Large-Scale Health Data Analytics with OHDSI

12-21

Large-Scale Health Data Analytics with OHDSI

12-21

Setting Up Selenium on RaspberryPi 2/3

12-22

Setting Up Selenium on RaspberryPi 2/3

12-22

Setting Up Selenium on RaspberryPi 2/3

12-22

Setting Up Selenium on RaspberryPi 2/3

12-22

Distilled News

09-18

2017 Winners and Losers

12-27

2018 Winners and Losers

01-06

2017 Winners and Losers

12-27

2018 Winners and Losers

01-06

2017 Winners and Losers

12-27

2017 Winners and Losers

12-27

2017 Winners and Losers

12-27

Magister Dixit

07-31

Can a Machine Be Racist or Sexist?

04-16

Linked Lists

12-28

Python Data Science jobs list into 2018

12-31

“discover feature relationships” – new EDA tool

01-10

Python Data Science jobs list into 2018

12-31

Top gsutil command lines to get started on Google Cloud Storage

01-01

Deploy a TensorFlow trained image classification model to AWS DeepLens

08-15

Package Paths in R

03-31

Top gsutil command lines to get started on Google Cloud Storage

01-01

ML/NLP Publications in 2017

01-02

Java Image Cat&Dog Recognition with Deep Neural Networks

01-03

Alternative approaches to scaling Shiny with RStudio Shiny Server, ShinyProxy or custom architecture.

12-18

Document worth reading: “Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks”

10-25

A Neural Architecture for Bayesian CompressiveSensing over the Simplex via Laplace Techniques

10-08

Java Image Cat&Dog Recognition with Deep Neural Networks

01-03

Should I do a Data Science bootcamp?

01-03

Engineering a New Career in Data Science: Alumni Spotlight on Abhishek Mishra

06-06

Should I do a Data Science bootcamp?

01-03

Cryptocurrency: Your Current Options

08-10

Interactive Broker’s SNAP Orders for Delayed Trading

01-03

Interactive Broker’s SNAP Orders for Delayed Trading

01-03

Interactive Broker’s SNAP Orders for Delayed Trading

01-03

Spam Detection with Natural Language Processing – Part 3

11-01

Deep learning, hydroponics, and medical marijuana

10-15

Keras vs. TensorFlow – Which one is better and which one should I learn?

10-08

Interactive Broker’s SNAP Orders for Delayed Trading

01-03

Help us understand your Data Science goals!

11-13

A Data Scientist’s Guide to an Efficient Project Lifecycle

10-25

Data Projects WILL Fail - Learn to Fail Quickly & Efficiently

09-21

Visual Reinforcement Learning with Imagined Goals

09-06

AHL Python Data Hackathon

04-22

New Year's Resolutions 2018

01-05

Two things about power

05-14

CES 2018

01-12

Leading the Charge 🔌 🚘: 10 Charts on Electric Vehicles in Plotly

10-09

CES 2018

01-12

CES 2018

01-12

Shortest Crease Problem

01-14

Machine Learning Trick of the Day (7): Density Ratio Trick

01-14

Shortest Crease Problem

01-14

Shortest Crease Problem

01-14

Shortest Crease Problem

01-14

Data Retrieval and Cleaning: Tracking Migratory Patterns

05-23

Machine Learning Trick of the Day (7): Density Ratio Trick

01-14

The Generalization Mystery: Sharp vs Flat Minima

01-18

The Generalization Mystery: Sharp vs Flat Minima

01-18

Java Object Tracking for Cars

11-30

Java Autonomous driving – Car detection

01-18

Java Autonomous driving – Car detection

01-18

Forget Motivation and Double Your Chances of Learning Success

11-20

Motivation in Academia vs Industry

01-21

Motivation in Academia vs Industry

01-21

Summer of Data Science Goal-Setting

06-06

Motivation in Academia vs Industry

01-21

Motivation in Academia vs Industry

01-21

Kernel Feature Selection via Conditional Covariance Minimization

01-23

Updated Review: jamovi User Interface to R

01-09

Distilled News

12-21

Kernel Feature Selection via Conditional Covariance Minimization

01-23

Choose Your Own Adventure – Analytics On-boarding

10-15

A Subtle Flaw in Some Popular R NSE Interfaces

09-24

“Identification of and correction for publication bias,” and another discussion of how forking paths is not the same thing as file drawer

08-31

Kernel Feature Selection via Conditional Covariance Minimization

01-23

Learn D3.js in 5 minutes

05-16

Kernel Feature Selection via Conditional Covariance Minimization

01-23

9 new pandas updates that will save you time

01-25

9 new pandas updates that will save you time

01-25

My notes on (Liang et al., 2017): Generalization and the Fisher-Rao norm

01-25

Habits and Tools, Old and New

01-26

Habits and Tools, Old and New

01-26

Habits and Tools, Old and New

01-26

TSrepr - Time Series Representations in R

01-26

TSrepr use case - Clustering time series representations in R

03-13

TSrepr - Time Series Representations in R

01-26

TSrepr - Time Series Representations in R

01-26

Time Series and MCHT

11-12

“Statistical and Machine Learning forecasting methods: Concerns and ways forward”

11-06

Time Series for scikit-learn People (Part I): Where's the X Matrix?

01-28

Time Series for scikit-learn People (Part I): Where's the X Matrix?

01-28

Time Series for scikit-learn People (Part I): Where's the X Matrix?

01-28

Neural Networks and the generalisation problem

01-28

Document worth reading: “A Review of Theoretical and Practical Challenges of Trusted Autonomy in Big Data”

12-04

Present each others’ posters

10-06

Neural Networks and the generalisation problem

01-28

Neural Networks and the generalisation problem

01-28

Word Morphing – an original idea

11-20

Document worth reading: “Opening the black box of deep learning”

10-28

automl package: part 2/2 first steps how to

10-24

Document worth reading: “Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation”

10-13

k-server, part 3: entropy regularization for weighted k-paging

01-29

k-server, part 3: entropy regularization for weighted k-paging

01-29

R Packages worth a look

01-13

Heavy Tailed Self Regularization in Deep Neural Nets: 1 year of research

12-18

If you did not already know

10-29

k-server, part 3: entropy regularization for weighted k-paging

01-29

k-server, part 3: entropy regularization for weighted k-paging

01-29

Counting Efficiently with Bounter pt. 2: CountMinSketch

01-31

Counting Efficiently with Bounter pt. 2: CountMinSketch

01-31

GARCH and a rudimentary application to Vol Trading

12-03

Document worth reading: “An Information-Theoretic Analysis of Deep Latent-Variable Models”

08-23

Counting Efficiently with Bounter pt. 2: CountMinSketch

01-31

How to use common workflows on Amazon SageMaker notebook instances

10-03

Static Blog: Jekyll, Hyde and GitHub Pages

02-01

Static Blog: Jekyll, Hyde and GitHub Pages

02-01

A Practical Guide to the "Open-Source Machine Learning Masters"

02-03

Hiring Data Scientists

02-04

Hiring Data Scientists

02-04

Hiring Data Scientists

02-04

Superbowl Helmet Puzzle

02-04

linl 0.0.3: Micro release

12-15

Superbowl Helmet Puzzle

02-04

Superbowl Helmet Puzzle

02-04

iPhone addiction? Get a grip!

02-06

Text Preprocessing in Python: Steps, Tools, and Examples

11-06

iPhone addiction? Get a grip!

02-06

Text Preprocessing in Python: Steps, Tools, and Examples

11-06

Why, oh why, do so many people embrace the Pacific Garbage Cleanup nonsense? (I have a theory).

09-18

iPhone addiction? Get a grip!

02-06

If you did not already know

12-30

Learning Robot Objectives from Physical Human Interaction

02-06

Learning Robot Objectives from Physical Human Interaction

02-06

Linus Sequence

02-06

Linus Sequence

02-06

Linus Sequence

02-06

epubr 0.6.0 CRAN release

01-11

My R Take in Advent of Code – Day 5

01-03

Linus Sequence

02-06

Natural and Artificial Intelligence

02-06

Natural and Artificial Intelligence

02-06

Pruning Neural Networks: Two Recent Papers

02-06

Pruning Neural Networks: Two Recent Papers

02-06

Training models with unequal economic error costs using Amazon SageMaker

09-18

Machine learning mega-benchmark: GPU providers (part 2)

02-08

Scaling H2O analytics with AWS and p(f)urrr (Part 1)

01-06

Scalable multi-node training with TensorFlow

12-17

R now supported in Azure SQL Database

11-28

Introducing Dynamic Training for deep learning with Amazon EC2

11-27

Lifecycle configuration update for Amazon SageMaker notebook instances

11-06

New speed record set for training deep learning models on AWS

08-22

Machine learning mega-benchmark: GPU providers (part 2)

02-08

Distilled News

10-08

Machine learning mega-benchmark: GPU providers (part 2)

02-08

Machine learning mega-benchmark: GPU providers (part 2)

02-08

Re-creating a Voronoi-Style Map with R

12-22

Common mistakes when carrying out machine learning and data science

12-06

Using a Keras Long Short-Term Memory (LSTM) Model to Predict Stock Prices

11-21

Data Notes: How Do Autoencoders Work?

09-20

Introduction to Learning to Trade with Reinforcement Learning

02-11

Introduction to Learning to Trade with Reinforcement Learning

02-11

Pancake Numbers

02-12

Pancake Numbers

02-12

Pancake Numbers

02-12

Pancake Numbers

02-12

Pancake Numbers

02-12

How to maraaverickfy a blog post without even reading it

02-12

How to maraaverickfy a blog post without even reading it

02-12

How to maraaverickfy a blog post without even reading it

02-12

Pervasive Simulator Misuse with Reinforcement Learning

02-14

Pervasive Simulator Misuse with Reinforcement Learning

02-14

Pervasive Simulator Misuse with Reinforcement Learning

02-14

Pervasive Simulator Misuse with Reinforcement Learning

02-14

Setting up Jupyter for Deep Learning on EC2

02-15

Setting up Jupyter for Deep Learning on EC2

02-15

Setting up Jupyter for Deep Learning on EC2

02-15

Creating List with Iterator

11-23

Hands-on: Creating Neural Networks using Chainer

02-15

Hands-on: Creating Neural Networks using Chainer

02-15

Hands-on: Creating Neural Networks using Chainer

02-15

RSiteCatalyst Version 1.4.14 Release Notes

02-16

RSiteCatalyst Version 1.4.14 Release Notes

02-16

R 3.5.2 now available

12-20

R 3.5.2 now available

12-20

Thanks, NVIDIA

08-01

PyData Conference & AHL Hackathon

02-16

Pivoted document length normalisation

06-19

Sutton’s Temporal-Difference Learning

02-19

Text to Speech Deep Learning Architectures

02-20

Text to Speech Deep Learning Architectures

02-20

Voice Control your Shiny Apps

10-15

DIY AI for the Future

06-27

Text to Speech Deep Learning Architectures

02-20

Most liked R-bloggers’ posts from last week (2018-10-07 till 2018-10-13 – based on twitter)

10-15

Markdown based web analytics? Rectangle your blog

02-21

3368a9b98a073e7ba296e1f5f41f6c4f

06-02

Markdown based web analytics? Rectangle your blog

02-21

Markdown based web analytics? Rectangle your blog

02-21

Markdown based web analytics? Rectangle your blog

02-21

Markdown based web analytics? Rectangle your blog

02-21

Reduce GPU costs with startup scripts on the Google Cloud Engine

02-21

Overlapping Disks

09-30

Reduce GPU costs with startup scripts on the Google Cloud Engine

02-21

Reduce GPU costs with startup scripts on the Google Cloud Engine

02-21

What Do Data Scientists Need to Know about Containerization? As Little as Possible.

02-22

What Do Data Scientists Need to Know about Containerization? As Little as Possible.

02-22

Google Calendar should prevent spam by default

02-22

Google Calendar should prevent spam by default

02-22

My steps into Data Science

05-21

Google Calendar should prevent spam by default

02-22

Getting Started With MapD, Part 2: Electricity Dataset

02-23

Getting Started With MapD, Part 2: Electricity Dataset

02-23

Getting Started With MapD, Part 2: Electricity Dataset

02-23

Getting Started With MapD, Part 2: Electricity Dataset

02-23

Getting Started With MapD, Part 2: Electricity Dataset

02-23

My Approach to Natas Level 11 (a Web Security Game)

02-23

Python Patterns: max Instead of if

01-10

AzureStor: an R package for working with Azure storage

12-18

AzureStor: an R package for working with Azure storage

12-18

Let Automation Carry You from BI to AI in 2019

12-11

New R Cheatsheet: Data Science Workflow with R

11-04

“Snip Insights” – An Open Source Cross-Platform AI Tool for Intelligent Screen Capture

10-03

Python Dictionary Tutorial

10-03

How to use an R interface with Airtable API

05-23

My Approach to Natas Level 11 (a Web Security Game)

02-23

“My advisor and I disagree on how we should carry out repeated cross-validation. We would love to have a third expert opinion…”

12-15

Automated Web Scraping in R

12-11

R plus Magento 2 REST API revisited: part 3 – more complex samples of use

12-02

Integrating R and Telegram

11-07

Get started with automated metadata extraction using the AWS Media Analysis Solution

09-07

✚ Detailed Intentions of a Map, When Everything Leads to Nothing, Designing for Misinterpretations

08-09

Deep Learning for Emojis with VS Code Tools for AI – Part 2

06-05

Connect to Google Sheets in Power BI using R

03-06

My Approach to Natas Level 11 (a Web Security Game)

02-23

Java Art Generation with Neural Style Transfer

02-24

Java Art Generation with Neural Style Transfer

02-24

Document worth reading: “Neural Style Transfer: A Review”

01-02

styler 1.1.0

11-27

Synesthesia: The Sound of Style

08-29

Understanding Latent Style

06-28

Java Art Generation with Neural Style Transfer

02-24

Java Art Generation with Neural Style Transfer

02-24

Mathematics of Tape Recorders

02-28

Mathematics of Tape Recorders

02-28

Mathematics of Tape Recorders

02-28

Mathematics of Tape Recorders

02-28

Mathematics of Tape Recorders

02-28

The Sickness That Is Depression

02-28

The Sickness That Is Depression

02-28

Distilled News

11-24

The Sickness That Is Depression

02-28

The Sickness That Is Depression

02-28

The Sickness That Is Depression

02-28

Image Recognition and Object Detection

02-28

Image Recognition and Object Detection

02-28

Image Recognition and Object Detection

02-28

Nine digits puzzle

03-02

Nine digits puzzle

03-02

Lucy`s Secret Number puzzle

06-03

Nine digits puzzle

03-02

Nine digits puzzle

03-02

Multithreaded in the Wild

12-03

Multithreaded in the Wild

11-01

Multithreaded in the Wild

10-05

Multithreaded in the Wild

09-07

Multithreaded in the Wild

06-11

Multithreaded in the Wild

05-07

Multithreaded in the Wild

03-02

Multithreaded in the Wild

03-02

Multithreaded in the Wild

03-02

The 2018 Best Picture Nominees Ranked, Reviewed, and Reflected Upon

03-03

The 2018 Best Picture Nominees Ranked, Reviewed, and Reflected Upon

03-03

An Utility Function For Monotonic Binning

12-03

Integration method to map model scores to conversion rates from example data

03-04

R Packages worth a look

09-10

Integration method to map model scores to conversion rates from example data

03-04

Mounting multiple data and outputs volumes

07-15

Saving, resuming, and restarting experiments with Polyaxon

05-03

Jupyter notebooks and tensorboard on Polyaxon

03-04

Jupyter notebooks and tensorboard on Polyaxon

03-04

Jupyter notebooks and tensorboard on Polyaxon

03-04

Jupyter notebooks and tensorboard on Polyaxon

03-04

Linguistic Signals of Album Quality: A Predictive Analysis of Pitchfork Review Scores Using Quanteda

01-10

Mouse Among the Cats

09-11

Researchers.one: A souped-up Arxiv with pre- and post-publication review

09-10

Distill Update 2018

08-14

ICML Board and Reviewer profiles

03-05

FIFA WC 2018: Quarter Final Stage Preditions

07-06

ICML Board and Reviewer profiles

03-05

Compound interest and retirement

03-05

Compound interest and retirement

03-05

Compound interest and retirement

03-05

Compound interest and retirement

03-05

Top 10 oldest and youngest industries in the U.S.

03-05

Top 10 oldest and youngest industries in the U.S.

03-05

Top 10 oldest and youngest industries in the U.S.

03-05

How Americans make a living based on their age

03-06

Connect to Google Sheets in Power BI using R

03-06

Connect to Google Sheets in Power BI using R

03-06

Connect to Google Sheets in Power BI using R

03-06

The Building Blocks of Interpretability

03-06

Teaching kids data visualization

11-29

11 Design Tips for Data Visualization

10-25

Obtaining the number of components from cross validation of principal components regression

10-15

Document worth reading: “A Survey on Visual Query Systems in the Web Era (extended version)”

08-20

The Building Blocks of Interpretability

03-06

Understanding rolling calculations in R

03-07

Sock Puzzle Revisited

03-07

Sock Puzzle Revisited

03-07

When Men and Women talk to Siri

03-09

If you did not already know

10-17

AI, Machine Learning and Data Science Announcements from Microsoft Ignite

10-02

Make R speak

08-16

Twilio offers greater voice selection to customers with Amazon Polly integration

08-06

When Men and Women talk to Siri

03-09

How simpleshow uses Amazon Polly to voice stories in their explainer videos

01-11

Meet Zhiyu—the first Mandarin Chinese voice for Amazon Polly

09-14

Make R speak

08-16

Twilio offers greater voice selection to customers with Amazon Polly integration

08-06

Amazon Polly adds bilingual Indian English/Hindi language support

08-02

When Men and Women talk to Siri

03-09

Transfer Your Font Style with GANs

03-13

TSrepr use case - Clustering time series representations in R

03-13

Quasiquotation in R via bquote()

10-16

Turning Water into Wine

03-13

Document worth reading: “Instance-Level Explanations for Fraud Detection: A Case Study”

01-01

Announcing Kaggle integration with Google Data Studio

12-05

Using RSiteCatalyst With Microsoft PowerBI Desktop

03-13

Running an R script on heroku

12-06

Using RSiteCatalyst With Microsoft PowerBI Desktop

03-13

Technoslavia 2.5: Open Source Topography

11-07

Two cool features of Python NumPy: Mutating by slicing and Broadcasting

03-17

Two cool features of Python NumPy: Mutating by slicing and Broadcasting

03-17

Distributed Deep Learning with Polyaxon

03-18

Why you should start using .npy file more often…

03-20

Deterministic A/B tests via the hashing trick

03-20

Deterministic A/B tests via the hashing trick

03-20

Deterministic A/B tests via the hashing trick

03-20

Engineering Data Science at Automattic

03-20

Notes on the Frank-Wolfe Algorithm, Part II: A Primal-dual Analysis

11-14

Three Operator Splitting

09-04

Notes on the Frank-Wolfe algorithm, Part I

03-20

Notes on the Frank-Wolfe Algorithm, Part II: A Primal-dual Analysis

11-14

Three Operator Splitting

09-04

Notes on the Frank-Wolfe algorithm, Part I

03-20

How many college football teams can you watch in-person in one football season?

03-21

TDWI In-Person and Virtual Data and Analytics Training

10-10

How many college football teams can you watch in-person in one football season?

03-21

How many college football teams can you watch in-person in one football season?

03-21

Good Feature Building Techniques and Tricks for Kaggle

12-31

Reflections on remote data science work

11-03

How many college football teams can you watch in-person in one football season?

03-21

Time Series for scikit-learn People (Part II): Autoregressive Forecasting Pipelines

03-22

Crossing Your Data Science Chasm

03-22

The Bull Survived on Friday, but Barely

03-25

Mérida, Yucatán

03-25

Mérida, Yucatán

03-25

Mérida, Yucatán

03-25

Mérida, Yucatán

03-25

Escaping the macOS 10.14 (Mojave) Filesystem Sandbox with R / RStudio

11-09

Using a Column as a Column Index

09-21

Data Science and Python

03-29

Data Science and Python

03-29

Data Science and Python

03-29

Webinar – Integrate AI Across Insurance Operations to Turbocharge Tech Transformation, Nov 14

10-31

From Project Manager to Data Champion — Conquer Your Data Projects

10-18

Data Science and Python

03-29

RcppStreams 0.1.2

01-07

R 101

12-24

BH 1.69.0-0 pre-releases and three required changes

12-20

RcppEigen 0.3.3.5.0

11-24

Quick overview on the new Bioconductor 3.8 release

11-02

RcppRedis 0.1.9

10-27

A small logical change with big impact

10-16

A small logical change with big impact

10-16

A small logical change with big impact

10-16

Package support offer

10-15

Microsoft R Open 3.5.1 now available

08-14

Learn to R blog series - R and RStudio

03-29

Learn to R blog series - R and RStudio

03-29

If you did not already know

11-27

Google Dataset Search : Google’s New Data Search Engine

09-10

Visual search on AWS—Part 1: Engine implementation with Amazon SageMaker

08-30

Package Paths in R

03-31

Automated machine learning is coming... and it won't matter

04-04

Quarterly product update: Create your data science projects on Kaggle

04-04

Quarterly product update: Create your data science projects on Kaggle

04-04

Use Kaggle to start (and guide) your ML/ Data Science journey — Why and How

08-29

Profiling Top Kagglers: Martin Henze (AKA Heads or Tails), World's First Kernels Grandmaster

06-19

Mother's Day Interview: How Nicole Finnie Became a Competitive Kaggler on Maternity Leave

05-10

Profiling Top Kagglers: Bestfitting, Currently

05-07

Quarterly product update: Create your data science projects on Kaggle

04-04

NYU Stern Fubon Center for Technology, Business and Innovation: Fubon Center Faculty Fellow [New York, NY]

01-08

Magister Dixit

01-06

Maryville University: Business Intelligence Analyst [St. Louis, MO]

01-04

Strata Data SF 2019 KDnuggets Offer

01-04

New Year's Resolution: Help Data Scientists Help You

12-31

Using the Economics Value Curve to Drive Digital Transformation

12-27

Six Steps to Master Machine Learning with Data Preparation

12-21

How will automation tools change data science?

12-18

The 2019 PAW Business Agenda is Live – Super Early Bird expires this Friday

12-17

Vanguard: Senior AI Engineer [Malvern, PA]

12-17

Top Insights from 50 Chief Data Officers

12-14

Why You Shouldn’t be a Data Science Generalist

12-14

Are you ready to tackle the data-driven revolution?

12-13

MINDBODY: Business Intelligence Analyst II [San Luis Obispo, CA]

12-13

AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-03

Insights on the role data can play in your organization

11-19

UnitedHealth Group: Data Analytics and Reporting Lead [Minnetonka, MN or Telecommute]

11-16

(Webinar) Farmers and Chubb on Humanizing Claims with AI

11-15

Bright Lights, Bright Future. TDWI Is Back in Vegas

11-14

Metadata Enrichment is Essential to Realize the Value of Open Datasets

11-14

Top Data Science Hacks

11-05

Don’t use AI when BI will suffice!

11-05

Top Data Science Hacks

11-05

New R Cheatsheet: Data Science Workflow with R

11-04

Data Science With R Course Series – Week 7

10-29

If you did not already know

10-28

M4 Forecasting Conference

10-24

Holy Grail of AI for Enterprise — Explainable AI

10-19

Distilled News

10-09

Journey from Non-Technical background to an expert in Data Science

10-05

UnitedHealth Group: UHC Digital Project Manager [Minnetonka, MN]

10-04

Magister Dixit

10-02

Distilled News

09-23

Understanding Different Components & Roles in Data Science

09-18

Magister Dixit

09-07

No code chatbots: TIBCO uses Amazon Lex to put chat interfaces into the hands of business users

09-05

Distilled News

09-04

Understanding Different Components & Roles in Data Science

08-30

ACL 2018 Highlights: Understanding Representations and Evaluation in More Challenging Settings

07-26

Top-Down vs. Bottom-Up Approaches to Data Science

07-10

The Ponzi threshold and the Armstrong principle

07-02

Lumpers and Splitters: Tensions in Taxonomies

04-05

R Spatial Resources

04-06

R Spatial Resources

04-06

R Spatial Resources

04-06

Multithreaded in the Wild

11-01

Multithreaded in the Wild

10-05

Multithreaded in the Wild

09-07

Defining data science in 2018

07-22

Multithreaded in the Wild

04-09

Circle circumference in the hyperbolic plane is exponential in the radius: proof by computer game

04-10

Circle circumference in the hyperbolic plane is exponential in the radius: proof by computer game

04-10

Circle circumference in the hyperbolic plane is exponential in the radius: proof by computer game

04-10

Graph embeddings in Hyperbolic Space

04-10

Circle circumference in the hyperbolic plane is exponential in the radius: proof by computer game

04-10

Circle circumference in the hyperbolic plane is exponential in the radius: proof by computer game

04-10

Towards a Virtual Stuntman

04-10

Towards a Virtual Stuntman

04-10

Towards a Virtual Stuntman

04-10

Gradient optimisation on the Poincaré disc

04-10

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J

12-05

Co-localization analysis of fluorescence microscopy images

11-27

Solving the chinese postman problem

10-19

“Fudged statistics on the Iraq War death toll are still circulating today”

10-06

If you did not already know

08-20

Graph embeddings in Hyperbolic Space

04-10

Graph embeddings in Hyperbolic Space

04-10

Goals and Principles of Representation Learning

04-12

Goals and Principles of Representation Learning

04-12

Traveling salesman portrait in Python

04-12

Traveling salesman portrait in Python

04-12

Traveling salesman portrait in Python

04-12

Seasonalities: The Near-Term Future for the Market

04-14

Seasonalities: The Near-Term Future for the Market

04-14

Seasonalities: The Near-Term Future for the Market

04-14

Can a Machine Be Racist or Sexist?

04-16

DataCamp: Part-time Contract Instructors [Remote]

10-11

Data science books - theory and practice

06-29

Why Start a Data Science Project?

04-18

Why Start a Data Science Project?

04-18

Why Start a Data Science Project?

04-18

The seminar speaker contest begins: Jim Thorpe (1) vs. John Oliver

01-07

Update on the R Consortium Census Working Group

10-22

Why Start a Data Science Project?

04-18

R Packages worth a look

12-28

Speak at Mega-PAW Vegas 2019 – on Machine Learning Deployment (Apply by Nov 15)

10-22

Why Start a Data Science Project?

04-18

How many CRAN package maintainers have been pwned?

04-18

How many CRAN package maintainers have been pwned?

04-18

Shared Autonomy via Deep Reinforcement Learning

04-18

Shared Autonomy via Deep Reinforcement Learning

04-18

Document worth reading: “A Survey of Knowledge Representation and Retrieval for Learning in Service Robotics”

12-26

Soft Actor Critic—Deep Reinforcement Learning with Real-World Robots

12-14

Visual Model-Based Reinforcement Learning as a Path towards Generalist Robots

11-30

Document worth reading: “Artificial Intelligence for Long-Term Robot Autonomy: A Survey”

11-04

Dexterous Manipulation with Reinforcement Learning: Efficient, General, and Low-Cost

08-31

Import AI: 108: Learning language with fake sentences, Chinese researchers use RL to train prototype warehouse robots; and what the implications are of scaled-up Neural Architecture Search

08-20

One-Shot Imitation from Watching Videos

06-28

Shared Autonomy via Deep Reinforcement Learning

04-18

Announcing Ursa Labs: an innovation lab for open source data science

04-19

Some web API package development lessons from HIBPwned

04-19

Some web API package development lessons from HIBPwned

04-19

AHL Python Data Hackathon

04-22

AHL Python Data Hackathon

04-22

KDnuggets™ News 19:n02, Jan 9: The cold start problem: how to build your machine learning portfolio; 5 Best Data Visualization Libraries

01-09

KDnuggets™ News 19:n01, Jan 3: The Essence of Machine Learning; A Guide to Decision Trees for Machine Learning and Data Science

01-03

KDnuggets™ News 18:n48, Dec 19: Why You Shouldn’t be a Data Science Generalist; Industry Data Science & Machine Learning 2019 Predictions

12-19

KDnuggets™ News 18:n47, Dec 12: Common mistakes when doing machine learning; Here are the most popular Python IDEs / Editors

12-12

KDnuggets™ News 18:n46, Dec 5: AI, Data Science, Analytics 2018 Main Developments, 2019 Key Trends; Deep Learning Cheat Sheets

12-05

KDnuggets™ News 18:n45, Nov 28: Your Favorite Python IDE/editor? Intro to Data Science for Managers

11-28

Filter Clickbait from News Content with our custom Natural Language Processing Model

11-28

KDnuggets™ News 18:n44, Nov 21: What is the Best Python IDE for Data Science?; Anticipating the next move in data science

11-21

KDnuggets™ News 18:n43, Nov 14: To get hired as a data scientist, don’t follow the herd; LinkedIn Top Voices in Data Science & Analytics

11-14

KDnuggets™ News 18:n42, Nov 7: The Most in Demand Skills for Data Scientists; How Machines Understand Our Language: Intro to NLP

11-07

KDnuggets™ News 18:n41, Oct 31: Introduction to Deep Learning with Keras; Easy Named Entity Recognition with Scikit-Learn

10-31

The Axios Turing test and the heat death of the journalistic universe

10-25

Building statues of hope in augmented reality

10-22

KDnuggets™ News 18:n39, Oct 17: 10 Best Mobile Apps for Data Scientist; Vote in new poll: Largest dataset you analyzed?

10-17

KDnuggets™ News 18:n38, Oct 10: Concise Explanation of Learning Algorithms; Why I Call Myself a Data Scientist; Linear Regression in the Wild

10-10

KDnuggets™ News 18:n37, Oct 3: Mathematics of Machine Learning; Effective Transfer Learning for NLP; Path Analysis with R

10-03

Fake News and Filter Bubbles

08-21

Build this media monitoring Slack bot in 20 minutes without writing code

07-04

Opinion mining on Dutch news articles

06-20

Cambridge Analytica, Facebook, and user data – Monthly Media Review with the AYLIEN News API, April

05-03

Using 360° Stance Detection to Analyze coverage of Donald Trump by CNN

04-30

Using Natural Language Processing to Combat Filter Bubbles and Fake News – 360° Stance Detection

04-24

Using Natural Language Processing to Combat Filter Bubbles and Fake News – 360° Stance Detection

04-24

A Three Month Data Analysis in Excel Could Have Taken Me One Day

10-01

BD reviews

07-11

What's New in Dataquest v1.85: Takeaways, Intermediate R, and More

05-25

Cambridge Analytica, Facebook, and user data – Monthly Media Review with the AYLIEN News API, April

05-03

Using Natural Language Processing to Combat Filter Bubbles and Fake News – 360° Stance Detection

04-24

How a meme grew into a campaign slogan

11-05

Maryland's Bridge Safety, reported using R

10-19

A Three Month Data Analysis in Excel Could Have Taken Me One Day

10-01

Monitoring the media reaction to Facebook’s disastrous earnings call – News API Monthly Media Review

08-16

BD reviews

07-11

What's New in Dataquest v1.85: Takeaways, Intermediate R, and More

05-25

Cambridge Analytica, Facebook, and user data – Monthly Media Review with the AYLIEN News API, April

05-03

Using Natural Language Processing to Combat Filter Bubbles and Fake News – 360° Stance Detection

04-24

Role of Computer Science in Data Science World

01-07

Top Data Science Hacks

11-05

Top Data Science Hacks

11-05

Simple Architectures Outperform Complex Ones in Language Modeling

04-25

Simple Architectures Outperform Complex Ones in Language Modeling

04-25

Simple Architectures Outperform Complex Ones in Language Modeling

04-25

Simple Architectures Outperform Complex Ones in Language Modeling

04-25

TDM: From Model-Free to Model-Based Deep Reinforcement Learning

04-26

Gensim Survey 2018

04-30

PyDataLondon 2018 and “Creating Correct and Capable Classifiers”

04-30

If you did not already know

09-20

Document worth reading: “Accelerating CNN inference on FPGAs: A Survey”

09-08

Using 360° Stance Detection to Analyze coverage of Donald Trump by CNN

04-30

Using 360° Stance Detection to Analyze coverage of Donald Trump by CNN

04-30

Using 360° Stance Detection to Analyze coverage of Donald Trump by CNN

04-30

Book review: SQL Server 2017 Machine Learning Services with R

09-04

Using 360° Stance Detection to Analyze coverage of Donald Trump by CNN

04-30

Technology and Information: Data Science and UX

05-01

Technology and Information: Data Science and UX

05-01

How analog TV worked

05-01

How analog TV worked

05-01

A Study Of Reddit Politics

06-20

A particles-arly fun book draw

05-02

My

12-28

A crystal clear book draw

06-01

A particles-arly fun book draw

05-02

An intuitive, visual guide to copulas

05-03

An intuitive, visual guide to copulas

05-03

5 Ways in which Data Science is Revolutionizing Web Development

01-03

Monitoring the media reaction to Facebook’s disastrous earnings call – News API Monthly Media Review

08-16

Shared items

08-11

Cambridge Analytica, Facebook, and user data – Monthly Media Review with the AYLIEN News API, April

05-03

Saving, resuming, and restarting experiments with Polyaxon

05-03

Software as an academic publication

05-03

Quick DB result caching in R

05-05

Quick DB result caching in R

05-05

Kung Fury Review (2015) : Don’t Hassle the Hoff

05-05

Kung Fury Review (2015) : Don’t Hassle the Hoff

05-05

Kung Fury Review (2015) : Don’t Hassle the Hoff

05-05

Kung Fury Review (2015) : Don’t Hassle the Hoff

05-05

Data Links

05-06

Data Links

05-06

Data Links

05-06

Data Links

05-06

Data Links

05-06

Ffa1ea00fdab31b3b44b87839c503629

05-06

Ffa1ea00fdab31b3b44b87839c503629

05-06

Multithreaded in the Wild

10-05

Multithreaded in the Wild

05-07

Multithreaded in the Wild

05-07

R Packages worth a look

10-15

Data types

05-08

Day 01 – little helper checkdir

12-01

Data types

05-08

Mother's Day Interview: How Nicole Finnie Became a Competitive Kaggler on Maternity Leave

05-10

The Lottery Ticket Hypothesis - Paper Recommendation

05-10

The Lottery Ticket Hypothesis - Paper Recommendation

05-10

The Lottery Ticket Hypothesis - Paper Recommendation

05-10

The Lottery Ticket Hypothesis - Paper Recommendation

05-10

The Lottery Ticket Hypothesis - Paper Recommendation

05-10

Two things about power

05-14

Differentiable Dynamic Programs and SparseMAP Inference

05-15

Teaching R to New Users - From tapply to the Tidyverse

07-12

Cultural Differences in Map Data Visualization

06-30

Rethinking Academic Data Sharing

05-15

Teaching R to New Users - From tapply to the Tidyverse

07-12

Rethinking Academic Data Sharing

05-15

Rethinking Academic Data Sharing

05-15

Rethinking Academic Data Sharing

05-15

Learn D3.js in 5 minutes

05-16

R Packages worth a look

12-17

A Subtle Flaw in Some Popular R NSE Interfaces

09-24

“Identification of and correction for publication bias,” and another discussion of how forking paths is not the same thing as file drawer

08-31

R Packages worth a look

08-15

Learn D3.js in 5 minutes

05-16

Machine Learning Trick of the Day (8): Instrumental Thinking

10-15

Multiple Linear Regression & Assumptions of Linear Regression: A-Z

08-17

Using Linear Regression for Predictive Modeling in R

05-16

Awesome postdoc opportunities in computational genomics at JHU

05-17

Monash University: Research Fellow (Digital Civics) [Melbourne, Australia]

11-22

URI: Director, Data Analytics/DataSpark [Kingston, RI]

11-15

Awesome postdoc opportunities in computational genomics at JHU

05-17

Awesome postdoc opportunities in computational genomics at JHU

05-17

Delayed Impact of Fair Machine Learning

05-17

WNS Hackathon Solutions by Top Finishers

12-13

Data Science With R Course Series – Week 9

11-12

Delayed Impact of Fair Machine Learning

05-17

Microsoft Weekly Data Science News for May 18, 2018

05-18

“Bayesian Meta-Analysis with Weakly Informative Prior Distributions”

07-13

3 Things We Can Do About Fake News

05-18

lmer vs INLA for variance components

11-24

3 Things We Can Do About Fake News

05-18

3 Things We Can Do About Fake News

05-18

Life-cycle of a Data Science Project

05-18

Life-cycle of a Data Science Project

05-18

My steps into Data Science

05-21

Enterprise Deployment Tips for Azure Data Science Virtual Machine (DSVM)

05-21

How to use an R interface with Airtable API

05-23

How to use an R interface with Airtable API

05-23

Best practices with pandas (video series)

05-23

Best practices with pandas (video series)

05-23

Best practices with pandas (video series)

05-23

Best practices with pandas (video series)

05-23

SQLite vs Pandas: Performance Benchmarks

05-23

SQLite vs Pandas: Performance Benchmarks

05-23

Data Retrieval and Cleaning: Tracking Migratory Patterns

05-23

Data Retrieval and Cleaning: Tracking Migratory Patterns

05-23

Data Retrieval and Cleaning: Tracking Migratory Patterns

05-23

Data Retrieval and Cleaning: Tracking Migratory Patterns

05-23

The Blessings of Multiple Causes: Causal Inference when you Can't Measure Confounders

09-07

ML beyond Curve Fitting: An Intro to Causal Inference and do-Calculus

05-24

The most practical causal inference book I’ve read (is still a draft)

12-24

ML beyond Curve Fitting: An Intro to Causal Inference and do-Calculus

05-24

Light FM Recommendation System Explained

05-24

Light FM Recommendation System Explained

05-24

Light FM Recommendation System Explained

05-24

Why AI Isn’t A Black Box (And Its Business Value)

07-17

Light FM Recommendation System Explained

05-24

Why Lies Spread Faster than the Truth

05-24

Why Lies Spread Faster than the Truth

05-24

Why Lies Spread Faster than the Truth

05-24

Why Lies Spread Faster than the Truth

05-24

If you did not already know

10-05

Context Compatibility in Data Analysis

05-24

Context Compatibility in Data Analysis

05-24

Context Compatibility in Data Analysis

05-24

How digital cameras work

05-25

How digital cameras work

05-25

What's New in Dataquest v1.85: Takeaways, Intermediate R, and More

05-25

“Creating correct and capable classifiers” at PyDataAmsterdam 2018

05-26

Image Compression using K-means Clustering.

05-28

Image Compression using K-means Clustering.

05-28

Summer of Data Science 2018

05-28

Summer of Data Science 2018

05-28

How to Use FPGAs for Deep Learning Inference to Perform Land Cover Mapping on Terabytes of Aerial Images

05-29

How to Use FPGAs for Deep Learning Inference to Perform Land Cover Mapping on Terabytes of Aerial Images

05-29

How to Use FPGAs for Deep Learning Inference to Perform Land Cover Mapping on Terabytes of Aerial Images

05-29

How to Use FPGAs for Deep Learning Inference to Perform Land Cover Mapping on Terabytes of Aerial Images

05-29

EARL conference recap: Seattle 2018

11-24

The Golden Rule of Nudge

10-10

It was the weeds that bothered him.

08-14

It was the weeds that bothered him.

08-14

Import AI:

05-29

Some updates

05-29

Some updates

05-29

Some updates

05-29

Some updates

05-29

Document worth reading: “Does putting your emotions into words make you feel better? Measuring the minute-scale dynamics of emotions from online data”

08-14

How to Overcome Imposter Syndrome For Good

05-30

How to Overcome Imposter Syndrome For Good

05-30

“When Both Men and Women Drop Out of the Labor Force, Why Do Economists Only Ask About Men?”

12-23

How to Overcome Imposter Syndrome For Good

05-30

UnitedHealth Group: Director, Omni-Channel Analytics [Minnetonka, MN]

11-19

BDD100K: A Large-scale Diverse Driving Video Database

05-30

BDD100K: A Large-scale Diverse Driving Video Database

05-30

BDD100K: A Large-scale Diverse Driving Video Database

05-30

BDD100K: A Large-scale Diverse Driving Video Database

05-30

Document worth reading: “Universality of Deep Convolutional Neural Networks”

01-10

If you did not already know

08-05

Convolve all the things

05-31

Convolve all the things

05-31

Rules to Learn By

05-31

Rules to Learn By

05-31

Rules to Learn By

05-31

Six Dice Betting Game

05-31

R Packages worth a look

11-08

Six Dice Betting Game

05-31

Six Dice Betting Game

05-31

Document worth reading: “I can see clearly now: reinterpreting statistical significance”

01-08

He’s a history teacher and he has a statistics question

10-20

“Tweeking”: The big problem is not where you think it is.

09-23

R Generation: 25 Years of R

08-01

Talking about clinical significance

06-01

Gazing into the Abyss of P-Hacking: HARKing vs. Optional Stopping

11-14

The purported CSI effect and the retroactive precision fallacy

11-05

Talking about clinical significance

06-01

A crystal clear book draw

06-01

A crystal clear book draw

06-01

A crystal clear book draw

06-01

Parallel, Disk-Efficient .zip to .gz Conversion

06-01

Parallel, Disk-Efficient .zip to .gz Conversion

06-01

Parallel, Disk-Efficient .zip to .gz Conversion

06-01

Parallel, Disk-Efficient .zip to .gz Conversion

06-01

Parallel, Disk-Efficient .zip to .gz Conversion

06-01

Python and Tidyverse

06-01

Biggest Deep Learning Summit – Special KDnuggets Offer

01-10

Bulk Loading Shapefiles Into Postgres/Postgis

06-01

Bulk Loading Shapefiles Into Postgres/Postgis

06-01

Bulk Loading Shapefiles Into Postgres/Postgis

06-01

Bulk Loading Shapefiles Into Postgres/Postgis

06-01

3368a9b98a073e7ba296e1f5f41f6c4f

06-02

Document worth reading: “A Survey of Knowledge Representation and Retrieval for Learning in Service Robotics”

12-26

3368a9b98a073e7ba296e1f5f41f6c4f

06-02

CRN: The 10 Coolest Machine-Learning And AI Startups Of 2018 (So Far)

07-16

Lucy`s Secret Number puzzle

06-03

Lucy`s Secret Number puzzle

06-03

Lucy`s Secret Number puzzle

06-03

Data Links

06-03

“Ivy League Football Saw Large Reduction in Concussions After New Kickoff Rules”

10-05

Data Links

06-03

Data Links

06-03

Data Links

06-03

rqdatatable: rquery Powered by data.table

06-03

rqdatatable: rquery Powered by data.table

06-03

Trustworthy Data Analysis

06-04

Trustworthy Data Analysis

06-04

Trustworthy Data Analysis

06-04

When the bubble bursts…

06-04

When the bubble bursts…

06-04

SiliconANGLE: Machine learning automation startup DataRobot lands $100M round

10-24

ITWire: VIDEO Interview with a DataRobot: Greg Michaelson talks AI, banking, machine learning and more

10-24

Vulcan Post: This AI Startup Is Run By The World’s Top Data Scientists – Lets Anyone Build Predictive Models

09-04

Forbes: DataRobot Puts the Power of Machine Learning in the Hands of Business Analysts

06-04

KNNs (K-Nearest-Neighbours) in Python

11-22

To get hired as a data scientist, don’t follow the herd

11-12

New Poll: What was the largest dataset you analyzed / data mined?

10-12

✚ Google Dataset Search Impressions, the Challenges of Looking for Data, and Other Places to Find Data

09-13

Forbes: DataRobot Puts the Power of Machine Learning in the Hands of Business Analysts

06-04

Import AI

06-05

Deep Learning for Emojis with VS Code Tools for AI – Part 2

06-05

Summer of Data Science Goal-Setting

06-06

Forget Motivation and Double Your Chances of Learning Success

11-20

Summer of Data Science Goal-Setting

06-06

Debiasing Approximate Inference

12-05

Summer of Data Science Goal-Setting

06-06

If you did not already know

01-13

How to combine Multiple ggplot Plots to make Publication-ready Plots

01-12

Easily train models using datasets labeled by Amazon SageMaker Ground Truth

12-20

Labeling Unstructured Text for Meaning to Achieve Predictive Lift

10-31

Introduction to Active Learning

10-23

Document worth reading: “Does putting your emotions into words make you feel better? Measuring the minute-scale dynamics of emotions from online data”

08-14

Automatically Tag Trello Cards with Zapier and Natural Language Processing

06-07

Harmonizing and emojifying our GitHub issue trackers

07-12

Automatically Tag Trello Cards with Zapier and Natural Language Processing

06-07

Build this media monitoring Slack bot in 20 minutes without writing code

07-04

Automatically Tag Trello Cards with Zapier and Natural Language Processing

06-07

Automatically Tag Trello Cards with Zapier and Natural Language Processing

06-07

Data Scientist’s Dilemma – The Cold Start Problem

12-15

Bringing Machine Learning Research to Product Commercialization

11-27

A more systematic look at suppressed data by @ellis2013nz

11-17

Distilled News

11-14

Data Science With R Course Series – Week 6

10-22

R Packages worth a look

10-02

AWS Deep Learning AMIs now with optimized TensorFlow 1.9 and Apache MXNet 1.2 with Keras 2 support to accelerate deep learning on Amazon EC2 instances

07-23

“If you deprive the robot of your intuition about cause and effect, you’re never going to communicate meaningfully.” – Pearl ’18

06-08

Why Would Prosthetic Arms Need to See or Connect to Cloud AI?

09-10

Connected Arms – Can AI Revolutionize Prosthetic Devices & Make them More Affordable?

09-07

“If you deprive the robot of your intuition about cause and effect, you’re never going to communicate meaningfully.” – Pearl ’18

06-08

Programming Best Practices For Data Science

06-08

Monitoring the media reaction to Facebook’s disastrous earnings call – News API Monthly Media Review

08-16

Programming Best Practices For Data Science

06-08

Cryptocurrency: Your Current Options

08-10

Is it Time to Regulate Bitcoin?

06-19

Bitcoin and Cryptocurrency Litigation

06-08

Cryptocurrency: Your Current Options

08-10

Is it Time to Regulate Bitcoin?

06-19

Bitcoin and Cryptocurrency Litigation

06-08

Document worth reading: “Computational Power and the Social Impact of Artificial Intelligence”

12-27

If you did not already know

12-11

Key Takeaways from AI Conference SF, Day 2: AI and Security, Adversarial Examples, Innovation

10-30

Document worth reading: “Accelerating CNN inference on FPGAs: A Survey”

09-08

Bitcoin and Cryptocurrency Litigation

06-08

Bitcoin and Cryptocurrency Litigation

06-08

Parallelize a For-Loop by Rewriting it as an Lapply Call

01-11

Change over time is not “treatment response”

11-19

Estimating mortality rates in Puerto Rico after hurricane María using newly released official death counts

06-08

Estimating mortality rates in Puerto Rico after hurricane María using newly released official death counts

06-08

Estimating mortality rates in Puerto Rico after hurricane María using newly released official death counts

06-08

Philippine Senate Bills: NLP Word Cloud Analysis for the 13th to 17th Congress

06-09

Philippine Senate Bills: NLP Word Cloud Analysis for the 13th to 17th Congress

06-09

Philippine Senate Bills: NLP Word Cloud Analysis for the 13th to 17th Congress

06-09

Top KDnuggets tweets, Oct 10-16: 7 Books to Grasp Mathematical Foundations of Data Science and Machine Learning; 6 Books Every Data Scientist Should Keep Nearby

10-17

Philippine Senate Bills: NLP Word Cloud Analysis for the 13th to 17th Congress

06-09

The Dynamics of Philippine Senate Bills: Gensim, Topic Modeling and All That Good NLP Stuff

06-09

World Models Experiments

06-09

An ode to King James

06-10

An ode to King James

06-10

An ode to King James

06-10

An ode to King James

06-10

Annihilation Review (2018) : The Descent + Arrival

06-10

Annihilation Review (2018) : The Descent + Arrival

06-10

Annihilation Review (2018) : The Descent + Arrival

06-10

Annihilation Review (2018) : The Descent + Arrival

06-10

Annihilation Review (2018) : The Descent + Arrival

06-10

R Tip: use isTRUE()

06-11

R Tip: use isTRUE()

06-11

Take These 7 Small Steps To Make a Big Career Move

06-11

Apache Spark Introduction for Beginners

10-18

If you did not already know

08-15

Take These 7 Small Steps To Make a Big Career Move

06-11

Take These 7 Small Steps To Make a Big Career Move

06-11

Multithreaded in the Wild

12-03

Multithreaded in the Wild

06-11

Multithreaded in the Wild

12-03

Free Machine Learning Textbook

12-01

Multithreaded in the Wild

06-11

Multithreaded in the Wild

12-03

Multithreaded in the Wild

06-11

Highlights of NAACL-HLT 2018: Generalization, Test-of-time, and Dialogue Systems

06-12

Highlights of NAACL-HLT 2018: Generalization, Test-of-time, and Dialogue Systems

06-12

Highlights of NAACL-HLT 2018: Generalization, Test-of-time, and Dialogue Systems

06-12

Overview and benchmark of traditional and deep learning models in text classification

06-12

From Gaussian Algebra to Gaussian Processes, Part 2

06-12

From Gaussian Algebra to Gaussian Processes, Part 2

06-12

wrapr 1.5.0 available on CRAN

06-13

wrapr 1.5.0 available on CRAN

06-13

wrapr 1.5.0 available on CRAN

06-13

Python Generators Tutorial

06-13

Python Generators Tutorial

06-13

Data Science in 30 Minutes: Holden Karau – A Quick Introduction to PySpark

06-13

Data Notes: Malaria Detection with FastAI

01-03

Designing a Self-Learning Tic-Tac-Toe Player

11-29

Data Notes: Impact of Game of Thrones on US Baby Names

11-15

Data Notes: Chinese Tourism's Impact on Taiwan

11-01

New Course: Visualization Best Practices in R

10-19

R Packages worth a look

10-18

Data Notes: The Secret of Academic Success

10-17

Data Notes: Are Those Honey Bees Healthy?

10-04

Advantages of Online Data Science Courses

09-26

A psychology researcher uses Stan, multiverse, and open data exploration to explore human memory

09-21

Data Notes: How Do Autoencoders Work?

09-20

Data Notes: The Secret to Getting to a Second Date

09-06

Data Notes: Drought and the War in Syria

08-23

Data Notes: From Hate Speech to Russian Troll Tweets

08-09

Data Notes: Winning Solutions of Kaggle Competitions

07-26

Data Notes: How to Forecast the S&P 500 with Prophet

07-12

Data Notes: Your smartphone knows *what*?

06-28

Data Notes: Predict the World Cup 2018 Winner

06-14

R Packages worth a look

12-16

Apps gather your location and then sell the data

12-13

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: December and Beyond

12-04

David Brooks discovers Red State Blue State Rich State Poor State!

10-16

✚ Detailed Intentions of a Map, When Everything Leads to Nothing, Designing for Misinterpretations

08-09

Data Notes: Predict the World Cup 2018 Winner

06-14

Network Centrality in R: New ways of measuring Centrality

12-12

Redmonk Language Rankings, June 2018

08-10

Predicting World Cup dark horses from press coverage using the AYLIEN News API – Monthly Media Roundup

06-15

Predicting World Cup dark horses from press coverage using the AYLIEN News API – Monthly Media Roundup

06-15

PNAS forgets basic principles of game theory, thus dooming thousands of Bothans to the fate of Alderaan

07-04

Predicting World Cup dark horses from press coverage using the AYLIEN News API – Monthly Media Roundup

06-15

Predicting World Cup dark horses from press coverage using the AYLIEN News API – Monthly Media Roundup

06-15

R Tip: Be Wary of “…”

06-15

R Tip: Be Wary of “…”

06-15

U.S. Open Data — Gathering and Understanding the Data from 2018 Shinnecock

06-15

U.S. Open Data — Gathering and Understanding the Data from 2018 Shinnecock

06-15

U.S. Open Data — Gathering and Understanding the Data from 2018 Shinnecock

06-15

Top KDnuggets tweets, Nov 14-20: 10 Free Must-See Courses for Machine Learning and Data Science; Great list of

11-21

Echo Chamber Incites Online Mob to Attack Math Profs

09-14

5 Tips To Learn Machine Learning

06-17

Docstrings in open source Python

06-18

Docstrings in open source Python

06-18

Docstrings in open source Python

06-18

BDD100K Blog Update

06-18

BDD100K Blog Update

06-18

BDD100K Blog Update

06-18

Interactive panel EDA with 3 lines of code

12-09

A Three Month Data Analysis in Excel Could Have Taken Me One Day

10-01

Profiling Top Kagglers: Martin Henze (AKA Heads or Tails), World's First Kernels Grandmaster

06-19

The Role of Resources in Data Analysis

06-18

Import AI

06-18

Entering and Exiting 2018

01-02

Miami University: Director of the Center for Analytics & Data Science (CADS) [Oxford, OH]

12-20

Distilled News

08-30

A Certification for R Package Quality

07-30

Import AI:

07-16

Import AI:

07-09

AI Lab: Learn to Code with the Cutting-Edge Microsoft AI Platform

06-19

Pivoted document length normalisation

06-19

Pivoted document length normalisation

06-19

Is it Time to Regulate Bitcoin?

06-19

Why Machine Learning Interpretability Matters

12-04

Is it Time to Regulate Bitcoin?

06-19

Why Machine Learning Interpretability Matters

12-04

Is it Time to Regulate Bitcoin?

06-19

Document worth reading: “Model-free, Model-based, and General Intelligence”

08-10

Sent2Vec: An unsupervised approach towards learning sentence embeddings

06-19

Document worth reading: “Deep Learning for Generic Object Detection: A Survey”

10-10

If you did not already know

10-05

Sent2Vec: An unsupervised approach towards learning sentence embeddings

06-19

Sent2Vec: An unsupervised approach towards learning sentence embeddings

06-19

Sent2Vec: An unsupervised approach towards learning sentence embeddings

06-19

Best Data Visualization Projects of 2018

12-27

Morph, an open-source tool for data-driven art without code

09-26

Profiling Top Kagglers: Martin Henze (AKA Heads or Tails), World's First Kernels Grandmaster

06-19

Profiling Top Kagglers: Martin Henze (AKA Heads or Tails), World's First Kernels Grandmaster

06-19

Deep Reinforcement Learning in Action (Announcement)

06-20

Deep Reinforcement Learning in Action (Announcement)

06-20

Automated Web Scraping in R

12-11

Niall Ferguson and the perils of playing to your audience

12-05

“Statistical insights into public opinion and politics” (my talk for the Columbia Data Science Society this Wed 9pm)

12-04

“Law professor Alan Dershowitz’s new book claims that political differences have lately been criminalized in the United States. He has it wrong. Instead, the orderly enforcement of the law has, ludicrously, been framed as political.”

11-12

David Brooks discovers Red State Blue State Rich State Poor State!

10-16

David Weakliem points out that both economic and cultural issues can be more or less “moralized.”

10-03

What do you do when someone says, “The quote is, this is the exact quote”—and then misquotes you?

09-30

A Study Of Reddit Politics

06-20

Echo Chamber Incites Online Mob to Attack Math Profs

09-14

A Study Of Reddit Politics

06-20

On Tensor Networks and the Nature of Non-Linearity

06-20

On Tensor Networks and the Nature of Non-Linearity

06-20

Opinion mining on Dutch news articles

06-20

Opinion mining on Dutch news articles

06-20

Scalable multi-node training with TensorFlow

12-17

How to Do Distributed Deep Learning for Object Detection Using Horovod on Azure

06-20

Optimism corrected bootstrapping: a problematic method

12-25

Learning R: A gentle introduction to higher-order functions

12-14

Interactive Graphics with R Shiny

11-23

Those “other” apply functions…

11-13

More on sigr

11-06

automl package: part 2/2 first steps how to

10-24

How To Remotely Send R and Python Execution to SQL Server from Jupyter Notebooks

07-17

Top 12 Essential Command Line Tools for Data Scientists

06-20

Top 12 Essential Command Line Tools for Data Scientists

06-20

John Mount speaking on rquery and rqdatatable

07-11

Big News: vtreat 1.2.0 is Available on CRAN, and it is now Big Data Capable

06-20

John Mount speaking on rquery and rqdatatable

07-11

Big News: vtreat 1.2.0 is Available on CRAN, and it is now Big Data Capable

06-20

Published in 2018

01-03

5 Reasons Why You Should Use Cross-Validation in Your Data Science Projects

10-02

Using Stacking to Average Bayesian Predictive Distributions (with Discussion)

09-26

Big News: vtreat 1.2.0 is Available on CRAN, and it is now Big Data Capable

06-20

vtreat Variable Importance

12-18

vtreat Variable Importance

12-18

More Practical Data Science with R Book News

08-19

Big News: vtreat 1.2.0 is Available on CRAN, and it is now Big Data Capable

06-20

Big News: vtreat 1.2.0 is Available on CRAN, and it is now Big Data Capable

06-20

The Impact of Bitcoin on the Insurance Industry

06-21

Open Source Datasets with Kaggle

06-21

Add Constrained Optimization To Your Toolbelt

06-21

Use Amazon Mechanical Turk with Amazon SageMaker for supervised learning

08-02

Amelia, it was just a false alarm

07-31

Add Constrained Optimization To Your Toolbelt

06-21

Top-Down vs. Bottom-Up Approaches to Data Science

07-10

How I built a receipt chatbot over a weekend

06-23

How I built a receipt chatbot over a weekend

06-23

How I built a receipt chatbot over a weekend

06-23

Last academic results

06-23

Last academic results

06-23

What Is Machine Learning and How Is It Making Our World a Better Place?

06-23

What Is Machine Learning and How Is It Making Our World a Better Place?

06-23

Classify your own images using Amazon SageMaker

07-20

Building a Diabetic Retinopathy Prediction Application using Azure Machine Learning

06-25

Import AI:

06-25

Import AI:

06-25

Supercharging Classification - The Value of Multi-task Learning

06-26

DIY AI for the Future

06-27

Physics-Based Learned Design: Teaching a Microscope How to Image

11-26

If you did not already know

10-08

Being at the Center

09-07

What Data Scientists should focus on in 2018?

06-27

Can Lessons from Data Science Help Journalism?

06-27

Data Notes: Malaria Detection with FastAI

01-03

Designing a Self-Learning Tic-Tac-Toe Player

11-29

Data Notes: Impact of Game of Thrones on US Baby Names

11-15

Pulse of the Competition: November Edition

11-02

Data Notes: Chinese Tourism's Impact on Taiwan

11-01

Data Notes: The Secret of Academic Success

10-17

Data Notes: Are Those Honey Bees Healthy?

10-04

Data Notes: How Do Autoencoders Work?

09-20

Winner Interview | Particle Tracking Challenge first runner-up, Pei-Lien Chou

09-14

Data Notes: The Secret to Getting to a Second Date

09-06

Data Notes: Drought and the War in Syria

08-23

Data Notes: Winning Solutions of Kaggle Competitions

07-26

Data Notes: Your smartphone knows *what*?

06-28

Announcement – The Data Incubator Partnership with MRI Network

06-28

Announcement – The Data Incubator Partnership with MRI Network

06-28

R Packages worth a look

01-04

Machine Learning Explainability vs Interpretability: Two concepts that could help restore trust in AI

12-20

Document worth reading: “A Theory of Diagnostic Interpretation in Supervised Classification”

12-08

Explainable Artificial Intelligence (Part 2) – Model Interpretation Strategies

12-06

Interpretability is crucial for trusting AI and machine learning

12-01

Improving model interpretability with LIME

10-31

Explainable ML versus Interpretable ML

10-30

Slides from my talk at the R-Ladies Meetup about Interpretable Deep Learning with R, Keras and LIME

10-17

Understanding Latent Style

06-28

Document worth reading: “A Theory of Diagnostic Interpretation in Supervised Classification”

12-08

Understanding Latent Style

06-28

Dexterous Manipulation with Reinforcement Learning: Efficient, General, and Low-Cost

08-31

One-Shot Imitation from Watching Videos

06-28

Sequence labeling with semi-supervised multi-task learning

06-29

crfsuite for natural language processing

10-29

Named Entity Recognition and Classification with Scikit-Learn

10-25

Sequence labeling with semi-supervised multi-task learning

06-29

Sequence labeling with semi-supervised multi-task learning

06-29

Computability, Complexity, & Algorithms Part 1

06-29

Computability, Complexity, & Algorithms Part 1

06-29

Computability, Complexity, & Algorithms Part 1

06-29

Timing Grouped Mean Calculation in R

12-08

Chocolate milk! Another stunning discovery from an experiment on 24 people!

11-13

Computability, Complexity, & Algorithms Part 1

06-29

Deep Learning Vendor Update: Hyperparameter Tuning Systems

06-29

Deep Learning Vendor Update: Hyperparameter Tuning Systems

06-29

If you did not already know

09-22

If you did not already know

08-17

Data science books - theory and practice

06-29

Keras vs PyTorch:谁是「第一」深度学习框架?

06-30

Keras vs PyTorch:谁是「第一」深度学习框架?

06-30

Cultural Differences in Map Data Visualization

06-30

Cultural Differences in Map Data Visualization

06-30

Cultural Differences in Map Data Visualization

06-30

Cultural Differences in Map Data Visualization

06-30

Boost Computation Power and Speed with Snowflake

07-02

Dataiku Series C: New Year, New Chapter

12-19

Data Tools We're Thankful For

11-22

Driving Success through Business Insight, One Customer at a Time

11-21

Egg-Not-Egg Deep Learning Model

11-08

7 Awesome Things You Can Do in Dataiku Without Coding

11-02

Dataiku: “Multimodal Force Majeure” Among Predictive Analytics & ML Platforms

09-18

Dataiku 5.0: Enterprise AI Within Reach

09-12

Data Science at Scale: Six Major Trends

07-05

Boost Computation Power and Speed with Snowflake

07-02

R Packages worth a look

12-04

Graph-Powered Machine Learning

12-03

Boost Computation Power and Speed with Snowflake

07-02

seplyr 0.5.8 Now Available on CRAN

07-02

seplyr 0.5.8 Now Available on CRAN

07-02

Machine Learning Toronto SummitNov 20-21 – Special KDnuggets discount

11-12

seplyr 0.5.8 Now Available on CRAN

07-02

The Ponzi threshold and the Armstrong principle

07-02

The Ponzi threshold and the Armstrong principle

07-02

The Ponzi threshold and the Armstrong principle

07-02

Automated Email Reports with R

11-01

R Packages worth a look

10-04

Poor Customer Service

07-12

Reply-all loop

07-03

Reply-all loop

07-03

Four Techniques for Outlier Detection

12-06

Reply-all loop

07-03

Reply-all loop

07-03

Flaws in stupid horrible algorithm revealed because it made numerical predictions

07-03

Flaws in stupid horrible algorithm revealed because it made numerical predictions

07-03

Flaws in stupid horrible algorithm revealed because it made numerical predictions

07-03

Flaws in stupid horrible algorithm revealed because it made numerical predictions

07-03

The Golden Rule of Nudge

10-10

About that claim in the NYT that the immigration issue helped Hillary Clinton? The numbers don’t seem to add up.

07-03

About that claim in the NYT that the immigration issue helped Hillary Clinton? The numbers don’t seem to add up.

07-03

2018: How did people actually vote? (The real story, not the exit polls.)

11-16

About that claim in the NYT that the immigration issue helped Hillary Clinton? The numbers don’t seem to add up.

07-03

Here’s How to Survive the Rise of A.I. – Become a Data Facilitator

07-03

InformationAge: Will 2019 See the Automation of Automation and Push Up Salaries of Data Scientists?

12-11

You Can’t Do AI Without Augmented Analytics and AutoML

11-26

If you did not already know

11-15

Here’s How to Survive the Rise of A.I. – Become a Data Facilitator

07-03

Basic Statistics in Python: Descriptive Statistics

07-03

A few upcoming R conferences

10-05

A few upcoming R conferences

10-05

SatRdays Cardiff

07-04

SatRdays Cardiff

07-04

SatRdays Cardiff

07-04

SatRdays Cardiff

07-04

Does batting order matter in Major League Baseball? A simulation approach

07-04

Does batting order matter in Major League Baseball? A simulation approach

07-04

Does batting order matter in Major League Baseball? A simulation approach

07-04

Using WSL Linux on Windows 10 for Deep Learning Development.

07-04

Search labels and IDs from IAB-QAG and IPTC Subject Codes taxonomies

11-01

High-profile statistical errors occur in the physical sciences too, it’s not just a problem in social science.

09-15

PNAS forgets basic principles of game theory, thus dooming thousands of Bothans to the fate of Alderaan

07-04

PNAS forgets basic principles of game theory, thus dooming thousands of Bothans to the fate of Alderaan

07-04

PNAS forgets basic principles of game theory, thus dooming thousands of Bothans to the fate of Alderaan

07-04

PNAS forgets basic principles of game theory, thus dooming thousands of Bothans to the fate of Alderaan

07-04

Build this media monitoring Slack bot in 20 minutes without writing code

07-04

Tutorial: The practical application of complicated statistical methods to fill up the scientific literature with confusing and irrelevant analyses

07-05

Tutorial: The practical application of complicated statistical methods to fill up the scientific literature with confusing and irrelevant analyses

07-05

Tutorial: The practical application of complicated statistical methods to fill up the scientific literature with confusing and irrelevant analyses

07-05

Tutorial: The practical application of complicated statistical methods to fill up the scientific literature with confusing and irrelevant analyses

07-05

An Intro to Deep Learning in Python

12-06

Tutorial: The practical application of complicated statistical methods to fill up the scientific literature with confusing and irrelevant analyses

07-05

Data Science at Scale: Six Major Trends

07-05

See How AI is Inspiring the Next Generation of Developers

09-05

Data Science at Scale: Six Major Trends

07-05

On this 4th of July, let’s declare independence from “95%”

07-05

Awesome MCMC animation site by Chi Feng! On Github!

07-26

On this 4th of July, let’s declare independence from “95%”

07-05

5 Alternatives to the Default R Outputs for GLMs and Linear Models

10-17

A Real World Reinforcement Learning Research Program

07-06

The Evolution of Build Engineering in Managing Open Source [Webinar Replay]

11-13

SQL, Python, & R in One Platform

10-26

What I’ve learned from competing in machine learning contests on Kaggle

07-06

R Packages worth a look

01-11

SQL, Python, & R in One Platform

10-26

If you did not already know

09-28

What I’ve learned from competing in machine learning contests on Kaggle

07-06

High-profile statistical errors occur in the physical sciences too, it’s not just a problem in social science.

09-15

Problems in a published article on food security in the Lower Mekong Basin

08-23

“Bayesian Meta-Analysis with Weakly Informative Prior Distributions”

07-13

All of Life is 6 to 5 Against

07-06

All of Life is 6 to 5 Against

07-06

All of Life is 6 to 5 Against

07-06

All of Life is 6 to 5 Against

07-06

FIFA WC 2018: Quarter Final Stage Preditions

07-06

FIFA WC 2018: Quarter Final Stage Preditions

07-06

How quickly do stock market valuations revert back to their means?

10-28

FIFA WC 2018: Quarter Final Stage Preditions

07-06

The Economist’s Big Mac Index is calculated with R

10-12

In case you missed it: July 2018 roundup

08-08

FIFA WC 2018: Quarter Final Stage Preditions

07-06

8 Data Science Projects to Build your Portfolio

12-11

8 Data Science Projects to Build your Portfolio

12-03

Growth of Subreddits

10-30

From the Sidewalk to the Saddle: Data and the Tour de France

07-06

From the Sidewalk to the Saddle: Data and the Tour de France

07-06

Hey—take this psychological science replication quiz!

09-02

From the Sidewalk to the Saddle: Data and the Tour de France

07-06

He wants to know what to read and what software to learn, to increase his ability to think about quantitative methods in social science

07-07

The most practical causal inference book I’ve read (is still a draft)

12-24

The causal hype ratchet

12-21

The Blessings of Multiple Causes: Causal Inference when you Can't Measure Confounders

09-07

Distilled News

08-11

Distilled News

08-02

He wants to know what to read and what software to learn, to increase his ability to think about quantitative methods in social science

07-07

He wants to know what to read and what software to learn, to increase his ability to think about quantitative methods in social science

07-07

He wants to know what to read and what software to learn, to increase his ability to think about quantitative methods in social science

07-07

Lenny Dykstra, His Strike Zone, & Bayesian Stats

07-08

Lenny Dykstra, His Strike Zone, & Bayesian Stats

07-08

Lenny Dykstra, His Strike Zone, & Bayesian Stats

07-08

Lenny Dykstra, His Strike Zone, & Bayesian Stats

07-08

Lenny Dykstra, His Strike Zone, & Bayesian Stats

07-08

2018: Who actually voted? (The real story, not the exit polls.)

11-10

I think they use witchcraft

07-08

I think they use witchcraft

07-08

I think they use witchcraft

07-08

I think they use witchcraft

07-08

R Packages worth a look

09-20

Design Patterns for Production NLP Systems

07-09

Data Science Project Style Guide

07-09

If you did not already know

10-15

Feature-wise transformations

07-09

If you did not already know

10-15

If you did not already know

09-02

Feature-wise transformations

07-09

If you did not already know

10-15

R Packages worth a look

10-06

If you did not already know

09-02

Feature-wise transformations

07-09

Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

12-11

AI, Machine Learning and Data Science Announcements from Microsoft Ignite

10-02

Import AI:

07-16

Import AI:

07-09

Joint inference or modular inference? Pierre Jacob, Lawrence Murray, Chris Holmes, Christian Robert discuss conditions on the strength and weaknesses of these choices

07-09

DATAx Presents: AI AND MACHINE LEARNING TRENDS IN 2019

12-06

Turn data into revenue. Wharton can show you how.

11-06

Machine Learning Making Big Moves in Marketing

07-30

Top-Down vs. Bottom-Up Approaches to Data Science

07-10

He wants to model a proportion given some predictors that sum to 1

07-10

Sudoku Solver

12-30

U. of Zurich: Professorship in Big Data Science (Open Rank) [Zurich, Switzerland]

10-24

U. of Zurich: Assistant Professorship in AI and Machine Learning (Non-tenure Track) [Zurich, Switzerland]

10-24

Temple University: Faculty Positions (Assistant/Associate/Full Professor) [Philadelphia, PA]

10-12

Colorado State University: Assistant Professor in Industrial and Organizational (IO) Psychology [Fort Collins, CO]

10-05

It should be ok to just publish the data.

08-15

It should be ok to just publish the data.

08-15

He wants to model a proportion given some predictors that sum to 1

07-10

Exercise and weight loss: long-term follow-up

07-10

Exercise and weight loss: long-term follow-up

07-10

Exercise and weight loss: long-term follow-up

07-10

Remembering Michael

10-08

3-D-Printed Time Series Plates

08-23

BD reviews

07-11

Do Bayesians Overfit?

07-11

Do Bayesians Overfit?

07-11

John Mount speaking on rquery and rqdatatable

07-11

Preparing for the Data Science Job Hunt

07-11

Predictive Analytics in 2018: Salaries & Industry Shifts

11-19

Preparing for the Data Science Job Hunt

07-11

8 Reasons to Take Data Analytics Certification Courses

11-28

Preparing for the Data Science Job Hunt

07-11

Should the points in this scatterplot be binned?

07-11

Should the points in this scatterplot be binned?

07-11

Should the points in this scatterplot be binned?

07-11

Poor Customer Service

07-12

Poor Customer Service

07-12

Automated Email Reports with R

11-01

Poor Customer Service

07-12

Poor Customer Service

07-12

Harmonizing and emojifying our GitHub issue trackers

07-12

Announcing Kaggle integration with Google Data Studio

12-05

What is the Best Python IDE for Data Science?

11-14

Where do I learn about log_sum_exp, log1p, lccdf, and other numerical analysis tricks?

07-12

Personal Data Analytics

12-10

Announcing Kaggle integration with Google Data Studio

12-05

What is the Best Python IDE for Data Science?

11-14

Where do I learn about log_sum_exp, log1p, lccdf, and other numerical analysis tricks?

07-12

Where do I learn about log_sum_exp, log1p, lccdf, and other numerical analysis tricks?

07-12

R Packages worth a look

09-20

FIFA WC 2018: Semi-Finals and the Final

07-12

FIFA WC 2018: Semi-Finals and the Final

07-12

FIFA WC 2018: Semi-Finals and the Final

07-12

The persistence of bad reporting and the reluctance of people to criticize it

07-12

The persistence of bad reporting and the reluctance of people to criticize it

07-12

The persistence of bad reporting and the reluctance of people to criticize it

07-12

Teaching R to New Users - From tapply to the Tidyverse

07-12

If you did not already know

01-05

Document worth reading: “Fractal AI: A fragile theory of intelligence”

10-22

R Packages worth a look

09-06

Teaching R to New Users - From tapply to the Tidyverse

07-12

Teaching R to New Users - From tapply to the Tidyverse

07-12

lmer vs INLA for variance components

11-24

“Bayesian Meta-Analysis with Weakly Informative Prior Distributions”

07-13

“Do you have any recommendations for useful priors when datasets are small?”

12-11

Against Arianism 2: Arianism Grande

09-12

“Bayesian Meta-Analysis with Weakly Informative Prior Distributions”

07-13

Simulating simple dice games by @ellis2013nz

10-26

Slot Machines

10-15

Help improve lives through Machine Learning by joining the AWS DeepLens Challenge!

09-25

RAIN Project: evolution of the game development dream

07-13

RAIN Project: evolution of the game development dream

07-13

RAIN Project: evolution of the game development dream

07-13

Co-localization analysis of fluorescence microscopy images

11-27

Fringe FM conversation on AI Ethics

10-30

SatRday talks recordings

10-17

Magister Dixit

08-09

Join "Data School Insiders" on Patreon

07-13

Join "Data School Insiders" on Patreon

07-13

Join "Data School Insiders" on Patreon

07-13

Join "Data School Insiders" on Patreon

07-13

Join "Data School Insiders" on Patreon

07-13

What happens to your career when you have to retract a paper?

07-14

What happens to your career when you have to retract a paper?

07-14

What happens to your career when you have to retract a paper?

07-14

Mounting multiple data and outputs volumes

07-15

CRN: The 10 Coolest Machine-Learning And AI Startups Of 2018 (So Far)

07-16

The “Carl Sagan effect”

07-16

The “Carl Sagan effect”

07-16

R Packages worth a look

12-23

R Packages worth a look

12-18

R Packages worth a look

12-05

R Packages worth a look

11-11

R Packages worth a look

09-30

R Packages worth a look

09-19

R Packages worth a look

08-30

R Packages worth a look

08-03

The “Carl Sagan effect”

07-16

Import AI:

07-16

Mister P wins again

07-16

Mister P wins again

07-16

Mister P wins again

07-16

Mister P wins again

07-16

Mister P wins again

07-16

Scanning Office 365 documents

07-16

Video: R for AI, and the Not Hotdog workshop

07-17

Video: R for AI, and the Not Hotdog workshop

07-17

The statistical checklist: Could there be a list of guidelines to help analysts do better work?

07-17

The statistical checklist: Could there be a list of guidelines to help analysts do better work?

07-17

Announcing the ultimate seminar speaker contest: 2019 edition!

01-06

Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese

12-21

2018-13 Rendering HTML Content in R Graphics

12-16

Amazon Transcribe now supports real-time transcriptions

11-20

Where that title came from

07-20

The statistical checklist: Could there be a list of guidelines to help analysts do better work?

07-17

Advanced News API search: leveraging DBpedia entity types

12-11

Introduction to Named Entity Recognition

12-11

What is “party balancing” and how does it explain midterm elections?

08-08

Using Entity-level Sentiment Analysis to understand News Content

07-30

Model Updates: Entity-level Sentiment Analysis and Brand New Entity Extraction Models Now Live in the Text Analysis API

07-17

Model Updates: Entity-level Sentiment Analysis and Brand New Entity Extraction Models Now Live in the Text Analysis API

07-17

Why AI Isn’t A Black Box (And Its Business Value)

07-17

Data-based ways of getting a job

07-18

Data-based ways of getting a job

07-18

Data-based ways of getting a job

07-18

Data-based ways of getting a job

07-18

Highlights from the useR! 2018 conference in Brisbane

07-18

Highlights from the useR! 2018 conference in Brisbane

07-18

The file drawer’s on fire!

07-30

“For professional baseball players, faster hand-eye coordination linked to batting performance”

07-18

How a meme grew into a campaign slogan

11-05

The file drawer’s on fire!

07-30

“For professional baseball players, faster hand-eye coordination linked to batting performance”

07-18

Python数据分析之pandas

07-18

Basic Statistics in Python: Probability

07-18

If you have a measure, it will be gamed (politics edition).

07-18

If you have a measure, it will be gamed (politics edition).

07-18

If you have a measure, it will be gamed (politics edition).

07-18

If you have a measure, it will be gamed (politics edition).

07-18

If you have a measure, it will be gamed (politics edition).

07-18

Authority figures in psychology spread more happy talk, still don’t get the point that much of the published, celebrated, and publicized work in their field is no good (Part 2)

12-31

“The idea of replication is central not just to scientific practice but also to formal statistics . . . Frequentist statistics relies on the reference set of repeated experiments, and Bayesian statistics relies on the prior distribution which represents the population of effects.”

07-19

“The idea of replication is central not just to scientific practice but also to formal statistics . . . Frequentist statistics relies on the reference set of repeated experiments, and Bayesian statistics relies on the prior distribution which represents the population of effects.”

07-19

“The idea of replication is central not just to scientific practice but also to formal statistics . . . Frequentist statistics relies on the reference set of repeated experiments, and Bayesian statistics relies on the prior distribution which represents the population of effects.”

07-19

Learn to R blog series - Operators and Objects

07-19

Learn to R blog series - Operators and Objects

07-19

Where that title came from

07-20

Where that title came from

07-20

Where that title came from

07-20

Scalable multi-node deep learning training using GPUs in the AWS Cloud

07-20

A hex sticker wall, created with R

07-20

A hex sticker wall, created with R

07-20

2018-13 Rendering HTML Content in R Graphics

12-16

A hex sticker wall, created with R

07-20

图像特征提取(纹理特征)

07-20

图像特征提取(纹理特征)

07-20

图像特征提取(纹理特征)

07-20

图像特征提取(纹理特征)

07-20

“A Headline That Will Make Global-Warming Activists Apoplectic”

07-21

“A Headline That Will Make Global-Warming Activists Apoplectic”

07-21

“A Headline That Will Make Global-Warming Activists Apoplectic”

07-21

“A Headline That Will Make Global-Warming Activists Apoplectic”

07-21

“A Headline That Will Make Global-Warming Activists Apoplectic”

07-21

Power your website with on-demand translated reviews using Amazon Translate

12-20

Introducing Amazon Translate Custom Terminology

11-28

Amazon Translate now offers 113 new language pairs

10-29

Create a translator chatbot using Amazon Translate and Amazon Lex

08-22

Create video subtitles with translation using machine learning

08-10

The Real Problems with Neural Machine Translation

07-21

Of statistics class and judo class: Beyond the paradigm of sequential education

07-22

Of statistics class and judo class: Beyond the paradigm of sequential education

07-22

Of statistics class and judo class: Beyond the paradigm of sequential education

07-22

Defining data science in 2018

07-22

LSTM的神奇之处

08-10

GBM

08-10

DeepLearning-Github排行

07-22

LSTM的神奇之处

08-10

GBM

08-10

DeepLearning-Github排行

07-22

DeepLearning-Github排行

07-22

DeepLearning-Github排行

07-22

DeepLearning-Github排行

07-22

AWS Deep Learning AMIs now with optimized TensorFlow 1.9 and Apache MXNet 1.2 with Keras 2 support to accelerate deep learning on Amazon EC2 instances

07-23

4 Reasons Santa Needs Machine Learning & AI

12-24

Help us understand your Data Science goals!

11-13

Journals and refereeing: toward a new equilibrium

07-25

Year 3 of Data, Beer, & Inspiration

07-23

Distilled News

12-09

WPI: Research Scientist [Worcester, MA]

11-30

WPI: Post-Doctoral Fellow [Worcester, MA]

11-21

Import AI 121: Sony researchers make ultra-fast ImageNet training breakthrough; Berkeley researchers tackle StarCraft II with modular RL system; and Germany adds €3bn for AI research

11-19

Import AI:

07-23

Top Python Libraries in 2018 in Data Science, Deep Learning, Machine Learning

12-19

Gender Diversity in the R and Python Communities

12-05

Gender Diversity in the R and Python Communities

12-05

Top 10 Python Data Science Libraries

11-16

Distilled News

11-04

Top 13 Python Deep Learning Libraries

11-02

Top 8 Python Machine Learning Libraries

10-09

Top 20 Python AI and Machine Learning Open Source Projects

07-23

BIG, small or Right Data: Which is the proper focus?

10-08

How to scrape data from a website using Python

09-07

Top 20 Python AI and Machine Learning Open Source Projects

07-23

When wife earns more than husband, they report a lesser gap

07-23

When wife earns more than husband, they report a lesser gap

07-23

When wife earns more than husband, they report a lesser gap

07-23

Rcpp now used by 1500 CRAN packages

11-15

When wife earns more than husband, they report a lesser gap

07-23

A quick tour of AI services in Azure

07-24

A quick tour of AI services in Azure

07-24

Most Common Jobs, By State

07-24

Most Common Jobs, By State

07-24

Most Common Jobs, By State

07-24

Most Common Jobs, By State

07-24

Most Common Jobs, By State

07-24

New Research on Multi-Task Learning

07-24

New Research on Multi-Task Learning

07-24

Deploy your own TensorFlow object detection model to AWS DeepLens

09-27

Deploy a TensorFlow trained image classification model to AWS DeepLens

08-15

The AWS DeepLens Inclusivity Challenge submission period extended to 8/19

07-24

The AWS DeepLens Inclusivity Challenge submission period extended to 8/19

07-24

The AWS DeepLens Inclusivity Challenge submission period extended to 8/19

07-24

The AWS DeepLens Inclusivity Challenge submission period extended to 8/19

07-24

Recently in the sister blog

07-24

Recently in the sister blog

07-24

Did she really live 122 years?

01-08

Journals and refereeing: toward a new equilibrium

07-25

Serve yourself. The Next-Generation of Data Analytics. Dec 6 Webinar

11-29

Journals and refereeing: toward a new equilibrium

07-25

Advice on soft skills for academics

07-25

Surprise-hacking: “the narrative of blindness and illusion sells, and therefore continues to be the central thesis of popular books written by psychologists and cognitive scientists”

12-16

A small logical change with big impact

10-16

A small logical change with big impact

10-16

A small logical change with big impact

10-16

Document worth reading: “A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior”

08-07

Advice on soft skills for academics

07-25

Philip Roth (4) vs. DJ Jazzy Jeff; Jim Thorpe advances

01-08

Advice on soft skills for academics

07-25

AWS Deep Learning AMIs now include ONNX, enabling model portability across deep learning frameworks

07-26

AWS Deep Learning AMIs now include ONNX, enabling model portability across deep learning frameworks

07-26

New Year's Resolutions 2019

01-01

Whistler, British Columbia

07-26

Whistler, British Columbia

07-26

Whistler, British Columbia

07-26

Grazing and Calculus Revisited

07-26

Grazing and Calculus Revisited

07-26

Grazing and Calculus Revisited

07-26

Grazing and Calculus Revisited

07-26

How to think about an accelerating string of research successes?

07-26

How to think about an accelerating string of research successes?

07-26

How to think about an accelerating string of research successes?

07-26

Parsimonious principle vs integration over all uncertainties

07-26

R Packages worth a look

08-19

Parsimonious principle vs integration over all uncertainties

07-26

Awesome MCMC animation site by Chi Feng! On Github!

07-26

Awesome MCMC animation site by Chi Feng! On Github!

07-26

Data Notes: Winning Solutions of Kaggle Competitions

07-26

ACL 2018 Highlights: Understanding Representations and Evaluation in More Challenging Settings

07-26

ACL 2018 Highlights: Understanding Representations and Evaluation in More Challenging Settings

07-26

ACL 2018 Highlights: Understanding Representations and Evaluation in More Challenging Settings

07-26

Document worth reading: “Attribute-aware Collaborative Filtering: Survey and Classification”

10-23

Document worth reading: “A Reliability Theory of Truth”

08-02

ACL 2018 Highlights: Understanding Representations and Evaluation in More Challenging Settings

07-26

First Data Project? Go Tandem! (AVISIA at Play)

07-27

First Data Project? Go Tandem! (AVISIA at Play)

07-27

Document worth reading: “Machine Learning in Official Statistics”

01-11

Researchers.one: A souped-up Arxiv with pre- and post-publication review

09-10

Thoughts On Machine Learning Accuracy

07-27

Thoughts On Machine Learning Accuracy

07-27

Aella Credit empowers underbanked individuals by using Amazon Rekognition for identity verification

08-15

Amazon Rekognition is now available in the Asia Pacific (Seoul) and Asia Pacific (Mumbai) Regions

08-09

Thoughts On Machine Learning Accuracy

07-27

Why I Indent My Code 8 Spaces

07-27

Why I Indent My Code 8 Spaces

07-27

Why I Indent My Code 8 Spaces

07-27

Ensure consistency in data processing code between training and inference in Amazon SageMaker

01-11

Amazon SageMaker Automatic Model Tuning now supports early stopping of training jobs

12-13

New Features For Amazon SageMaker: Workflows, Algorithms, and Accreditation

11-21

Amazon SageMaker Automatic Model Tuning becomes more efficient with warm start of hyperparameter tuning jobs

11-19

Easily monitor and visualize metrics while training models on Amazon SageMaker

11-19

Direct access to Amazon SageMaker notebooks from Amazon VPC by using an AWS PrivateLink endpoint

11-06

Customize your notebook volume size, up to 16 TB, with Amazon SageMaker

11-06

Limit access to a Jupyter notebook instance by IP address

09-14

Amazon SageMaker runtime now supports the CustomAttributes header

08-31

Deploy a TensorFlow trained image classification model to AWS DeepLens

08-15

Securing all Amazon SageMaker API calls with AWS PrivateLink

08-14

Transfer learning for custom labels using a TensorFlow container and “bring your own algorithm” in Amazon SageMaker

07-27

Because it's Friday: Street Orientation

07-27

If you did not already know

12-30

Because it's Friday: Street Orientation

07-27

Because it's Friday: Street Orientation

07-27

Keynote at EuroPython 2018 on “Citizen Science”

07-27

Keynote at EuroPython 2018 on “Citizen Science”

07-27

The SIAM Book Series on Data Science

01-11

MS in Applied Data Science Online – which track is right for you?

01-10

French Baccalaureate Results

01-08

Niall Ferguson and the perils of playing to your audience

12-05

Yeshiva University: Data Science Program Director [New York, NY]

11-30

How Data Science Is Improving Higher Education

11-01

Data + Art STEAM Project: Final Results

10-30

How DataCamp Handles Course Quality

10-25

Present each others’ posters

10-06

Discussion of effects of growth mindset: Let’s not demand unrealistic effect sizes.

09-13

The Benefits of Active Learning for Data Science Skills

09-12

What makes Robin Pemantle’s bag of tricks for teaching math so great?

07-27

What makes Robin Pemantle’s bag of tricks for teaching math so great?

07-27

aRt with code

07-27

aRt with code

07-27

Very shiny holidays!

12-26

Highlights of 2018

12-18

aRt with code

07-27

Of Tennys players and moral Hazards

07-28

Of Tennys players and moral Hazards

07-28

Legal Tech: How Can Lawyers Benefit?

08-13

Of Tennys players and moral Hazards

07-28

Fringe FM conversation on AI Ethics

10-30

Of Tennys players and moral Hazards

07-28

Revisiting “Is the scientific paper a fraud?”

07-29

Revisiting “Is the scientific paper a fraud?”

07-29

Recreating the NBA lead tracker graphic

12-13

Seasonalities: Bad Period for Stocks?

07-29

Seasonalities: Bad Period for Stocks?

07-29

✚ Repetitions, Data Analysis as Brainstorm

01-10

✚ Avoiding D3, Using D3, and Why I Use D3

01-03

✚ Tufte Tweet Follow-up; Visualization Tools and Resources Roundup for December 2018

12-20

✚ Wrong Tool, Right Tool, More Tools for Visualization

08-02

Seasonalities: Bad Period for Stocks?

07-29

A Certification for R Package Quality

07-30

Quantum Computing: Cats, Crushes, and Chemistry

07-30

Please vote

10-29

Machine Learning Making Big Moves in Marketing

07-30

The file drawer’s on fire!

07-30

The file drawer’s on fire!

07-30

The file drawer’s on fire!

07-30

Four Ways to Harness Big Data in the Energy Sector

07-30

Four Ways to Harness Big Data in the Energy Sector

07-30

If you did not already know

12-16

Monash University: Research Fellow (Digital Civics) [Melbourne, Australia]

11-22

If you did not already know

10-04

Facilitate Proactive Cybersecurity Operations with Big Data Analytics and Machine Intelligence

07-30

Using Entity-level Sentiment Analysis to understand News Content

07-30

3 Steps to Build Your First Intelligent App – Conference Buddy

07-31

R Packages worth a look

07-31

R Packages worth a look

07-31

Distilled News

07-31

Distilled News

07-31

Amelia, it was just a false alarm

07-31

Exploring Models with lime

11-09

Improving model interpretability with LIME

10-31

Data Science With R Course Series – Week 7

10-29

Slides from my talk at the R-Ladies Meetup about Interpretable Deep Learning with R, Keras and LIME

10-17

Deep Learning Without Labels

10-03

Progress in machine learning interpretability

07-31

Progress in machine learning interpretability

07-31

Distilled News

11-11

Progress in machine learning interpretability

07-31

Progress in machine learning interpretability

07-31

Neural reinterpretations of movie trailers

07-31

Neural reinterpretations of movie trailers

07-31

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

Neural reinterpretations of movie trailers

07-31

If you did not already know

01-13

Document worth reading: “Recommendation System based on Semantic Scholar Mining and Topic modeling: A behavioral analysis of researchers from six conferences”

01-03

If you did not already know

10-03

Magister Dixit

07-31

Magister Dixit

07-31

Document worth reading: “Are Efficient Deep Representations Learnable”

07-31

Document worth reading: “Are Efficient Deep Representations Learnable”

07-31

Document worth reading: “Are Efficient Deep Representations Learnable”

07-31

If you did not already know

10-10

New Dynamics for Topic Models

07-31

New Dynamics for Topic Models

07-31

What makes the Python Cool.

07-31

Is it really true that babies should sleep on their backs?

07-31

Is it really true that babies should sleep on their backs?

07-31

Is it really true that babies should sleep on their backs?

07-31

Is it really true that babies should sleep on their backs?

07-31

If you did not already know

08-01

If you did not already know

08-01

If you did not already know

08-01

Thanks, NVIDIA

08-01

Thanks, NVIDIA

08-01

Sorry I didn’t get that! How to understand what your users want

11-16

Build a document search bot using Amazon Lex and Amazon Elasticsearch Service

08-01

Applying for a PhD program in visualization

01-03

R Packages worth a look

09-23

R Packages worth a look

08-01

R Packages worth a look

12-29

R Packages worth a look

12-27

Document worth reading: “A second-quantised Shannon theory”

12-20

R Packages worth a look

11-26

Document worth reading: “To Cluster, or Not to Cluster: An Analysis of Clusterability Methods”

11-23

Magister Dixit

11-13

R Packages worth a look

11-10

Document worth reading: “Machine Learning for Wireless Networks with Artificial Intelligence: A Tutorial on Neural Networks”

10-28

R Packages worth a look

10-06

Document worth reading: “Detecting Dead Weights and Units in Neural Networks”

10-05

R Packages worth a look

09-30

R Packages worth a look

09-08

Document worth reading: “A Survey on Influence Maximization in a Social Network”

09-02

Document worth reading: “How Important Is a Neuron”

08-15

R Packages worth a look

08-01

R Generation: 25 Years of R

08-01

R Generation: 25 Years of R

08-01

When cycling is faster than driving

12-11

Tips & Tricks for Starting Your First Data Project

08-01

Tips & Tricks for Starting Your First Data Project

08-01

Tips & Tricks for Starting Your First Data Project

08-01

TechTarget: Data science in healthcare demands dual focus, expert says

08-01

TechTarget: Data science in healthcare demands dual focus, expert says

08-01

TechTarget: Data science in healthcare demands dual focus, expert says

08-01

A glass shattering book draw with gganimate

08-01

gganimation for the nation

01-06

Using gganimate to illustrate the luminance illusion

08-22

A glass shattering book draw with gganimate

08-01

gganimate has transitioned to a state of release

01-03

Some fun with {gganimate}

12-27

Visualising Networks in ASOIAF – Part II

10-14

A glass shattering book draw with gganimate

08-01

A glass shattering book draw with gganimate

08-01

4 Myths of Big Data and 4 Ways to Improve with Deep Data

01-09

From Project Manager to Data Champion — Conquer Your Data Projects

10-18

Big Data : Meaning, Components, Collection & Analysis

09-10

Data Makes Possible Many Things: Insights Discovery, Innovation, and Better Decisions

08-01

Metadata Enrichment is Essential to Realize the Value of Open Datasets

11-14

Data Makes Possible Many Things: Insights Discovery, Innovation, and Better Decisions

08-01

Why Vegetarians Miss Fewer Flights – Five Bizarre Insights from Data

01-12

Data Makes Possible Many Things: Insights Discovery, Innovation, and Better Decisions

08-01

R Packages worth a look

12-27

These 3 problems destroy many clinical trials (in context of some papers on problems with non-inferiority trials, or problems with clinical trials in general)

11-25

R Packages worth a look

09-18

“Seeding trials”: medical marketing disguised as science

08-01

How America uses its land

08-01

How America uses its land

08-01

How America uses its land

08-01

R Packages worth a look

12-16

Using a Keras Long Short-Term Memory (LSTM) Model to Predict Stock Prices

11-21

R Packages worth a look

08-09

How America uses its land

08-01

How America uses its land

08-01

Continuous tempering through path sampling

08-02

Continuous tempering through path sampling

08-02

Continuous tempering through path sampling

08-02

If you did not already know

08-02

If you did not already know

08-02

Download 3 million Russian troll tweets

08-02

Whats new on arXiv

08-02

If you did not already know

10-21

Document worth reading: “A Reliability Theory of Truth”

08-02

If you did not already know

09-15

Document worth reading: “A Reliability Theory of Truth”

08-02

Your AI journey… and Happy Holidays!

12-20

KDnuggets™ News 18:n48, Dec 19: Why You Shouldn’t be a Data Science Generalist; Industry Data Science & Machine Learning 2019 Predictions

12-19

Trust The Process

08-02

Fine-tuning for Natural Language Processing

12-28

At Year's End: 2018

12-25

Amazon’s own ‘Machine Learning University’ now available to all developers

11-26

Trust The Process

08-02

Trust The Process

08-02

Skills that Employers look in a Data Scientist

08-02

Skills that Employers look in a Data Scientist

08-02

Skills that Employers look in a Data Scientist

08-02

IEEE Language Rankings 2018

08-07

Skills that Employers look in a Data Scientist

08-02

Three flavors of data scientist

08-02

Three flavors of data scientist

08-02

Three flavors of data scientist

08-02

Amazon Polly adds bilingual Indian English/Hindi language support

08-02

Amazon Polly adds bilingual Indian English/Hindi language support

08-02

My R Take on Advent of Code – Day 3

12-28

(Webinar) Farmers and Chubb on Humanizing Claims with AI

11-15

China air pollution regression discontinuity update

08-02

My R Take on Advent of Code – Day 3

12-28

China air pollution regression discontinuity update

08-02

Use Amazon Mechanical Turk with Amazon SageMaker for supervised learning

08-02

Use Amazon Mechanical Turk with Amazon SageMaker for supervised learning

08-02

Use Amazon Mechanical Turk with Amazon SageMaker for supervised learning

08-02

Use Amazon Mechanical Turk with Amazon SageMaker for supervised learning

08-02

Limitations of “Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection”

10-12

When LOO and other cross-validation approaches are valid

08-03

On the "we have naughty videos of you" scam

08-03

Handling Imbalanced Classes in the Dataset

08-03

R Packages worth a look

08-03

Document worth reading: “Transfer Metric Learning: Algorithms, Applications and Outlooks”

11-02

R Packages worth a look

08-03

R Packages worth a look

08-03

Let Automation Carry You from BI to AI in 2019

12-11

Document worth reading: “Attend Before you Act: Leveraging human visual attention for continual learning”

08-03

The replication crisis and the political process

08-03

The replication crisis and the political process

08-03

Chromebook Data Science - a free online data science program for anyone with a web browser.

10-01

Artificial Intelligence in the Workplace

08-03

Artificial Intelligence in the Workplace

08-03

Getting Started with Competitions - A Peer to Peer Guide

08-22

Video: How to run R and Python in SQL Server from a Jupyter notebook

08-03

OpenCPU 2.1 Release: Scalable R Services

11-22

Building a data warehouse

10-17

Announcing RStudio Package Manager

10-17

Modularize your Shiny Apps: Exercises

10-15

Running R scripts within in-database SQL Server Machine Learning

10-14

Video: How to run R and Python in SQL Server from a Jupyter notebook

08-03

Bring your own pre-trained MXNet or TensorFlow models into Amazon SageMaker

08-03

Bring your own pre-trained MXNet or TensorFlow models into Amazon SageMaker

08-03

Use R with Excel: Importing and Exporting Data

10-17

Bring your own pre-trained MXNet or TensorFlow models into Amazon SageMaker

08-03

Thorn collaborates with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

Thorn partners with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

If you did not already know

10-04

Thorn collaborates with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

Thorn partners with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

Thorn collaborates with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

Thorn partners with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

Thorn collaborates with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

Thorn partners with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

On Using Hyperopt: Advanced Machine Learning

08-04

R or Python? Why not both? Using Anaconda Python within R with {reticulate}

12-30

On Using Hyperopt: Advanced Machine Learning

08-04

On Using Hyperopt: Advanced Machine Learning

08-04

Don’t call it a bandit

08-04

Don’t call it a bandit

08-04

Thorn collaborates with Amazon Rekognition to help fight child sexual abuse and trafficking

08-04

Document worth reading: “Foundations of Complex Event Processing”

08-04

Magister Dixit

08-04

Bayes, statistics, and reproducibility: “Many serious problems with statistics in practice arise from Bayesian inference that is not Bayesian enough, or frequentist evaluation that is not frequentist enough, in both cases using replication distributions that do not make scientific sense or do not reflect the actual procedures being performed on the data.”

12-04

If you did not already know

08-05

If you did not already know

08-05

If you did not already know

09-28

R Packages worth a look

08-05

R Packages worth a look

08-05

Distilled News

12-17

Reusable Pipelines in R

12-13

Reusable Pipelines in R

12-13

Collecting Expressions in R

08-05

Distilled News

09-30

Collecting Expressions in R

08-05

Scale out your Pandas DataFrame operations using Dask

08-05

Announcing the Amazon SageMaker MXNet 1.2 container

08-06

Announcing the Amazon SageMaker MXNet 1.2 container

08-06

Announcing the Amazon SageMaker MXNet 1.2 container

08-06

Testing code with random output

08-06

Twilio offers greater voice selection to customers with Amazon Polly integration

08-06

Twilio offers greater voice selection to customers with Amazon Polly integration

08-06

Top 10 Advantages of a Data Science Certification

12-17

Facial feedback: “These findings suggest that minute differences in the experimental protocol might lead to theoretically meaningful changes in the outcomes.”

11-01

A Concise Explanation of Learning Algorithms with the Mitchell Paradigm

10-05

Twilio offers greater voice selection to customers with Amazon Polly integration

08-06

Quick and Dirty Serverless Integer Programming

08-06

Quick and Dirty Serverless Integer Programming

08-06

Quick and Dirty Serverless Integer Programming

08-06

2018 Data Sources for Cool Data Science Projects, provided by Thinknum

08-06

2018 Data Sources for Cool Data Science Projects, provided by Thinknum

08-06

2018 Data Sources for Cool Data Science Projects, provided by Thinknum

08-06

When Recurrent Models Don't Need to be Recurrent

08-06

When Recurrent Models Don't Need to be Recurrent

08-06

A study fails to replicate, but it continues to get referenced as if it had no problems. Communication channels are blocked.

10-24

Let’s be open about the evidence for the benefits of open science

08-06

Miami University: Assistant Provost for Institutional Research and Effectiveness [Oxford, OH]

12-26

When anyone claims 80% power, I’m skeptical.

08-24

Distilled News

08-23

Let’s be open about the evidence for the benefits of open science

08-06

Azure Functions for Data Science

08-06

“Optimized” floor plan with genetic algorithms

08-06

“Optimized” floor plan with genetic algorithms

08-06

“Optimized” floor plan with genetic algorithms

08-06

Meta-packages, nails in CRAN’s coffin

08-07

Meta-packages, nails in CRAN’s coffin

08-07

Meta-packages, nails in CRAN’s coffin

08-07

Meta-packages, nails in CRAN’s coffin

08-07

If you did not already know

08-07

“The most important aspect of a statistical analysis is not what you do with the data, it’s what data you use” (survey adjustment edition)

08-07

IEEE Language Rankings 2018

08-07

IEEE Language Rankings 2018

08-07

Trapped in the spam folder? Here’s what to do.

08-07

Trapped in the spam folder? Here’s what to do.

08-07

R Packages worth a look

08-10

Trapped in the spam folder? Here’s what to do.

08-07

R Packages worth a look

08-07

R Packages worth a look

08-07

R Packages worth a look

08-07

Maps, models, and analytic problem framing

11-05

Whats new on arXiv

08-28

Distilled News

08-07

Apache Drill 1.15.0 + sergeant 0.8.0 = pcapng Support, Proper Column Types & Mounds of New Metadata

01-02

Address Your Data Science Strategy at DSNY

11-20

The Hidden Costs of Data Silos

08-07

An actual quote from a paper published in a medical journal: “The data, analytic methods, and study materials will not be made available to other researchers for purposes of reproducing the results or replicating the procedure.”

10-19

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

The Hidden Costs of Data Silos

08-07

Document worth reading: “A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior”

08-07

Gender Diversity in the R and Python Communities

12-05

Gender Diversity in the R and Python Communities

12-05

Document worth reading: “Visions of a generalized probability theory”

11-14

The purported CSI effect and the retroactive precision fallacy

11-05

Document worth reading: “Bayesian model reduction”

10-03

If you did not already know

09-28

The Law and Order of Data Science

08-15

Document worth reading: “A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior”

08-07

Considering sensitivity to unmeasured confounding: part 2

01-10

Considering sensitivity to unmeasured confounding: part 1

01-02

If you did not already know

08-08

Document worth reading: “A Survey: Non-Orthogonal Multiple Access with Compressed Sensing Multiuser Detection for mMTC”

12-31

If you did not already know

08-08

What is “party balancing” and how does it explain midterm elections?

08-08

Document worth reading: “Mathematics of Deep Learning”

08-08

Document worth reading: “Mathematics of Deep Learning”

08-08

Distilled News

08-08

R Packages worth a look

12-03

Distilled News

08-08

Document worth reading: “Examining the Use of Neural Networks for Feature Extraction: A Comparative Analysis using Deep Learning, Support Vector Machines, and K-Nearest Neighbor Classifiers”

08-08

How to Overcome That Awkward Silence in Interviews

08-08

How to Overcome That Awkward Silence in Interviews

08-08

Magister Dixit

12-01

Dataiku: “Multimodal Force Majeure” Among Predictive Analytics & ML Platforms

09-18

How to Overcome That Awkward Silence in Interviews

08-08

How to Overcome That Awkward Silence in Interviews

08-08

In case you missed it: July 2018 roundup

08-08

In case you missed it: July 2018 roundup

08-08

In case you missed it: July 2018 roundup

08-08

In case you missed it: July 2018 roundup

08-08

“Richard Jarecki, Doctor Who Conquered Roulette, Dies at 86”

08-09

“Richard Jarecki, Doctor Who Conquered Roulette, Dies at 86”

08-09

“Richard Jarecki, Doctor Who Conquered Roulette, Dies at 86”

08-09

“Richard Jarecki, Doctor Who Conquered Roulette, Dies at 86”

08-09

Whats new on arXiv

08-09

The Trillion Dollar Question

08-09

Some thoughts after reading “Bad Blood: Secrets and Lies in a Silicon Valley Startup”

08-09

Some thoughts after reading “Bad Blood: Secrets and Lies in a Silicon Valley Startup”

08-09

Some thoughts after reading “Bad Blood: Secrets and Lies in a Silicon Valley Startup”

08-09

R Packages worth a look

08-09

R Packages worth a look

08-09

R Packages worth a look

11-04

R Packages worth a look

10-20

R Packages worth a look

10-07

R Packages worth a look

09-08

R Packages worth a look

08-09

Limit access to a Jupyter notebook instance by IP address

09-14

Securing all Amazon SageMaker API calls with AWS PrivateLink

08-14

R Packages worth a look

08-09

Distilled News

08-09

How to Define a Machine Learning Problem Like a Detective

10-22

What is a p-value

08-09

What is a p-value

08-09

Lana Del Rey’s Discography through the Lens of Text Analytics

08-09

Lana Del Rey’s Discography through the Lens of Text Analytics

08-09

AI Meets Mail Processing (Automation for Admin Tasks)

08-09

AI Meets Mail Processing (Automation for Admin Tasks)

08-09

AI Meets Mail Processing (Automation for Admin Tasks)

08-09

✚ Detailed Intentions of a Map, When Everything Leads to Nothing, Designing for Misinterpretations

08-09

Magister Dixit

08-09

Magister Dixit

08-09

Amazon Lex integration with Genesys PureCloud IVR now available

08-09

Amazon Lex integration with Genesys PureCloud IVR now available

08-09

Amazon Lex integration with Genesys PureCloud IVR now available

08-09

Amazon Lex integration with Genesys PureCloud IVR now available

08-09

Amazon Lex integration with Genesys PureCloud IVR now available

08-09

Can You Read My Mind? Analyzing The Killers’ Discography with NLP

08-09

Can You Read My Mind? Analyzing The Killers’ Discography with NLP

08-09

The brain as a neural network: this is why we can’t get along

12-19

Scrapping data about Australian politicians with RSelenium

11-21

“2010: What happened?” in light of 2018

10-31

What’s gonna happen in the 2018 midterm elections?

08-09

What’s gonna happen in the 2018 midterm elections?

08-09

2018: What really happened?

11-10

What’s gonna happen in the 2018 midterm elections?

08-09

2018: What really happened?

11-10

What’s gonna happen in the 2018 midterm elections?

08-09

Data Notes: From Hate Speech to Russian Troll Tweets

08-09

Data Notes: From Hate Speech to Russian Troll Tweets

08-09

Marinus Analytics fights human trafficking using Amazon Rekognition

08-09

Marinus Analytics fights human trafficking using Amazon Rekognition

08-09

Young Investigator Special Competition for Time-Sharing Experiment for the Social Sciences

10-19

Marinus Analytics fights human trafficking using Amazon Rekognition

08-09

Marinus Analytics fights human trafficking using Amazon Rekognition

08-09

Marinus Analytics fights human trafficking using Amazon Rekognition

08-09

If you did not already know

08-10

Introducing pipe, The Automattic Machine Learning Pipeline

11-20

If you did not already know

08-10

If you did not already know

08-10

Cryptocurrency: Your Current Options

08-10

The Axios Turing test and the heat death of the journalistic universe

10-25

Beyond text: How Spokata uses Amazon Polly to make news and information universally accessible as real-time audio

10-04

Build an automatic alert system to easily moderate content at scale with Amazon Rekognition Video

08-15

R Packages worth a look

08-10

R Packages worth a look

08-10

Redmonk Language Rankings, June 2018

08-10

Redmonk Language Rankings, June 2018

08-10

Community Call Summary – Code Review in the Lab

11-29

On “Competition” in the R Ecosystem

09-15

Document worth reading: “Model-free, Model-based, and General Intelligence”

08-10

Power your website with on-demand translated reviews using Amazon Translate

12-20

Amazon Translate now offers 113 new language pairs

10-29

Create a translator chatbot using Amazon Translate and Amazon Lex

08-22

Create video subtitles with translation using machine learning

08-10

Create video subtitles with translation using machine learning

08-10

Create video subtitles with translation using machine learning

08-10

Document worth reading: “Learning to Succeed while Teaching to Fail: Privacy in Closed Machine Learning Systems”

08-10

A Non-Compromising Approach to Privacy-Preserving Personalized Services

01-08

If you did not already know

01-03

If you did not already know

12-27

Federated learning: distributed machine learning with data locality and privacy

11-14

Escaping the macOS 10.14 (Mojave) Filesystem Sandbox with R / RStudio

11-09

Why you can't have privacy on the internet

08-22

Document worth reading: “Learning to Succeed while Teaching to Fail: Privacy in Closed Machine Learning Systems”

08-10

Document worth reading: “Learning to Succeed while Teaching to Fail: Privacy in Closed Machine Learning Systems”

08-10

Jeremy Freese was ahead of the curve

08-10

In Memoriam: Manfred te Grotenhuis

10-15

Jeremy Freese was ahead of the curve

08-10

Jeremy Freese was ahead of the curve

08-10

GBM

08-10

机器学习面试

08-10

机器学习面试

08-10

机器学习面试

08-10

机器学习面试

08-10

机器学习面试

08-10

LSTM的神奇之处

08-10

If you did not already know

08-11

If you did not already know

08-11

If you did not already know

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

Discussion of the value of a mathematical model for the dissemination of propaganda

08-11

R Packages worth a look

08-11

R Packages worth a look

08-11

Surprise-hacking: “the narrative of blindness and illusion sells, and therefore continues to be the central thesis of popular books written by psychologists and cognitive scientists”

12-16

Minimum CRPS vs. maximum likelihood

12-16

Document worth reading: “A Survey on Trust Modeling from a Bayesian Perspective”

11-22

Document worth reading: “Visions of a generalized probability theory”

11-14

If you did not already know

09-17

Document worth reading: “Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution”

08-11

Magister Dixit

08-11

Magister Dixit

08-11

Will Compression Be Machine Learning’s Killer App?

10-16

Linear compression in python: PCA vs unsupervised feature selection

08-11

Linear compression in python: PCA vs unsupervised feature selection

08-11

Linear compression in python: PCA vs unsupervised feature selection

08-11

Intuition for principal component analysis (PCA)

12-06

Linear compression in python: PCA vs unsupervised feature selection

08-11

R Packages worth a look

01-06

If you did not already know

08-12

If you did not already know

08-12

R Packages worth a look

08-29

R Packages worth a look

08-12

“Usefully skeptical science journalism”

08-12

“Usefully skeptical science journalism”

08-12

“Usefully skeptical science journalism”

08-12

“Usefully skeptical science journalism”

08-12

“Usefully skeptical science journalism”

08-12

“Usefully skeptical science journalism”

08-12

How to Build a Data Science Portfolio

08-13

How to Build a Data Science Portfolio

08-13

Hierarchical Bayesian Neural Networks with Informative Priors

08-13

Discovering and indexing podcast episodes using Amazon Transcribe and Amazon Comprehend

09-20

R Packages worth a look

08-13

The Riddler: Santa Needs Some Help With Math

12-22

Document worth reading: “Learning Tree Distributions by Hidden Markov Models”

10-07

R Packages worth a look

08-13

R Packages worth a look

08-13

R Packages worth a look

08-13

R Packages worth a look

08-13

Distilled News

08-13

Data Feminism

11-06

How feminism has made me a better scientist

08-13

How feminism has made me a better scientist

08-13

How feminism has made me a better scientist

08-13

How feminism has made me a better scientist

08-13

Legal Tech: How Can Lawyers Benefit?

08-13

Document worth reading: “Weighted Abstract Dialectical Frameworks: Extended and Revised Report”

08-13

The year in AI/Machine Learning advances: Xavier Amatriain 2018 Roundup

01-11

If you did not already know

12-13

If you did not already know

09-30

Distilled News

09-20

Document worth reading: “Weighted Abstract Dialectical Frameworks: Extended and Revised Report”

08-13

Document worth reading: “Weighted Abstract Dialectical Frameworks: Extended and Revised Report”

08-13

R Packages worth a look

09-07

Document worth reading: “Weighted Abstract Dialectical Frameworks: Extended and Revised Report”

08-13

Curalate makes social sell with AI using Apache MXNet on AWS

08-13

Create a translator chatbot using Amazon Translate and Amazon Lex

08-22

Amazon Translate now available in the Memsource translation management system

08-14

Amazon Translate now available in the Memsource translation management system

08-14

Introducing a simple and intuitive Python API for UCI machine learning repository

11-12

Mirroring an FTP Using lftp and cron

09-06

Microsoft R Open 3.5.1 now available

08-14

Microsoft R Open 3.5.1 now available

08-14

If you did not already know

08-14

Whats new on arXiv

08-14

R Packages worth a look

08-14

5 Ways in which Data Science is Revolutionizing Web Development

01-03

Tis the Season to Check your SSL/TLS Cipher List Thrice (RCurl/curl/openssl)

11-17

Google Dataset Search now in public beta

09-06

R Packages worth a look

08-14

R Packages worth a look

08-14

It was the weeds that bothered him.

08-14

It was the weeds that bothered him.

08-14

It was the weeds that bothered him.

08-14

It was the weeds that bothered him.

08-14

It was the weeds that bothered him.

08-14

It was the weeds that bothered him.

08-14

High-profile statistical errors occur in the physical sciences too, it’s not just a problem in social science.

09-15

It was the weeds that bothered him.

08-14

It was the weeds that bothered him.

08-14

Amazon SageMaker now comes with new capabilities for accelerating machine learning experimentation

11-29

Direct access to Amazon SageMaker notebooks from Amazon VPC by using an AWS PrivateLink endpoint

11-06

Amazon SageMaker runtime now supports the CustomAttributes header

08-31

Securing all Amazon SageMaker API calls with AWS PrivateLink

08-14

Direct access to Amazon SageMaker notebooks from Amazon VPC by using an AWS PrivateLink endpoint

11-06

Securing all Amazon SageMaker API calls with AWS PrivateLink

08-14

Limit access to a Jupyter notebook instance by IP address

09-14

Securing all Amazon SageMaker API calls with AWS PrivateLink

08-14

The Microsoft AI Idea Challenge – Breakthrough Ideas Wanted!

08-14

Problems in a published article on food security in the Lower Mekong Basin

08-23

A transforming river seen from above

08-14

A transforming river seen from above

08-14

A transforming river seen from above

08-14

A transforming river seen from above

08-14

Adversarial Examples, Explained

10-16

data.table is Really Good at Sorting

08-14

Document worth reading: “Does putting your emotions into words make you feel better? Measuring the minute-scale dynamics of emotions from online data”

08-14

Mindstrong Health: Sr Data Scientist / Machine Learning, Statistics, Coding [Palo Alto, CA]

10-17

What to do when your measured outcome doesn’t quite line up with what you’re interested in?

09-17

Document worth reading: “Does putting your emotions into words make you feel better? Measuring the minute-scale dynamics of emotions from online data”

08-14

Document worth reading: “Does putting your emotions into words make you feel better? Measuring the minute-scale dynamics of emotions from online data”

08-14

Distill Update 2018

08-14

Distill Update 2018

08-14

Building a Linear Regression Model for Real World Problems, in R

08-14

TINT uses Amazon Comprehend to find and aggregate the best social media content for customers

08-15

TINT uses Amazon Comprehend to find and aggregate the best social media content for customers

08-15

TINT uses Amazon Comprehend to find and aggregate the best social media content for customers

08-15

Extracting data from news articles: Australian pollution by postcode

11-28

R Packages worth a look

09-29

R Packages worth a look

08-15

R Packages worth a look

08-15

Linear Regression in Real Life

11-05

Tidyverse 'Starts_with' in M/Power Query

10-08

R Packages worth a look

10-02

R Packages worth a look

08-15

Cool tennis-tracking app

08-15

Cool tennis-tracking app

08-15

Cool tennis-tracking app

08-15

Cool tennis-tracking app

08-15

Cool tennis-tracking app

08-15

Cool tennis-tracking app

08-15

It should be ok to just publish the data.

08-15

It should be ok to just publish the data.

08-15

Distilled News

01-13

Document worth reading: “A Review for Weighted MinHash Algorithms”

01-02

Lessons from posting a fake map about pies

11-28

How to Define a Machine Learning Problem Like a Detective

10-22

The Enterprise AI Lab: Not Your Average AI Lab

10-02

It should be ok to just publish the data.

08-15

It should be ok to just publish the data.

08-15

Advanced Jupyter Notebooks: A Tutorial

01-02

It should be ok to just publish the data.

08-15

It should be ok to just publish the data.

08-15

Data Science Portfolio Project: Is Fandango Still Inflating Ratings?

08-15

If you did not already know

10-28

If you did not already know

08-15

If you did not already know

10-28

If you did not already know

08-15

If you did not already know

08-15

Practical Data Science with R, 2nd Edition discount!

01-12

Announcing Practical Data Science with R, 2nd Edition

08-15

Announcing the Artificial Intelligence (AI) Hackathon: Build Intelligent Applications using machine learning APIs and serverless

08-15

R Packages worth a look

08-15

“My advisor and I disagree on how we should carry out repeated cross-validation. We would love to have a third expert opinion…”

12-15

R Packages worth a look

12-07

Magister Dixit

11-16

R Packages worth a look

10-14

R Packages worth a look

09-26

R Packages worth a look

08-24

R Packages worth a look

08-15

R Packages worth a look

08-15

Document worth reading: “Radial Basis Function Approximations: Comparison and Applications”

08-15

Document worth reading: “Radial Basis Function Approximations: Comparison and Applications”

08-15

Document worth reading: “How Important Is a Neuron”

08-15

The Law and Order of Data Science

08-15

The Law and Order of Data Science

08-15

Site Redesign

12-02

✚ Visualization Away from the Computer, Developing Ideas, Bring in the Constraints

08-16

R Packages worth a look

08-16

R Packages worth a look

08-16

R Packages worth a look

08-16

Document worth reading: “Sequences, yet Functions: The Dual Nature of Data-Stream Processing”

08-16

Distilled News

08-16

On the growth of our PyDataLondon community

08-16

On the growth of our PyDataLondon community

08-16

On the growth of our PyDataLondon community

08-16

Distilled News

09-26

Distilled News

08-16

Mindstrong Health: Sr Data Scientist / Machine Learning, Statistics, Coding [Palo Alto, CA]

10-17

A visual analysis of jean pockets and their lack of practicality

08-16

A visual analysis of jean pockets and their lack of practicality

08-16

A visual analysis of jean pockets and their lack of practicality

08-16

A visual analysis of jean pockets and their lack of practicality

08-16

Build a model to predict the impact of weather on urban air quality using Amazon SageMaker

08-16

Keras Conv2D and Convolutional Layers

12-31

Scikit-learn Tutorial: Machine Learning in Python

11-15

Evaluating the Business Value of Predictive Models in Python and R

10-11

Keras vs. TensorFlow – Which one is better and which one should I learn?

10-08

Deploy your own TensorFlow object detection model to AWS DeepLens

09-27

Build a model to predict the impact of weather on urban air quality using Amazon SageMaker

08-16

Build a model to predict the impact of weather on urban air quality using Amazon SageMaker

08-16

Magister Dixit

08-16

Make R speak

08-16

Serial and Parallel bulb puzzle

10-18

Monitoring the media reaction to Facebook’s disastrous earnings call – News API Monthly Media Review

08-16

Monitoring the media reaction to Facebook’s disastrous earnings call – News API Monthly Media Review

08-16

No, I don’t think it’s the file drawer effect

08-16

No, I don’t think it’s the file drawer effect

08-16

No, I don’t think it’s the file drawer effect

08-16

No, I don’t think it’s the file drawer effect

08-16

Document worth reading: “Learning Tree Distributions by Hidden Markov Models”

10-07

Multiple Linear Regression & Assumptions of Linear Regression: A-Z

08-17

Multiple Linear Regression & Assumptions of Linear Regression: A-Z

08-17

Distilled News

08-17

Distilled News

08-17

Distilled News

08-17

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

Three informal case studies: (1) Monte Carlo EM, (2) a new approach to C++ matrix autodiff with closures, (3) C++ serialization via parameter packs

08-17

If you did not already know

08-17

FAQ on ICML 2019 Code Submission Policy

12-19

If you did not already know

08-17

The fallacy of the excluded middle — statistical philosophy edition

08-18

The fallacy of the excluded middle — statistical philosophy edition

08-18

R Packages worth a look

08-18

R Packages worth a look

09-24

R Packages worth a look

08-18

R Packages worth a look

08-18

If you did not already know

08-18

Document worth reading: “Cogniculture: Towards a Better Human-Machine Co-evolution”

08-18

Document worth reading: “Cogniculture: Towards a Better Human-Machine Co-evolution”

08-18

Let’s get hysterical

08-19

Let’s get hysterical

08-19

Practical Data Science with R, 2nd Edition discount!

01-12

More Practical Data Science with R Book News

08-19

More Practical Data Science with R Book News

08-19

R Packages worth a look

08-19

R Packages worth a look

08-19

R Packages worth a look

08-19

R Packages worth a look

08-19

R Packages worth a look

12-31

R Packages worth a look

12-08

R Packages worth a look

09-07

R Packages worth a look

08-19

If you did not already know

08-19

If you did not already know

08-19

If you did not already know

08-19

If you did not already know

08-19

If you did not already know

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

The competing narratives of scientific revolution

08-20

Document worth reading: “A rational analysis of curiosity”

08-20

Nextgov: DHS Funds Machine Learning Tool to Boost Other Countries’ Airport Security

08-20

Magister Dixit

08-20

Magister Dixit

08-20

Bad headlines distract from real AI problems

08-20

Linguistic Signals of Album Quality: A Predictive Analysis of Pitchfork Review Scores Using Quanteda

01-10

Distilled News

10-06

Problems in a published article on food security in the Lower Mekong Basin

08-23

Document worth reading: “A Survey on Visual Query Systems in the Web Era (extended version)”

08-20

R Packages worth a look

08-21

R Packages worth a look

08-21

R Packages worth a look

08-21

R Packages worth a look

08-21

How quickly do stock market valuations revert back to their means?

10-28

Fake News and Filter Bubbles

08-21

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

10-27

Fake News and Filter Bubbles

08-21

Fake News and Filter Bubbles

08-21

If you did not already know

11-01

If you did not already know

08-21

If you did not already know

08-21

Against Arianism

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

The scandal isn’t what’s retracted, the scandal is what’s not retracted.

08-21

Document worth reading: “Instance-Level Explanations for Fraud Detection: A Case Study”

01-01

Why you can't have privacy on the internet

08-22

Distilled News

08-21

Against Arianism

08-21

Building a conversational business intelligence bot with Amazon Lex

11-21

Managing your expenses with Amazon Lex

08-21

Managing your expenses with Amazon Lex

08-21

Creating a MapD ODBC Connection in RStudio Server

08-21

Creating a MapD ODBC Connection in RStudio Server

08-21

Document worth reading: “A Tutorial on Network Embeddings”

08-26

R tip: Use Radix Sort

08-21

R tip: Use Radix Sort

08-21

R tip: Use Radix Sort

08-21

Data concerns when interpreting comparisons of gender equality between countries

08-21

Data concerns when interpreting comparisons of gender equality between countries

08-21

What data scientists really do

08-21

What data scientists really do

08-21

What to Consider When Choosing Colors for Data Visualization

08-22

If you did not already know

12-24

If you did not already know

10-29

If you did not already know

08-22

If you did not already know

12-24

If you did not already know

08-22

If you did not already know

08-22

If you did not already know

12-25

Document worth reading: “A Learning Approach to Secure Learning”

11-19

If you did not already know

08-22

If you did not already know

08-22

Using gganimate to illustrate the luminance illusion

08-22

Create a translator chatbot using Amazon Translate and Amazon Lex

08-22

Create a translator chatbot using Amazon Translate and Amazon Lex

08-22

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Office for Students report on “grade inflation”

01-02

One-stop-learning-shop for data pros – get exclusive access for less than a cup of coffee

12-06

RcppTOML 0.1.5: Small extensions

11-01

namer, Automatic Labelling of R Markdown Chunks

10-31

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Who spends how much, and on what?

08-22

Key Takeaways from AI Conference SF, Day 2: AI and Security, Adversarial Examples, Innovation

10-30

Document worth reading: “Applications of Artificial Intelligence to Network Security”

08-22

Document worth reading: “Applications of Artificial Intelligence to Network Security”

08-22

Document worth reading: “Applications of Artificial Intelligence to Network Security”

08-22

If you did not already know

01-01

Distilled News

12-06

Amazon Rekognition announces updates to its face detection, analysis, and recognition capabilities

11-22

YOLO object detection with OpenCV

11-12

Document worth reading: “Applications of Artificial Intelligence to Network Security”

08-22

R Packages worth a look

08-22

R Packages worth a look

08-22

R Packages worth a look

08-22

Document worth reading: “Fog Computing: Survey of Trends, Architectures, Requirements, and Research Directions”

08-22

R Packages worth a look

08-23

3-D-Printed Time Series Plates

08-23

3-D-Printed Time Series Plates

08-23

3-D-Printed Time Series Plates

08-23

3-D-Printed Time Series Plates

08-23

Distilled News

08-23

What is Cloud Computing & Which is Better, AWS or GCP

11-15

Video: Azure Machine Learning in plain English

08-23

Video: Azure Machine Learning in plain English

08-23

If you did not already know

12-26

R Packages worth a look

11-03

Document worth reading: “Causal inference and the data-fusion problem”

10-26

If you did not already know

09-20

Document worth reading: “An Information-Theoretic Analysis of Deep Latent-Variable Models”

08-23

Document worth reading: “An Information-Theoretic Analysis of Deep Latent-Variable Models”

08-23

Timings of a Grouped Rank Filter Task

08-23

Timings of a Grouped Rank Filter Task

08-23

Problems in a published article on food security in the Lower Mekong Basin

08-23

Problems in a published article on food security in the Lower Mekong Basin

08-23

What is a Box Plot?

08-24

Constructing a Data Analysis

08-24

I Spy with my Graphing Eye 📊 👁️

12-12

Constructing a Data Analysis

08-24

World map shows aerosol billowing in the wind

08-24

World map shows aerosol billowing in the wind

08-24

Distilled News

12-11

Matching (and discarding non-matches) to deal with lack of complete overlap, then regression to adjust for imbalance between treatment and control groups

11-10

R Packages worth a look

08-24

R Packages worth a look

08-24

R Packages worth a look

08-24

R Objects

08-24

New Course: Interactive Data Visualization with rbokeh

10-19

The Chartmaker Directory: Data visualizations in every tool

08-24

The Chartmaker Directory: Data visualizations in every tool

08-24

High-performance mathematical paradigms in Python

11-22

Microsoft Weekly Data Science News for August 24, 2018

08-24

Because it's Friday: One Million Integers

08-24

Weighing the risk of moderate alcohol consumption

08-24

Weighing the risk of moderate alcohol consumption

08-24

Weighing the risk of moderate alcohol consumption

08-24

Weighing the risk of moderate alcohol consumption

08-24

R Packages worth a look

08-24

R Packages worth a look

08-24

When anyone claims 80% power, I’m skeptical.

08-24

When anyone claims 80% power, I’m skeptical.

08-24

Document worth reading: “Nonnegative Matrix Factorization for Signal and Data Analytics: Identifiability, Algorithms, and Applications”

08-25

Document worth reading: “Nonnegative Matrix Factorization for Signal and Data Analytics: Identifiability, Algorithms, and Applications”

08-25

Document worth reading: “Nonnegative Matrix Factorization for Signal and Data Analytics: Identifiability, Algorithms, and Applications”

08-25

Document worth reading: “Nonnegative Matrix Factorization for Signal and Data Analytics: Identifiability, Algorithms, and Applications”

08-25

Document worth reading: “Nonnegative Matrix Factorization for Signal and Data Analytics: Identifiability, Algorithms, and Applications”

08-25

If you did not already know

11-27

AI Masterpieces: But is it Art?

10-27

Document worth reading: “Data Science vs. Statistics: Two Cultures”

09-27

If you did not already know

08-25

Ensure consistency in data processing code between training and inference in Amazon SageMaker

01-11

Document worth reading: “On-Disk Data Processing: Issues and Future Directions”

09-18

If you did not already know

08-25

Distilled News

08-25

Purr yourself into a math genius

01-03

Distilled News

08-25

Magister Dixit

09-27

Distilled News

08-25

In statistics, we talk about uncertainty without it being viewed as undesirable

08-25

In statistics, we talk about uncertainty without it being viewed as undesirable

08-25

Whats new on arXiv

10-27

Get a 2–6x Speed-up on Your Data Pre-processing with Python

10-23

Big Data : Meaning, Components, Collection & Analysis

09-10

If you did not already know

08-26

If you did not already know

08-26

Document worth reading: “The State of the Art in Developing Fuzzy Ontologies: A Survey”

08-26

Forbes: 25 Machine Learning Startups to Watch in 2018

08-26

Document worth reading: “A Tutorial on Network Embeddings”

08-26

Document worth reading: “A Tutorial on Network Embeddings”

08-26

Additional Strategies for Confronting the Partition Function

10-30

If you did not already know

08-27

If you did not already know

08-27

Monotonicity constraints in machine learning

09-16

R Packages worth a look

08-27

R Packages worth a look

08-27

R Packages worth a look

08-27

R Packages worth a look

08-27

One-arm Bayesian Adaptive Trial Simulation Code

11-10

R Packages worth a look

08-27

R Packages worth a look

08-27

R Packages worth a look

08-27

Amazon Transcribe now supports multi-channel transcriptions

08-27

Amazon Transcribe now supports real-time transcriptions

11-20

Amazon Transcribe now supports multi-channel transcriptions

08-27

Amazon Transcribe now supports multi-channel transcriptions

08-27

Analyzing contact center calls—Part 1: Use Amazon Transcribe and Amazon Comprehend to analyze customer sentiment

12-18

Amazon SageMaker automatic model tuning produces better models, faster

09-25

Amazon Transcribe now supports multi-channel transcriptions

08-27

“To get started, I suggest coming up with a simple but reasonable model for missingness, then simulate fake complete data followed by a fake missingness pattern, and check that you can recover your missing-data model and your complete data model in that fake-data situation. You can then proceed from there. But if you can’t even do it with fake data, you’re sunk.”

08-27

“To get started, I suggest coming up with a simple but reasonable model for missingness, then simulate fake complete data followed by a fake missingness pattern, and check that you can recover your missing-data model and your complete data model in that fake-data situation. You can then proceed from there. But if you can’t even do it with fake data, you’re sunk.”

08-27

R Packages worth a look

10-07

“To get started, I suggest coming up with a simple but reasonable model for missingness, then simulate fake complete data followed by a fake missingness pattern, and check that you can recover your missing-data model and your complete data model in that fake-data situation. You can then proceed from there. But if you can’t even do it with fake data, you’re sunk.”

08-27

Pixm takes on phishing attacks with deep learning using Apache MXNet on AWS

08-28

Pixm takes on phishing attacks with deep learning using Apache MXNet on AWS

08-28

Old school

08-28

Visualization in the 1980s, just before the rise of computers

09-07

Old school

08-28

Short Article Reveals the Undeniable Facts About College Essay Writing Service and How It Can Affect You

10-04

Why Almost Everything You’ve Learned About Cheap Custom Essay Is Wrong and What You Should Know

10-04

Old school

08-28

Old school

08-28

Videos from NYC R Conference

08-28

Document worth reading: “What am I searching for?”

08-28

Oh, I hate it when work is criticized (or, in this case, fails in attempted replications) and then the original researchers don’t even consider the possibility that maybe in their original work they were inadvertently just finding patterns in noise.

12-13

CRAN’s New Missing Data Task View

10-26

Document worth reading: “Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks”

10-25

R Packages worth a look

10-22

If you did not already know

09-17

Document worth reading: “What am I searching for?”

08-28

Document worth reading: “What am I searching for?”

08-28

Document worth reading: “What am I searching for?”

08-28

Distilled News

08-28

Whats new on arXiv

08-28

Access Amazon S3 data managed by AWS Glue Data Catalog from Amazon SageMaker notebooks

08-29

R Packages worth a look

12-26

OneR – fascinating insights through simple rules

11-25

OneR – fascinating insights through simple rules

11-24

R Packages worth a look

08-29

R Packages worth a look

08-29

R Packages worth a look

08-29

Document worth reading: “A Comparative Study on using Principle Component Analysis with Different Text Classifiers”

08-29

Spam Detection with Natural Language Processing – Part 3

11-01

Document worth reading: “A Comparative Study on using Principle Component Analysis with Different Text Classifiers”

08-29

If you did not already know

08-29

styler 1.1.0

11-27

Synesthesia: The Sound of Style

08-29

simmer 4.1.0

11-09

Synesthesia: The Sound of Style

08-29

Distilled News

08-29

Distilled News

08-29

Explainable Artificial Intelligence (Part 2) – Model Interpretation Strategies

12-06

Distilled News

08-29

Some clues that this study has big big problems

08-29

Whats new on arXiv

12-10

Whats new on arXiv

11-15

Whats new on arXiv

11-03

Whats new on arXiv

10-26

Whats new on arXiv

10-09

Whats new on arXiv

10-04

Whats new on arXiv

10-04

Whats new on arXiv

09-25

Whats new on arXiv

09-21

Whats new on arXiv

09-08

Whats new on arXiv

09-07

Whats new on arXiv

08-29

Document worth reading: “Idealised Bayesian Neural Networks Cannot Have Adversarial Examples: Theoretical and Empirical Study”

08-29

9 Reasons Excel Users Should Consider Learning Programming

12-27

I’m an Analyst and the software engineers made fun of my code!

10-19

Modifying Excel Files using openxlsx

10-16

Review: Excel TV’s Data Science with Power BI and R

10-12

Journey from Non-Technical background to an expert in Data Science

10-05

Tips for analyzing Excel data in R

08-30

9 Reasons Excel Users Should Consider Learning Programming

12-27

How to Find an Entry-Level Job in Data Science

11-13

Tips for analyzing Excel data in R

08-30

Nimble tweak to use specific python version or virtual environment in RStudio

01-01

Webinar – Integrate AI Across Insurance Operations to Turbocharge Tech Transformation, Nov 14

10-31

R Tip: Put Your Values in Columns

08-30

R Tip: Put Your Values in Columns

08-30

R Tip: Put Your Values in Columns

08-30

R Packages worth a look

08-30

R Packages worth a look

08-30

Residential Property Investment Visualization and Analysis Shiny App

10-22

R Packages worth a look

08-30

Visual search on AWS—Part 1: Engine implementation with Amazon SageMaker

08-30

Distilled News

08-30

3 recent movies from the 50s and the 70s

08-30

3 recent movies from the 50s and the 70s

08-30

If you did not already know

08-30

If you did not already know

08-30

Guide to a high-performance, powerful R installation

08-31

Guide to a high-performance, powerful R installation

08-31

Counting baseball cliches

08-31

Counting baseball cliches

08-31

Document worth reading: “PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison”

08-31

RATest. A Randomization Tests package is available on CRAN

11-11

A Deep (But Jargon and Math Free) Dive Into Deep Learning

08-31

rnoaa: new data sources and NCDC units

12-04

Magister Dixit

08-31

Magister Dixit

08-31

Magister Dixit

08-31

Because it's Friday: The Curiosity Show

08-31

Because it's Friday: The Curiosity Show

08-31

Because it's Friday: The Curiosity Show

08-31

Because it's Friday: The Curiosity Show

08-31

Amazon SageMaker runtime now supports the CustomAttributes header

08-31

“Identification of and correction for publication bias,” and another discussion of how forking paths is not the same thing as file drawer

08-31

“Identification of and correction for publication bias,” and another discussion of how forking paths is not the same thing as file drawer

08-31

Dexterous Manipulation with Reinforcement Learning: Efficient, General, and Low-Cost

08-31

Distilled News

08-31

Authority figures in psychology spread more happy talk, still don’t get the point that much of the published, celebrated, and publicized work in their field is no good (Part 2)

12-31

R Packages worth a look

09-01

R Tip: How to Pass a formula to lm

09-01

R Tip: How to Pass a formula to lm

09-01

Magister Dixit

09-01

If you did not already know

09-01

Christmas elves puzzle

12-27

Magister Dixit

12-24

Magister Dixit

09-02

binb 0.0.3: Now with Monash

10-12

Hey—take this psychological science replication quiz!

09-02

R Packages worth a look

09-02

R Packages worth a look

09-02

R Packages worth a look

09-02

Document worth reading: “A Survey on Influence Maximization in a Social Network”

09-02

If you did not already know

09-02

How to set up a voting system for a Hall of Fame?

09-02

How to set up a voting system for a Hall of Fame?

09-02

How to set up a voting system for a Hall of Fame?

09-02

Human Fuel Consumption

09-02

Linear Regression in Real Life

11-05

Human Fuel Consumption

09-02

Human Fuel Consumption

09-02

R Packages worth a look

09-03

R Packages worth a look

09-03

If you did not already know

12-08

If you did not already know

11-27

R Packages worth a look

11-18

Document worth reading: “Psychological State in Text: A Limitation of Sentiment Analysis”

09-03

Document worth reading: “Psychological State in Text: A Limitation of Sentiment Analysis”

09-03

A.I. parity with the West in 2020

09-03

A.I. parity with the West in 2020

09-03

A.I. parity with the West in 2020

09-03

A.I. parity with the West in 2020

09-03

A.I. parity with the West in 2020

09-03

Top 10 Advantages of a Data Science Certification

12-17

Monash University: Academic Opportunities in Dialogue Research [Melbourne, Australia]

10-25

The Data Science Roadshow is ON!

09-03

Document worth reading: “Artificial Intelligence and Robotics”

09-04

Document worth reading: “Artificial Intelligence and Robotics”

09-04

Document worth reading: “Artificial Intelligence and Robotics”

09-04

Document worth reading: “Interpreting Deep Learning: The Machine Learning Rorschach Test”

09-04

Document worth reading: “Interpreting Deep Learning: The Machine Learning Rorschach Test”

09-04

Document worth reading: “Interpreting Deep Learning: The Machine Learning Rorschach Test”

09-04

Document worth reading: “Interpreting Deep Learning: The Machine Learning Rorschach Test”

09-04

My talk tomorrow (Tues) noon at the Princeton University Psychology Department

12-03

Introducing Webhooks — Fastest Way to Collect Data

11-08

R Packages worth a look

09-30

“We continuously increased the number of animals until statistical significance was reached to support our conclusions” . . . I think this is not so bad, actually!

09-04

Three Operator Splitting

09-04

Three Operator Splitting

09-04

Streamlining Production with Predictive Maintenance and Essilor

09-04

Streamlining Production with Predictive Maintenance and Essilor

09-04

Stan on the web! (thanks to RStudio)

10-12

Against Winner-Take-All Attribution

09-05

StanCon 2018 Helsinki tutorial videos online

09-04

StanCon 2018 Helsinki tutorial videos online

09-04

Against Winner-Take-All Attribution

09-05

StanCon 2018 Helsinki tutorial videos online

09-04

Robert Heinlein vs. Lawrence Summers

09-04

Robert Heinlein vs. Lawrence Summers

09-04

Robert Heinlein vs. Lawrence Summers

09-04

Robert Heinlein vs. Lawrence Summers

09-04

GovernmentCIO: How AI Can Stop Doctors Likely to Overprescribe Opioids — and Stem the Crisis

09-04

GovernmentCIO: How AI Can Stop Doctors Likely to Overprescribe Opioids — and Stem the Crisis

09-04

Document worth reading: “Searching Toward Pareto-Optimal Device-Aware Neural Architectures”

01-05

If you did not already know

09-04

If you did not already know

09-04

Data Science Portfolio Project: Where to Advertise an E-learning Product

09-05

Data Science Portfolio Project: Where to Advertise an E-learning Product

09-05

If you did not already know

09-05

If you did not already know

09-05

Putting the Power of Kafka into the Hands of Data Scientists

09-05

Putting the Power of Kafka into the Hands of Data Scientists

09-05

Aosta Valley, Italy

09-05

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

01-08

Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese

12-21

How Data Science (+ Friends) Helped Me Learn French

11-01

RHL’19 St-Cergue, Switzerland, 25-27 January 2019

10-31

Aosta Valley, Italy

09-05

Aosta Valley, Italy

09-05

Aosta Valley, Italy

09-05

British journalists not running corrections and talking about putting people in the freezer

09-05

British journalists not running corrections and talking about putting people in the freezer

09-05

British journalists not running corrections and talking about putting people in the freezer

09-05

Master R shiny: One trick to build maintainable and scalable event chains

11-02

British journalists not running corrections and talking about putting people in the freezer

09-05

British journalists not running corrections and talking about putting people in the freezer

09-05

If you did not already know

09-05

If you did not already know

09-05

In which I demonstrate my ignorance of world literature

12-03

If you did not already know

09-05

No code chatbots: TIBCO uses Amazon Lex to put chat interfaces into the hands of business users

09-05

No code chatbots: TIBCO uses Amazon Lex to put chat interfaces into the hands of business users

09-05

Visual search on AWS—Part 2: Deployment with AWS DeepLens

09-05

Visual search on AWS—Part 2: Deployment with AWS DeepLens

09-05

See How AI is Inspiring the Next Generation of Developers

09-05

See How AI is Inspiring the Next Generation of Developers

09-05

See How AI is Inspiring the Next Generation of Developers

09-05

anytime – dates in R

11-08

How to Start Learning R for Data Science

10-31

Google Dataset Search now in public beta

09-06

If you did not already know

10-09

What is Neural Network?

09-06

What is Neural Network?

09-06

The gaps between 1, 2, and 3 are just too large.

09-06

The gaps between 1, 2, and 3 are just too large.

09-06

The gaps between 1, 2, and 3 are just too large.

09-06

The gaps between 1, 2, and 3 are just too large.

09-06

T-mobile uses R for Customer Service AI

11-09

T-mobile uses R for Customer Service AI

11-09

Distilled News

09-06

R Packages worth a look

09-06

Data Science With R Course Series – Week 8

11-05

R Packages worth a look

09-06

How Foundations Student Russell Martin got into The Data Incubator’s Fellowship

09-06

How Foundations Student Russell Martin got into The Data Incubator’s Fellowship

09-06

Document worth reading: “Data learning from big data”

09-06

Document worth reading: “Data learning from big data”

09-06

Document worth reading: “Data learning from big data”

09-06

“Dynamically Rescaled Hamiltonian Monte Carlo for Bayesian Hierarchical Models”

09-06

“Dynamically Rescaled Hamiltonian Monte Carlo for Bayesian Hierarchical Models”

09-06

“Dynamically Rescaled Hamiltonian Monte Carlo for Bayesian Hierarchical Models”

09-06

“Dynamically Rescaled Hamiltonian Monte Carlo for Bayesian Hierarchical Models”

09-06

If you did not already know

09-06

Mirroring an FTP Using lftp and cron

09-06

Mirroring an FTP Using lftp and cron

09-06

Mirroring an FTP Using lftp and cron

09-06

Mirroring an FTP Using lftp and cron

09-06

In case you missed it: September 2018 roundup

10-03

Who wrote that anonymous NYT op-ed? Text similarity analyses with R

09-07

Who wrote that anonymous NYT op-ed? Text similarity analyses with R

09-07

Who wrote that anonymous NYT op-ed? Text similarity analyses with R

09-07

R Packages worth a look

12-22

These 3 problems destroy many clinical trials (in context of some papers on problems with non-inferiority trials, or problems with clinical trials in general)

11-25

Change over time is not “treatment response”

11-19

If you did not already know

10-29

Document worth reading: “The Risk of Machine Learning”

10-11

The Blessings of Multiple Causes: Causal Inference when you Can't Measure Confounders

09-07

The Blessings of Multiple Causes: Causal Inference when you Can't Measure Confounders

09-07

Document worth reading: “Causal inference and the data-fusion problem”

10-26

The Blessings of Multiple Causes: Causal Inference when you Can't Measure Confounders

09-07

Bothered by non-monotonicity? Here’s ONE QUICK TRICK to make you happy.

09-07

Bothered by non-monotonicity? Here’s ONE QUICK TRICK to make you happy.

09-07

Magister Dixit

09-07

Magister Dixit

09-07

Magister Dixit

09-07

Document worth reading: “Putting Data Science In Production”

09-07

Document worth reading: “Putting Data Science In Production”

09-07

Visualization in the 1980s, just before the rise of computers

09-07

Visualization in the 1980s, just before the rise of computers

09-07

Visualization in the 1980s, just before the rise of computers

09-07

Roll Your Own Federal Government Shutdown-caused SSL Certificate Expiration Monitor in R

01-10

Visualization in the 1980s, just before the rise of computers

09-07

Get started with automated metadata extraction using the AWS Media Analysis Solution

09-07

Get started with automated metadata extraction using the AWS Media Analysis Solution

09-07

How Data Scientists Think - A Mini Case Study

01-09

Hilary Mason and Gilad Lotan to Keynote at MADS 2019

11-08

Multithreaded in the Wild

11-01

Multithreaded in the Wild

09-07

Distilled News

09-07

Cosmos DB for Data Science

09-07

Whats new on arXiv

09-07

Connected Arms – Can AI Revolutionize Prosthetic Devices & Make them More Affordable?

09-07

R Packages worth a look

09-07

R Packages worth a look

09-07

Adversarial Examples, Explained

10-16

Naive Bayes Classifier: A Geometric Analysis of the Naivete. Part 1

09-07

Naive Bayes Classifier: A Geometric Analysis of the Naivete. Part 1

09-07

Math for Machine Learning

01-04

Math for Machine Learning

12-10

Document worth reading: “An Introduction to Mathematical Optimal Control Theory Version 0.2”

10-03

“It’s Always Sunny in Correlationville: Stories in Science,” or, Science should not be a game of Botticelli

09-08

“It’s Always Sunny in Correlationville: Stories in Science,” or, Science should not be a game of Botticelli

09-08

“It’s Always Sunny in Correlationville: Stories in Science,” or, Science should not be a game of Botticelli

09-08

“It’s Always Sunny in Correlationville: Stories in Science,” or, Science should not be a game of Botticelli

09-08

R Packages worth a look

09-08

If you did not already know

09-08

Document worth reading: “A Survey of Modern Object Detection Literature using Deep Learning”

12-03

If you did not already know

09-08

If you did not already know

09-08

If you did not already know

09-08

R Tip: Give data.table a Try

09-08

The evolution of pace in popular movies

11-24

R Tip: Give data.table a Try

09-08

Labeling Unstructured Text for Meaning to Achieve Predictive Lift

10-31

Distilled News

09-08

Document worth reading: “Accelerating CNN inference on FPGAs: A Survey”

09-08

Autonomy – Do we have the choice?

11-21

If you did not already know

09-09

If you did not already know

09-09

“Check out table 4.”

09-09

“Check out table 4.”

09-09

Magister Dixit

09-09

Magister Dixit

09-09

Pear Therapeutics: Data Scientist [San Francisco, CA]

01-11

UnitedHealth Group: Director, Data Science [Minnetonka, MN]

12-19

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-17

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-14

LoyaltyOne: Manager, CPG [Westborough, MA]

12-14

Cummins: Reliability Analytics Leader [Columbus, IN]

12-13

CBH Group: Data Scientist [Perth, Australia]

12-11

CBH Group: Sr Data Scientist [Perth, Australia]

12-11

AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

12-03

Mega-PAW Las Vegas Registration is Live & Super Early Bird Pricing is Now Available!

11-20

UnitedHealth Group: Sr Director, Decision Analytics [Minnetonka, MN]

11-19

Discourse Network Analysis: Undertaking Literature Reviews in R

11-15

Bank of Canada: Data Scientist [Ottawa, Canada]

10-29

Distilled News

09-24

Document worth reading: “Analytics for the Internet of Things: A Survey”

09-12

Magister Dixit

09-09

This New [AI] Software Constantly Improves – and that Makes all the Difference

09-21

How to Implement AI-First Business Models at Scale

09-21

Why Would Prosthetic Arms Need to See or Connect to Cloud AI?

09-10

Document worth reading: “Concept Tagging for Natural Language Understanding: Two Decadelong Algorithm Development”

09-10

Document worth reading: “Concept Tagging for Natural Language Understanding: Two Decadelong Algorithm Development”

09-10

Document worth reading: “Concept Tagging for Natural Language Understanding: Two Decadelong Algorithm Development”

09-10

What if a big study is done and nobody reports it?

09-10

What if a big study is done and nobody reports it?

09-10

The Long Tail of Medical Data

11-12

What if a big study is done and nobody reports it?

09-10

Document worth reading: “Quantizing deep convolutional networks for efficient inference: A whitepaper”

09-10

Document worth reading: “Quantizing deep convolutional networks for efficient inference: A whitepaper”

09-10

Document worth reading: “Quantizing deep convolutional networks for efficient inference: A whitepaper”

09-10

If you did not already know

09-10

If you did not already know

09-10

If you did not already know

09-10

R Packages worth a look

09-10

Working with US Census Data in R

11-07

Table of Contents for PIM

10-20

Researchers.one: A souped-up Arxiv with pre- and post-publication review

09-10

A Quick Appreciation of the R transform Function

09-10

A Quick Appreciation of the R transform Function

09-10

A Quick Appreciation of the R transform Function

09-10

Import AI 111: Hacking computers with Generative Adversarial Networks, Facebook trains world-class speech translation in 85 minutes via 128 GPUs, and Europeans use AI to classify 1,000-year-old graffiti.

09-10

Import AI 111: Hacking computers with Generative Adversarial Networks, Facebook trains world-class speech translation in 85 minutes via 128 GPUs, and Europeans use AI to classify 1,000-year-old graffiti.

09-10

Who is the greatest finisher in soccer?

01-10

Who is the greatest finisher in soccer?

01-10

Import AI 111: Hacking computers with Generative Adversarial Networks, Facebook trains world-class speech translation in 85 minutes via 128 GPUs, and Europeans use AI to classify 1,000-year-old graffiti.

09-10

Import AI 111: Hacking computers with Generative Adversarial Networks, Facebook trains world-class speech translation in 85 minutes via 128 GPUs, and Europeans use AI to classify 1,000-year-old graffiti.

09-10

Google Dataset Search : Google’s New Data Search Engine

09-10

Satellite imagery generation with Generative Adversarial Networks (GANs)

01-11

R Packages worth a look

09-11

R Packages worth a look

09-11

R Packages worth a look

09-11

R Packages worth a look

09-11

Mouse Among the Cats

09-11

Mouse Among the Cats

09-11

Whats new on arXiv

09-11

Paris ML E#2 S#6: Conscience, Code Analysis, Can a machine learn like a child?

11-14

Distilled News

09-11

If you did not already know

09-11

Run SQL queries from your SageMaker notebooks using Amazon Athena

09-12

Run SQL queries from your SageMaker notebooks using Amazon Athena

09-12

If not Notebooks, then what? Look to Literate Programming

09-12

If not Notebooks, then what? Look to Literate Programming

09-12

If not Notebooks, then what? Look to Literate Programming

09-12

Narcolepsy Could Be ‘Sleeper Effect’ in Trump and Brexit Campaigns

09-12

Narcolepsy Could Be ‘Sleeper Effect’ in Trump and Brexit Campaigns

09-12

Java Home Made Face Recognition Application

09-12

Java Home Made Face Recognition Application

09-12

If you did not already know

09-12

If you did not already know

09-12

Practical Data Science with R2

09-12

Practical Data Science with R2

09-12

✚ Repetitions, Data Analysis as Brainstorm

01-10

✚ Avoiding D3, Using D3, and Why I Use D3

01-03

✚ Tufte Tweet Follow-up; Visualization Tools and Resources Roundup for December 2018

12-20

AzureStor: an R package for working with Azure storage

12-18

AzureStor: an R package for working with Azure storage

12-18

Escaping the macOS 10.14 (Mojave) Filesystem Sandbox with R / RStudio

11-09

Practical Data Science with R2

09-12

Icon making with ggplot2 and magick

01-03

Practical Data Science with R2

09-12

Practical Data Science with R2

09-12

R Packages worth a look

09-12

R Packages worth a look

09-12

R Packages worth a look

09-12

The Benefits of Active Learning for Data Science Skills

09-12

The Benefits of Active Learning for Data Science Skills

09-12

Against Arianism 2: Arianism Grande

09-12

Because it's Friday: Hurricane Trackers

09-14

Hurricane Florence trackers

09-12

Because it's Friday: Hurricane Trackers

09-14

Hurricane Florence trackers

09-12

Hurricane Florence trackers

09-12

Hurricane Florence trackers

09-12

Hurricane Florence trackers

09-12

Announcing wrapr 1.6.2

09-13

Quoting Concatenate

12-16

Quoting Concatenate

12-16

coalesce with wrapr

11-03

coalesce with wrapr

11-03

Announcing wrapr 1.6.2

09-13

Announcing wrapr 1.6.2

09-13

Document worth reading: “A Taxonomy for Neural Memory Networks”

09-13

Document worth reading: “A Taxonomy for Neural Memory Networks”

09-13

Document worth reading: “A Taxonomy for Neural Memory Networks”

09-13

R Packages worth a look

09-13

The Waiting Time Paradox, or, Why Is My Bus Always Late?

09-13

The Waiting Time Paradox, or, Why Is My Bus Always Late?

09-13

“Dissolving the Fermi Paradox”

01-05

The Waiting Time Paradox, or, Why Is My Bus Always Late?

09-13

The Waiting Time Paradox, or, Why Is My Bus Always Late?

09-13

The Waiting Time Paradox, or, Why Is My Bus Always Late?

09-13

Discussion of effects of growth mindset: Let’s not demand unrealistic effect sizes.

09-13

Mapillary uses Amazon Rekognition to work towards building parking solutions for US cities

09-13

Amazon Rekognition announces updates to its face detection, analysis, and recognition capabilities

11-22

YOLO object detection with OpenCV

11-12

Mapillary uses Amazon Rekognition to work towards building parking solutions for US cities

09-13

YOLO object detection with OpenCV

11-12

Mapillary uses Amazon Rekognition to work towards building parking solutions for US cities

09-13

When cycling is faster than driving

12-11

Mapillary uses Amazon Rekognition to work towards building parking solutions for US cities

09-13

✚ Google Dataset Search Impressions, the Challenges of Looking for Data, and Other Places to Find Data

09-13

✚ Google Dataset Search Impressions, the Challenges of Looking for Data, and Other Places to Find Data

09-13

Learn how to create data-driven marketing team

10-25

N=1 survey tells me Cynthia Nixon will lose by a lot (no joke)

09-13

N=1 survey tells me Cynthia Nixon will lose by a lot (no joke)

09-13

N=1 survey tells me Cynthia Nixon will lose by a lot (no joke)

09-13

N=1 survey tells me Cynthia Nixon will lose by a lot (no joke)

09-13

Can we predict the crawling of the Google-Bot?

11-06

Winner Interview | Particle Tracking Challenge first runner-up, Pei-Lien Chou

09-14

Latest Trends in Computer Vision Technology and Applications

11-07

Winner Interview | Particle Tracking Challenge first runner-up, Pei-Lien Chou

09-14

Here are the most popular Python IDEs / Editors

12-07

Adversarial Examples, Explained

10-16

Winner Interview | Particle Tracking Challenge first runner-up, Pei-Lien Chou

09-14

Divergent and Convergent Phases of Data Analysis

09-14

How simpleshow uses Amazon Polly to voice stories in their explainer videos

01-11

Amazon Polly adds Italian and Castilian Spanish voices, and Mexican Spanish language support

11-12

Meet Zhiyu—the first Mandarin Chinese voice for Amazon Polly

09-14

Meet Zhiyu—the first Mandarin Chinese voice for Amazon Polly

09-14

An R Shiny app to recognize flower species

12-17

If you did not already know

11-12

Meet Zhiyu—the first Mandarin Chinese voice for Amazon Polly

09-14

The complex process of obtaining Puerto Rico mortality data: a timeline

09-28

How many deaths were caused by the hurricane in Puerto Rico?

09-14

How many deaths were caused by the hurricane in Puerto Rico?

09-14

How many deaths were caused by the hurricane in Puerto Rico?

09-14

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Waffle House index as a storm indicator

09-14

Echo Chamber Incites Online Mob to Attack Math Profs

09-14

Distilled News

09-14

Document worth reading: “Closing the AI Knowledge Gap”

09-14

Because it's Friday: Hurricane Trackers

09-14

Because it's Friday: Hurricane Trackers

09-14

Limit access to a Jupyter notebook instance by IP address

09-14

On “Competition” in the R Ecosystem

09-15

On “Competition” in the R Ecosystem

09-15

Considering sensitivity to unmeasured confounding: part 1

01-02

On “Competition” in the R Ecosystem

09-15

If you did not already know

09-15

NYU Stern: 2019-20 Asst. Professor of Information, Operations & Management Sciences – Information Systems, tenure-track [New York City, NY]

11-14

3-D shadow maps in R: the rayshader package

09-26

One Drink Per Day, Your Chances of Developing an Alcohol-Related Condition

09-25

If you did not already know

09-15

R Packages worth a look

09-15

R Packages worth a look

09-15

R Packages worth a look

09-15

Better R Code with wrapr Dot Arrow

09-15

Piping into ggplot2

10-13

Piping into ggplot2

10-13

Better R Code with wrapr Dot Arrow

09-15

Better R Code with wrapr Dot Arrow

09-15

Better R Code with wrapr Dot Arrow

09-15

High-profile statistical errors occur in the physical sciences too, it’s not just a problem in social science.

09-15

Parameterizing with bquote

09-16

Parameterizing with bquote

09-16

Parameterizing with bquote

09-16

Parameterizing with bquote

09-16

R Packages worth a look

09-16

automl package: part 1/2 why and how

10-21

Distilled News

09-22

R Packages worth a look

09-16

R Packages worth a look

09-16

Document worth reading: “A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines”

09-16

Document worth reading: “A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines”

09-16

Document worth reading: “A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines”

09-16

Document worth reading: “Introduction to Nonnegative Matrix Factorization”

09-16

French Mortality Poster

12-27

An R Shiny app to recognize flower species

12-17

KDnuggets™ News 18:n41, Oct 31: Introduction to Deep Learning with Keras; Easy Named Entity Recognition with Scikit-Learn

10-31

Document worth reading: “Introduction to Nonnegative Matrix Factorization”

09-16

Document worth reading: “Introduction to Nonnegative Matrix Factorization”

09-16

Don’t get fooled by observational correlations

09-16

Don’t get fooled by observational correlations

09-16

Monotonicity constraints in machine learning

09-16

Monotonicity constraints in machine learning

09-16

Distilled News

09-17

If you did not already know

09-17

Document worth reading: “A Survey on Trust Modeling from a Bayesian Perspective”

11-22

Distilled News

10-09

If you did not already know

09-17

What to do when your measured outcome doesn’t quite line up with what you’re interested in?

09-17

Distilled News

11-29

What to do when your measured outcome doesn’t quite line up with what you’re interested in?

09-17

How to Optimise Ad CTR with Reinforcement Learning

09-24

How to Optimise Ad CTR with Reinforcement Learning

09-17

If you did not already know

09-17

If you did not already know

09-17

R Packages worth a look

12-19

One Recipe Step to Rule Them All

12-03

Cuisine Ingredients

09-18

Cuisine Ingredients

09-18

R Packages worth a look

09-17

How to import a directory of csvs at once with base R and data.table. Can you guess which way is the fastest?

10-13

R Packages worth a look

09-17

R Packages worth a look

09-17

R Packages worth a look

09-17

If you did not already know

12-01

The blocks and rows theory of data shaping

11-02

R Packages worth a look

09-17

BRUNO: A Deep Recurrent Model for Exchangeable Data

09-17

BRUNO: A Deep Recurrent Model for Exchangeable Data

09-17

BRUNO: A Deep Recurrent Model for Exchangeable Data

09-17

Cuisine Ingredients

09-18

Cuisine Ingredients

09-18

Cuisine Ingredients

09-18

Cuisine Ingredients

09-18

Cuisine Ingredients

09-18

Cuisine Ingredients

09-18

Distilled News

09-18

Text Preprocessing in Python: Steps, Tools, and Examples

11-06

Why, oh why, do so many people embrace the Pacific Garbage Cleanup nonsense? (I have a theory).

09-18

Text Preprocessing in Python: Steps, Tools, and Examples

11-06

Why, oh why, do so many people embrace the Pacific Garbage Cleanup nonsense? (I have a theory).

09-18

Why, oh why, do so many people embrace the Pacific Garbage Cleanup nonsense? (I have a theory).

09-18

Not Hotdog: A Shiny app using the Custom Vision API

09-18

Not Hotdog: A Shiny app using the Custom Vision API

09-18

Not Hotdog: A Shiny app using the Custom Vision API

09-18

Document worth reading: “On-Disk Data Processing: Issues and Future Directions”

09-18

Document worth reading: “On-Disk Data Processing: Issues and Future Directions”

09-18

Document worth reading: “On-Disk Data Processing: Issues and Future Directions”

09-18

Document worth reading: “On-Disk Data Processing: Issues and Future Directions”

09-18

How to Solve the ModelOps Challenge

10-18

Dataiku: “Multimodal Force Majeure” Among Predictive Analytics & ML Platforms

09-18

If you did not already know

11-17

Dataiku: “Multimodal Force Majeure” Among Predictive Analytics & ML Platforms

09-18

R Packages worth a look

09-19

Dataiku: “Multimodal Force Majeure” Among Predictive Analytics & ML Platforms

09-18

Save time and money by filtering faces during indexing with Amazon Rekognition

09-18

A quick look at GHCN version 4

11-03

Save time and money by filtering faces during indexing with Amazon Rekognition

09-18

Variety is the Secret Sauce for Big Discoveries in Big Data

09-18

Variety is the Secret Sauce for Big Discoveries in Big Data

09-18

Variety is the Secret Sauce for Big Discoveries in Big Data

09-18

If you did not already know

09-18

If you did not already know

09-18

If you did not already know

11-06

How to generalize (algorithmically)

09-18

How to generalize (algorithmically)

09-18

Document worth reading: “The Three Pillars of Machine-Based Programming”

09-18

Training models with unequal economic error costs using Amazon SageMaker

09-18

R Packages worth a look

09-18

The hot hand—in darts!

09-18

The hot hand—in darts!

09-18

The hot hand—in darts!

09-18

A couple more papers on genetic diversity as an explanation for why Africa and remote Andean countries are so poor while Europe and North America are so wealthy

09-19

A couple more papers on genetic diversity as an explanation for why Africa and remote Andean countries are so poor while Europe and North America are so wealthy

09-19

A couple more papers on genetic diversity as an explanation for why Africa and remote Andean countries are so poor while Europe and North America are so wealthy

09-19

R Packages worth a look

09-19

New Engen improves customer acquisition marketing campaigns using Amazon Rekognition

09-19

New Engen improves customer acquisition marketing campaigns using Amazon Rekognition

09-19

Maps of the issues mentioned most in election advertising

11-05

New Engen improves customer acquisition marketing campaigns using Amazon Rekognition

09-19

Common mistakes when carrying out machine learning and data science

12-06

New Engen improves customer acquisition marketing campaigns using Amazon Rekognition

09-19

Help! I can’t reproduce a machine learning project!

09-19

Help! I can’t reproduce a machine learning project!

09-19

Document worth reading: “Decision-Making with Belief Functions: a Review”

09-19

Document worth reading: “Decision-Making with Belief Functions: a Review”

09-19

Document worth reading: “Decision-Making with Belief Functions: a Review”

09-19

If you did not already know

10-25

PyConUK 2018

09-19

How to graph a function of 4 variables using a grid

09-20

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

How to graph a function of 4 variables using a grid

09-20

Document worth reading: “Automatic Language Identification in Texts: A Survey”

09-20

Judging connectedness of American communities, based on Facebook friendships

09-20

Judging connectedness of American communities, based on Facebook friendships

09-20

Create 3D County Maps Using Density as Z-Axis

11-29

Judging connectedness of American communities, based on Facebook friendships

09-20

Should we be concerned about MRP estimates being used in later analyses? Maybe. I recommend checking using fake-data simulation.

12-09

Judging connectedness of American communities, based on Facebook friendships

09-20

Judging connectedness of American communities, based on Facebook friendships

09-20

Magister Dixit

09-20

Magister Dixit

09-20

Magister Dixit

09-20

Post-publication peer review: who’s qualified?

09-20

Post-publication peer review: who’s qualified?

09-20

Linking Data Science Activities to Business Initiatives Using the Hypothesis Development Canvas

11-29

Document worth reading: “Lectures on Statistics in Theory: Prelude to Statistics in Practice”

11-06

Data Science “Paint by the Numbers” with the Hypothesis Development Canvas

11-02

What is P-value?

09-20

What is P-value?

09-20

October 2018: “Top 40” New Packages

11-29

AI, Machine Learning and Data Science Roundup: September 2018

09-20

The Best Programming Languages for Data Science and Machine Learning in 2018

09-20

The Best Programming Languages for Data Science and Machine Learning in 2018

09-20

✚ Chart Components and Working On Your Graphics Piece-wise

09-20

✚ Chart Components and Working On Your Graphics Piece-wise

09-20

✚ Chart Components and Working On Your Graphics Piece-wise

09-20

AI-Based Virtual Tutors – The Future of Education?

09-21

Applications of R presented at EARL London 2018

09-21

How Pol Brigneti got a Data Analyst job using Dataquest at Belgrave Valley

09-21

How Pol Brigneti got a Data Analyst job using Dataquest at Belgrave Valley

09-21

How Pol Brigneti got a Data Analyst job using Dataquest at Belgrave Valley

09-21

If you did not already know

12-03

If you did not already know

09-21

R Packages worth a look

11-18

If you did not already know

09-21

The rise and plummet of the name Heather

09-21

RcppTOML 0.1.4: Now with TOML v0.5.0

10-23

The rise and plummet of the name Heather

09-21

The rise and plummet of the name Heather

09-21

Using a Column as a Column Index

09-21

R Packages worth a look

09-21

Data Projects WILL Fail - Learn to Fail Quickly & Efficiently

09-21

Data Projects WILL Fail - Learn to Fail Quickly & Efficiently

09-21

Data Projects WILL Fail - Learn to Fail Quickly & Efficiently

09-21

If you did not already know

09-21

Import AI: 123: Facebook sees demands for deep learning services in its data centers grow by 3.5X; why advanced AI might require a global policeforce; and diagnosing natural disasters with deep learning

12-03

If you did not already know

09-21

Building Surveillance System Using USB Camera and Wireless-Connected Raspberry Pi

11-06

This New [AI] Software Constantly Improves – and that Makes all the Difference

09-21

This New [AI] Software Constantly Improves – and that Makes all the Difference

09-21

Nextgov: Machine Learning Could Help Chip Away at the Security Clearance Backlog

09-21

Nextgov: Machine Learning Could Help Chip Away at the Security Clearance Backlog

09-21

Nextgov: Machine Learning Could Help Chip Away at the Security Clearance Backlog

09-21

Nextgov: Machine Learning Could Help Chip Away at the Security Clearance Backlog

09-21

Nextgov: Machine Learning Could Help Chip Away at the Security Clearance Backlog

09-21

Whats new on arXiv

12-25

Timing Column Indexing in R

09-21

A psychology researcher uses Stan, multiverse, and open data exploration to explore human memory

09-21

Multilevel data collection and analysis for weight training (with R code)

09-22

Document worth reading: “Do Deep Learning Models Have Too Many Parameters An Information Theory Viewpoint”

09-22

Document worth reading: “Do Deep Learning Models Have Too Many Parameters An Information Theory Viewpoint”

09-22

Manning Countdown to 2019 – Big Deals on AI, Data Science, Machine Learning books and videos

12-28

If you did not already know

09-22

If you did not already know

09-22

Distilled News

09-22

Statistical Modeling, Causal Inference, and Social Science Regrets Its Decision to Hire Cannibal P-hacker as Writer-at-Large

09-29

R Packages worth a look

09-22

R Packages worth a look

09-22

Statistics Sunday: Introduction to Regular Expressions

11-25

Fewer Headaches (Thanks to Data Science)

10-08

R Packages worth a look

09-23

R Packages worth a look

09-23

R Packages worth a look

09-23

R Packages worth a look

09-23

Document worth reading: “On the Learning Dynamics of Deep Neural Networks”

09-23

Document worth reading: “On the Learning Dynamics of Deep Neural Networks”

09-23

Understand Why ODSC is the Most Recommended Conference for Applied Data Science

10-04

Document worth reading: “On the Learning Dynamics of Deep Neural Networks”

09-23

Document worth reading: “On the Learning Dynamics of Deep Neural Networks”

09-23

Document worth reading: “Graph-based Ontology Summarization: A Survey”

09-23

Document worth reading: “Graph-based Ontology Summarization: A Survey”

09-23

An Intro to Deep Learning in Python

12-06

Document worth reading: “Graph-based Ontology Summarization: A Survey”

09-23

“Tweeking”: The big problem is not where you think it is.

09-23

Whats new on arXiv

09-24

Dataquest helped me get my dream job at Noodle.ai

09-24

Dataquest helped me get my dream job at Noodle.ai

09-24

R Packages worth a look

09-24

Handling Imbalanced Datasets in Deep Learning

12-04

R Packages worth a look

09-24

Phillips-Ouliaris Test For Cointegration

12-17

R Packages worth a look

09-24

Whats new on arXiv

11-30

How to de-Bias Standard Deviation Estimates

11-12

How to de-Bias Standard Deviation Estimates

11-12

R Packages worth a look

11-01

If you did not already know

10-23

R Packages worth a look

09-24

A Subtle Flaw in Some Popular R NSE Interfaces

09-24

Looking back on 2018, looking to 2019

01-07

Using pandas and pymapd for ETL into OmniSci

10-16

If you did not already know

09-24

If you did not already know

09-24

Distilled News

09-25

Amazon SageMaker automatic model tuning produces better models, faster

09-25

Document worth reading: “Data Innovation for International Development: An overview of natural language processing for qualitative data analysis”

09-25

Document worth reading: “Data Innovation for International Development: An overview of natural language processing for qualitative data analysis”

09-25

Document worth reading: “Data Innovation for International Development: An overview of natural language processing for qualitative data analysis”

09-25

Document worth reading: “Data Innovation for International Development: An overview of natural language processing for qualitative data analysis”

09-25

One Drink Per Day, Your Chances of Developing an Alcohol-Related Condition

09-25

One Drink Per Day, Your Chances of Developing an Alcohol-Related Condition

09-25

R Packages worth a look

10-24

If you did not already know

09-25

Help improve lives through Machine Learning by joining the AWS DeepLens Challenge!

09-25

Help improve lives through Machine Learning by joining the AWS DeepLens Challenge!

09-25

You’ve got data on 35 countries, but it’s really just N=3 groups.

09-25

You’ve got data on 35 countries, but it’s really just N=3 groups.

09-25

Distilled News

09-26

Distilled News

09-26

Job opening at CDC: “The Statistician will play a central role in guiding the statistical methods of all major projects of the Epidemiology and Prevention Branch of the CDC Influenza Division, and aid in designing, analyzing, and interpreting research intended to understand the burden of influenza in the US and internationally and identify the best influenza vaccines and vaccine strategies.”

09-26

Distilled News

09-26

R Packages worth a look

10-17

Distilled News

09-26

Inside Higher Ed: Pushing the Boundaries of Learning With AI

09-26

Inside Higher Ed: Pushing the Boundaries of Learning With AI

09-26

Understanding Regression Error Metrics

09-26

Can AI Generate Programs to Help Automate Busy Work?

09-26

Can AI Generate Programs to Help Automate Busy Work?

09-26

Can AI Generate Programs to Help Automate Busy Work?

09-26

Can AI Generate Programs to Help Automate Busy Work?

09-26

R Packages worth a look

12-20

R Packages worth a look

11-26

Can AI Generate Programs to Help Automate Busy Work?

09-26

If you did not already know

09-26

If you did not already know

09-26

Job opening at CDC: “The Statistician will play a central role in guiding the statistical methods of all major projects of the Epidemiology and Prevention Branch of the CDC Influenza Division, and aid in designing, analyzing, and interpreting research intended to understand the burden of influenza in the US and internationally and identify the best influenza vaccines and vaccine strategies.”

09-26

R Packages worth a look

09-26

3-D shadow maps in R: the rayshader package

09-26

3-D shadow maps in R: the rayshader package

09-26

Pivot Billions and Deep Learning enhanced trading models achieve 100% net profit

12-24

Quidditch: is it all about the Snitch?

11-24

The Price of Transformation

09-26

The Price of Transformation

09-26

The Price of Transformation

09-26

The Price of Transformation

09-26

Document worth reading: “Human-Machine Inference Networks For Smart Decision Making: Opportunities and Challenges”

09-26

CBH Group: Sr Data Engineer [Perth, Australia]

12-14

Guest Post: Galin Jones on criteria for promotion and tenture in (bio)statistics departments

10-11

Document worth reading: “Human-Machine Inference Networks For Smart Decision Making: Opportunities and Challenges”

09-26

Document worth reading: “Human-Machine Inference Networks For Smart Decision Making: Opportunities and Challenges”

09-26

R Packages worth a look

09-26

R Packages worth a look

09-26

R Packages worth a look

09-26

R Packages worth a look

09-26

Advantages of Online Data Science Courses

09-26

Using Stacking to Average Bayesian Predictive Distributions (with Discussion)

09-26

Using Stacking to Average Bayesian Predictive Distributions (with Discussion)

09-26

Using Stacking to Average Bayesian Predictive Distributions (with Discussion)

09-26

A potential big problem with placebo tests in econometrics: they’re subject to the “difference between significant and non-significant is not itself statistically significant” issue

09-26

A potential big problem with placebo tests in econometrics: they’re subject to the “difference between significant and non-significant is not itself statistically significant” issue

09-26

A potential big problem with placebo tests in econometrics: they’re subject to the “difference between significant and non-significant is not itself statistically significant” issue

09-26

The Right Kind of Internal Motivation Can Improve Your Studies

01-08

A potential big problem with placebo tests in econometrics: they’re subject to the “difference between significant and non-significant is not itself statistically significant” issue

09-26

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 1

09-26

Magister Dixit

11-07

Document worth reading: “Data Science vs. Statistics: Two Cultures”

09-27

Document worth reading: “Data Science vs. Statistics: Two Cultures”

09-27

Deploy your own TensorFlow object detection model to AWS DeepLens

09-27

R Packages worth a look

09-27

R Packages worth a look

11-26

R Packages worth a look

11-23

R Packages worth a look

09-27

R Packages worth a look

01-01

R Packages worth a look

09-27

Magister Dixit

09-27

If you did not already know

09-27

If you did not already know

09-27

If you did not already know

09-27

The Markup is a new journalism venture to examine technology through data

09-28

The Markup is a new journalism venture to examine technology through data

09-28

The Markup is a new journalism venture to examine technology through data

09-28

The Markup is a new journalism venture to examine technology through data

09-28

The Markup is a new journalism venture to examine technology through data

09-28

The complex process of obtaining Puerto Rico mortality data: a timeline

09-28

The complex process of obtaining Puerto Rico mortality data: a timeline

09-28

The complex process of obtaining Puerto Rico mortality data: a timeline

09-28

Machine Reading at Scale – Transfer Learning for Large Text Corpuses

10-17

Document worth reading: “A Survey on Expert Recommendation in Community Question Answering”

09-28

Document worth reading: “A Survey on Expert Recommendation in Community Question Answering”

09-28

R Packages worth a look

09-29

R Packages worth a look

09-29

If you did not already know

10-27

Document worth reading: “How to Maximize the Spread of Social Influence: A Survey”

09-29

Document worth reading: “How to Maximize the Spread of Social Influence: A Survey”

09-29

Document worth reading: “How to Maximize the Spread of Social Influence: A Survey”

09-29

Document worth reading: “How to Maximize the Spread of Social Influence: A Survey”

09-29

The Evolution of Build Engineering in Managing Open Source [Webinar Replay]

11-13

Document worth reading: “How to Maximize the Spread of Social Influence: A Survey”

09-29

Document worth reading: “Physically optimizing inference”

09-29

Statistical Modeling, Causal Inference, and Social Science Regrets Its Decision to Hire Cannibal P-hacker as Writer-at-Large

09-29

Statistical Modeling, Causal Inference, and Social Science Regrets Its Decision to Hire Cannibal P-hacker as Writer-at-Large

09-29

Statistical Modeling, Causal Inference, and Social Science Regrets Its Decision to Hire Cannibal P-hacker as Writer-at-Large

09-29

Overlapping Disks

09-30

Big career opportunities in big data

10-08

Distilled News

09-30

If you did not already know

09-30

If you did not already know

09-30

Present each others’ posters

10-06

If you did not already know

09-30

Document worth reading: “Importance of the Mathematical Foundations of Machine Learning Methods for Scientific and Engineering Applications”

09-30

Document worth reading: “Learning From Positive and Unlabeled Data: A Survey”

11-23

Distilled News

11-16

Discourse Network Analysis: Undertaking Literature Reviews in R

11-15

Document worth reading: “Importance of the Mathematical Foundations of Machine Learning Methods for Scientific and Engineering Applications”

09-30

R Packages worth a look

09-30

R Packages worth a look

09-30

What do you do when someone says, “The quote is, this is the exact quote”—and then misquotes you?

09-30

What do you do when someone says, “The quote is, this is the exact quote”—and then misquotes you?

09-30

What do you do when someone says, “The quote is, this is the exact quote”—and then misquotes you?

09-30

Data Feminism

11-06

What do you do when someone says, “The quote is, this is the exact quote”—and then misquotes you?

09-30

R Packages worth a look

09-30

R Packages worth a look

09-30

McKinsey Datathon: The City Cup17 November, Amsterdam, Stockholm and Zurich. Apply Now

10-19

A Three Month Data Analysis in Excel Could Have Taken Me One Day

10-01

My talk tomorrow (Tues) 4pm in the Biomedical Informatics department (at 168th St)

10-01

My talk tomorrow (Tues) 4pm in the Biomedical Informatics department (at 168th St)

10-01

Center for Ultrasound Research and Translation, Massachusetts General Hospital: Post-Doctoral Scholar / Research Scientist [Boston, MA]

12-31

Introducing medical language processing with Amazon Comprehend Medical

11-27

If you did not already know

11-03

How AI Will Change Healthcare

10-15

My talk tomorrow (Tues) 4pm in the Biomedical Informatics department (at 168th St)

10-01

Distilled News

10-01

Distilled News

10-01

Distilled News

10-01

Distilled News

10-01

PyImageConf 2018 Recap

10-01

PyImageConf 2018 Recap

10-01

Dr. Data Show Video: Why Machine Learning Is the Coolest Science

10-01

Dr. Data Show Video: Why Machine Learning Is the Coolest Science

10-01

Bob Erikson on the 2018 Midterms

10-01

Bob Erikson on the 2018 Midterms

10-01

Bob Erikson on the 2018 Midterms

10-01

Up your open source game with Hacktoberfest at Locke Data!

10-01

Gift ideas for the R lovers

12-14

Up your open source game with Hacktoberfest at Locke Data!

10-01

Import AI 114: Synthetic images take a big leap forward with BigGANs; US lawmakers call for national AI strategy; researchers probe language reasoning via HotspotQA

10-01

Modeling muti-category Outcomes With vtreat

10-01

Math for Machine Learning

01-04

Math for Machine Learning

12-10

Modeling muti-category Outcomes With vtreat

10-01

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount.

10-09

TEXATA Data Analytics Summit 2018 – Exclusive 30% KDnuggets Discount

10-01

Magister Dixit

10-02

Magister Dixit

10-02

“Moral cowardice requires choice and action.”

10-02

Minimum CRPS vs. maximum likelihood

12-16

“Moral cowardice requires choice and action.”

10-02

“Moral cowardice requires choice and action.”

10-02

Introducing pipe, The Automattic Machine Learning Pipeline

11-20

“Moral cowardice requires choice and action.”

10-02

“Moral cowardice requires choice and action.”

10-02

The Enterprise AI Lab: Not Your Average AI Lab

10-02

The Enterprise AI Lab: Not Your Average AI Lab

10-02

The Enterprise AI Lab: Not Your Average AI Lab

10-02

Unleash a Faster Python on Your Data

10-02

Unleash a Faster Python on Your Data

10-02

Unleash a Faster Python on Your Data

10-02

R Packages worth a look

10-02

R Packages worth a look

10-02

French Mortality Poster

12-27

Heatmaps of Mortality Rates

12-04

Shifting Causes of Death

10-02

Reduced privacy risk in exchange for accuracy in the Census count

12-06

Shifting Causes of Death

10-02

Carol Nickerson explains what those mysterious diagrams were saying

12-22

Shifting Causes of Death

10-02

Cool postdoc position in Arizona on forestry forecasting using tree ring models!

10-02

R Packages worth a look

01-06

Cool postdoc position in Arizona on forestry forecasting using tree ring models!

10-02

Cool postdoc position in Arizona on forestry forecasting using tree ring models!

10-02

If you did not already know

10-02

Math for Machine Learning

01-04

Manning Countdown to 2019 – Big Deals on AI, Data Science, Machine Learning books and videos

12-28

Exploring the Data Jungle Free eBook

12-18

Four Real-Life Machine Learning Use Cases

12-13

Math for Machine Learning

12-10

DATAx Presents: AI AND MACHINE LEARNING TRENDS IN 2019

12-06

The Quick Python Book

12-05

Introducing the First AI / Machine Learning Course With a Job Guarantee

11-30

The Evolution of Build Engineering in Managing Open Source [Webinar Replay]

11-13

Healthcare Analytics Made Simple

11-12

Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake, Nov 8

11-01

Tomorrow, Nov 8 Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake

11-01

Webinar – Integrate AI Across Insurance Operations to Turbocharge Tech Transformation, Nov 14

10-31

How to Mitigate Open Source License Risks

10-30

The Definitive Guide to AI’s “Black Box” Problem

10-17

DevOps 2.0: Applying Machine Learning in the CI/CD Chain

10-02

DevOps 2.0: Applying Machine Learning in the CI/CD Chain

10-02

DevOps 2.0: Applying Machine Learning in the CI/CD Chain

10-02

DevOps 2.0: Applying Machine Learning in the CI/CD Chain

10-02

Document worth reading: “A Review for Weighted MinHash Algorithms”

01-02

The Backpropagation Algorithm Demystified

01-02

Don’t Peek part 2: Predictions without Test Data

11-18

If you did not already know

10-14

How to Create a Simple Neural Network in Python

10-02

David Weakliem points out that both economic and cultural issues can be more or less “moralized.”

10-03

David Weakliem points out that both economic and cultural issues can be more or less “moralized.”

10-03

AI for Good: slides and notebooks from the ODSC workshop

11-13

In case you missed it: September 2018 roundup

10-03

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

11-14

In case you missed it: September 2018 roundup

10-03

Build a Predictive Maintenance Engine with GIS Data

10-03

Build a Predictive Maintenance Engine with GIS Data

10-03

Build a Predictive Maintenance Engine with GIS Data

10-03

Build a Predictive Maintenance Engine with GIS Data

10-03

Mapping opportunity for children, based on where they grew up

10-03

Mapping opportunity for children, based on where they grew up

10-03

Mapping opportunity for children, based on where they grew up

10-03

Mapping opportunity for children, based on where they grew up

10-03

Industry leaders to speak at Mega-PAW, Las Vegas – June 16-20

01-09

Speak at Mega-PAW Vegas 2019 – on Machine Learning Deployment (Apply by Nov 15)

10-22

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: October and Beyond

10-03

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: October and Beyond

10-03

Spelling 2.0: Improved Markdown and RStudio Support

12-20

Python Dictionary Tutorial

10-03

Spelling 2.0: Improved Markdown and RStudio Support

12-20

Python Dictionary Tutorial

10-03

Python Dictionary Tutorial

10-03

Python Dictionary Tutorial

10-03

PyTorch 1.0 preview now available in Amazon SageMaker and the AWS Deep Learning AMIs

10-03

Document worth reading: “Bayesian model reduction”

10-03

Document worth reading: “Bayesian model reduction”

10-03

I walk the (train) line – part deux – the weight loss continues

01-12

KDnuggets™ News 18:n37, Oct 3: Mathematics of Machine Learning; Effective Transfer Learning for NLP; Path Analysis with R

10-03

If you did not already know

10-03

Top KDnuggets tweets, Sep 26 – Oct 2: Why building your own Deep Learning Computer is 10x cheaper than AWS; 6 Steps To Write Any Machine Learning Algorithm

10-03

Top 3 Trends in Deep Learning

10-03

Top 3 Trends in Deep Learning

10-03

R Packages worth a look

10-03

R Packages worth a look

10-03

Peak Non-Creepy Dating Pool

11-27

R Packages worth a look

10-03

“Snip Insights” – An Open Source Cross-Platform AI Tool for Intelligent Screen Capture

10-03

Document worth reading: “An Introduction to Mathematical Optimal Control Theory Version 0.2”

10-03

Document worth reading: “An Introduction to Mathematical Optimal Control Theory Version 0.2”

10-03

Document worth reading: “An Introduction to Mathematical Optimal Control Theory Version 0.2”

10-03

Document worth reading: “An Introduction to Mathematical Optimal Control Theory Version 0.2”

10-03

R Packages worth a look

10-03

How to use common workflows on Amazon SageMaker notebook instances

10-03

GitHub Python Data Science Spotlight: High Level Machine Learning & NLP, Ensembles, Command Line Viz & Docker Made Easy

10-16

How to use common workflows on Amazon SageMaker notebook instances

10-03

Deep Learning Without Labels

10-03

Distilled News

01-07

Top 10 Data Science Tools (other than SQL Python R)

12-21

Quantcast: Sr Applied Scientist, Audience Platform [Seattle, WA]

11-20

Big Data Day Camp: Big Data Tools & Techniques (October 25-26)

10-04

Why Vegetarians Miss Fewer Flights – Five Bizarre Insights from Data

01-12

“Six Signs of Scientism”: where I disagree with Haack

10-04

“Six Signs of Scientism”: where I disagree with Haack

10-04

“Six Signs of Scientism”: where I disagree with Haack

10-04

“Six Signs of Scientism”: where I disagree with Haack

10-04

Easy CI/CD of GPU applications on Google Cloud including bare-metal using Gitlab and Kubernetes

12-14

What does a data scientist REALLY look like?

11-09

Peter Bull discusses the importance of human-centered design in data science.

11-05

Top 10 Mistakes to Avoid to Master Data Science

10-10

Top 10 Mistakes to Avoid to Master Data Science

10-04

The role of academia in data science education

11-01

How Can Autonomous Drones Help the Energy and Utilities Industry?

10-23

UnitedHealth Group: Sr .Net Web Developer, UHC E&I [Indianapolis, IN or Green Bay, WI]

10-04

R Packages worth a look

10-04

Why Almost Everything You’ve Learned About Cheap Custom Essay Is Wrong and What You Should Know

10-04

Why Almost Everything You’ve Learned About Cheap Custom Essay Is Wrong and What You Should Know

10-04

Why Almost Everything You’ve Learned About Cheap Custom Essay Is Wrong and What You Should Know

10-04

If you did not already know

01-13

NYC buses: simple Cubist regression

11-29

R Packages worth a look

10-15

Society of Machines: The Complex Interaction of Agents

10-04

Society of Machines: The Complex Interaction of Agents

10-04

Beyond text: How Spokata uses Amazon Polly to make news and information universally accessible as real-time audio

10-04

Beyond text: How Spokata uses Amazon Polly to make news and information universally accessible as real-time audio

10-04

What Does it Take to Train Deep Learning Models On-Device?

10-04

UnitedHealth Group: Sr Director, Decision Analytics [Minnetonka, MN]

11-19

UnitedHealth Group: Data Analytics and Reporting Lead [Minnetonka, MN or Telecommute]

11-16

UnitedHealth Group: Senior Principal Data Scientist [Telecommute, Central or Eastern Time Zones]

11-16

UnitedHealth Group: UHC Digital Director of Project Management [Minnetonka, MN]

10-04

UnitedHealth Group: UHC Digital Project Manager [Minnetonka, MN]

10-04

Speed Up With Microsoft

10-04

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference – Part 2

10-04

Chromebook Data Science

10-04

Chromebook Data Science

10-04

Chromebook Data Science

10-04

Document worth reading: “A Survey: Non-Orthogonal Multiple Access with Compressed Sensing Multiuser Detection for mMTC”

12-31

Chromebook Data Science

10-04

Amazon SageMaker Neural Topic Model now supports auxiliary vocabulary channel, new topic evaluation metrics, and training subsampling

10-04

✚ This is Misleading, This is Not Really Misleading

10-04

✚ This is Misleading, This is Not Really Misleading

10-04

✚ This is Misleading, This is Not Really Misleading

10-04

If you did not already know

10-04

Short Article Reveals the Undeniable Facts About College Essay Writing Service and How It Can Affect You

10-04

R Packages worth a look

01-04

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

Creating Tables Using R and Pure HTML

12-05

Automated Email Reports with R

11-01

Short Article Reveals the Undeniable Facts About College Essay Writing Service and How It Can Affect You

10-04

Using the Economics Value Curve to Drive Digital Transformation

12-27

Linking Data Science Activities to Business Initiatives Using the Hypothesis Development Canvas

11-29

Data Science “Paint by the Numbers” with the Hypothesis Development Canvas

11-02

3 Stages of Creating Smart

10-04

Semantic Segmentation: Wiki, Applications and Resources

10-04

Semantic Interoperability: Are you training your AI by mixing data sources that look the same but aren’t?

10-09

Semantic Segmentation: Wiki, Applications and Resources

10-04

Accelerate model training using faster Pipe mode on Amazon SageMaker

10-05

Accelerate model training using faster Pipe mode on Amazon SageMaker

10-05

Now use Pipe mode with CSV datasets for faster training on Amazon SageMaker built-in algorithms

11-01

Accelerate model training using faster Pipe mode on Amazon SageMaker

10-05

Accelerate model training using faster Pipe mode on Amazon SageMaker

10-05

“Ivy League Football Saw Large Reduction in Concussions After New Kickoff Rules”

10-05

“Ivy League Football Saw Large Reduction in Concussions After New Kickoff Rules”

10-05

“Ivy League Football Saw Large Reduction in Concussions After New Kickoff Rules”

10-05

Colorado State University: Assistant Professor in Industrial and Organizational (IO) Psychology [Fort Collins, CO]

10-05

Colorado State University: Assistant Professor in Industrial and Organizational (IO) Psychology [Fort Collins, CO]

10-05

Journey from Non-Technical background to an expert in Data Science

10-05

If you did not already know

10-05

If you did not already know

10-05

If you did not already know

10-05

If you did not already know

10-05

Magister Dixit

10-05

Distilled News

10-05

A few upcoming R conferences

10-05

A few upcoming R conferences

10-05

A few upcoming R conferences

10-05

A few upcoming R conferences

10-05

Document worth reading: “Detecting Dead Weights and Units in Neural Networks”

10-05

If you did not already know

10-05

If you did not already know

12-26

If you did not already know

10-05

Basic Image Data Analysis Using Python – Part 4

10-05

Challenges & Solutions for Production Recommendation Systems

10-05

Proof that 1/7 is a repeated decimal

10-05

Proof that 1/7 is a repeated decimal

10-05

University of Nebraska at Omaha: Faculty Position in Computer Science [Omaha, NE]

10-05

University of Nebraska at Omaha: Faculty Position in Computer Science [Omaha, NE]

10-05

The Distribution of Time Between Recessions: Revisited (with MCHT)

11-19

Quick Significance Calculations for A/B Tests in R

10-06

Quick Significance Calculations for A/B Tests in R

10-06

Distilled News

10-06

French Mortality Poster

12-27

Present each others’ posters

10-06

Present each others’ posters

10-06

ShinyProxy Christmas Release

12-23

Fewer Headaches (Thanks to Data Science)

10-08

Document worth reading: “An Analysis of Hierarchical Text Classification Using Word Embeddings”

10-06

Document worth reading: “An Analysis of Hierarchical Text Classification Using Word Embeddings”

10-06

R Packages worth a look

10-06

R Packages worth a look

12-26

R Packages worth a look

10-06

“Fudged statistics on the Iraq War death toll are still circulating today”

10-06

“Fudged statistics on the Iraq War death toll are still circulating today”

10-06

Sunday Morning Video (in french): Les travaux de Grothendieck.sur les espaces de Banach, Gilles. Pisier (Lectures grothendieckiennes)

10-07

LightOn: Forward We Go !

12-20

Paris ML E#2 S#6: Conscience, Code Analysis, Can a machine learn like a child?

11-14

Paris Machine Learning

10-10

A Neural Architecture for Bayesian CompressiveSensing over the Simplex via Laplace Techniques

10-08

Job: Postdoctoral Researcher in Small Data Deep Learning and Explainable Machine Learning, Livermore, CA

10-08

Sunday Morning Video (in french): Les travaux de Grothendieck.sur les espaces de Banach, Gilles. Pisier (Lectures grothendieckiennes)

10-07

Sunday Morning Video (in french): Les travaux de Grothendieck.sur les espaces de Banach, Gilles. Pisier (Lectures grothendieckiennes)

10-07

Sunday Morning Video (in french): Les travaux de Grothendieck.sur les espaces de Banach, Gilles. Pisier (Lectures grothendieckiennes)

10-07

LightOn: Forward We Go !

12-20

Paris Machine Learning

10-10

A Neural Architecture for Bayesian CompressiveSensing over the Simplex via Laplace Techniques

10-08

Sunday Morning Video (in french): Les travaux de Grothendieck.sur les espaces de Banach, Gilles. Pisier (Lectures grothendieckiennes)

10-07

R Packages worth a look

10-07

R Packages worth a look

10-07

R Packages worth a look

01-03

CRAN’s New Missing Data Task View

10-26

R Packages worth a look

10-07

If you did not already know

10-07

If you did not already know

10-07

If you did not already know

10-07

Bayesian inference and religious belief

10-07

Bayesian inference and religious belief

10-07

Bayesian inference and religious belief

10-07

Bayesian inference and religious belief

10-07

Document worth reading: “Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service”

10-07

Document worth reading: “Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service”

10-07

Document worth reading: “Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service”

10-07

Tidyverse 'Starts_with' in M/Power Query

10-08

If you did not already know

10-08

A Neural Architecture for Bayesian CompressiveSensing over the Simplex via Laplace Techniques

10-08

Distilled News

12-31

BIG, small or Right Data: Which is the proper focus?

10-08

Fewer Headaches (Thanks to Data Science)

10-08

Statistics Sunday: What Fast Food Can Tell Us About a Community and the World

10-21

Fewer Headaches (Thanks to Data Science)

10-08

Rising test scores . . . reported as stagnant test scores

10-08

Rising test scores . . . reported as stagnant test scores

10-08

Rising test scores . . . reported as stagnant test scores

10-08

Big career opportunities in big data

10-08

Spam Detection with Natural Language Processing – Part 3

11-01

Big career opportunities in big data

10-08

Business Analysis (BA) Career Path

10-11

Big career opportunities in big data

10-08

Big career opportunities in big data

10-08

R Packages worth a look

10-08

If you did not already know

12-21

Don’t Peek: Deep Learning without looking … at test data

10-08

Don’t Peek: Deep Learning without looking … at test data

10-08

Don’t Peek: Deep Learning without looking … at test data

10-08

The One reason you should learn Python

10-11

Things you should know when traveling via the Big Data Engineering hype-train

10-08

The economic consequences of MOOCs

10-08

The economic consequences of MOOCs

10-08

Remembering Michael

10-08

Document worth reading: “An Overview of Blockchain Integration with Robotics and Artificial Intelligence”

11-08

Remembering Michael

10-08

Running the Same Task in Python and R

10-08

Running the Same Task in Python and R

10-08

Distilled News

10-08

Track the number of coffees consumed using AWS DeepLens

10-09

Track the number of coffees consumed using AWS DeepLens

10-09

Simulating the iSight Camera in the iOS Simulator

10-09

Simulating the iSight Camera in the iOS Simulator

10-09

Simulating the iSight Camera in the iOS Simulator

10-09

Simulating the iSight Camera in the iOS Simulator

10-09

Processing complicated package outputs

10-09

Processing complicated package outputs

10-09

Processing complicated package outputs

10-09

Processing complicated package outputs

10-09

R Packages worth a look

10-09

R Packages worth a look

10-09

R Packages worth a look

10-09

R Packages worth a look

12-10

R Packages worth a look

10-09

Learning Acrobatics by Watching YouTube

10-09

Learning Acrobatics by Watching YouTube

10-09

Semantic Interoperability: Are you training your AI by mixing data sources that look the same but aren’t?

10-09

Semantic Interoperability: Are you training your AI by mixing data sources that look the same but aren’t?

10-09

R Packages worth a look

12-04

A deep dive into glmnet: standardize

11-15

Semantic Interoperability: Are you training your AI by mixing data sources that look the same but aren’t?

10-09

How To Learn Data Science If You’re Broke

10-09

How To Learn Data Science If You’re Broke

10-09

State of Deep Learning and Major Advances: H2 2018 Review

12-13

How To Learn Data Science If You’re Broke

10-09

How To Learn Data Science If You’re Broke

10-09

RTutor: Driving Electric or Gasoline Cars? Comparing the Pollution Damages

11-21

Leading the Charge 🔌 🚘: 10 Charts on Electric Vehicles in Plotly

10-09

Leading the Charge 🔌 🚘: 10 Charts on Electric Vehicles in Plotly

10-09

Leading the Charge 🔌 🚘: 10 Charts on Electric Vehicles in Plotly

10-09

R Consortium grant applications due October 31

10-09

R Consortium grant applications due October 31

10-09

R Consortium grant applications due October 31

10-09

R Consortium grant applications due October 31

10-09

SiliconANGLE: Machine learning automation startup DataRobot lands $100M round

10-24

R Consortium grant applications due October 31

10-09

R Consortium grant applications due October 31

10-09

Building an Image Classifier Running on Raspberry Pi

10-09

How to sync Fastmail's CardDAV to use with mutt + abook

11-08

Building an Image Classifier Running on Raspberry Pi

10-09

Building an Image Classifier Running on Raspberry Pi

10-09

Building an Image Classifier Running on Raspberry Pi

10-09

Shopper Sentiment: Analyzing in-store customer experience

10-09

Shopper Sentiment: Analyzing in-store customer experience

10-09

If you did not already know

10-09

Technoslavia 2.5: Open Source Topography

11-07

Open Source Deep Dive with Olivier Grisel

10-29

All About Open Source

10-09

All About Open Source

10-09

All About Open Source

10-09

If you did not already know

11-19

Document worth reading: “Deep Learning for Generic Object Detection: A Survey”

10-10

Data Mining Book – Chapter Download

12-04

Data Mining Book – Chapter Download

11-02

Data Mining Book: Chapter Download.

10-10

Data Mining Book – Chapter Download

12-04

R Packages worth a look

11-24

If you did not already know

11-23

Introducing Drexel new online MS in Data Science

11-15

Data Mining Book: Chapter Download.

10-10

Data Mining Book: Chapter Download.

10-10

The Golden Rule of Nudge

10-10

The Golden Rule of Nudge

10-10

R Packages worth a look

11-28

R Packages worth a look

10-10

Multilevel Modeling Solves the Multiple Comparison Problem: An Example with R

10-31

R Packages worth a look

10-10

Top November Stories: The Most in Demand Skills for Data Scientists; What is the Best Python IDE for Data Science?

12-11

Top September Stories: Essential Math for Data Science: Why and How; Machine Learning Cheat Sheets

10-10

Top December Stories: Why You Shouldn’t be a Data Science Generalist

01-09

Top September Stories: Essential Math for Data Science: Why and How; Machine Learning Cheat Sheets

10-10

Life in Madrid seen through BiciMAD

10-10

Life in Madrid seen through BiciMAD

10-10

KDnuggets™ News 18:n38, Oct 10: Concise Explanation of Learning Algorithms; Why I Call Myself a Data Scientist; Linear Regression in the Wild

10-10

anytime – dates in R

11-08

KDnuggets™ News 18:n40, Oct 24: Graphs Are The Next Frontier In Data Science; Apache Spark Intro for Beginners

10-24

KDnuggets™ News 18:n38, Oct 10: Concise Explanation of Learning Algorithms; Why I Call Myself a Data Scientist; Linear Regression in the Wild

10-10

Top Stories of 2018: 9 Must-have skills you need to become a Data Scientist, updated; Python eats away at R: Top Software for Analytics, Data Science, Machine Learning in 2018

12-14

KDnuggets™ News 18:n45, Nov 28: Your Favorite Python IDE/editor? Intro to Data Science for Managers

11-28

Cartoon: Thanksgiving, Big Data, and Turkey Data Science.

11-22

LinkedIn Top Voices 2018: Data Science & Analytics

11-13

KDnuggets™ News 18:n41, Oct 31: Introduction to Deep Learning with Keras; Easy Named Entity Recognition with Scikit-Learn

10-31

New Poll: How Important is Understanding Machine Learning Models?

10-30

KDnuggets™ News 18:n40, Oct 24: Graphs Are The Next Frontier In Data Science; Apache Spark Intro for Beginners

10-24

KDnuggets™ News 18:n39, Oct 17: 10 Best Mobile Apps for Data Scientist; Vote in new poll: Largest dataset you analyzed?

10-17

KDnuggets™ News 18:n38, Oct 10: Concise Explanation of Learning Algorithms; Why I Call Myself a Data Scientist; Linear Regression in the Wild

10-10

How to get a Data Science Job in 6 Months

10-10

Preprocessing for Deep Learning: From covariance matrix to image whitening

10-10

Preprocessing for Deep Learning: From covariance matrix to image whitening

10-10

Preprocessing for Deep Learning: From covariance matrix to image whitening

10-10

CBH Group: Sr Data Engineer [Perth, Australia]

12-14

MINDBODY: Business Intelligence Analyst II [San Luis Obispo, CA]

12-13

CBH Group: Sr Data Scientist [Perth, Australia]

12-11

UnitedHealth Group: Clinical Data Statistical Analyst – SQL SAS (Clinician Required) [Telecommute]

11-16

Using deep learning on AWS to lower property damage losses from natural disasters

10-30

a4 Media: Manager, Machine Learning Data Engineer [Long Island City, NY]

10-10

Who is a Data Scientist?

12-27

LightOn: Forward We Go !

12-20

Paris Machine Learning

10-10

If you did not already know

10-10

Build Your Own Natural Language Models on AWS (no ML experience required)

11-19

Amazon Comprehend introduces new Region availability and language support for French, German, Italian, and Portuguese

10-10

Amazon Comprehend introduces new Region availability and language support for French, German, Italian, and Portuguese

10-10

If you did not already know

10-10

TDWI In-Person and Virtual Data and Analytics Training

10-10

One-stop-learning-shop for data pros – get exclusive access for less than a cup of coffee

12-06

TDWI In-Person and Virtual Data and Analytics Training

10-10

Document worth reading: “The Risk of Machine Learning”

10-11

Silent Duels and an Old Paper of Restrepo

12-31

University of Virginia: Faculty, Open Rank Model and Simulation at the Human-Technology Frontier [Charlottesville, VA]

12-24

NYU Stern: 2019-20 Asst. Professor of Information, Operations & Management Sciences – Information Systems, tenure-track [New York City, NY]

11-14

University of San Francisco: Postdoctoral Fellowship, Data Institute [San Francisco, CA]

10-16

Guest Post: Galin Jones on criteria for promotion and tenture in (bio)statistics departments

10-11

Day 11 – little helper trim

12-11

Guest Post: Galin Jones on criteria for promotion and tenture in (bio)statistics departments

10-11

Guest Post: Galin Jones on criteria for promotion and tenture in (bio)statistics departments

10-11

Using Confusion Matrices to Quantify the Cost of Being Wrong

10-11

Using Confusion Matrices to Quantify the Cost of Being Wrong

10-11

Using Confusion Matrices to Quantify the Cost of Being Wrong

10-11

Machine Reading Comprehension: Learning to Ask & Answer

10-11

Machine Reading Comprehension: Learning to Ask & Answer

10-11

Decolonising Artificial Intelligence

10-11

Decolonising Artificial Intelligence

10-11

Decolonising Artificial Intelligence

10-11

Decolonising Artificial Intelligence

10-11

Site Redesign

12-02

A/B Testing: The Definitive Guide to Improving Your Product

10-11

Power Bat – How Spektacom is Powering the Game of Cricket with Microsoft AI

10-11

Power Bat – How Spektacom is Powering the Game of Cricket with Microsoft AI

10-11

Power Bat – How Spektacom is Powering the Game of Cricket with Microsoft AI

10-11

Power Bat – How Spektacom is Powering the Game of Cricket with Microsoft AI

10-11

cransays - Follow your R Package Journey to CRANterbury with our Dashboard!

10-11

cransays - Follow your R Package Journey to CRANterbury with our Dashboard!

10-11

cransays - Follow your R Package Journey to CRANterbury with our Dashboard!

10-11

SQL, Python, & R: All in One Platform

10-11

SQL, Python, & R: All in One Platform

10-11

Announcing Kaggle integration with Google Data Studio

12-05

Azure ML Studio now supports R 3.4

11-01

SQL, Python, & R: All in One Platform

10-11

How R gets built on Windows

10-11

How R gets built on Windows

10-11

If you did not already know

10-11

My introductory course on Bayesian statistics

12-12

Request for Proposal: Topical Projects for January 2019

11-29

How DataCamp Handles Course Quality

10-25

DataCamp: Part-time Contract Instructors [Remote]

10-11

Why Learning Data Science Live is Better than Self-Paced Learning

01-02

Request for Proposal: Topical Projects for January 2019

11-29

DataCamp: Part-time Contract Instructors [Remote]

10-11

Intuit: Staff Data Scientist [Mountain View, CA]

12-12

DataCamp: Part-time Contract Instructors [Remote]

10-11

Document worth reading: “Automatic Rumor Detection on Microblogs: A Survey”

10-11

Document worth reading: “Automatic Rumor Detection on Microblogs: A Survey”

10-11

Document worth reading: “Automatic Rumor Detection on Microblogs: A Survey”

10-11

Document worth reading: “Automatic Rumor Detection on Microblogs: A Survey”

10-11

Why are functional programming languages so popular in the programming languages community?

10-11

We Sized Washington’s Edible Marijuana Market Using AI

10-12

Why Primary Research?

12-04

We Sized Washington’s Edible Marijuana Market Using AI

10-12

We Sized Washington’s Edible Marijuana Market Using AI

10-12

The Economist’s Big Mac Index is calculated with R

10-12

The Economist’s Big Mac Index is calculated with R

10-12

The Economist’s Big Mac Index is calculated with R

10-12

The Economist’s Big Mac Index is calculated with R

10-12

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

Document worth reading: “A Survey of Knowledge Representation and Retrieval for Learning in Service Robotics”

12-26

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

Because it's Friday: Hey, it's Enrico Pallazzo!

10-12

The Economist's Big Mac Index is calculated with R

10-12

The Economist's Big Mac Index is calculated with R

10-12

automl package: part 1/2 why and how

10-21

A Lazy Function

10-20

The Economist's Big Mac Index is calculated with R

10-12

The Economist's Big Mac Index is calculated with R

10-12

The Economist's Big Mac Index is calculated with R

10-12

The Economist's Big Mac Index is calculated with R

10-12

Join us for DataTech19, Scotland’s first technical data science conference as part of DataFest

10-12

Join us for DataTech19, Scotland’s first technical data science conference as part of DataFest

10-12

Gold-Mining Week 16 (2018)

12-21

Gold-Mining Week 15 (2018)

12-13

Gold-Mining Week 13 (2018)

11-29

Gold-Mining Week 12 (2018)

11-22

Gold-Mining Week 11 (2018)

11-15

Gold-Mining Week 10 (2018)

11-10

Gold-Mining Week 9 (2018)

10-31

Gold-Mining Week 8 (2018)

10-26

Gold-Mining Week 7 (2018)

10-19

Gold-Mining Week 6 (2018)

10-12

Gold-Mining Week 16 (2018)

12-21

Gold-Mining Week 15 (2018)

12-13

Gold-Mining Week 13 (2018)

11-29

Gold-Mining Week 12 (2018)

11-22

Gold-Mining Week 11 (2018)

11-15

Gold-Mining Week 10 (2018)

11-10

Gold-Mining Week 9 (2018)

10-31

Gold-Mining Week 8 (2018)

10-26

Gold-Mining Week 7 (2018)

10-19

Gold-Mining Week 6 (2018)

10-12

Residential Property Investment Visualization and Analysis Shiny App

10-22

Top Blockchain Applications Making Waves in Commercial Real Estate

10-12

Top Blockchain Applications Making Waves in Commercial Real Estate

10-12

Top Blockchain Applications Making Waves in Commercial Real Estate

10-12

If you did not already know

10-12

If you did not already know

10-12

R Packages worth a look

11-21

R Packages worth a look

10-12

R Packages worth a look

10-12

Distilled News

10-12

Sketchnotes from TWiML&AI: Evaluating Model Explainability Methods with Sara Hooker

10-12

High-performance mathematical paradigms in Python

11-22

Sketchnotes from TWiML&AI: Evaluating Model Explainability Methods with Sara Hooker

10-12

Is it time to stop using sentinel values for null / "NA" values?

10-12

Is it time to stop using sentinel values for null / "NA" values?

10-12

Is it time to stop using sentinel values for null / "NA" values?

10-12

Hitchhiker's guide to Exploratory Data Analysis

10-12

Hitchhiker's guide to Exploratory Data Analysis

10-12

2018: How did people actually vote? (The real story, not the exit polls.)

11-16

New Course: Analyzing Election and Polling Data in R

11-01

Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

10-29

New Poll: What was the largest dataset you analyzed / data mined?

10-12

Quoting Concatenate

12-16

Quoting Concatenate

12-16

How to work with strings in base R – An overview of 20+ methods for daily use.

11-24

Quoting in R

11-15

Quoting in R

11-15

Writing Code to Read Quotes About Writing Code

10-12

Writing Code to Read Quotes About Writing Code

10-12

Animated River Flow Revisited

10-12

Animated River Flow Revisited

10-12

Animated River Flow Revisited

10-12

Animated River Flow Revisited

10-12

Animated River Flow Revisited

10-12

linl 0.0.3: Micro release

12-15

It was twenty years ago …

12-08

RcppEigen 0.3.3.5.0

11-24

RcppGetconf 0.0.3

11-17

anytime 0.3.3

11-14

Document worth reading: “Toward a System Building Agenda for Data Integration”

11-06

Happy 10th Bday, Rcpp – and welcome release 1.0 !!

11-06

RcppAnnoy 0.0.11

11-02

binb 0.0.3: Now with Monash

10-12

binb 0.0.3: Now with Monash

10-12

Temple University: Faculty Positions (Assistant/Associate/Full Professor) [Philadelphia, PA]

10-12

Miami University: Assistant Provost for Institutional Research and Effectiveness [Oxford, OH]

12-26

Temple University: Faculty Positions (Assistant/Associate/Full Professor) [Philadelphia, PA]

10-12

Review: Excel TV’s Data Science with Power BI and R

10-12

NYC buses: company level predictors with R

11-28

Learn the top things to look for in an AI Vendor

10-12

Showing a difference in means between two groups

01-13

10 years of playback history on Last.FM: "Just sit back and listen"

01-12

Using emojis as scatterplot points

12-28

Some fun with {gganimate}

12-27

ggQC | ggplot Quality Control Charts – New Release

12-05

Open Workshop: Data Visualization in R and ggplot2, January 25th in Munich

11-26

R Packages worth a look

11-14

Detailed introduction of “myprettyreport” R package

11-10

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 1: Basics

10-25

Piping into ggplot2

10-13

Piping into ggplot2

10-13

Piping into ggplot2

10-13

Piping into ggplot2

10-13

New package in CRAN: PkgsFromFiles

10-13

New package in CRAN: PkgsFromFiles

10-13

Music listener statistics: last.fm’s last.year as an R package

01-02

Prophets of gloom: Using NLP to analyze Radiohead lyrics

10-13

Prophets of gloom: Using NLP to analyze Radiohead lyrics

10-13

Day 13 – little helper read_files

12-13

Day 11 – little helper trim

12-11

Day 01 – little helper checkdir

12-01

Open Workshop: Deep Learning in R and Keras, November 14th in Frankfurt

10-13

Open Workshop: Deep Learning in R and Keras, November 14th in Frankfurt

10-13

Open Workshop: Deep Learning in R and Keras, November 14th in Frankfurt

10-13

GitHub Streak: Round Five

10-13

linl 0.0.3: Micro release

12-15

It was twenty years ago …

12-08

anytime 0.3.3

11-14

anytime 0.3.2

11-07

Happy 10th Bday, Rcpp – and welcome release 1.0 !!

11-06

GitHub Streak: Round Five

10-13

Reading List Faster With parallel, doParallel, and pbapply

12-12

How to import a directory of csvs at once with base R and data.table. Can you guess which way is the fastest?

10-13

Whats new on arXiv

10-13

Peak Non-Creepy Dating Pool

11-27

Monash University: Research Fellow (Digital Civics) [Melbourne, Australia]

11-22

Understanding Chicago’s homicide spike; comparisons to other cities

10-13

Document worth reading: “AI Reasoning Systems: PAC and Applied Methods”

12-13

Understanding Chicago’s homicide spike; comparisons to other cities

10-13

Understanding Chicago’s homicide spike; comparisons to other cities

10-13

Understanding Chicago’s homicide spike; comparisons to other cities

10-13

Document worth reading: “Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation”

10-13

Combining apparently contradictory evidence

12-30

Document worth reading: “Deep Reinforcement Learning: An Overview”

11-14

Document worth reading: “Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation”

10-13

R Packages worth a look

10-13

He had a sudden cardiac arrest. How does this change the probability that he has a particular genetic condition?

10-14

R Packages worth a look

10-14

Document worth reading: “Vector and Matrix Optimal Mass Transport: Theory, Algorithm, and Applications”

10-14

Statistics Sunday: Some Psychometric Tricks in R

10-14

Introducing the New Zealand Trade Intelligence Dashboard

10-14

Introducing the New Zealand Trade Intelligence Dashboard

10-14

Introducing the New Zealand Trade Intelligence Dashboard

10-14

Monotonic Binning with Equal-Sized Bads for Scorecard Development

10-14

Monotonic Binning with Equal-Sized Bads for Scorecard Development

10-14

A deep dive into glmnet: standardize

11-15

“Simulations are not scalable but theory is scalable”

11-02

If you did not already know

10-14

Looking back on 2018, looking to 2019

01-07

Gender Diversity in the R and Python Communities

12-05

Gender Diversity in the R and Python Communities

12-05

Why R? 2018 Conference – After Movie and Summary

11-07

Visualising Networks in ASOIAF – Part II

10-14

Visualising Networks in ASOIAF – Part II

10-14

Random Walk of Pi – Another ggplot2 Experiment

10-14

Random Walk of Pi – Another ggplot2 Experiment

10-14

The ‘knight on an infinite chessboard’ puzzle: efficient simulation in R

12-10

Random Walk of Pi – Another ggplot2 Experiment

10-14

Random Walk of Pi – Another ggplot2 Experiment

10-14

Random Walk of Pi – Another ggplot2 Experiment

10-14

Running R scripts within in-database SQL Server Machine Learning

10-14

ABC intro for Astrophysics

10-15

If you did not already know

10-15

Deep learning, hydroponics, and medical marijuana

10-15

In Memoriam: Manfred te Grotenhuis

10-15

In Memoriam: Manfred te Grotenhuis

10-15

7 Reasons for Policy Professionals to Get Pumped About R Programming in 2019

01-07

In Memoriam: Manfred te Grotenhuis

10-15

How AI Will Change Healthcare

10-15

5 “Clean Code” Tips That Will Dramatically Improve Your Productivity

10-15

Most liked R-bloggers’ posts from last week (2018-10-07 till 2018-10-13 – based on twitter)

10-15

I fell out with tapply and in love with dplyr

10-15

Choose Your Own Adventure – Analytics On-boarding

10-15

Spam Detection with Natural Language Processing (NLP) – Part 1

10-15

Spam Detection with Natural Language Processing (NLP) – Part 1

10-15

NLP for Log Analysis – Tokenization

11-13

How Machines Understand Our Language: An Introduction to Natural Language Processing

10-31

Spam Detection with Natural Language Processing (NLP) – Part 1

10-15

Looking into 19th century ads from a Luxembourguish newspaper with R

01-04

Your and my 2019 R goals

01-01

NLP for Log Analysis – Tokenization

11-13

Spam Detection with Natural Language Processing (NLP) – Part 1

10-15

Using emojis as scatterplot points

12-28

How we use emojis

10-15

Modularize your Shiny Apps: Exercises

10-15

Modularize your Shiny Apps: Exercises

10-15

Modularize your Shiny Apps: Exercises

10-15

If you did not already know

11-30

Document worth reading: “A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress”

10-15

Obtaining the number of components from cross validation of principal components regression

10-15

Making Art in R

10-15

Machine Learning Trick of the Day (8): Instrumental Thinking

10-15

Machine Learning Trick of the Day (8): Instrumental Thinking

10-15

Package support offer

10-15

Gift ideas for the R lovers

12-14

Package support offer

10-15

Slot Machines

10-15

Voice Control your Shiny Apps

10-15

In which I demonstrate my ignorance of world literature

12-03

Toward better measurement in K-12 education research

10-15

Toward better measurement in K-12 education research

10-15

Toward better measurement in K-12 education research

10-15

R Packages worth a look

10-16

R Packages worth a look

10-16

R Packages worth a look

10-16

R Packages worth a look

12-27

R Packages worth a look

10-16

A small logical change with big impact

10-16

A small logical change with big impact

10-16

A small logical change with big impact

10-16

Exploring college major and income: a live data analysis in R

10-16

Exploring college major and income: a live data analysis in R

10-16

Optimized bubble tea consumption

10-16

Optimized bubble tea consumption

10-16

Optimized bubble tea consumption

10-16

Designing Turbofan Tycoon

12-06

simmer 4.1.0

11-09

Optimized bubble tea consumption

10-16

R Packages worth a look

12-22

Optimized bubble tea consumption

10-16

University of San Francisco: Postdoctoral Fellowship, Data Institute [San Francisco, CA]

10-16

David Brooks discovers Red State Blue State Rich State Poor State!

10-16

David Brooks discovers Red State Blue State Rich State Poor State!

10-16

Modifying Excel Files using openxlsx

10-16

Modifying Excel Files using openxlsx

10-16

Modifying Excel Files using openxlsx

10-16

New paper – Inside or outside: quantifying extrapolation across river networks

10-16

epubr 0.6.0 CRAN release

01-11

New paper – Inside or outside: quantifying extrapolation across river networks

10-16

Self-Service Analytics or Operationalization: Which Should I Implement?

10-16

Self-Service Analytics or Operationalization: Which Should I Implement?

10-16

Even more images as x-axis labels

10-16

Even more images as x-axis labels

10-16

Even more images as x-axis labels

10-16

Adversarial Examples, Explained

10-16

Amazon SageMaker notebooks now support Git integration for increased persistence, collaboration, and reproducibility

11-29

GitHub Python Data Science Spotlight: High Level Machine Learning & NLP, Ensembles, Command Line Viz & Docker Made Easy

10-16

Using pandas and pymapd for ETL into OmniSci

10-16

Using pandas and pymapd for ETL into OmniSci

10-16

Using pandas and pymapd for ETL into OmniSci

10-16

Using pandas and pymapd for ETL into OmniSci

10-16

RStudio 1.2 Preview: Stan

10-16

RStudio 1.2 Preview: Stan

10-16

Quasiquotation in R via bquote()

10-16

Quasiquotation in R via bquote()

10-16

Quasiquotation in R via bquote()

10-16

Exploring the Gender Pay Gap with Publicly Available Data

12-12

Will Compression Be Machine Learning’s Killer App?

10-16

WoRkshop in ToRonto

10-16

WoRkshop in ToRonto

10-16

WoRkshop in ToRonto

10-16

WoRkshop in ToRonto

10-16

Accelerating Your Algorithms in Production [Webinar Replay]

10-16

Magister Dixit

01-02

UnitedHealth Group: Data Analytics and Reporting Lead [Minnetonka, MN or Telecommute]

11-16

Why you need GPUs for your deep learning platform

10-16

Distilled News

10-16

Estimating Pi

10-16

Estimating Pi

10-16

The AAA tranche of subprime science, revisited

10-16

The AAA tranche of subprime science, revisited

10-16

“Economic predictions with big data” using partial pooling

11-26

The AAA tranche of subprime science, revisited

10-16

The AAA tranche of subprime science, revisited

10-16

Mindstrong Health: Sr Data Scientist / Machine Learning, Statistics, Coding [Palo Alto, CA]

10-17

5 Alternatives to the Default R Outputs for GLMs and Linear Models

10-17

5 Alternatives to the Default R Outputs for GLMs and Linear Models

10-17

5 Alternatives to the Default R Outputs for GLMs and Linear Models

10-17

Use R with Excel: Importing and Exporting Data

10-17

Use R with Excel: Importing and Exporting Data

10-17

R Packages worth a look

11-13

The Definitive Guide to AI’s “Black Box” Problem

10-17

Estimating Control Chart Constants with R

10-17

Estimating Control Chart Constants with R

10-17

Estimating Control Chart Constants with R

10-17

XmR Chart | Step-by-Step Guide by Hand and with R

01-13

The Five Best Data Visualization Libraries

01-07

Estimating Control Chart Constants with R

10-17

SatRday talks recordings

10-17

Fitting the Besag, York, and Mollie spatial autoregression model with discrete data

10-17

Fitting the Besag, York, and Mollie spatial autoregression model with discrete data

10-17

Fitting the Besag, York, and Mollie spatial autoregression model with discrete data

10-17

Fitting the Besag, York, and Mollie spatial autoregression model with discrete data

10-17

Notebooks from the Practical AI Workshop

01-03

Exploring 2018 R-bloggers & R Weekly Posts with Feedly & the ‘seymour’ package

12-31

Citizen Data Scientists | Why Not DIY AI?

10-17

Citizen Data Scientists | Why Not DIY AI?

10-17

Automated Dashboard visualizations with Deviation in R

12-06

You Can’t Do AI Without Augmented Analytics and AutoML

11-26

If you did not already know

11-15

Implementing Automated Machine Learning Systems with Open Source Tools

10-25

Citizen Data Scientists | Why Not DIY AI?

10-17

Music for Data Scientists? Music by Data Scientists? …What…?!

10-17

Music for Data Scientists? Music by Data Scientists? …What…?!

10-17

Music for Data Scientists? Music by Data Scientists? …What…?!

10-17

Four machine learning strategies for solving real-world problems

10-17

R Packages worth a look

10-17

R Packages worth a look

10-17

Building a data warehouse

10-17

Cannibus Curve with ggplot2

10-17

Cannibus Curve with ggplot2

10-17

Cannibus Curve with ggplot2

10-17

Document worth reading: “Deep Facial Expression Recognition: A Survey”

10-17

If you did not already know

01-10

Whats new on arXiv

12-28

Document worth reading: “Deep Facial Expression Recognition: A Survey”

10-17

Use AWS DeepLens to give Amazon Alexa the power to detect objects via Alexa skills

10-17

Use AWS DeepLens to give Amazon Alexa the power to detect objects via Alexa skills

10-17

University of San Francisco: Assistant Professor, Tenure Track, Mathematics and Statistics [San Francisco, CA]

10-17

Cummins: Data Engineering Apps and Solutions Architect [Columbus, IN]

12-13

Cummins: Advanced Analytics Platform Principle Engineer [Columbus, IN]

12-12

Top Data Science Hacks

11-05

Top Data Science Hacks

11-05

University of San Francisco: Assistant Professor, Tenure Track, Mathematics and Statistics [San Francisco, CA]

10-17

Stan development in RStudio

10-17

Blockchain applications in the Federal Government sector

10-17

Blockchain applications in the Federal Government sector

10-17

Distilled News

10-18

Ethics in statistical practice and communication: Five recommendations.

10-18

R Packages worth a look

10-18

R Packages worth a look

10-18

Serial and Parallel bulb puzzle

10-18

Serial and Parallel bulb puzzle

10-18

Making Machine Learning Accessible [Webinar Replay]

11-27

Data Science in 30 Minutes with Jake Porway of DataKind

11-06

BI to AI: Getting Intelligent Insights to Everyone

10-18

Adam: “It would have been much harder without Dataquest”

10-18

Adam: “It would have been much harder without Dataquest”

10-18

Adam: “It would have been much harder without Dataquest”

10-18

Adam: “It would have been much harder without Dataquest”

10-18

Philip Roth (4) vs. DJ Jazzy Jeff; Jim Thorpe advances

01-08

The seminar speaker contest begins: Jim Thorpe (1) vs. John Oliver

01-07

How to Solve the ModelOps Challenge

10-18

Spam Detection with Natural Language Processing-Part 2

10-18

Spam Detection with Natural Language Processing-Part 2

10-18

Spam Detection with Natural Language Processing-Part 2

10-18

Examining Inter-Rater Reliability in a Reality Baking Show

10-18

Examining Inter-Rater Reliability in a Reality Baking Show

10-18

Examining Inter-Rater Reliability in a Reality Baking Show

10-18

Graphs Are The Next Frontier In Data Science

10-18

Prior distributions for covariance matrices

12-10

R Packages worth a look

11-14

R Packages worth a look

11-07

R Packages worth a look

10-18

Analyzing English Team of the Year Data Since 1973

10-18

Manipulate dates easily with {lubridate}

12-15

Analyzing English Team of the Year Data Since 1973

10-18

Analyzing English Team of the Year Data Since 1973

10-18

New Jobs Sure to Emerge Alongside Artificial Intelligence

10-18

New Jobs Sure to Emerge Alongside Artificial Intelligence

10-18

New Jobs Sure to Emerge Alongside Artificial Intelligence

10-18

New Jobs Sure to Emerge Alongside Artificial Intelligence

10-18

survHE new release

10-19

survHE new release

10-19

survHE new release

10-19

Finding a house to buy, using statistics

11-14

survHE new release

10-19

Solving the chinese postman problem

10-19

Solving the chinese postman problem

10-19

Solving the chinese postman problem

10-19

Solving the chinese postman problem

10-19

Maryland's Bridge Safety, reported using R

10-19

Maryland's Bridge Safety, reported using R

10-19

Maryland's Bridge Safety, reported using R

10-19

Maryland's Bridge Safety, reported using R

10-19

If you did not already know

10-19

Holy Grail of AI for Enterprise — Explainable AI

10-19

Holy Grail of AI for Enterprise — Explainable AI

10-19

Key Takeaways from AI Conference SF, Day 1: Domain Specific Architectures, Emerging China, AI Risks

10-29

Document worth reading: “Review of Deep Learning”

10-19

Statistics Challenge Invites Students to Tackle Opioid Crisis Using Real-World Data

10-19

Statistics Challenge Invites Students to Tackle Opioid Crisis Using Real-World Data

10-19

Statistics Challenge Invites Students to Tackle Opioid Crisis Using Real-World Data

10-19

Young Investigator Special Competition for Time-Sharing Experiment for the Social Sciences

10-19

Young Investigator Special Competition for Time-Sharing Experiment for the Social Sciences

10-19

At Year's End: 2018

12-25

I’m an Analyst and the software engineers made fun of my code!

10-19

Loops and Pizzas

10-19

Loops and Pizzas

10-19

Loops and Pizzas

10-19

The Intuitions Behind Bayesian Optimization with Gaussian Processes

10-19

McKinsey Datathon: The City Cup17 November, Amsterdam, Stockholm and Zurich. Apply Now

10-19

Twas the Night Before Analysis or A Visit from the Chief Data Scientist

12-24

An actual quote from a paper published in a medical journal: “The data, analytic methods, and study materials will not be made available to other researchers for purposes of reproducing the results or replicating the procedure.”

10-19

An actual quote from a paper published in a medical journal: “The data, analytic methods, and study materials will not be made available to other researchers for purposes of reproducing the results or replicating the procedure.”

10-19

The business case for federated learning

12-28

An actual quote from a paper published in a medical journal: “The data, analytic methods, and study materials will not be made available to other researchers for purposes of reproducing the results or replicating the procedure.”

10-19

An actual quote from a paper published in a medical journal: “The data, analytic methods, and study materials will not be made available to other researchers for purposes of reproducing the results or replicating the procedure.”

10-19

Start your journey into data science today

10-19

New Course: Interactive Data Visualization with rbokeh

10-19

New Course: Visualization Best Practices in R

10-19

New Course: Visualization Best Practices in R

10-19

New Course: Visualization Best Practices in R

10-19

R Packages worth a look

10-20

R Packages worth a look

10-20

R Packages worth a look

10-20

He’s a history teacher and he has a statistics question

10-20

He’s a history teacher and he has a statistics question

10-20

He’s a history teacher and he has a statistics question

10-20

Table of Contents for PIM

10-20

R 3.5.2 now available

12-20

R 3.5.2 now available

12-20

RcppArmadillo 0.9.200.5.0

11-28

RQuantLib 0.4.6: Updated upstream, and calls for help

11-25

Rcpp now used by 1500 CRAN packages

11-15

RcppArmadillo 0.9.200.4.0

11-10

xts 0.11-2 on CRAN

11-06

Happy 10th Bday, Rcpp – and welcome release 1.0 !!

11-06

Table of Contents for PIM

10-20

A Thorough Introduction to Boltzmann Machines

10-20

Automated Dashboard visualizations with Deviation in R

12-06

A Lazy Function

10-20

A Lazy Function

10-20

Dr. Data Show Video: How Can You Trust AI?

10-20

Dr. Data Show Video: How Can You Trust AI?

10-20

Statistics Sunday: What Fast Food Can Tell Us About a Community and the World

10-21

Statistics Sunday: What Fast Food Can Tell Us About a Community and the World

10-21

Statistics Sunday: What Fast Food Can Tell Us About a Community and the World

10-21

Statistics Sunday: What Fast Food Can Tell Us About a Community and the World

10-21

2018-11 Variable-Width Bezier Splines in R

11-01

Faceted Graphs with cdata and ggplot2

10-21

Faceted Graphs with cdata and ggplot2

10-21

Faceted Graphs with cdata and ggplot2

10-21

Faceted Graphs with cdata and ggplot2

10-21

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Faceted Graphs with cdata and ggplot2

10-21

Faceted Graphs with cdata and ggplot2

10-21

automl package: part 1/2 why and how

10-21

automl package: part 1/2 why and how

10-21

Multilevel models with group-level predictors

10-21

Multilevel models with group-level predictors

10-21

RApiDatetime 0.0.4: Updates and Extensions

10-21

RApiDatetime 0.0.4: Updates and Extensions

10-21

RApiDatetime 0.0.4: Updates and Extensions

10-21

Document worth reading: “Machine Learning for Spatiotemporal Sequence Forecasting: A Survey”

10-21

Document worth reading: “Machine Learning for Spatiotemporal Sequence Forecasting: A Survey”

10-21

Document worth reading: “Machine Learning for Spatiotemporal Sequence Forecasting: A Survey”

10-21

Whats new on arXiv

10-21

If you did not already know

10-21

If you did not already know

10-21

If you did not already know

10-21

Getting the data from the Luxembourguish elections out of Excel

10-21

Getting the data from the Luxembourguish elections out of Excel

10-21

Getting the data from the Luxembourguish elections out of Excel

10-21

R Packages worth a look

10-22

R Packages worth a look

10-22

Residential Property Investment Visualization and Analysis Shiny App

10-22

Beginner Data Visualization & Exploration Using Pandas

10-22

Cassie Kozyrkov discusses decision making and decision intelligence!

10-22

Does Sharing Goals Help or Hurt Your Chances of Success?

10-22

Python Patterns: max Instead of if

01-10

Day 06 – little helper statusbar

12-06

Summer Intern Projects

10-22

Summer Intern Projects

10-22

Summer Intern Projects

10-22

Document worth reading: “Declarative Statistics”

10-22

Import AI: 117: Surveillance search engines; harvesting real-world road data with hovering drones; and improving language with unsupervised pre-training

10-22

BERT: State of the Art NLP Model, Explained

12-26

Import AI: 117: Surveillance search engines; harvesting real-world road data with hovering drones; and improving language with unsupervised pre-training

10-22

Import AI: 117: Surveillance search engines; harvesting real-world road data with hovering drones; and improving language with unsupervised pre-training

10-22

Import AI: 117: Surveillance search engines; harvesting real-world road data with hovering drones; and improving language with unsupervised pre-training

10-22

“The dwarf galaxy NGC1052-DF2”

10-22

“The dwarf galaxy NGC1052-DF2”

10-22

“The dwarf galaxy NGC1052-DF2”

10-22

“The dwarf galaxy NGC1052-DF2”

10-22

Object tracking with dlib

10-22

Object tracking with dlib

10-22

MVP for Data Projects

10-22

MVP for Data Projects

10-22

How to Define a Machine Learning Problem Like a Detective

10-22

Interspeech 2018: Highlights for Data Scientists

12-24

If you did not already know

10-29

Don’t miss Big Data LDN 2018

10-22

Don’t miss Big Data LDN 2018

10-22

Document worth reading: “Fractal AI: A fragile theory of intelligence”

10-22

Document worth reading: “Fractal AI: A fragile theory of intelligence”

10-22

Working with US Census Data in R

11-07

Working with US Census Data in R

11-07

Update on the R Consortium Census Working Group

10-22

Update on the R Consortium Census Working Group

10-22

Distilled News

10-22

Understanding Amazon SageMaker notebook instance networking configurations and advanced routing options

10-24

Amazon SageMaker Batch Transform now supports Amazon VPC and AWS KMS-based encryption

10-22

Amazon SageMaker Batch Transform now supports Amazon VPC and AWS KMS-based encryption

10-22

Whats new on arXiv

10-22

Building statues of hope in augmented reality

10-22

Building statues of hope in augmented reality

10-22

Packages for Testing your R Package

10-22

Packages for Testing your R Package

10-22

If you did not already know

10-23

If you did not already know

10-23

Causal mediation estimation measures the unobservable

11-06

Cross-over study design with a major constraint

10-23

Cross-over study design with a major constraint

10-23

Cross-over study design with a major constraint

10-23

If you did not already know

10-23

R Packages worth a look

10-23

High school statistics class builds election prediction model

10-23

Computer Vision for Model Assessment

10-23

Computer Vision for Model Assessment

10-23

More on sigr

11-06

R tip: Make Your Results Clear with sigr

11-04

R tip: Make Your Results Clear with sigr

11-04

Computer Vision for Model Assessment

10-23

Computer Vision for Model Assessment

10-23

Document worth reading: “Attribute-aware Collaborative Filtering: Survey and Classification”

10-23

Document worth reading: “Attribute-aware Collaborative Filtering: Survey and Classification”

10-23

How Can Autonomous Drones Help the Energy and Utilities Industry?

10-23

Introducing gratia

10-23

Introducing gratia

10-23

Introducing gratia

10-23

Get a 2–6x Speed-up on Your Data Pre-processing with Python

10-23

RcppTOML 0.1.4: Now with TOML v0.5.0

10-23

Drilling Down on Depth Sensing and Deep Learning

10-23

Drilling Down on Depth Sensing and Deep Learning

10-23

Drilling Down on Depth Sensing and Deep Learning

10-23

Interactive Graphics with R Shiny

11-23

automl package: part 2/2 first steps how to

10-24

R Packages worth a look

10-24

R Packages worth a look

10-24

R Packages worth a look

10-24

Dealing with failed projects

11-22

M4 Forecasting Conference

10-24

M4 Forecasting Conference

10-24

M4 Forecasting Conference

10-24

R Packages worth a look

10-24

Vote suppression in corrupt NY State

10-24

Vote suppression in corrupt NY State

10-24

Vote suppression in corrupt NY State

10-24

Vote suppression in corrupt NY State

10-24

“Demystifying Data Science” remote notes

10-24

“Demystifying Data Science” remote notes

10-24

“Demystifying Data Science” remote notes

10-24

How AI Can Help Cope with Data Scientists’ Boredom

10-24

How AI Can Help Cope with Data Scientists’ Boredom

10-24

Building a Question-Answering System from Scratch

10-24

Building a Question-Answering System from Scratch

10-24

Community Call – Working with images in R

10-24

Community Call – Working with images in R

10-24

Community Call – Working with images in R

10-24

U. of Zurich: Professorship in Big Data Science (Open Rank) [Zurich, Switzerland]

10-24

U. of Zurich: Assistant Professorship in AI and Machine Learning (Non-tenure Track) [Zurich, Switzerland]

10-24

CBH Group: Sr Data Engineer [Perth, Australia]

12-14

U. of Zurich: Assistant Professorship in AI and Machine Learning (Non-tenure Track) [Zurich, Switzerland]

10-24

If you did not already know

10-24

If you did not already know

10-24

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J

12-05

Understanding Amazon SageMaker notebook instance networking configurations and advanced routing options

10-24

Understanding Amazon SageMaker notebook instance networking configurations and advanced routing options

10-24

Understanding Amazon SageMaker notebook instance networking configurations and advanced routing options

10-24

When the numbers don’t tell the whole story

10-24

When the numbers don't tell the whole story

10-24

When the numbers don't tell the whole story

10-24

Center for Ultrasound Research and Translation, Massachusetts General Hospital: Post-Doctoral Scholar / Research Scientist [Boston, MA]

12-31

Interspeech 2018: Highlights for Data Scientists

12-24

T-mobile uses R for Customer Service AI

11-09

Moody’s Analytics: Machine Learning / NLP – Research Scientist / Engineer [New York, NY]

10-30

Join us at the EARL US Roadshow – a conference dedicated to the real-world usage of R

10-24

Join us at the EARL US Roadshow – a conference dedicated to the real-world usage of R

10-24

Because it's Friday: Parable of the Polygons

10-26

U. of Zurich: Professorship in Big Data Science (Open Rank) [Zurich, Switzerland]

10-24

“And when you did you weren’t much use, you didn’t even know what a peptide was”

11-29

A study fails to replicate, but it continues to get referenced as if it had no problems. Communication channels are blocked.

10-24

A study fails to replicate, but it continues to get referenced as if it had no problems. Communication channels are blocked.

10-24

A study fails to replicate, but it continues to get referenced as if it had no problems. Communication channels are blocked.

10-24

SiliconANGLE: Machine learning automation startup DataRobot lands $100M round

10-24

SiliconANGLE: Machine learning automation startup DataRobot lands $100M round

10-24

When the numbers don’t tell the whole story

10-24

When the numbers don’t tell the whole story

10-24

When the numbers don’t tell the whole story

10-24

Kent State University: Assistant/Associate Professor – Business Analytics/Information Systems [Kent, OH]

12-19

How I Learned to Stop Worrying and Love Uncertainty

10-24

Getting started Stamen maps with ggmap

10-25

Getting started Stamen maps with ggmap

10-25

Getting started Stamen maps with ggmap

10-25

Baltimore-Washington

10-25

Baltimore-Washington

10-25

Baltimore-Washington

10-25

Preview my new book: Introduction to Reproducible Science in R

11-12

AI, Machine Learning and Data Science Roundup: October 2018

10-25

A Data Scientist’s Guide to an Efficient Project Lifecycle

10-25

A Data Scientist’s Guide to an Efficient Project Lifecycle

10-25

A Data Scientist’s Guide to an Efficient Project Lifecycle

10-25

The “probability to win” is hard to estimate…

11-07

Naive Bayes from Scratch using Python only – No Fancy Frameworks

10-25

How DataCamp Handles Course Quality

10-25

Learn how to create data-driven marketing team

10-25

Learn how to create data-driven marketing team

10-25

Popular Halloween Candy on US State Grid Map

10-25

Popular Halloween Candy on US State Grid Map

10-25

Popular Halloween Candy on US State Grid Map

10-25

Popular Halloween Candy on US State Grid Map

10-25

Monash University: Research Fellow (Bioinformatics) [Melbourne, Australia]

12-03

Monash University: Lecturer/Sr Lecturer – Digital Health [Melbourne, Australia]

11-22

Monash University: Research Fellow (Digital Civics) [Melbourne, Australia]

11-22

Monash University: Academic Opportunities in Dialogue Research [Melbourne, Australia]

10-25

The Axios Turing test and the heat death of the journalistic universe

10-25

The Netflix Data War

12-19

The Axios Turing test and the heat death of the journalistic universe

10-25

The Axios Turing test and the heat death of the journalistic universe

10-25

Implementing Automated Machine Learning Systems with Open Source Tools

10-25

Implementing Automated Machine Learning Systems with Open Source Tools

10-25

Whats new on arXiv

10-25

Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 3: Layouts

10-25

R Packages worth a look

10-26

R Packages worth a look

10-26

R Packages worth a look

10-26

R Packages worth a look

10-26

Are you buying an apartment? How to hack competition in the real estate market

10-26

Notes on Feature Preprocessing: The What, the Why, and the How

10-26

“Simulations are not scalable but theory is scalable”

11-02

Notes on Feature Preprocessing: The What, the Why, and the How

10-26

Marketing Analytics and Data Science

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

Designing Transforms for Data Reshaping with cdata

10-26

7 Best Practices for Machine Learning on a Data Lake

11-07

Spotlight on Julia Silge, Keynote Speaker EARL Seattle 7th November

10-26

Simulating simple dice games by @ellis2013nz

10-26

CRAN’s New Missing Data Task View

10-26

The Final Data Science Roadshow is Just the Beginning

10-26

R Packages worth a look

10-26

R Packages worth a look

10-26

R Packages worth a look

10-26

Because it's Friday: Parable of the Polygons

10-26

Because it's Friday: Parable of the Polygons

10-26

The Future of Management: Human Resource Analytics

10-29

RConsortium — Building an R Certification

10-26

RConsortium — Building an R Certification

10-26

Can we do better than using averaged measurements?

10-26

How to perform merges (joins) on two or more data frames with base R, tidyverse and data.table

10-27

How to perform merges (joins) on two or more data frames with base R, tidyverse and data.table

10-27

The Christmas Eve Selloff was a Classic Capitulation

12-27

The Bear is Here

12-22

The Bear is Here

12-22

How to perform merges (joins) on two or more data frames with base R, tidyverse and data.table

10-27

How to perform merges (joins) on two or more data frames with base R, tidyverse and data.table

10-27

How to perform merges (joins) on two or more data frames with base R, tidyverse and data.table

10-27

RcppRedis 0.1.9

10-27

RcppRedis 0.1.9

10-27

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

10-27

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

10-27

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

10-27

Debate about genetics and school performance

10-27

Debate about genetics and school performance

10-27

Debate about genetics and school performance

10-27

If you did not already know

10-27

If you did not already know

10-27

If you did not already know

10-27

Visualizing The Catholic Lectionary – Part 1

10-27

Visualizing The Catholic Lectionary – Part 1

10-27

Visualizing the Asian Cup with R!

01-11

Visualizing The Catholic Lectionary – Part 1

10-27

Document worth reading: “Opening the black box of deep learning”

10-28

Document worth reading: “Opening the black box of deep learning”

10-28

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

R Packages worth a look

10-28

R Packages worth a look

10-28

R Packages worth a look

10-28

Introducing cricpy:A python package to analyze performances of cricketers

10-28

Introducing cricpy:A python package to analyze performances of cricketers

10-28

Introducing cricpy:A python package to analyze performances of cricketers

10-28

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

10-28

If you did not already know

10-28

How quickly do stock market valuations revert back to their means?

10-28

How quickly do stock market valuations revert back to their means?

10-28

How quickly do stock market valuations revert back to their means?

10-28

Document worth reading: “Machine Learning for Wireless Networks with Artificial Intelligence: A Tutorial on Neural Networks”

10-28

Document worth reading: “Machine Learning for Wireless Networks with Artificial Intelligence: A Tutorial on Neural Networks”

10-28

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Scatterplot matrices (pair plots) with cdata and ggplot2

10-28

Simple Feed Ranking Algorithm

10-28

MRP (or RPP) with non-census variables

10-28

MRP (or RPP) with non-census variables

10-28

MRP (or RPP) with non-census variables

10-28

MRP (or RPP) with non-census variables

10-28

R Packages worth a look

10-28

R Packages worth a look

10-28

R Packages worth a look

10-28

American Association of Colleges of Osteopathic Medicine: Data Analyst [Bethesda, Maryland]

10-29

Amazon Translate now offers 113 new language pairs

10-29

Arnaub Chatterjee discusses artificial intelligence (AI) and machine learning (ML) in healthcare.

10-29

Arnaub Chatterjee discusses artificial intelligence (AI) and machine learning (ML) in healthcare.

10-29

running plot [and simulated annealing]

12-14

Very Non-Standard Calling in R

12-03

Very Non-Standard Calling in R

12-03

Arnaub Chatterjee discusses artificial intelligence (AI) and machine learning (ML) in healthcare.

10-29

Document worth reading: “Neural Approaches to Conversational AI”

10-29

crfsuite for natural language processing

10-29

crfsuite for natural language processing

10-29

Please vote

10-29

Please vote

10-29

“Recapping the recent plagiarism scandal”

11-09

The Decentralized Web

10-29

The Decentralized Web

10-29

The Decentralized Web

10-29

The Decentralized Web

10-29

NAIC: Analyst I (Capital Markets) [New York, NY]

10-29

NAIC: Analyst I (Capital Markets) [New York, NY]

10-29

Turbocharge Tech Transformation: Integrate AI Across Insurance

11-06

Using deep learning on AWS to lower property damage losses from natural disasters

10-30

NAIC: Analyst I (Capital Markets) [New York, NY]

10-29

The Future of Management: Human Resource Analytics

10-29

The Future of Management: Human Resource Analytics

10-29

The Future of Management: Human Resource Analytics

10-29

The quest continues: a look at a new initiative to explore human and machine intelligence

10-29

The quest continues: a look at a new initiative to explore human and machine intelligence

10-29

The quest continues: a look at a new initiative to explore human and machine intelligence

10-29

BH 1.69.0-0 pre-releases and three required changes

12-20

About a Curious Feature and Interpretation of Linear Regressions

10-29

Considering sensitivity to unmeasured confounding: part 2

01-10

About a Curious Feature and Interpretation of Linear Regressions

10-29

About a Curious Feature and Interpretation of Linear Regressions

10-29

Key Takeaways from AI Conference SF, Day 2: AI and Security, Adversarial Examples, Innovation

10-30

Key Takeaways from AI Conference SF, Day 1: Domain Specific Architectures, Emerging China, AI Risks

10-29

Intuit: Staff Data Scientist [Woodland Hills, CA and Mountain View, CA]

12-11

Intuit: Staff Data Scientist [Mountain View, CA]

12-11

Key Takeaways from AI Conference SF, Day 1: Domain Specific Architectures, Emerging China, AI Risks

10-29

Intuit: Staff Data Scientist [Woodland Hills, CA and Mountain View, CA]

12-11

Key Takeaways from AI Conference SF, Day 1: Domain Specific Architectures, Emerging China, AI Risks

10-29

What does it mean to talk about a “1 in 600 year drought”?

10-29

What does it mean to talk about a “1 in 600 year drought”?

10-29

What does it mean to talk about a “1 in 600 year drought”?

10-29

What does it mean to talk about a “1 in 600 year drought”?

10-29

If you did not already know

10-29

Bank of Canada: Data Scientist [Ottawa, Canada]

10-29

The business case for federated learning

12-28

Federated learning: distributed machine learning with data locality and privacy

11-14

Federated Learning: Machine Learning with Privacy on the Edge

10-29

Multi-object tracking with dlib

10-29

Multi-object tracking with dlib

10-29

Is the answer to everything Gaussian?

10-29

Peak Non-Creepy Dating Pool

11-27

Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

10-29

Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

10-29

Top Skills Needed to Work as Data Scientist in iGaming

01-10

Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

10-29

Bootstrap Testing with MCHT

10-29

R Packages worth a look

10-29

R Packages worth a look

10-29

Top KDnuggets tweets, Nov 14-20: 10 Free Must-See Courses for Machine Learning and Data Science; Great list of

11-21

R Packages worth a look

11-01

R Packages worth a look

10-29

How do I visualise the results of a Bayesian Model: Rugby models in Arviz

10-29

How do I visualise the results of a Bayesian Model: Rugby models in Arviz

10-29

If you did not already know

10-29

Introduction to Deep Learning with Keras

10-29

Adding Firebase Authentication to Shiny

01-03

Distilled News

10-30

Site Migration

10-30

Lehigh University: Tenure Track Positions in Foundations of Data Science [Bethlehem, PA]

10-30

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Zero Counts in dplyr

11-19

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

10-30

Using deep learning on AWS to lower property damage losses from natural disasters

10-30

Are petrol prices in Australia fair?

10-30

Are petrol prices in Australia fair?

10-30

Are petrol prices in Australia fair?

10-30

Are petrol prices in Australia fair?

10-30

Fringe FM conversation on AI Ethics

10-30

Hey! Here’s what to do when you have two or more surveys on the same population!

11-11

New Poll: How Important is Understanding Machine Learning Models?

10-30

Key Takeaways from AI Conference SF, Day 2: AI and Security, Adversarial Examples, Innovation

10-30

Key Takeaways from AI Conference SF, Day 2: AI and Security, Adversarial Examples, Innovation

10-30

Data + Art STEAM Project: Final Results

10-30

Spooky! Gravedigger in R

10-31

Spooky! Gravedigger in R

10-31

Our Favorite Spooky AI & Data Articles

10-30

Our Favorite Spooky AI & Data Articles

10-30

How to Mitigate Open Source License Risks

10-30

How to Mitigate Open Source License Risks

10-30

Growth of Subreddits

10-30

Growth of Subreddits

10-30

In-Depth Training for the Future of Data, Orlando, Nov 11-16 – Save with code KD30

10-30

Machine Learning Explainability vs Interpretability: Two concepts that could help restore trust in AI

12-20

Highlights of 2018

12-18

Document worth reading: “A Theory of Diagnostic Interpretation in Supervised Classification”

12-08

Explainable Artificial Intelligence (Part 2) – Model Interpretation Strategies

12-06

Why Machine Learning Interpretability Matters

12-04

Interpretability is crucial for trusting AI and machine learning

12-01

Explainable ML versus Interpretable ML

10-30

Explainable ML versus Interpretable ML

10-30

Stop Installing Tensorflow Using pip for Performance Sake!

10-30

Improving model interpretability with LIME

10-31

Ensure consistency in data processing code between training and inference in Amazon SageMaker

01-11

If you did not already know

12-29

If you did not already know

12-25

Carol Nickerson explains what those mysterious diagrams were saying

12-22

The ultimate guide to starting AI

11-13

Introduction to Amazon SageMaker Object2Vec

11-08

Labeling Unstructured Text for Meaning to Achieve Predictive Lift

10-31

How to Start Learning R for Data Science

10-31

Spooky! Gravedigger in R

10-31

Spooky! Gravedigger in R

10-31

Spooky! Gravedigger in R

10-31

KDnuggets™ News 18:n41, Oct 31: Introduction to Deep Learning with Keras; Easy Named Entity Recognition with Scikit-Learn

10-31

RHL’19 St-Cergue, Switzerland, 25-27 January 2019

10-31

RHL’19 St-Cergue, Switzerland, 25-27 January 2019

10-31

RHL’19 St-Cergue, Switzerland, 25-27 January 2019

10-31

RHL’19 St-Cergue, Switzerland, 25-27 January 2019

10-31

“2010: What happened?” in light of 2018

10-31

“2010: What happened?” in light of 2018

10-31

“2010: What happened?” in light of 2018

10-31

Webinar – Integrate AI Across Insurance Operations to Turbocharge Tech Transformation, Nov 14

10-31

R 3.5.2 now available

12-20

Spooky! Gravedigger in R

10-31

R 3.5.2 now available

12-20

Spooky! Gravedigger in R

10-31

If you did not already know

10-31

If you did not already know

10-31

If you did not already know

10-31

R or Python? Why not both? Using Anaconda Python within R with {reticulate}

12-30

4 Strategies to Deal With Large Datasets Using Pandas

12-19

R Packages worth a look

11-01

namer, Automatic Labelling of R Markdown Chunks

10-31

namer, Automatic Labelling of R Markdown Chunks

10-31

namer, Automatic Labelling of R Markdown Chunks

10-31

Document worth reading: “A Comprehensive Study of Deep Learning for Image Captioning”

10-31

Document worth reading: “A Comprehensive Study of Deep Learning for Image Captioning”

10-31

If you did not already know

10-31

If you did not already know

10-31

Model Server for Apache MXNet v1.0 released

10-31

Model Server for Apache MXNet v1.0 released

10-31

Le Monde puzzle [#1072]

10-31

Le Monde puzzle [#1072]

10-31

Apps gather your location and then sell the data

12-13

Le Monde puzzle [#1072]

10-31

Le Monde puzzle [#1072]

10-31

NLP for Log Analysis – Tokenization

11-13

How Machines Understand Our Language: An Introduction to Natural Language Processing

10-31

How Machines Understand Our Language: An Introduction to Natural Language Processing

10-31

How to Highlight 3D Brain Regions

10-31

Distilled News

10-31

Distilled News

10-31

Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake, Nov 8

11-01

Tomorrow, Nov 8 Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake

11-01

Turn data into revenue. Wharton can show you how.

11-06

Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake, Nov 8

11-01

Tomorrow, Nov 8 Webinar: Transform Your Stagnant Data Swamp into a Pristine Data Lake

11-01

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: November and Beyond

11-01

Extracting data from news articles: Australian pollution by postcode

11-28

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: November and Beyond

11-01

Spam Detection with Natural Language Processing – Part 3

11-01

Why AI will not replace radiologists

11-01

R Packages worth a look

11-01

RFishBC CRAN Release

11-22

Rcpp 1.0.0: The Tenth Birthday Release

11-08

What R version do you really need for a package?

11-01

What R version do you really need for a package?

11-01

What R version do you really need for a package?

11-01

Use GitHub Vulnerability Alerts to Keep Users of Your R Packages Safe

11-14

What R version do you really need for a package?

11-01

2018-11 Variable-Width Bezier Splines in R

11-01

2018-11 Variable-Width Bezier Splines in R

11-01

epubr 0.6.0 CRAN release

01-11

epubr 0.5.0 CRAN release

11-18

Data Mining Book – Chapter Download

11-02

New Course: Analyzing Election and Polling Data in R

11-01

2018: How did people actually vote? (The real story, not the exit polls.)

11-16

New Course: Analyzing Election and Polling Data in R

11-01

New Course: Analyzing Election and Polling Data in R

11-01

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: December and Beyond

12-04

Multithreaded in the Wild

11-01

RcppTOML 0.1.5: Small extensions

11-01

RcppTOML 0.1.5: Small extensions

11-01

Join AI experts from Google Brain, Open AI & Uber AI Labs in San Francisco

11-01

Raghuveer Parthasarathy’s big idea for fixing science

11-01

Raghuveer Parthasarathy’s big idea for fixing science

11-01

R Packages worth a look

11-01

R Packages worth a look

11-01

Talk: How Do We Support Under-represented Groups To Put Themselves Forward?

11-01

Talk: How Do We Support Under-represented Groups To Put Themselves Forward?

11-01

Talk: How Do We Support Under-represented Groups To Put Themselves Forward?

11-01

If you did not already know

11-01

Sharing the Recipe for rOpenSci’s Unconf Ice Breaker

11-01

Sharing the Recipe for rOpenSci’s Unconf Ice Breaker

11-01

Multi-Class Text Classification Model Comparison and Selection

11-01

Python vs R: Head to Head Data Analysis

11-01

Python vs R: Head to Head Data Analysis

11-01

Search labels and IDs from IAB-QAG and IPTC Subject Codes taxonomies

11-01

Search labels and IDs from IAB-QAG and IPTC Subject Codes taxonomies

11-01

Search labels and IDs from IAB-QAG and IPTC Subject Codes taxonomies

11-01

How Data Science Is Improving Higher Education

11-01

How Data Science Is Improving Higher Education

11-01

Data Notes: Chinese Tourism's Impact on Taiwan

11-01

Whats new on arXiv

11-02

Whats new on arXiv

11-02

The blocks and rows theory of data shaping

11-02

Document worth reading: “Transfer Metric Learning: Algorithms, Applications and Outlooks”

11-02

Document worth reading: “Transfer Metric Learning: Algorithms, Applications and Outlooks”

11-02

Quick overview on the new Bioconductor 3.8 release

11-02

Quick overview on the new Bioconductor 3.8 release

11-02

Building Gene Expression Atlases with Deep Generative Models for Single-cell Transcriptomics

12-05

Quick overview on the new Bioconductor 3.8 release

11-02

Data Representation for Natural Language Processing Tasks

11-02

Data Science “Paint by the Numbers” with the Hypothesis Development Canvas

11-02

Distilled News

11-02

“What Happened Next Tuesday: A New Way To Understand Election Results”

11-02

“What Happened Next Tuesday: A New Way To Understand Election Results”

11-02

“What Happened Next Tuesday: A New Way To Understand Election Results”

11-02

Learn how machine learning is transforming business

11-02

Learn how machine learning is transforming business, Nov 12 Webinar

11-02

Learn how machine learning is transforming business

11-02

Learn how machine learning is transforming business, Nov 12 Webinar

11-02

Learn how machine learning is transforming business

11-02

Learn how machine learning is transforming business, Nov 12 Webinar

11-02

Master R shiny: One trick to build maintainable and scalable event chains

11-02

Pulse of the Competition: November Edition

11-02

Pulse of the Competition: November Edition

11-02

collateral

11-02

collateral

11-02

How to Land a Job As a Data Scientist in 2019

12-24

The Most in Demand Skills for Data Scientists

11-02

RcppAnnoy 0.0.11

11-02

RcppAnnoy 0.0.11

11-02

RcppAnnoy 0.0.11

11-02

RcppArmadillo 0.9.200.5.0

11-28

RcppMsgPack 0.2.3

11-18

RcppAnnoy 0.0.11

11-02

“We are reluctant to engage in post hoc speculation about this unexpected result, but it does not clearly support our hypothesis”

11-03

“We are reluctant to engage in post hoc speculation about this unexpected result, but it does not clearly support our hypothesis”

11-03

“We are reluctant to engage in post hoc speculation about this unexpected result, but it does not clearly support our hypothesis”

11-03

Back by popular demand . . . The Greatest Seminar Speaker contest!

01-04

“We are reluctant to engage in post hoc speculation about this unexpected result, but it does not clearly support our hypothesis”

11-03

Document worth reading: “A User’s Guide to Support Vector Machines”

11-03

Introducing medical language processing with Amazon Comprehend Medical

11-27

If you did not already know

11-03

If you did not already know

11-03

Book Review – Sound Analysis and Synthesis with R

11-03

coalesce with wrapr

11-03

coalesce with wrapr

11-03

Using the Economics Value Curve to Drive Digital Transformation

12-27

coalesce with wrapr

11-03

coalesce with wrapr

11-03

Visualize the Business Value of your Predictive Models with modelplotr

11-03

A quick look at GHCN version 4

11-03

A quick look at GHCN version 4

11-03

Le Monde puzzle [#1075]

11-21

Le Monde puzzle [#1073]

11-03

Le Monde puzzle [#1073]

11-03

Le Monde puzzle [#1073]

11-03

Le Monde puzzle [#1073]

11-03

Le Monde puzzle [#1073]

11-03

R Packages worth a look

11-20

R Packages worth a look

11-03

R Packages worth a look

11-03

R Packages worth a look

11-03

Reflections on remote data science work

11-03

RProtoBuf 0.4.13 (and 0.4.12)

11-03

RProtoBuf 0.4.13 (and 0.4.12)

11-03

RProtoBuf 0.4.13 (and 0.4.12)

11-03

R Packages worth a look

11-04

Building a Repository of Alpine-based Docker Images for R, Part I

11-04

Building a Repository of Alpine-based Docker Images for R, Part I

11-04

Cornell prof (but not the pizzagate guy!) has one quick trick to getting 1700 peer reviewed publications on your CV

11-04

Cornell prof (but not the pizzagate guy!) has one quick trick to getting 1700 peer reviewed publications on your CV

11-04

Polished statistical analysis chapters in evidence-based software engineering

11-24

Cornell prof (but not the pizzagate guy!) has one quick trick to getting 1700 peer reviewed publications on your CV

11-04

R tip: Make Your Results Clear with sigr

11-04

R tip: Make Your Results Clear with sigr

11-04

If you did not already know

11-04

If you did not already know

11-04

Document worth reading: “Deep Learning for Image Denoising: A Survey”

11-04

Document worth reading: “Artificial Intelligence for Long-Term Robot Autonomy: A Survey”

11-04

Top 10 Books on NLP and Text Analysis

01-09

Data Science With R Course Series – Week 8

11-05

Telling Truth from Hype When Hunting for Data Science Work

11-05

Telling Truth from Hype When Hunting for Data Science Work

11-05

Coding Gradient boosted machines in 100 lines of code

11-05

Coding Gradient boosted machines in 100 lines of code

11-05

Coding Gradient boosted machines in 100 lines of code

11-05

Vanderbilt University: Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University: Sr Lecturer in Data and Analytics [Nashville, TN]

11-05

Vanderbilt University’s Peabody College: Sr. Lecturer in Data and Analytics [Nashville, TN]

11-05

NG "roll returns" – inflection point?

11-05

NG "roll returns" – inflection point?

11-05

NG "roll returns" – inflection point?

11-05

Distilled News

11-05

Bright Lights, Bright Future. TDWI Is Back in Vegas

11-14

Dr. Data Show Video: What the Hell Does “Data Science” Really Mean?

11-10

Maps, models, and analytic problem framing

11-05

Import AI 119: How to benefit AI research in Africa; German politician calls for billions in spending to prevent country being left behind; and using deep learning to spot thefts

11-05

R Packages worth a look

11-05

Vanderbilt University’s Peabody College: Lecturer in Data and Analytics [Online Teaching]

11-05

Creating GIFs with OpenCV

11-05

How a meme grew into a campaign slogan

11-05

How a meme grew into a campaign slogan

11-05

“A Guide to Working With Census Data in R” is now Complete!

11-05

“A Guide to Working With Census Data in R” is now Complete!

11-05

Maps of the issues mentioned most in election advertising

11-05

India vs US – Kaggle Users & Data Scientists

11-05

India vs US – Kaggle Users & Data Scientists

11-05

Quantum Machine Learning: A look at myths, realities, and future projections

11-05

The purported CSI effect and the retroactive precision fallacy

11-05

The purported CSI effect and the retroactive precision fallacy

11-05

The purported CSI effect and the retroactive precision fallacy

11-05

Lifecycle configuration update for Amazon SageMaker notebook instances

11-06

Lifecycle configuration update for Amazon SageMaker notebook instances

11-06

Mastering the Learning Rate to Speed Up Deep Learning

11-06

Mastering the Learning Rate to Speed Up Deep Learning

11-06

Mastering the Learning Rate to Speed Up Deep Learning

11-06

Using httr to Detect HTTP(s) Redirects

11-06

Using httr to Detect HTTP(s) Redirects

11-06

Tesseract 4 is here! State of the art OCR in R!

11-06

Tesseract 4 is here! State of the art OCR in R!

11-06

Tesseract 4 is here! State of the art OCR in R!

11-06

Can we predict the crawling of the Google-Bot?

11-06

Can we predict the crawling of the Google-Bot?

11-06

Building Surveillance System Using USB Camera and Wireless-Connected Raspberry Pi

11-06

Building Surveillance System Using USB Camera and Wireless-Connected Raspberry Pi

11-06

Data Science in 30 Minutes with Jake Porway of DataKind

11-06

Data Science in 30 Minutes with Jake Porway of DataKind

11-06

Source and List: Organizing R Shiny Apps

11-06

Fine-tuning for Natural Language Processing

12-28

Source and List: Organizing R Shiny Apps

11-06

Source and List: Organizing R Shiny Apps

11-06

“Statistical and Machine Learning forecasting methods: Concerns and ways forward”

11-06

“Statistical and Machine Learning forecasting methods: Concerns and ways forward”

11-06

R Packages worth a look

11-06

R Packages worth a look

11-06

R Packages worth a look

12-17

R Packages worth a look

11-06

Happy 10th Bday, Rcpp – and welcome release 1.0 !!

11-06

Happy 10th Bday, Rcpp – and welcome release 1.0 !!

11-06

xts 0.11-2 on CRAN

11-06

xts 0.11-2 on CRAN

11-06

xts 0.11-2 on CRAN

11-06

Data Feminism

11-06

All the (NBA) box scores you ever wanted

12-18

More on sigr

11-06

Le Monde puzzle [#1075]

11-21

Document worth reading: “Lectures on Statistics in Theory: Prelude to Statistics in Practice”

11-06

Document worth reading: “Lectures on Statistics in Theory: Prelude to Statistics in Practice”

11-06

R Packages worth a look

01-11

KDnuggets™ News 19:n02, Jan 9: The cold start problem: how to build your machine learning portfolio; 5 Best Data Visualization Libraries

01-09

Top KDnuggets tweets, Jan 02-08: 10 Free Must-Read Books for Machine Learning and Data Science

01-09

Document worth reading: “Small Sample Learning in Big Data Era”

12-14

Document worth reading: “Lectures on Statistics in Theory: Prelude to Statistics in Practice”

11-06

If you did not already know

11-06

If you did not already know

11-06

Document worth reading: “Toward a System Building Agenda for Data Integration”

11-06

Document worth reading: “Toward a System Building Agenda for Data Integration”

11-06

Postdocs and Research fellows for combining probabilistic programming, simulators and interactive AI

11-06

Postdocs and Research fellows for combining probabilistic programming, simulators and interactive AI

11-06

Postdocs and Research fellows for combining probabilistic programming, simulators and interactive AI

11-06

R plus Magento 2 REST API revisited: part 1- authentication and universal search

11-06

R plus Magento 2 REST API revisited: part 1- authentication and universal search

11-06

R plus Magento 2 REST API revisited: part 1- authentication and universal search

11-06

Customize your notebook volume size, up to 16 TB, with Amazon SageMaker

11-06

Turbocharge Tech Transformation: Integrate AI Across Insurance

11-06

Turbocharge Tech Transformation: Integrate AI Across Insurance

11-06

New: Maintained Datasets

11-06

New: Maintained Datasets

11-06

New: Maintained Datasets

11-06

Causal mediation estimation measures the unobservable

11-06

Causal mediation estimation measures the unobservable

11-06

Causal mediation estimation measures the unobservable

11-06

Causal mediation estimation measures the unobservable

11-06

7 Best Practices for Machine Learning on a Data Lake

11-07

“35. What differentiates solitary confinement, county jail and house arrest” and 70 others

11-07

Magister Dixit

11-07

Magister Dixit

11-07

Magister Dixit

11-07

Whats new on arXiv

11-07

If you did not already know

11-07

DePaul University: Professor of Practice position in Data Science [Chicago, IL]

11-07

DePaul University: Two tenure-track/tenured positions in Data Science/Computer Science [Chicago, IL]

11-07

DePaul University: Two tenure-track/tenured positions in Data Science/Computer Science [Chicago, IL]

11-07

If you did not already know

11-07

R Packages worth a look

11-07

R Packages worth a look

11-07

The “probability to win” is hard to estimate…

11-07

The “probability to win” is hard to estimate…

11-07

Python Patterns: max Instead of if

01-10

R Packages worth a look

11-17

The “probability to win” is hard to estimate…

11-07

Working with US Census Data in R

11-07

Working with US Census Data in R

11-07

Working with US Census Data in R

11-07

Why R? 2018 Conference – After Movie and Summary

11-07

Now easily perform incremental learning on Amazon SageMaker

11-07

R Packages worth a look

12-03

Introduction to PyTorch for Deep Learning

11-07

Introduction to PyTorch for Deep Learning

11-07

Integrating R and Telegram

11-07

Integrating R and Telegram

11-07

Integrating R and Telegram

11-07

Integrating R and Telegram

11-07

Top KDnuggets tweets, Oct 31 – Nov 6: 10 More Free Must-Read Books for Machine Learning and Data Science

11-07

Top KDnuggets tweets, Oct 31 – Nov 6: 10 More Free Must-Read Books for Machine Learning and Data Science

11-07

anytime 0.3.3

11-14

anytime 0.3.2

11-07

pinp 0.0.7: More small YAML options

01-11

RcppStreams 0.1.2

01-07

Personal Data Analytics

12-10

RcppMsgPack 0.2.3

11-18

anytime 0.3.3

11-14

anytime 0.3.2

11-07

anytime 0.3.2

11-07

anytime 0.3.3

11-14

anytime 0.3.2

11-07

Latest Trends in Computer Vision Technology and Applications

11-07

Hilary Mason and Gilad Lotan to Keynote at MADS 2019

11-08

Hilary Mason and Gilad Lotan to Keynote at MADS 2019

11-08

Hilary Mason and Gilad Lotan to Keynote at MADS 2019

11-08

10 Free Must-See Courses for Machine Learning and Data Science

11-08

10 Free Must-See Courses for Machine Learning and Data Science

11-08

10 Free Must-See Courses for Machine Learning and Data Science

11-08

AzureR: R packages to control Azure services

11-08

AzureR: R packages to control Azure services

11-08

In case you missed it: December 2018 roundup

01-04

AzureStor: an R package for working with Azure storage

12-18

AzureStor: an R package for working with Azure storage

12-18

AzureR: R packages to control Azure services

11-08

AzureR: R packages to control Azure services

11-08

AzureR packages now on CRAN

01-08

AzureR packages now on CRAN

01-08

AzureR: R packages to control Azure services

11-08

AzureR: R packages to control Azure services

11-08

Carlos: ‘Everything Dataquest showed me, I use in my new job’

11-08

How Miguel Got 3 Data Science Job Offers Fast With Dataquest

12-24

Carlos: ‘Everything Dataquest showed me, I use in my new job’

11-08

Best Practices for Using Notebooks for Data Science

11-08

anytime – dates in R

11-08

anytime – dates in R

11-08

The Essence of Machine Learning

12-28

Melanie Mitchell says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

How to sync Fastmail's CardDAV to use with mutt + abook

11-08

How to sync Fastmail's CardDAV to use with mutt + abook

11-08

How to sync Fastmail's CardDAV to use with mutt + abook

11-08

How to sync Fastmail's CardDAV to use with mutt + abook

11-08

Introducing Webhooks — Fastest Way to Collect Data

11-08

Introducing Webhooks — Fastest Way to Collect Data

11-08

Introducing Webhooks — Fastest Way to Collect Data

11-08

Rcpp 1.0.0: The Tenth Birthday Release

11-08

Rcpp 1.0.0: The Tenth Birthday Release

11-08

If you did not already know

11-22

R Packages worth a look

11-08

R Packages worth a look

11-08

Egg-Not-Egg Deep Learning Model

11-08

Egg-Not-Egg Deep Learning Model

11-08

Watch out for naively (because implicitly based on flat-prior) Bayesian statements based on classical confidence intervals! (Comptroller of the Currency edition)

11-08

Exploring the Gender Pay Gap with Publicly Available Data

12-12

Watch out for naively (because implicitly based on flat-prior) Bayesian statements based on classical confidence intervals! (Comptroller of the Currency edition)

11-08

Watch out for naively (because implicitly based on flat-prior) Bayesian statements based on classical confidence intervals! (Comptroller of the Currency edition)

11-08

5 Critical Steps to Predictive Business Analytics

11-08

Your Client Engagement Program Isn't Doing What You Think It Is.

11-08

Your Client Engagement Program Isn't Doing What You Think It Is.

11-08

Your Client Engagement Program Isn't Doing What You Think It Is.

11-08

Melanie Miller says, “As someone who has worked in A.I. for decades, I’ve witnessed the failure of similar predictions of imminent human-level A.I., and I’m certain these latest forecasts will fall short as well. “

11-08

If you did not already know

11-08

If you did not already know

11-08

Document worth reading: “An Overview of Blockchain Integration with Robotics and Artificial Intelligence”

11-08

Document worth reading: “A Tutorial on Modeling and Inference in Undirected Graphical Models for Hyperspectral Image Analysis”

11-09

Document worth reading: “A Tutorial on Modeling and Inference in Undirected Graphical Models for Hyperspectral Image Analysis”

11-09

Why would I ever NEED Bayesian Statistics?

11-09

Coding Regression trees in 150 lines of R code

11-09

Coding Regression trees in 150 lines of R code

11-09

Coding Regression trees in 150 lines of R code

11-09

Coding Regression trees in 150 lines of R code

11-09

simmer 4.1.0

11-09

simmer 4.1.0

11-09

“Recapping the recent plagiarism scandal”

11-09

“Recapping the recent plagiarism scandal”

11-09

“Recapping the recent plagiarism scandal”

11-09

Advanced Jupyter Notebooks: A Tutorial

01-02

Escaping the macOS 10.14 (Mojave) Filesystem Sandbox with R / RStudio

11-09

T-mobile uses R for Customer Service AI

11-09

Image segmentation based on Superpixels and Clustering

11-09

Image segmentation based on Superpixels and Clustering

11-09

R Packages worth a look

11-09

R Packages worth a look

11-09

R Packages worth a look

11-09

Top December Stories: Why You Shouldn’t be a Data Science Generalist

01-09

Top October Stories: 9 Must-have skills you need to become a Data Scientist, updated; 10 Best Mobile Apps for Data Scientist / Data Analysts

11-09

Top October Stories: 9 Must-have skills you need to become a Data Scientist, updated; 10 Best Mobile Apps for Data Scientist / Data Analysts

11-09

Matching (and discarding non-matches) to deal with lack of complete overlap, then regression to adjust for imbalance between treatment and control groups

11-10

R Packages worth a look

11-10

R Packages worth a look

11-10

R Packages worth a look

11-10

If you did not already know

12-27

If you did not already know

12-12

R Packages worth a look

11-10

Model evaluation, model selection, and algorithm selection in machine learning

11-10

Model evaluation, model selection, and algorithm selection in machine learning

11-10

Model evaluation, model selection, and algorithm selection in machine learning

11-10

The Gamification Of Fitbit: How an API Provided the Next Level of tRaining

11-10

The Gamification Of Fitbit: How an API Provided the Next Level of tRaining

11-10

The Gamification Of Fitbit: How an API Provided the Next Level of tRaining

11-10

RcppArmadillo 0.9.200.4.0

11-10

RcppArmadillo 0.9.200.4.0

11-10

RcppArmadillo 0.9.200.4.0

11-10

RcppArmadillo 0.9.200.4.0

11-10

Document worth reading: “Advice from the Oracle: Really Intelligent Information Retrieval”

11-10

Document worth reading: “Advice from the Oracle: Really Intelligent Information Retrieval”

11-10

Detailed introduction of “myprettyreport” R package

11-10

Detailed introduction of “myprettyreport” R package

11-10

Detailed introduction of “myprettyreport” R package

11-10

One-arm Bayesian Adaptive Trial Simulation Code

11-10

One-arm Bayesian Adaptive Trial Simulation Code

11-10

One-arm Bayesian Adaptive Trial Simulation Code

11-10

If you did not already know

11-10

2018: Who actually voted? (The real story, not the exit polls.)

11-10

2018: Who actually voted? (The real story, not the exit polls.)

11-10

4 ways to be more efficient using RStudio’s Code Snippets, with 11 ready to use examples

11-10

Voronoi diagram with ggvoronoi package with Train Station data

11-10

Voronoi diagram with ggvoronoi package with Train Station data

11-10

Voronoi diagram with ggvoronoi package with Train Station data

11-10

If you did not already know

11-10

If you did not already know

11-10

If you did not already know

11-10

Characterizing Online Public Discussions through Patterns of Participant Interactions

11-11

Characterizing Online Public Discussions through Patterns of Participant Interactions

11-11

Word associations from the Small World of Words

12-16

Characterizing Online Public Discussions through Patterns of Participant Interactions

11-11

On helping to open the inaugural PyDataPrague meetup

11-11

Cummins: Advanced Analytics Systems Architect Principle [Columbus, IN]

12-12

On helping to open the inaugural PyDataPrague meetup

11-11

On helping to open the inaugural PyDataPrague meetup

11-11

On helping to open the inaugural PyDataPrague meetup

11-11

On helping to open the inaugural PyDataPrague meetup

11-11

On receiving the Community Leadership Award at the NumFOCUS Summit 2018

11-11

On receiving the Community Leadership Award at the NumFOCUS Summit 2018

11-11

On receiving the Community Leadership Award at the NumFOCUS Summit 2018

11-11

R Packages worth a look

11-11

RATest. A Randomization Tests package is available on CRAN

11-11

Hey! Here’s what to do when you have two or more surveys on the same population!

11-11

Hey! Here’s what to do when you have two or more surveys on the same population!

11-11

Data Science in Esports

11-21

Data Science in Esports

11-12

8 Reasons to Take Data Analytics Certification Courses

11-28

Data Science in Esports

11-21

Data Science in Esports

11-12

Data Science in Esports

11-21

Data Science in Esports

11-12

Angela Bassa discusses managing data science teams and much more.

11-12

If you did not already know

11-12

If you did not already know

11-12

If you did not already know

11-12

If you did not already know

11-12

If you did not already know

11-12

More on Bias Corrected Standard Deviation Estimates

11-14

How to de-Bias Standard Deviation Estimates

11-12

How to de-Bias Standard Deviation Estimates

11-12

Which taxonomy should you use to classify news content, IAB-QAG or IPTC Subject Codes?

11-12

Which taxonomy should you use to classify news content, IAB-QAG or IPTC Subject Codes?

11-12

Which taxonomy should you use to classify news content, IAB-QAG or IPTC Subject Codes?

11-12

Which taxonomy should you use to classify news content, IAB-QAG or IPTC Subject Codes?

11-12

Which taxonomy should you use to classify news content, IAB-QAG or IPTC Subject Codes?

11-12

Distilled News

11-12

Visualization research for non-researchers

11-12

Visualization research for non-researchers

11-12

Visualization research for non-researchers

11-12

Visualization research for non-researchers

11-12

Introducing a simple and intuitive Python API for UCI machine learning repository

11-12

Introducing a simple and intuitive Python API for UCI machine learning repository

11-12

Introducing a simple and intuitive Python API for UCI machine learning repository

11-12

Data Science With R Course Series – Week 9

11-12

Document worth reading: “An Introduction to Probabilistic Programming”

11-12

Document worth reading: “An Introduction to Probabilistic Programming”

11-12

Document worth reading: “An Introduction to Probabilistic Programming”

11-12

Healthcare Analytics Made Simple

11-12

Healthcare Analytics Made Simple

11-12

Healthcare Analytics Made Simple

11-12

Time Series and MCHT

11-12

Amazon Polly adds Italian and Castilian Spanish voices, and Mexican Spanish language support

11-12

Dataiku Series C: New Year, New Chapter

12-19

Getting Started with Amazon Comprehend custom entities

11-17

Amazon Polly adds Italian and Castilian Spanish voices, and Mexican Spanish language support

11-12

Amazon Polly adds Italian and Castilian Spanish voices, and Mexican Spanish language support

11-12

Machine Learning Toronto SummitNov 20-21 – Special KDnuggets discount

11-12

Machine Learning Toronto SummitNov 20-21 – Special KDnuggets discount

11-12

Machine Learning Toronto SummitNov 20-21 – Special KDnuggets discount

11-12

Machine Learning Toronto SummitNov 20-21 – Special KDnuggets discount

11-12

“Law professor Alan Dershowitz’s new book claims that political differences have lately been criminalized in the United States. He has it wrong. Instead, the orderly enforcement of the law has, ludicrously, been framed as political.”

11-12

“Law professor Alan Dershowitz’s new book claims that political differences have lately been criminalized in the United States. He has it wrong. Instead, the orderly enforcement of the law has, ludicrously, been framed as political.”

11-12

Distilled News

11-27

The Long Tail of Medical Data

11-12

YOLO object detection with OpenCV

11-12

Whats new on arXiv

11-13

The Antarctic/Southern Ocean rOpenSci community

11-13

The Antarctic/Southern Ocean rOpenSci community

11-13

The Antarctic/Southern Ocean rOpenSci community

11-13

The Antarctic/Southern Ocean rOpenSci community

11-13

Those “other” apply functions…

11-13

Magister Dixit

11-13

LinkedIn Top Voices 2018: Data Science & Analytics

11-13

R Packages worth a look

12-29

How to Find an Entry-Level Job in Data Science

11-13

Help us understand your Data Science goals!

11-13

Site Redesign

12-02

A deep dive into glmnet: penalty.factor

11-13

A deep dive into glmnet: penalty.factor

11-13

Announcing the Winners of the 2018 AWS AI Hackathon

12-05

A deep dive into glmnet: penalty.factor

11-13

R Packages worth a look

11-13

R Packages worth a look

11-13

If you did not already know

11-13

All About Scikit-Learn, with Olivier Grisel

11-13

All About Scikit-Learn, with Olivier Grisel

11-13

Chocolate milk! Another stunning discovery from an experiment on 24 people!

11-13

My R Take in Advent of Code – Day 5

01-03

Chocolate milk! Another stunning discovery from an experiment on 24 people!

11-13

Chocolate milk! Another stunning discovery from an experiment on 24 people!

11-13

Notes on the Frank-Wolfe Algorithm, Part II: A Primal-dual Analysis

11-14

Windows Clipboard Access with R

11-14

Windows Clipboard Access with R

11-14

Distilled News

11-14

Day 22 – little helper get_files

12-22

Day 14 – little helper print_fs

12-14

KDnuggets™ News 18:n47, Dec 12: Common mistakes when doing machine learning; Here are the most popular Python IDEs / Editors

12-12

Day 08 – little helper intersect2

12-08

KDnuggets™ News 18:n43, Nov 14: To get hired as a data scientist, don’t follow the herd; LinkedIn Top Voices in Data Science & Analytics

11-14

R Packages worth a look

11-14

Change over time is not “treatment response”

11-19

R Packages worth a look

11-14

Building a Repository of Alpine-based Docker Images for R, Part II

11-14

Top KDnuggets tweets, Nov 07-13: 10 Free Must-See Courses for Machine Learning and Data Science

11-14

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

11-14

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

11-14

More Sandwiches, Anyone?

11-14

Document worth reading: “Deep Reinforcement Learning: An Overview”

11-14

NYU Stern: 2019-20 Asst. Professor of Information, Operations & Management Sciences – Information Systems, tenure-track [New York City, NY]

11-14

Apply to NYU Stern’s MS in Business Analytics

01-08

NYU Stern: 2019-20 Asst. Professor of Information, Operations & Management Sciences – Information Systems, tenure-track [New York City, NY]

11-14

Finding a house to buy, using statistics

11-14

Finding a house to buy, using statistics

11-14

Finding a house to buy, using statistics

11-14

What is the Best Python IDE for Data Science?

11-14

More on Bias Corrected Standard Deviation Estimates

11-14

ggmap Tutorial Updated!

12-10

More on Bias Corrected Standard Deviation Estimates

11-14

More on Bias Corrected Standard Deviation Estimates

11-14

Paris ML E#2 S#6: Conscience, Code Analysis, Can a machine learn like a child?

11-14

Robustness checks are a joke

11-14

Robustness checks are a joke

11-14

Robustness checks are a joke

11-14

Robustness checks are a joke

11-14

Document worth reading: “Visions of a generalized probability theory”

11-14

Document worth reading: “Visions of a generalized probability theory”

11-14

AdaSearch: A Successive Elimination Approach to Adaptive Search

11-14

AdaSearch: A Successive Elimination Approach to Adaptive Search

11-14

AdaSearch: A Successive Elimination Approach to Adaptive Search

11-14

Bright Lights, Bright Future. TDWI Is Back in Vegas

11-14

Strategy: Customer Analytics: Are you Profiting from your Data?

11-14

Use GitHub Vulnerability Alerts to Keep Users of Your R Packages Safe

11-14

Rdew Valley: Optimizing Farming with R

11-14

Rdew Valley: Optimizing Farming with R

11-14

Rdew Valley: Optimizing Farming with R

11-14

Part 5: Code corrections to optimism corrected bootstrapping series

12-29

More on Bias Corrected Standard Deviation Estimates

11-14

More on Bias Corrected Standard Deviation Estimates

11-14

The Crime Machine

11-15

The Crime Machine

11-15

The Crime Machine

11-15

The Crime Machine

11-15

The Crime Machine

11-15

University of Rhode Island: Data Scientist, DataSpark (2 Positions) [Kingston, RI]

12-18

What Python editors or IDEs you used the most in 2018?

11-27

Monash University: Lecturer/Sr Lecturer – Digital Health [Melbourne, Australia]

11-22

Monash University: Research Fellow (Digital Civics) [Melbourne, Australia]

11-22

UnitedHealth Group: Sr Director, Decision Analytics [Minnetonka, MN]

11-19

URI: Director, Data Analytics/DataSpark [Kingston, RI]

11-15

Mastering The New Generation of Gradient Boosting

11-15

If you did not already know

11-15

If you did not already know

11-15

How will automation tools change data science?

12-18

If you did not already know

11-15

Rcpp now used by 1500 CRAN packages

11-15

Magister Dixit

11-15

Magister Dixit

11-15

Magister Dixit

11-15

R Packages worth a look

11-15

R Packages worth a look

11-15

R Packages worth a look

11-15

R Packages worth a look

11-15

Quoting Concatenate

12-16

Quoting Concatenate

12-16

How to work with strings in base R – An overview of 20+ methods for daily use.

11-24

Quoting in R

11-15

Quoting in R

11-15

Quoting Concatenate

12-16

Quoting Concatenate

12-16

Quoting in R

11-15

Quoting in R

11-15

Quoting in R

11-15

Quoting in R

11-15

Introducing Drexel new online MS in Data Science

11-15

A deep dive into glmnet: standardize

11-15

A deep dive into glmnet: standardize

11-15

Best Deals in Deep Learning Cloud Providers: From CPU to GPU to TPU

11-15

Best Deals in Deep Learning Cloud Providers: From CPU to GPU to TPU

11-15

Best Deals in Deep Learning Cloud Providers: From CPU to GPU to TPU

11-15

Scikit-learn Tutorial: Machine Learning in Python

11-15

(Webinar) Farmers and Chubb on Humanizing Claims with AI

11-15

(Webinar) Farmers and Chubb on Humanizing Claims with AI

11-15

Visualization of NYC bus delays with R

11-27

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

11-15

Make Beautiful Tables with the Formattable Package

11-15

Make Beautiful Tables with the Formattable Package

11-15

Make Beautiful Tables with the Formattable Package

11-15

Make Beautiful Tables with the Formattable Package

11-15

Online Bayesian Deep Learning in Production at Tencent

11-15

My talk tomorrow (Tues) noon at the Princeton University Psychology Department

12-03

The State of the Art

11-15

The State of the Art

11-15

The State of the Art

11-15

Magister Dixit

11-16

Magister Dixit

11-16

Magister Dixit

11-16

Mirrors

11-16

Mirrors

11-16

Mirrors

11-16

Analysis of South African Funds

01-08

Mirrors

11-16

Using Uncertainty to Interpret your Model

11-16

Using Uncertainty to Interpret your Model

11-16

UnitedHealth Group: Sr Manager, Data Engineering [Minnetonka, MN]

11-19

UnitedHealth Group: Senior Principal Data Scientist [Telecommute, Central or Eastern Time Zones]

11-16

Sorry I didn’t get that! How to understand what your users want

11-16

Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

11-16

Example of Overfitting

11-16

Example of Overfitting

11-16

R Packages worth a look

11-16

Hey, check this out: Columbia’s Data Science Institute is hiring research scientists and postdocs!

11-16

Hey, check this out: Columbia’s Data Science Institute is hiring research scientists and postdocs!

11-16

Hey, check this out: Columbia’s Data Science Institute is hiring research scientists and postdocs!

11-16

Because it's Friday: The physics of The Expanse

11-16

Because it's Friday: The physics of The Expanse

11-16

Because it's Friday: The physics of The Expanse

11-16

Because it's Friday: The physics of The Expanse

11-16

“On the Diagramatic Diagnosis of Data” at BudapestBI 2018

11-16

“On the Diagramatic Diagnosis of Data” at BudapestBI 2018

11-16

Distilled News

11-16

Document worth reading: “Saliency Prediction in the Deep Learning Era: An Empirical Investigation”

11-16

Document worth reading: “Saliency Prediction in the Deep Learning Era: An Empirical Investigation”

11-16

If you did not already know

11-16

If you did not already know

11-17

Tis the Season to Check your SSL/TLS Cipher List Thrice (RCurl/curl/openssl)

11-17

Tis the Season to Check your SSL/TLS Cipher List Thrice (RCurl/curl/openssl)

11-17

Document worth reading: “Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches”

11-17

Convert Data Frame to Dictionary List in R

11-17

Convert Data Frame to Dictionary List in R

11-17

“Using numbers to replace judgment”

11-17

Congress Over Time

11-17

Congress Over Time

11-17

Congress Over Time

11-17

Clustering the Bible

12-27

Congress Over Time

11-17

R Packages worth a look

11-17

R Packages worth a look

11-17

Getting Started with Amazon Comprehend custom entities

11-17

RcppGetconf 0.0.3

11-17

RcppGetconf 0.0.3

11-17

Growing List vs Growing Queue

11-18

Growing List vs Growing Queue

11-18

June is applied regression exam month!

12-24

confint3: 2-Sided Confidence Interval (Extended Moodle Version)

12-08

Statistics Sunday: Reading and Creating a Data Frame with Multiple Text Files

11-18

Statistics Sunday: Reading and Creating a Data Frame with Multiple Text Files

11-18

Day 13 – little helper read_files

12-13

Day 11 – little helper trim

12-11

Day 01 – little helper checkdir

12-01

Statistics Sunday: Reading and Creating a Data Frame with Multiple Text Files

11-18

Graphs and tables, tables and graphs

11-18

Document worth reading: “Graphical Models for Processing Missing Data”

11-18

Document worth reading: “Graphical Models for Processing Missing Data”

11-18

Document worth reading: “Graphical Models for Processing Missing Data”

11-18

Using OSX? Compiling an R package from source? Issues with ‘-fopenmp’? Try this.

11-18

Using OSX? Compiling an R package from source? Issues with ‘-fopenmp’? Try this.

11-18

R Packages worth a look

11-18

R Packages worth a look

12-28

epubr 0.5.0 CRAN release

11-18

Don’t Peek part 2: Predictions without Test Data

11-18

Document worth reading: “A Study and Comparison of Human and Deep Learning Recognition Performance Under Visual Distortions”

12-18

Document worth reading: “A Learning Approach to Secure Learning”

11-19

Don’t Peek part 2: Predictions without Test Data

11-18

Easily monitor and visualize metrics while training models on Amazon SageMaker

11-19

If you did not already know

11-19

If you did not already know

11-19

UnitedHealth Group: Sr Manager, Data Engineering [Minnetonka, MN]

11-19

UnitedHealth Group: Director, Omni-Channel Analytics [Minnetonka, MN]

11-19

Hacking Bioconductor

11-19

BERT: State of the Art NLP Model, Explained

12-26

Instance segmentation with OpenCV

11-26

Mask R-CNN with OpenCV

11-19

Mask R-CNN with OpenCV

11-19

Mask R-CNN with OpenCV

11-19

Generating Synthetic Data Sets with ‘synthpop’ in R

01-13

Reduced privacy risk in exchange for accuracy in the Census count

12-06

Zero Counts in dplyr

11-19

Zero Counts in dplyr

11-19

UnitedHealth Group: Sr Manager, Data Science [Telecommute, Central or Eastern Time Zones]

11-19

Predictive Analytics in 2018: Salaries & Industry Shifts

11-19

Predictive Analytics in 2018: Salaries & Industry Shifts

11-19

The Role of Theory in Data Analysis

12-11

Neural networks to generate music

11-19

Neural networks to generate music

11-19

Neural networks to generate music

11-19

Neural networks to generate music

11-19

Amazon SageMaker Automatic Model Tuning becomes more efficient with warm start of hyperparameter tuning jobs

11-19

Amazon SageMaker Automatic Model Tuning becomes more efficient with warm start of hyperparameter tuning jobs

11-19

Analyze live video at scale in real time using Amazon Kinesis Video Streams and Amazon SageMaker

11-19

What I Learned About Machine Learning at ODSC West 2018

11-19

What I Learned About Machine Learning at ODSC West 2018

11-19

Tom Wolfe

11-19

Tom Wolfe

11-19

Tom Wolfe

11-19

Detect suspicious IP addresses with the Amazon SageMaker IP Insights algorithm

11-19

Detect suspicious IP addresses with the Amazon SageMaker IP Insights algorithm

11-19

The Distribution of Time Between Recessions: Revisited (with MCHT)

11-19

The Big Data Game Board™

11-19

Import AI 121: Sony researchers make ultra-fast ImageNet training breakthrough; Berkeley researchers tackle StarCraft II with modular RL system; and Germany adds €3bn for AI research

11-19

Build Your Own Natural Language Models on AWS (no ML experience required)

11-19

Change over time is not “treatment response”

11-19

How Important is that Machine Learning Model be Understandable? We analyze poll results

11-19

How Important is that Machine Learning Model be Understandable? We analyze poll results

11-19

UnitedHealth Group: Sr Manager, Data Engineering [Minnetonka, MN]

11-19

UnitedHealth Group: Sr Manager, Data Engineering [Minnetonka, MN]

11-19

“The hype economy”

11-20

“The hype economy”

11-20

“The hype economy”

11-20

Introducing Octoparse New Version 7.1 – web scraping for dummies is official

11-20

vitae: Dynamic CVs with R Markdown

01-10

An even better rOpenSci website with Hugo

01-09

The JapanR Conference 2018 Round-Up!

12-06

Introducing Octoparse New Version 7.1 – web scraping for dummies is official

11-20

Generating data to explore the myriad causal effects that can be estimated in observational data analysis

11-20

If you did not already know

12-09

Plotting wind highways using rWind

11-26

R Packages worth a look

11-20

Quantcast: Sr Applied Scientist, Audience Platform [Seattle, WA]

11-20

Data Shows No Increase In NYC Plowing as Storm Picked Up

11-20

Data Shows No Increase In NYC Plowing as Storm Picked Up

11-20

Data Shows No Increase In NYC Plowing as Storm Picked Up

11-20

Data Shows No Increase In NYC Plowing as Storm Picked Up

11-20

R Packages worth a look

11-20

R Packages worth a look

11-20

R Packages worth a look

11-20

Rev Summit for Data Science Leaders featuring Daniel Kahneman

01-07

Address Your Data Science Strategy at DSNY

11-20

Checklist Recipe – How we created a template to standardize species data

11-20

Checklist Recipe – How we created a template to standardize species data

11-20

Mega-PAW Las Vegas Registration is Live & Super Early Bird Pricing is Now Available!

11-20

Amazon Transcribe now supports real-time transcriptions

11-20

Introducing pipe, The Automattic Machine Learning Pipeline

11-20

Introducing pipe, The Automattic Machine Learning Pipeline

11-20

Introducing pipe, The Automattic Machine Learning Pipeline

11-20

The Right Kind of Internal Motivation Can Improve Your Studies

01-08

Forget Motivation and Double Your Chances of Learning Success

11-20

Slides from my talks about Demystifying Big Data and Deep Learning (and how to get started)

11-20

Slides from my talks about Demystifying Big Data and Deep Learning (and how to get started)

11-20

Back by popular demand . . . The Greatest Seminar Speaker contest!

01-04

An Overview of the Singapore Hiring Landscape

11-21

The best way to visit Luxembourguish castles is doing data science + combinatorial optimization

11-21

Good Feature Building Techniques and Tricks for Kaggle

12-31

The best way to visit Luxembourguish castles is doing data science + combinatorial optimization

11-21

The best way to visit Luxembourguish castles is doing data science + combinatorial optimization

11-21

The best way to visit Luxembourguish castles is doing data science + combinatorial optimization

11-21

AI, Machine Learning and Data Science Roundup: November 2018

11-21

RTutor: Driving Electric or Gasoline Cars? Comparing the Pollution Damages

11-21

RTutor: Driving Electric or Gasoline Cars? Comparing the Pollution Damages

11-21

R Packages worth a look

11-21

Site Redesign

12-02

R > Python: a Concrete Example

11-21

R Packages worth a look

11-21

Driving Success through Business Insight, One Customer at a Time

11-21

Using a Keras Long Short-Term Memory (LSTM) Model to Predict Stock Prices

11-21

A Bayesian take on ballot order effects

11-21

A Bayesian take on ballot order effects

11-21

When cycling is faster than driving

12-11

A Bayesian take on ballot order effects

11-21

Document worth reading: “The Algorithm Selection Competition Series 2015-17”

11-21

WPI: Post-Doctoral Fellow [Worcester, MA]

11-21

WPI: Research Scientist [Worcester, MA]

11-30

WPI: Post-Doctoral Fellow [Worcester, MA]

11-21

R > Python: a Concrete Example

11-21

R > Python: a Concrete Example

11-21

R > Python: a Concrete Example

11-21

Le Monde puzzle [#1075]

11-21

Join the World’s Biggest Deep Learning Summit – KDnuggets Early Cyber Monday

11-21

Join the World’s Biggest Deep Learning Summit – KDnuggets Early Cyber Monday

11-21

New Features For Amazon SageMaker: Workflows, Algorithms, and Accreditation

11-21

A short proof for Nesterov’s momentum

11-21

A short proof for Nesterov’s momentum

11-21

A short proof for Nesterov’s momentum

11-21

Scrapping data about Australian politicians with RSelenium

11-21

Building a conversational business intelligence bot with Amazon Lex

11-21

Data Tools We're Thankful For

11-22

Document worth reading: “A Survey on Trust Modeling from a Bayesian Perspective”

11-22

OpenCPU 2.1 Release: Scalable R Services

11-22

OpenCPU 2.1 Release: Scalable R Services

11-22

6 Goals Every Wannabe Data Scientist Should Make for 2019

11-22

Monash University: Lecturer/Sr Lecturer – Digital Health [Melbourne, Australia]

11-22

KNNs (K-Nearest-Neighbours) in Python

11-22

“She also observed that results from smaller studies conducted by NGOs – often pilot studies – would often look promising. But when governments tried to implement scaled-up versions of those programs, their performance would drop considerably.”

11-22

Dealing with failed projects

11-22

Dealing with failed projects

11-22

If you did not already know

11-22

Cartoon: Thanksgiving, Big Data, and Turkey Data Science.

11-22

Cartoon: Thanksgiving, Big Data, and Turkey Data Science.

11-22

Beautiful Chaos: The Double Pendulum

11-22

Beautiful Chaos: The Double Pendulum

11-22

High-performance mathematical paradigms in Python

11-22

R Packages worth a look

11-22

R Packages worth a look

11-28

R Packages worth a look

11-22

R Packages worth a look

11-22

RFishBC CRAN Release

11-22

RFishBC CRAN Release

11-22

RFishBC CRAN Release

11-22

Interactive Graphics with R Shiny

11-23

Interactive Graphics with R Shiny

11-23

Whats new on arXiv

11-23

Document worth reading: “To Cluster, or Not to Cluster: An Analysis of Clusterability Methods”

11-23

Counting digits by @ellis2013nz

11-23

Counting digits by @ellis2013nz

11-23

Counting digits by @ellis2013nz

11-23

Counting digits by @ellis2013nz

11-23

Counting digits by @ellis2013nz

11-23

If you did not already know

11-23

R Packages worth a look

11-23

Document worth reading: “Learning From Positive and Unlabeled Data: A Survey”

11-23

The evolution of pace in popular movies

11-24

The evolution of pace in popular movies

11-24

R Packages worth a look

11-24

Considering sensitivity to unmeasured confounding: part 2

01-10

Considering sensitivity to unmeasured confounding: part 1

01-02

R Packages worth a look

11-24

R Packages worth a look

11-24

R Packages worth a look

11-24

R Packages worth a look

11-24

lmer vs INLA for variance components

11-24

lmer vs INLA for variance components

11-24

lmer vs INLA for variance components

11-24

More Robust Monotonic Binning Based on Isotonic Regression

11-24

More Robust Monotonic Binning Based on Isotonic Regression

11-24

Polished statistical analysis chapters in evidence-based software engineering

11-24

Polished statistical analysis chapters in evidence-based software engineering

11-24

RcppEigen 0.3.3.5.0

11-24

RcppEigen 0.3.3.5.0

11-24

RcppEigen 0.3.3.5.0

11-24

OneR – fascinating insights through simple rules

11-25

OneR – fascinating insights through simple rules

11-24

OneR – fascinating insights through simple rules

11-24

Quidditch: is it all about the Snitch?

11-24

Quidditch: is it all about the Snitch?

11-24

Document worth reading: “Customised Structural Elicitation”

11-25

Document worth reading: “Customised Structural Elicitation”

11-25

Document worth reading: “Customised Structural Elicitation”

11-25

Document worth reading: “Customised Structural Elicitation”

11-25

When “nudge” doesn’t work: Medication Reminders to Outcomes After Myocardial Infarction

12-19

Extracting data from news articles: Australian pollution by postcode

11-28

These 3 problems destroy many clinical trials (in context of some papers on problems with non-inferiority trials, or problems with clinical trials in general)

11-25

Statistics Sunday: Introduction to Regular Expressions

11-25

Statistics Sunday: Introduction to Regular Expressions

11-25

RQuantLib 0.4.6: Updated upstream, and calls for help

11-25

RQuantLib 0.4.6: Updated upstream, and calls for help

11-25

If you did not already know

11-25

If you did not already know

11-25

New version of pqR, with major speed improvements

11-25

Improving Binning by Bootstrap Bumping

11-25

Improving Binning by Bootstrap Bumping

11-25

A tutorial on tidy cross-validation with R

11-25

Document worth reading: “Internet of Things: An Overview”

11-25

Document worth reading: “Internet of Things: An Overview”

11-25

Amazon’s own ‘Machine Learning University’ now available to all developers

11-26

Amazon’s own ‘Machine Learning University’ now available to all developers

11-26

Amazon’s own ‘Machine Learning University’ now available to all developers

11-26

Amazon’s own ‘Machine Learning University’ now available to all developers

11-26

Open Workshop: Data Visualization in R and ggplot2, January 25th in Munich

11-26

Open Workshop: Data Visualization in R and ggplot2, January 25th in Munich

11-26

Project planning with plotly

11-26

Physics-Based Learned Design: Teaching a Microscope How to Image

11-26

Cathy O’Neil discusses the current lack of fairness in artificial intelligence and much more.

11-26

Import AI: 122: Google obtains new ImageNet state-of-the-art with GPipe; drone learns to land more effectively than PD controller policy; and Facebook releases its ‘CherryPi’ StarCraft bot

11-26

Import AI: 122: Google obtains new ImageNet state-of-the-art with GPipe; drone learns to land more effectively than PD controller policy; and Facebook releases its ‘CherryPi’ StarCraft bot

11-26

Import AI: 122: Google obtains new ImageNet state-of-the-art with GPipe; drone learns to land more effectively than PD controller policy; and Facebook releases its ‘CherryPi’ StarCraft bot

11-26

R Packages worth a look

11-26

Whats new on arXiv

11-26

Instance segmentation with OpenCV

11-26

Instance segmentation with OpenCV

11-26

Global Legal Entity Identifier Foundation (GLEIF): Data Analyst [Frankfurt, Germany]

11-26

Global Legal Entity Identifier Foundation (GLEIF): Data Analyst [Frankfurt, Germany]

11-26

Global Legal Entity Identifier Foundation (GLEIF): Data Analyst [Frankfurt, Germany]

11-26

Talking on “High Performance Python” at Linuxing In London last week

11-26

Talking on “High Performance Python” at Linuxing In London last week

11-26

Data Science Strategy Safari: Aligning Data Science Strategy to Org Strategy

11-26

Data Science Strategy Safari: Aligning Data Science Strategy to Org Strategy

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

AzureRMR: an R interface to Azure Resource Manager

11-26

R Packages worth a look

11-26

Distilled News

11-26

Plotting wind highways using rWind

11-26

Plotting wind highways using rWind

11-26

Peak Non-Creepy Dating Pool

11-27

Peak Non-Creepy Dating Pool

11-27

Amazon Launches Machine Learning University

11-27

$ vs. votes

11-27

Who is the greatest finisher in soccer?

01-10

Who is the greatest finisher in soccer?

01-10

Network Centrality in R: New ways of measuring Centrality

12-12

$ vs. votes

11-27

How to Engineer Your Way Out of Slow Models

11-27

Drexel University: 2 Teaching Faculty Positions in Data Science [Philadelphia, PA]

11-27

How to Gather Your Own Data by Conducting a Great Survey

11-27

How to Gather Your Own Data by Conducting a Great Survey

11-27

Introducing medical language processing with Amazon Comprehend Medical

11-27

Introducing medical language processing with Amazon Comprehend Medical

11-27

Document worth reading: “An exploration of algorithmic discrimination in data and classification”

11-27

Document worth reading: “An exploration of algorithmic discrimination in data and classification”

11-27

Co-localization analysis of fluorescence microscopy images

11-27

How to get the homology of a antibody using R

12-02

Co-localization analysis of fluorescence microscopy images

11-27

Horses for courses, or to each model its own (causal effect)

11-28

Introducing Dynamic Training for deep learning with Amazon EC2

11-27

My introductory course on Bayesian statistics

12-12

Making Machine Learning Accessible [Webinar Replay]

11-27

Making Machine Learning Accessible [Webinar Replay]

11-27

Distilled News

11-27

Humana: Principal Data Scientist/Informatics Principal [Chicago, IL, Dallas, TX and Louisville, KY]

11-27

styler 1.1.0

11-27

styler 1.1.0

11-27

If you did not already know

11-27

If you did not already know

11-27

If you did not already know

11-27

If you did not already know

11-27

Visualization of NYC bus delays with R

11-27

Visualization of NYC bus delays with R

11-27

NYC buses: C5.0 classification with R; more than 20 minute delay?

12-01

NYC buses: simple Cubist regression

11-29

Visualization of NYC bus delays with R

11-27

Lessons from posting a fake map about pies

11-28

Lessons from posting a fake map about pies

11-28

Le Monde puzzle [#1078]

11-28

Le Monde puzzle [#1078]

11-28

Filter Clickbait from News Content with our custom Natural Language Processing Model

11-28

NYC buses: company level predictors with R

11-28

NYC buses: company level predictors with R

11-28

Introducing Amazon Translate Custom Terminology

11-28

Introducing Amazon Translate Custom Terminology

11-28

Introducing Amazon Translate Custom Terminology

11-28

Multilevel models for multiple comparisons! Varying treatment effects!

11-28

ICML 2019: Some Changes and Call for Papers

11-28

ICML 2019: Some Changes and Call for Papers

11-28

ICML 2019: Some Changes and Call for Papers

11-28

ICML 2019: Some Changes and Call for Papers

11-28

The new pqR parser, and R’s “else” problem

11-28

The new pqR parser, and R’s “else” problem

11-28

The new pqR parser, and R’s “else” problem

11-28

The new pqR parser, and R’s “else” problem

11-28

Whats new on arXiv

11-28

Extracting data from news articles: Australian pollution by postcode

11-28

R now supported in Azure SQL Database

11-28

Deep Learning Cheat Sheets

11-28

Deep Learning Cheat Sheets

11-28

Document worth reading: “Legible Normativity for AI Alignment: The Value of Silly Rules”

11-28

Document worth reading: “Legible Normativity for AI Alignment: The Value of Silly Rules”

11-28

Document worth reading: “Legible Normativity for AI Alignment: The Value of Silly Rules”

11-28

Marginal Effects for (mixed effects) regression models

11-28

Marginal Effects for (mixed effects) regression models

11-28

R Packages worth a look

11-28

R Packages worth a look

11-28

Sales Forecasting Using Facebook’s Prophet

11-28

KDnuggets™ News 18:n47, Dec 12: Common mistakes when doing machine learning; Here are the most popular Python IDEs / Editors

12-12

KDnuggets™ News 18:n45, Nov 28: Your Favorite Python IDE/editor? Intro to Data Science for Managers

11-28

Document worth reading: “An Introductory Survey on Attention Mechanisms in NLP Problems”

11-28

Document worth reading: “An Introductory Survey on Attention Mechanisms in NLP Problems”

11-28

Document worth reading: “An Introductory Survey on Attention Mechanisms in NLP Problems”

11-28

How to Build a Machine Learning Team When You Are Not Google or Facebook

11-28

The Role of the Data Engineer is Changing

01-10

How to Build a Machine Learning Team When You Are Not Google or Facebook

11-28

Top KDnuggets tweets, Nov 21-27: Intro to

11-28

Top KDnuggets tweets, Nov 21-27: Intro to

11-28

Intuit: Staff Data Scientist [Mountain View, CA]

12-12

Top KDnuggets tweets, Nov 21-27: Intro to

11-28

Plotting Scottish census data with some tidyverse magic

11-28

Plotting Scottish census data with some tidyverse magic

11-28

R now supported in Azure SQL Database

11-28

R now supported in Azure SQL Database

11-28

R now supported in Azure SQL Database

11-28

RcppArmadillo 0.9.200.5.0

11-28

How to Find Mentors for Data Science?

11-29

How to Find Mentors for Data Science?

11-29

Community Call Summary – Code Review in the Lab

11-29

Community Call Summary – Code Review in the Lab

11-29

Linking Data Science Activities to Business Initiatives Using the Hypothesis Development Canvas

11-29

Teaching kids data visualization

11-29

Teaching kids data visualization

11-29

Teaching kids data visualization

11-29

NYC buses: simple Cubist regression

11-29

If you did not already know

11-29

Create 3D County Maps Using Density as Z-Axis

11-29

Create 3D County Maps Using Density as Z-Axis

11-29

Amazon SageMaker notebooks now support Git integration for increased persistence, collaboration, and reproducibility

11-29

Students Combat MS with Data Science

11-29

Students Combat MS with Data Science

11-29

Students Combat MS with Data Science

11-29

Combating Customer Churn with AI

11-29

Combating Customer Churn with AI

11-29

Designing a Self-Learning Tic-Tac-Toe Player

11-29

Robin Pemantle’s updated bag of tricks for math teaching!

01-04

Free ebook: Exploring Data with python

11-29

Serve yourself. The Next-Generation of Data Analytics. Dec 6 Webinar

11-29

Serve yourself. The Next-Generation of Data Analytics. Dec 6 Webinar

11-29

Document worth reading: “Big Data and Fog Computing”

11-29

“And when you did you weren’t much use, you didn’t even know what a peptide was”

11-29

“And when you did you weren’t much use, you didn’t even know what a peptide was”

11-29

“And when you did you weren’t much use, you didn’t even know what a peptide was”

11-29

Anomaly detection on Amazon DynamoDB Streams using the Amazon SageMaker Random Cut Forest algorithm

12-05

TSstudio 0.1.3

12-02

Amazon SageMaker now comes with new capabilities for accelerating machine learning experimentation

11-29

Amazon SageMaker now comes with new capabilities for accelerating machine learning experimentation

11-29

University of Tennessee Knoxville: Assistant or Associate Professor in Data Science [Knoxville, TN]

11-30

Java Object Tracking for Cars

11-30

Java Object Tracking for Cars

11-30

R Packages worth a look

11-30

NYC buses: Cubist regression with more predictors

11-30

NYC buses: Cubist regression with more predictors

11-30

NYC buses: C5.0 classification with R; more than 20 minute delay?

12-01

NYC buses: Cubist regression with more predictors

11-30

Faster garbage collection in pqR

11-30

Faster garbage collection in pqR

11-30

Number of births in the twentieth century by @ellis2013nz

11-30

Number of births in the twentieth century by @ellis2013nz

11-30

Creating and saving multiple plots to Powerpoint

11-30

Creating and saving multiple plots to Powerpoint

11-30

Creating and saving multiple plots to Powerpoint

11-30

Introducing the First AI / Machine Learning Course With a Job Guarantee

11-30

The Semantic Web: Where is it now?

12-23

Introducing the First AI / Machine Learning Course With a Job Guarantee

11-30

Simulating dinosaur populations, with R

11-30

Simulating dinosaur populations, with R

11-30

Simulating dinosaur populations, with R

11-30

Babe Didrikson Zaharias (2) vs. Adam Schiff; Sid Caesar advances

01-10

Simulating dinosaur populations, with R

11-30

Simulating dinosaur populations, with R

11-30

Simulating dinosaur populations, with R

11-30

Simulating dinosaur populations, with R

11-30

The Future of AI is the Enterprise

11-30

Defining visualization literacy

11-30

Defining visualization literacy

11-30

Day 01 – little helper checkdir

12-01

Using R: the best thing I’ve changed about my code in years

12-01

Using R: the best thing I’ve changed about my code in years

12-01

A Programmer’s Introduction to Mathematics

12-01

Magister Dixit

12-01

Magister Dixit

12-01

Free Machine Learning Textbook

12-01

Free Machine Learning Textbook

12-01

Free Machine Learning Textbook

12-01

R Packages worth a look

12-01

Why R for data science – and not Python?

12-02

December Reading for Econometricians

12-02

R plus Magento 2 REST API revisited: part 3 – more complex samples of use

12-02

R plus Magento 2 REST API revisited: part 3 – more complex samples of use

12-02

Document worth reading: “Comparative Study on Generative Adversarial Networks”

12-02

Document worth reading: “Comparative Study on Generative Adversarial Networks”

12-02

If you did not already know

12-02

Site Redesign

12-02

TSstudio 0.1.3

12-02

Day 02 – little helper na_omitlist

12-02

Day 12 – little helper dive

12-12

Day 10 – little helper %nin%

12-10

Day 09 – little helper object_size_in_env

12-09

Day 08 – little helper intersect2

12-08

Day 07 – little helper count_na

12-07

Day 02 – little helper na_omitlist

12-02

The p-value is 4.76×10^−264

12-02

The p-value is 4.76×10^−264

12-02

R Packages worth a look

12-02

How to get the homology of a antibody using R

12-02

How to get the homology of a antibody using R

12-02

Leaving NYC for Nashville

12-03

Leaving NYC for Nashville

12-03

Leaving NYC for Nashville

12-03

Improve your AI and Machine Learning skills at AI NEXTCon in Seattle, Jan 23-27

01-03

Twas the Night Before Analysis or A Visit from the Chief Data Scientist

12-24

Spark + AI Summit: learn best practices in ML and DL, latest frameworks, and more – special KDnuggets offer

12-14

Leaving NYC for Nashville

12-03

Very Non-Standard Calling in R

12-03

Very Non-Standard Calling in R

12-03

Very Non-Standard Calling in R

12-03

Very Non-Standard Calling in R

12-03

AzureVM: managing virtual machines in Azure

12-03

AzureVM: managing virtual machines in Azure

12-03

AzureVM: managing virtual machines in Azure

12-03

AzureVM: managing virtual machines in Azure

12-03

AzureVM: managing virtual machines in Azure

12-03

AzureVM: managing virtual machines in Azure

12-03

Monash University: Research Fellow (Bioinformatics) [Melbourne, Australia]

12-03

Monash University: Research Fellow (Bioinformatics) [Melbourne, Australia]

12-03

Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

12-03

Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

12-03

Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

12-03

Back by popular demand . . . The Greatest Seminar Speaker contest!

01-04

Making a Profit with Henry Wan in Arkham Horror: The Card Game

12-03

Making a Profit with Henry Wan in Arkham Horror: The Card Game

12-03

Document worth reading: “A Survey of Modern Object Detection Literature using Deep Learning”

12-03

Document worth reading: “A Survey of Modern Object Detection Literature using Deep Learning”

12-03

Import AI: 123: Facebook sees demands for deep learning services in its data centers grow by 3.5X; why advanced AI might require a global policeforce; and diagnosing natural disasters with deep learning

12-03

StanCon 2018 Helsinki talk slides, notebooks and code online

12-03

StanCon 2018 Helsinki talk slides, notebooks and code online

12-03

Authority figures in psychology spread more happy talk, still don’t get the point that much of the published, celebrated, and publicized work in their field is no good (Part 2)

12-31

An Utility Function For Monotonic Binning

12-03

An Utility Function For Monotonic Binning

12-03

An Utility Function For Monotonic Binning

12-03

Ronin: Sr Machine Learning and AI Data Scientist [San Mateo, CA]

12-03

Statistics in Glaucoma: Part I

12-03

Day 03 – little helper multiplot

12-03

Day 03 – little helper multiplot

12-03

Day 22 – little helper get_files

12-22

Day 17 – little helper to_na

12-17

Day 15 – little helper sci_palette

12-15

Day 12 – little helper dive

12-12

Day 10 – little helper %nin%

12-10

Day 09 – little helper object_size_in_env

12-09

Day 08 – little helper intersect2

12-08

Day 06 – little helper statusbar

12-06

Day 03 – little helper multiplot

12-03

restez: Query GenBank locally

12-03

restez: Query GenBank locally

12-03

restez: Query GenBank locally

12-03

If you did not already know

12-03

If you did not already know

12-03

In which I demonstrate my ignorance of world literature

12-03

Rotagrams

01-06

In which I demonstrate my ignorance of world literature

12-03

In which I demonstrate my ignorance of world literature

12-03

The State of Data in Astronomy

12-03

The State of Data in Astronomy

12-03

The State of Data in Astronomy

12-03

The State of Data in Astronomy

12-03

2018 Volatility Recap

01-06

GARCH and a rudimentary application to Vol Trading

12-03

Data Notes: Malaria Detection with FastAI

01-03

Deep Learning and Medical Image Analysis with Keras

12-03

Why Primary Research?

12-04

Bayes, statistics, and reproducibility: “Many serious problems with statistics in practice arise from Bayesian inference that is not Bayesian enough, or frequentist evaluation that is not frequentist enough, in both cases using replication distributions that do not make scientific sense or do not reflect the actual procedures being performed on the data.”

12-04

Bayes, statistics, and reproducibility: “Many serious problems with statistics in practice arise from Bayesian inference that is not Bayesian enough, or frequentist evaluation that is not frequentist enough, in both cases using replication distributions that do not make scientific sense or do not reflect the actual procedures being performed on the data.”

12-04

Bayes, statistics, and reproducibility: “Many serious problems with statistics in practice arise from Bayesian inference that is not Bayesian enough, or frequentist evaluation that is not frequentist enough, in both cases using replication distributions that do not make scientific sense or do not reflect the actual procedures being performed on the data.”

12-04

Bayes, statistics, and reproducibility: “Many serious problems with statistics in practice arise from Bayesian inference that is not Bayesian enough, or frequentist evaluation that is not frequentist enough, in both cases using replication distributions that do not make scientific sense or do not reflect the actual procedures being performed on the data.”

12-04

Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation

12-04

Detecting spatiotemporal groups in relocation data with spatsoc

12-04

Detecting spatiotemporal groups in relocation data with spatsoc

12-04

Detecting spatiotemporal groups in relocation data with spatsoc

12-04

Detecting spatiotemporal groups in relocation data with spatsoc

12-04

Why Machine Learning Interpretability Matters

12-04

Handling Imbalanced Datasets in Deep Learning

12-04

The Quick Python Book

12-05

Handling Imbalanced Datasets in Deep Learning

12-04

Day 04 – little helper evenstrings

12-04

Day 04 – little helper evenstrings

12-04

Starspace for NLP

12-04

Starspace for NLP

12-04

Document worth reading: “A Review of Theoretical and Practical Challenges of Trusted Autonomy in Big Data”

12-04

Document worth reading: “A Review of Theoretical and Practical Challenges of Trusted Autonomy in Big Data”

12-04

Deep learning in Satellite imagery

12-26

Deep learning in Satellite imagery

12-04

Deep learning in Satellite imagery

12-26

Deep learning in Satellite imagery

12-04

Data Science Projects Employers Want To See: How To Show A Business Impact

12-04

Document worth reading: “The Dynamics of Learning: A Random Matrix Approach”

12-04

Document worth reading: “The Dynamics of Learning: A Random Matrix Approach”

12-04

Document worth reading: “The Dynamics of Learning: A Random Matrix Approach”

12-04

Upcoming Meetings in AI, Analytics, Big Data, Data Science, Deep Learning, Machine Learning: December and Beyond

12-04

R Packages worth a look

12-04

Heatmaps of Mortality Rates

12-04

rnoaa: new data sources and NCDC units

12-04

rnoaa: new data sources and NCDC units

12-04

rnoaa: new data sources and NCDC units

12-04

If you did not already know

12-04

If you did not already know

12-04

“Statistical insights into public opinion and politics” (my talk for the Columbia Data Science Society this Wed 9pm)

12-04

“Statistical insights into public opinion and politics” (my talk for the Columbia Data Science Society this Wed 9pm)

12-04

Niall Ferguson and the perils of playing to your audience

12-05

How to build a data science project from scratch

12-05

Gender Diversity in the R and Python Communities

12-05

Gender Diversity in the R and Python Communities

12-05

Announcing the Winners of the 2018 AWS AI Hackathon

12-05

Document worth reading: “A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition”

12-05

Document worth reading: “A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition”

12-05

Learn to do Data Viz in R

12-05

Learn to do Data Viz in R

12-05

Magister Dixit

12-05

My Self-Driving Presentation for TTS

12-05

Creating Tables Using R and Pure HTML

12-05

The Quick Python Book

12-05

The Quick Python Book

12-05

Extract data from a PNG/TIFF

12-05

Extract data from a PNG/TIFF

12-05

Extract data from a PNG/TIFF

12-05

Extract data from a PNG/TIFF

12-05

Automated Dashboard visualizations with Deviation in R

12-06

Automated Dashboard with various correlation visualizations in R

12-05

Automated Dashboard with various correlation visualizations in R

12-05

KDnuggets™ News 18:n48, Dec 19: Why You Shouldn’t be a Data Science Generalist; Industry Data Science & Machine Learning 2019 Predictions

12-19

KDnuggets™ News 18:n46, Dec 5: AI, Data Science, Analytics 2018 Main Developments, 2019 Key Trends; Deep Learning Cheat Sheets

12-05

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J, PART 2

12-05

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J

12-05

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J

12-05

Day 05 – little helper get_network

12-05

Day 05 – little helper get_network

12-05

ggQC | ggplot Quality Control Charts – New Release

12-05

Distilled News

12-05

ROCK 'n' ROLL TRAFFIC ROUTING, WITH NEO4J, PART 2

12-05

Debiasing Approximate Inference

12-05

Debiasing Approximate Inference

12-05

Debiasing Approximate Inference

12-05

6 Step Plan to Starting Your Data Science Career

12-05

Community Call – Governance strategies for open source research software projects

12-05

Community Call – Governance strategies for open source research software projects

12-05

R Packages worth a look

12-05

Building Gene Expression Atlases with Deep Generative Models for Single-cell Transcriptomics

12-05

Building Gene Expression Atlases with Deep Generative Models for Single-cell Transcriptomics

12-05

The JapanR Conference 2018 Round-Up!

12-06

The JapanR Conference 2018 Round-Up!

12-06

Bitcoin and Taxes: What You May Not Know

12-06

Bitcoin and Taxes: What You May Not Know

12-06

Bitcoin and Taxes: What You May Not Know

12-06

Explainable Artificial Intelligence (Part 2) – Model Interpretation Strategies

12-06

R Packages worth a look

12-06

R Packages worth a look

12-06

R Packages worth a look

12-06

R Packages worth a look

12-06

R Packages worth a look

12-06

A parable regarding changing standards on the presentation of statistical evidence

12-06

A parable regarding changing standards on the presentation of statistical evidence

12-06

Distilled News

12-06

Distilled News

12-06

Distilled News

12-06

Automated Dashboard visualizations with Deviation in R

12-06

Four Techniques for Outlier Detection

12-06

Must-Have Resources to Become a Data Scientist

12-06

Day 06 – little helper statusbar

12-06

Build a serverless Twitter reader using AWS Fargate

12-06

An Intro to Deep Learning in Python

12-06

Designing Turbofan Tycoon

12-06

Intuition for principal component analysis (PCA)

12-06

Intuition for principal component analysis (PCA)

12-06

Intuition for principal component analysis (PCA)

12-06

Document worth reading: “Learning to Reason”

12-22

If you did not already know

12-06

If you did not already know

12-06

Running an R script on heroku

12-06

Teaching and Learning Materials for Data Visualization

12-12

Running an R script on heroku

12-06

Reduced privacy risk in exchange for accuracy in the Census count

12-06

Reduced privacy risk in exchange for accuracy in the Census count

12-06

Einops — a new style of deep learning code

12-06

DATAx Presents: AI AND MACHINE LEARNING TRENDS IN 2019

12-06

DATAx Presents: AI AND MACHINE LEARNING TRENDS IN 2019

12-06

Automated Dashboard Visualizations with Ranking in R

12-07

Automated Dashboard Visualizations with Ranking in R

12-07

Take a Look at Looker, Demo/Webinar Dec 13

12-07

Take a Look at Looker, Demo/Webinar Dec 13

12-07

Take a Look at Looker, Demo/Webinar Dec 13

12-07

Bayesian Nonparametric Models in NIMBLE, Part 2: Nonparametric Random Effects

12-07

Bayesian Nonparametric Models in NIMBLE, Part 2: Nonparametric Random Effects

12-07

Bayesian Nonparametric Models in NIMBLE, Part 2: Nonparametric Random Effects

12-07

Day 20 – little helper char_replace

12-20

Day 07 – little helper count_na

12-07

Distilled News

12-07

If you did not already know

12-07

“Increase sample size until statistical significance is reached” is not a valid adaptive trial design; but it’s fixable.

12-07

“Increase sample size until statistical significance is reached” is not a valid adaptive trial design; but it’s fixable.

12-07

Reusable Pipelines in R

12-13

Reusable Pipelines in R

12-13

“Increase sample size until statistical significance is reached” is not a valid adaptive trial design; but it’s fixable.

12-07

Babe Didrikson Zaharias (2) vs. Adam Schiff; Sid Caesar advances

01-10

Latour Sokal NYT

12-07

Latour Sokal NYT

12-07

Latour Sokal NYT

12-07

Latour Sokal NYT

12-07

A comprehensive list of Machine Learning Resources: Open Courses, Textbooks, Tutorials, Cheat Sheets and more

12-07

A comprehensive list of Machine Learning Resources: Open Courses, Textbooks, Tutorials, Cheat Sheets and more

12-07

R community update: announcing useR Delhi December meetup and CFP

12-07

R community update: announcing sessions for useR Delhi December meetup

12-13

R community update: announcing useR Delhi December meetup and CFP

12-07

Document worth reading: “Marketing Analytics: Methods, Practice, Implementation, and Links to Other Fields”

12-07

Day 22 – little helper get_files

12-22

Day 15 – little helper sci_palette

12-15

Day 14 – little helper print_fs

12-14

Day 12 – little helper dive

12-12

Day 08 – little helper intersect2

12-08

confint3: 2-Sided Confidence Interval (Extended Moodle Version)

12-08

R Packages worth a look

12-08

R Packages worth a look

12-08

If you did not already know

12-08

If you did not already know

12-08

Dr. Data Show Video: Five Reasons Computers Predict When You’ll Die

12-08

Document worth reading: “A Theory of Diagnostic Interpretation in Supervised Classification”

12-08

R Packages worth a look

12-08

My footnote about global warming

12-08

My footnote about global warming

12-08

Apache Drill 1.15.0 + sergeant 0.8.0 = pcapng Support, Proper Column Types & Mounds of New Metadata

01-02

My footnote about global warming

12-08

My footnote about global warming

12-08

My footnote about global warming

12-08

Timing Grouped Mean Calculation in R

12-08

My R take on Advent of Code – Day 1

12-17

Timing Grouped Mean Calculation in R

12-08

“Dissolving the Fermi Paradox”

01-05

Interactive panel EDA with 3 lines of code

12-09

Interesting packages taken from R/Pharma

12-09

Interesting packages taken from R/Pharma

12-09

Document worth reading: “What Do We Understand About Convolutional Networks”

12-09

Document worth reading: “What Do We Understand About Convolutional Networks”

12-09

Document worth reading: “What Do We Understand About Convolutional Networks”

12-09

An 8-hour course on R and Data Mining

12-09

An 8-hour course on R and Data Mining

12-09

Magister Dixit

12-09

Smartly select and mutate data frame columns, using dict

12-09

Smartly select and mutate data frame columns, using dict

12-09

R Packages worth a look

01-09

If you did not already know

12-09

Day 09 – little helper object_size_in_env

12-09

Canada Map

12-09

Canada Map

12-09

Should we be concerned about MRP estimates being used in later analyses? Maybe. I recommend checking using fake-data simulation.

12-09

Should we be concerned about MRP estimates being used in later analyses? Maybe. I recommend checking using fake-data simulation.

12-09

ggmap Tutorial Updated!

12-10

ggmap Tutorial Updated!

12-10

ggmap Tutorial Updated!

12-10

ggmap Tutorial Updated!

12-10

Reflections on the 10th anniversary of the Revolutions blog

12-10

Reflections on the 10th anniversary of the Revolutions blog

12-10

Reflections on the 10th anniversary of the Revolutions blog

12-10

Reflections on the 10th anniversary of the Revolutions blog

12-10

Math for Machine Learning

01-04

Math for Machine Learning

12-10

The Need for Speed Part 2: C++ vs. Fortran vs. C

12-24

The Need for Speed Part 1: Building an R Package with Fortran (or C)

12-10

The ‘knight on an infinite chessboard’ puzzle: efficient simulation in R

12-10

R Packages worth a look

12-10

Keras – Save and Load Your Deep Learning Models

12-10

5½ Reasons to Ditch Spreadsheets for Data Science: Code is Poetry

12-10

covrpage, more information on unit testing

12-10

Failure Pressure Prediction Using Machine Learning

12-10

If you did not already know

12-11

Document worth reading: “A Short Introduction to Local Graph Clustering Methods and Software”

12-10

Distilled News

12-10

Personal Data Analytics

12-10

Personal Data Analytics

12-10

Document worth reading: “Can Machines Design An Artificial General Intelligence Approach”

12-10

Document worth reading: “Can Machines Design An Artificial General Intelligence Approach”

12-10

Document worth reading: “Can Machines Design An Artificial General Intelligence Approach”

12-10

Great post Yash!

12-10

Great post Yash!

12-10

Great post Yash!

12-10

Enter the

12-10

Enter the

12-10

Prior distributions for covariance matrices

12-10

Prior distributions for covariance matrices

12-10

Le Monde puzzle [#1075]

12-11

Introduction to Named Entity Recognition

12-11

Introduction to Named Entity Recognition

12-11

Introduction to Named Entity Recognition

12-11

Day 11 – little helper trim

12-11

Intuit: Staff Experimentation Data Scientist [Mountain View, CA]

12-12

Intuit: Staff Data Scientist – Business Analytics [Mountain View, CA]

12-12

Intuit: Staff Data Scientist [Woodland Hills, CA and Mountain View, CA]

12-11

Intuit: Staff Data Scientist [Mountain View, CA]

12-11

Document worth reading: “A Survey: Non-Orthogonal Multiple Access with Compressed Sensing Multiuser Detection for mMTC”

12-31

Document worth reading: “Taxonomy of Big Data: A Survey”

12-11

Document worth reading: “Taxonomy of Big Data: A Survey”

12-11

Distilled News

12-11

DB connected R application on open-source Shiny server, part 1

12-11

DB connected R application on open-source Shiny server, part 1

12-11

DB connected R application on open-source Shiny server, part 1

12-11

DB connected R application on open-source Shiny server, part 1

12-11

DB connected R application on open-source Shiny server, part 1

12-11

A Machine Learning Deep Dive [Webinar, Dec 13]

12-11

A Machine Learning Deep Dive [Webinar, Dec 13]

12-11

Sharing Modeling Pipelines in R

12-11

Sharing Modeling Pipelines in R

12-11

Sharing Modeling Pipelines in R

12-11

Sharing Modeling Pipelines in R

12-11

Sharing Modeling Pipelines in R

12-11

Sharing Modeling Pipelines in R

12-11

Advanced News API search: leveraging DBpedia entity types

12-11

How to give money to the R project

12-11

How to give money to the R project

12-11

How to give money to the R project

12-11

How to give money to the R project

12-11

When cycling is faster than driving

12-11

When cycling is faster than driving

12-11

R Packages worth a look

12-11

If you did not already know

12-11

Historic Wildfire Data: Exploratory Visualization in R

12-11

Historic Wildfire Data: Exploratory Visualization in R

12-11

Historic Wildfire Data: Exploratory Visualization in R

12-11

InformationAge: Will 2019 See the Automation of Automation and Push Up Salaries of Data Scientists?

12-11

Single-Income Occupations

12-12

Single-Income Occupations

12-12

Single-Income Occupations

12-12

Single-Income Occupations

12-12

KDnuggets™ News 18:n47, Dec 12: Common mistakes when doing machine learning; Here are the most popular Python IDEs / Editors

12-12

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

My introductory course on Bayesian statistics

12-12

My introductory course on Bayesian statistics

12-12

Exploring the Gender Pay Gap with Publicly Available Data

12-12

Reading List Faster With parallel, doParallel, and pbapply

12-12

Reading List Faster With parallel, doParallel, and pbapply

12-12

I Spy with my Graphing Eye 📊 👁️

12-12

Scaling Multi-Agent Reinforcement Learning

12-12

Scaling Multi-Agent Reinforcement Learning

12-12

Using ggplot2 for functional time series

12-12

Using ggplot2 for functional time series

12-12

Who is the greatest finisher in soccer?

01-10

Who is the greatest finisher in soccer?

01-10

Network Centrality in R: New ways of measuring Centrality

12-12

Network Centrality in R: New ways of measuring Centrality

12-12

Network Centrality in R: New ways of measuring Centrality

12-12

If you did not already know

12-12

Code for case study – Customer Churn with Keras/TensorFlow and H2O

12-12

Teaching and Learning Materials for Data Visualization

12-12

How to deploy a predictive service to Kubernetes with R and the AzureContainers package

12-12

NLP Overview: Modern Deep Learning Techniques Applied to Natural Language Processing

01-08

R Packages worth a look

12-12

R Packages worth a look

12-12

Intuit: Staff Experimentation Data Scientist [Mountain View, CA]

12-12

Intuit: Staff Experimentation Data Scientist [Mountain View, CA]

12-12

Document worth reading: “Computing the Unique Information”

12-12

Document worth reading: “Computing the Unique Information”

12-12

Showing a difference in means between two groups

01-13

Document worth reading: “Computing the Unique Information”

12-12

Time series of Democratic/Republican vote share in House elections

12-12

Time series of Democratic/Republican vote share in House elections

12-12

Time series of Democratic/Republican vote share in House elections

12-12

Cummins: Data Engineering Technical Specialist [Columbus, IN]

12-13

Cummins: Reliability Analytics Leader [Columbus, IN]

12-13

Cummins: Data Engineering Apps and Solutions Architect [Columbus, IN]

12-13

Cummins: Advanced Analytics Systems Architect Principle [Columbus, IN]

12-12

Cummins: Advanced Analytics Systems Architect Principle [Columbus, IN]

12-12

Keras Hyperparameter Tuning in Google Colab Using Hyperas

12-12

Keras Hyperparameter Tuning in Google Colab Using Hyperas

12-12

What's the future of the pandas library?

12-12

Four Approaches to Explaining AI and Machine Learning

12-12

Explainable Artificial Intelligence

01-10

Machine Learning Explainability vs Interpretability: Two concepts that could help restore trust in AI

12-20

Four Approaches to Explaining AI and Machine Learning

12-12

Explainable Artificial Intelligence

01-10

Machine Learning Explainability vs Interpretability: Two concepts that could help restore trust in AI

12-20

Four Approaches to Explaining AI and Machine Learning

12-12

Explainable Artificial Intelligence

01-10

Four Approaches to Explaining AI and Machine Learning

12-12

Day 13 – little helper read_files

12-13

Day 13 – little helper read_files

12-13

Apps gather your location and then sell the data

12-13

Apps gather your location and then sell the data

12-13

von Neumann Poker Analysis

12-13

von Neumann Poker Analysis

12-13

von Neumann Poker Analysis

12-13

Reusable Pipelines in R

12-13

Reusable Pipelines in R

12-13

Reusable Pipelines in R

12-13

Reusable Pipelines in R

12-13

WNS Hackathon Solutions by Top Finishers

12-13

Top KDnuggets tweets, Dec 5-11: How to build a data science project from scratch; NeurIPS 2018 video talk collection

12-13

Top KDnuggets tweets, Dec 5-11: How to build a data science project from scratch; NeurIPS 2018 video talk collection

12-13

Yet another visualization of the Bayesian Beta-Binomial model

12-13

Yet another visualization of the Bayesian Beta-Binomial model

12-13

Yet another visualization of the Bayesian Beta-Binomial model

12-13

Yet another visualization of the Bayesian Beta-Binomial model

12-13

Recreating the NBA lead tracker graphic

12-13

Document worth reading: “AI Reasoning Systems: PAC and Applied Methods”

12-13

Document worth reading: “AI Reasoning Systems: PAC and Applied Methods”

12-13

Document worth reading: “AI Reasoning Systems: PAC and Applied Methods”

12-13

Are you ready to tackle the data-driven revolution?

12-13

Are you ready to tackle the data-driven revolution?

12-13

R community update: announcing sessions for useR Delhi December meetup

12-13

R Packages worth a look

01-06

R Packages worth a look

12-13

Oh, I hate it when work is criticized (or, in this case, fails in attempted replications) and then the original researchers don’t even consider the possibility that maybe in their original work they were inadvertently just finding patterns in noise.

12-13

Oh, I hate it when work is criticized (or, in this case, fails in attempted replications) and then the original researchers don’t even consider the possibility that maybe in their original work they were inadvertently just finding patterns in noise.

12-13

Oh, I hate it when work is criticized (or, in this case, fails in attempted replications) and then the original researchers don’t even consider the possibility that maybe in their original work they were inadvertently just finding patterns in noise.

12-13

Oh, I hate it when work is criticized (or, in this case, fails in attempted replications) and then the original researchers don’t even consider the possibility that maybe in their original work they were inadvertently just finding patterns in noise.

12-13

RTutor: Better Incentive Contracts For Road Construction

12-13

RTutor: Better Incentive Contracts For Road Construction

12-13

RTutor: Better Incentive Contracts For Road Construction

12-13

RTutor: Better Incentive Contracts For Road Construction

12-13

Solve any Image Classification Problem Quickly and Easily

12-13

A couple of thoughts regarding the hot hand fallacy fallacy

12-14

A couple of thoughts regarding the hot hand fallacy fallacy

12-14

My book ‘Deep Learning from first principles:Second Edition’ now on Amazon

12-14

Implementing ResNet with MXNET Gluon and Comet.ml for Image Classification

12-14

Implementing ResNet with MXNET Gluon and Comet.ml for Image Classification

12-14

Implementing ResNet with MXNET Gluon and Comet.ml for Image Classification

12-14

Pdftools 2.0: powerful pdf text extraction tools

12-14

Pdftools 2.0: powerful pdf text extraction tools

12-14

5 amazing free tools that can help with publishing R results and blogging

12-22

Top Stories of 2018: 9 Must-have skills you need to become a Data Scientist, updated; Python eats away at R: Top Software for Analytics, Data Science, Machine Learning in 2018

12-14

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-17

LoyaltyOne: Associate Director, Client Services [Westborough, MA]

12-17

LoyaltyOne: Consultant Category Manager / Analyst, Client Services [Westborough, MA]

12-14

LoyaltyOne: Manager, CPG [Westborough, MA]

12-14

Document worth reading: “Small Sample Learning in Big Data Era”

12-14

Document worth reading: “Small Sample Learning in Big Data Era”

12-14

Distilled News

12-19

AI, Machine Learning and Data Science Roundup: December 2018

12-19

In case you missed it: November 2018 roundup

12-14

R Packages worth a look

12-14

R Packages worth a look

12-14

R Packages worth a look

12-14

R Packages worth a look

12-14

running plot [and simulated annealing]

12-14

running plot [and simulated annealing]

12-14

Learning R: A gentle introduction to higher-order functions

12-14

Day 14 – little helper print_fs

12-14

Day 14 – little helper print_fs

12-14

Gift ideas for the R lovers

12-14

Top Insights from 50 Chief Data Officers

12-14

Top Insights from 50 Chief Data Officers

12-14

Easy CI/CD of GPU applications on Google Cloud including bare-metal using Gitlab and Kubernetes

12-14

Easy CI/CD of GPU applications on Google Cloud including bare-metal using Gitlab and Kubernetes

12-14

Request for comments on planned features for futile.logger 1.5

12-15

Request for comments on planned features for futile.logger 1.5

12-15

Request for comments on planned features for futile.logger 1.5

12-15

Request for comments on planned features for futile.logger 1.5

12-15

Six Sigma DMAIC Series in R – Part4

12-15

If you did not already know

12-15

If you did not already know

12-15

If you did not already know

12-15

Advent of Code: Most Popular Languages

12-15

Manipulate dates easily with {lubridate}

12-15

Manipulate dates easily with {lubridate}

12-15

Manipulate dates easily with {lubridate}

12-15

Neural Ordinary Differential Equations

12-15

Neural Ordinary Differential Equations

12-15

Data Scientist’s Dilemma – The Cold Start Problem

12-15

RStudio Pandoc – HTML To Markdown

12-15

RStudio Pandoc – HTML To Markdown

12-15

RStudio Pandoc – HTML To Markdown

12-15

linl 0.0.3: Micro release

12-15

Minimum CRPS vs. maximum likelihood

12-16

Minimum CRPS vs. maximum likelihood

12-16

Minimum CRPS vs. maximum likelihood

12-16

Surprise-hacking: “the narrative of blindness and illusion sells, and therefore continues to be the central thesis of popular books written by psychologists and cognitive scientists”

12-16

Surprise-hacking: “the narrative of blindness and illusion sells, and therefore continues to be the central thesis of popular books written by psychologists and cognitive scientists”

12-16

Word associations from the Small World of Words

12-16

Word associations from the Small World of Words

12-16

Word associations from the Small World of Words

12-16

Word associations from the Small World of Words

12-16

Document worth reading: “Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences”

12-16

Document worth reading: “Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences”

12-16

Document worth reading: “Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences”

12-16

R Packages worth a look

12-16

R Packages worth a look

12-16

2018-13 Rendering HTML Content in R Graphics

12-16

2018-13 Rendering HTML Content in R Graphics

12-16

Document worth reading: “Coupled Ensembles of Neural Networks”

12-16

Document worth reading: “Coupled Ensembles of Neural Networks”

12-16

Document worth reading: “Coupled Ensembles of Neural Networks”

12-16

Document worth reading: “Coupled Ensembles of Neural Networks”

12-16

How AI Will Change Brick-and-Mortar Retail in 2019

12-26

LoyaltyOne: Associate Director, CPG [Westborough, MA]

12-17

Why do sociologists (and bloggers) focus on the negative? 5 possible explanations. (A post in the style of Fabio Rojas)

12-17

Why do sociologists (and bloggers) focus on the negative? 5 possible explanations. (A post in the style of Fabio Rojas)

12-17

An R Shiny app to recognize flower species

12-17

Phillips-Ouliaris Test For Cointegration

12-17

Phillips-Ouliaris Test For Cointegration

12-17

Vanguard: Senior AI Engineer [Malvern, PA]

12-17

Whats new on arXiv

12-17

The 2019 PAW Business Agenda is Live – Super Early Bird expires this Friday

12-17

Day 17 – little helper to_na

12-17

Image Stitching with OpenCV and Python

12-17

Image Stitching with OpenCV and Python

12-17

Image Stitching with OpenCV and Python

12-17

KDnuggets™ News 19:n02, Jan 9: The cold start problem: how to build your machine learning portfolio; 5 Best Data Visualization Libraries

01-09

Meta-Learning For Better Machine Learning

12-17

R Packages worth a look

12-17

R Packages worth a look

12-17

R Packages worth a look

12-17

eBook: An Introduction to Active Learning

12-17

eBook: An Introduction to Active Learning

12-17

Introduction to Statistics for Data Science

12-17

Document worth reading: “Learning to Reason”

12-22

Introduction to Statistics for Data Science

12-17

If you did not already know

12-17

Sudoku Solver

12-30

My R take on Advent of Code – Day 1

12-17

Scalable multi-node training with TensorFlow

12-17

If you did not already know

12-17

Will Julia Replace Python and R for Data Science?

12-26

If you did not already know

12-17

2018 Year-in-Review: Machine Learning Open Source Projects & Frameworks

12-17

New public course on Successfully Delivering Data Science Projects for Feb 1st

12-18

All the (NBA) box scores you ever wanted

12-18

All the (NBA) box scores you ever wanted

12-18

Magister Dixit

12-18

Magister Dixit

12-18

vtreat Variable Importance

12-18

vtreat Variable Importance

12-18

vtreat Variable Importance

12-18

vtreat Variable Importance

12-18

If you did not already know

12-18

If you did not already know

12-18

If you did not already know

12-18

If you did not already know

12-18

If you did not already know

12-18

Exploring the Data Jungle Free eBook

12-18

Comparing racism from different eras: If only Tucker Carlson had been around in the 1950s he could’ve been a New York Intellectual.

12-18

Comparing racism from different eras: If only Tucker Carlson had been around in the 1950s he could’ve been a New York Intellectual.

12-18

Comparing racism from different eras: If only Tucker Carlson had been around in the 1950s he could’ve been a New York Intellectual.

12-18

Comparing racism from different eras: If only Tucker Carlson had been around in the 1950s he could’ve been a New York Intellectual.

12-18

Comparing racism from different eras: If only Tucker Carlson had been around in the 1950s he could’ve been a New York Intellectual.

12-18

Modern reproduction of 1847 geometry books

12-18

Modern reproduction of 1847 geometry books

12-18

Modern reproduction of 1847 geometry books

12-18

Document worth reading: “Learning to Reason”

12-22

Modern reproduction of 1847 geometry books

12-18

Statistics in Glaucoma: Part III

12-18

How will automation tools change data science?

12-18

University of Rhode Island: Data Scientist, DataSpark (2 Positions) [Kingston, RI]

12-18

Classifying yin and yang using MRI

12-18

Classifying yin and yang using MRI

12-18

Document worth reading: “A Study and Comparison of Human and Deep Learning Recognition Performance Under Visual Distortions”

12-18

So you want to play a pRank in R…?

12-18

So you want to play a pRank in R…?

12-18

Document worth reading: “Are screening methods useful in feature selection? An empirical study”

12-18

R Packages worth a look

12-18

Analyzing contact center calls—Part 1: Use Amazon Transcribe and Amazon Comprehend to analyze customer sentiment

12-18

Heavy Tailed Self Regularization in Deep Neural Nets: 1 year of research

12-18

Heavy Tailed Self Regularization in Deep Neural Nets: 1 year of research

12-18

Data, movies and ggplot2

12-19

Data, movies and ggplot2

12-19

Rotary

12-19

Rotary

12-19

Rotary

12-19

When “nudge” doesn’t work: Medication Reminders to Outcomes After Myocardial Infarction

12-19

When “nudge” doesn’t work: Medication Reminders to Outcomes After Myocardial Infarction

12-19

When “nudge” doesn’t work: Medication Reminders to Outcomes After Myocardial Infarction

12-19

When “nudge” doesn’t work: Medication Reminders to Outcomes After Myocardial Infarction

12-19

Think Twice Before You Accept That Fancy Data Science Job

12-19

FAQ on ICML 2019 Code Submission Policy

12-19

FAQ on ICML 2019 Code Submission Policy

12-19

The Netflix Data War

12-19

Optimal Picture Viewing Distance

12-19

Optimal Picture Viewing Distance

12-19

Optimal Picture Viewing Distance

12-19

Document worth reading: “Mobile big data analysis with machine learning”

12-19

Document worth reading: “Mobile big data analysis with machine learning”

12-19

Document worth reading: “Mobile big data analysis with machine learning”

12-19

Top KDnuggets tweets, Dec 12-18: Deep Learning Cheat Sheets; The Nate Silver vs. Nassim Taleb Twitter War

12-19

Kent State University: Assistant/Associate Professor – Business Analytics/Information Systems [Kent, OH]

12-19

4 Strategies to Deal With Large Datasets Using Pandas

12-19

4 Strategies to Deal With Large Datasets Using Pandas

12-19

Dataiku Series C: New Year, New Chapter

12-19

Dataiku Series C: New Year, New Chapter

12-19

The brain as a neural network: this is why we can’t get along

12-19

The brain as a neural network: this is why we can’t get along

12-19

The importance of Data Analytics skills in today’s MBA roles

12-19

If you did not already know

12-20

If you did not already know

12-20

Day 20 – little helper char_replace

12-20

Day 20 – little helper char_replace

12-20

Easily train models using datasets labeled by Amazon SageMaker Ground Truth

12-20

Amazon SageMaker adds Scikit-Learn support

12-20

Amazon SageMaker adds Scikit-Learn support

12-20

Document worth reading: “A second-quantised Shannon theory”

12-20

Exploring model fit by looking at a histogram of a posterior simulation draw of a set of parameters in a hierarchical model

12-20

Exploring model fit by looking at a histogram of a posterior simulation draw of a set of parameters in a hierarchical model

12-20

Exploring model fit by looking at a histogram of a posterior simulation draw of a set of parameters in a hierarchical model

12-20

Exploring model fit by looking at a histogram of a posterior simulation draw of a set of parameters in a hierarchical model

12-20

Miami University: Director of the Center for Analytics & Data Science (CADS) [Oxford, OH]

12-20

Examining the Tweeting Patterns of Prominent Crossfit Gyms

12-20

Examining the Tweeting Patterns of Prominent Crossfit Gyms

12-20

Very shiny holidays!

12-26

Your AI journey… and Happy Holidays!

12-20

Your AI journey… and Happy Holidays!

12-20

The Key to Getting a Data Science Job, According to Briana Brownell

12-20

The Key to Getting a Data Science Job, According to Briana Brownell

12-20

The Key to Getting a Data Science Job, According to Briana Brownell

12-20

✚ Repetitions, Data Analysis as Brainstorm

01-10

✚ Avoiding D3, Using D3, and Why I Use D3

01-03

✚ Tufte Tweet Follow-up; Visualization Tools and Resources Roundup for December 2018

12-20

✚ Repetitions, Data Analysis as Brainstorm

01-10

✚ Avoiding D3, Using D3, and Why I Use D3

01-03

✚ Tufte Tweet Follow-up; Visualization Tools and Resources Roundup for December 2018

12-20

✚ Repetitions, Data Analysis as Brainstorm

01-10

✚ Avoiding D3, Using D3, and Why I Use D3

01-03

✚ Tufte Tweet Follow-up; Visualization Tools and Resources Roundup for December 2018

12-20

BH 1.69.0-1 on CRAN

01-07

BH 1.69.0-0 pre-releases and three required changes

12-20

BH 1.69.0-0 pre-releases and three required changes

12-20

How to Scrape Data from a JavaScript Website with R

12-20

How to Scrape Data from a JavaScript Website with R

12-20

How to Scrape Data from a JavaScript Website with R

12-20

How to Scrape Data from a JavaScript Website with R

12-20

Spelling 2.0: Improved Markdown and RStudio Support

12-20

Spelling 2.0: Improved Markdown and RStudio Support

12-20

Spelling 2.0: Improved Markdown and RStudio Support

12-20

Explainable Artificial Intelligence

01-10

Machine Learning Explainability vs Interpretability: Two concepts that could help restore trust in AI

12-20

Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese

12-21

Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese

12-21

Six Steps to Master Machine Learning with Data Preparation

12-21

Considering sensitivity to unmeasured confounding: part 2

01-10

Six Steps to Master Machine Learning with Data Preparation

12-21

The causal hype ratchet

12-21

The causal hype ratchet

12-21

The causal hype ratchet

12-21

The causal hype ratchet

12-21

R Packages worth a look

12-21

R Packages worth a look

12-21

R Packages worth a look

12-21

Top 10 Data Science Tools (other than SQL Python R)

12-21

If you did not already know

12-21

If you did not already know

12-21

If you did not already know

12-21

Blogdown – shortcode for radix-like Bibtex

12-21

Blogdown – shortcode for radix-like Bibtex

12-21

Blogdown – shortcode for radix-like Bibtex

12-21

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

12-21

If you did not already know

12-21

The Riddler: Santa Needs Some Help With Math

12-22

The Riddler: Santa Needs Some Help With Math

12-22

The Riddler: Santa Needs Some Help With Math

12-22

The Christmas Eve Selloff was a Classic Capitulation

12-27

The Bear is Here

12-22

The Bear is Here

12-22

The Bear is Here

12-22

The Bear is Here

12-22

The Christmas Eve Selloff was a Classic Capitulation

12-27

The Bear is Here

12-22

The Bear is Here

12-22

Re-creating a Voronoi-Style Map with R

12-22

Re-creating a Voronoi-Style Map with R

12-22

R Packages worth a look

12-22

Bubble Packed Chart with R using packcircles package

12-22

Bubble Packed Chart with R using packcircles package

12-22

pinp 0.0.7: More small YAML options

01-11

Add a static pdf vignette to an R package

01-11

AzureR packages now on CRAN

01-08

AzureR packages now on CRAN

01-08

Bubble Packed Chart with R using packcircles package

12-22

5 amazing free tools that can help with publishing R results and blogging

12-22

5 amazing free tools that can help with publishing R results and blogging

12-22

5 amazing free tools that can help with publishing R results and blogging

12-22

Document worth reading: “Learning to Reason”

12-22

Simulating Persian Monarchs gameplay by @ellis2013nz

12-22

Simulating Persian Monarchs gameplay by @ellis2013nz

12-22

Simulating Persian Monarchs gameplay by @ellis2013nz

12-22

“When Both Men and Women Drop Out of the Labor Force, Why Do Economists Only Ask About Men?”

12-23

“When Both Men and Women Drop Out of the Labor Force, Why Do Economists Only Ask About Men?”

12-23

Certifiably Gone Phishing

12-23

Certifiably Gone Phishing

12-23

If you did not already know

12-23

If you did not already know

12-23

If you did not already know

12-23

R Packages worth a look

12-23

ShinyProxy Christmas Release

12-23

ShinyProxy Christmas Release

12-23

R Packages worth a look

12-23

R Packages worth a look

12-23

R Packages worth a look

12-23

R Packages worth a look

12-23

How Miguel Got 3 Data Science Job Offers Fast With Dataquest

12-24

Interspeech 2018: Highlights for Data Scientists

12-24

Interspeech 2018: Highlights for Data Scientists

12-24

Pivot Billions and Deep Learning enhanced trading models achieve 100% net profit

12-24

R 101

12-24

R 101

12-24

R 101

12-24

Twas the Night Before Analysis or A Visit from the Chief Data Scientist

12-24

Twas the Night Before Analysis or A Visit from the Chief Data Scientist

12-24

June is applied regression exam month!

12-24

The most practical causal inference book I’ve read (is still a draft)

12-24

The most practical causal inference book I’ve read (is still a draft)

12-24

University of Virginia: Faculty, Open Rank Model and Simulation at the Human-Technology Frontier [Charlottesville, VA]

12-24

Dreaming of a white Christmas – with ggmap in R

12-24

Objects types and some useful R functions for beginners

12-24

4 Reasons Santa Needs Machine Learning & AI

12-24

If you did not already know

12-24

Magister Dixit

12-24

“Thus, a loss aversion principle is rendered superfluous to an account of the phenomena it was introduced to explain.”

12-25

“Thus, a loss aversion principle is rendered superfluous to an account of the phenomena it was introduced to explain.”

12-25

At Year's End: 2018

12-25

At Year's End: 2018

12-25

Very shiny holidays!

12-26

Very shiny holidays!

12-26

Very shiny holidays!

12-26

R Packages worth a look

12-26

Finally, You Can Plot H2O Decision Trees in R

12-26

Finally, You Can Plot H2O Decision Trees in R

12-26

Le Monde puzzle [#1076]

12-26

Le Monde puzzle [#1076]

12-26

Le Monde puzzle [#1076]

12-26

Document worth reading: “A Survey of Knowledge Representation and Retrieval for Learning in Service Robotics”

12-26

Miami University: Assistant Provost for Institutional Research and Effectiveness [Oxford, OH]

12-26

Statistical Assessments of AUC

12-26

If you did not already know

12-26

Following your gut, following the data

12-26

Following your gut, following the data

12-26

Following your gut, following the data

12-26

Following your gut, following the data

12-26

Part 4: Why does bias occur in optimism corrected bootstrapping?

12-28

Part 3: Two more implementations of optimism corrected bootstrapping show shocking bias

12-27

Part 2: Optimism corrected bootstrapping is definitely bias, further evidence

12-26

Part 2: Optimism corrected bootstrapping is definitely bias, further evidence

12-26

How AI Will Change Brick-and-Mortar Retail in 2019

12-26

The Christmas Eve Selloff was a Classic Capitulation

12-27

The Christmas Eve Selloff was a Classic Capitulation

12-27

Part 3: Two more implementations of optimism corrected bootstrapping show shocking bias

12-27

Who is a Data Scientist?

12-27

Document worth reading: “Computational Power and the Social Impact of Artificial Intelligence”

12-27

The Christmas Eve Selloff was a Classic Capitulation

12-27

The Christmas Eve Selloff was a Classic Capitulation

12-27

The Christmas Eve Selloff was a Classic Capitulation

12-27

The Christmas Eve Selloff was a Classic Capitulation

12-27

Best Data Visualization Projects of 2018

12-27

Best Data Visualization Projects of 2018

12-27

If you did not already know

12-27

A Case For Explainable AI & Machine Learning

12-27

A Case For Explainable AI & Machine Learning

12-27

French Mortality Poster

12-27

Clustering the Bible

12-27

Clustering the Bible

12-27

Clustering the Bible

12-27

Document worth reading: “The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers”

12-27

Document worth reading: “The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers”

12-27

Document worth reading: “The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers”

12-27

Christmas elves puzzle

12-27

Christmas elves puzzle

12-27

Document worth reading: “Generalization in Machine Learning via Analytical Learning Theory”

12-28

Document worth reading: “Generalization in Machine Learning via Analytical Learning Theory”

12-28

Document worth reading: “Generalization in Machine Learning via Analytical Learning Theory”

12-28

Document worth reading: “Generalization in Machine Learning via Analytical Learning Theory”

12-28

Document worth reading: “Generalization in Machine Learning via Analytical Learning Theory”

12-28

Use AWS Machine Learning to Analyze Customer Calls from Contact Centers (Part 2): Automate, Deploy, and Visualize Analytics using Amazon Transcribe, Amazon Comprehend, AWS CloudFormation, and Amazon QuickSight

12-28

Use AWS Machine Learning to Analyze Customer Calls from Contact Centers (Part 2): Automate, Deploy, and Visualize Analytics using Amazon Transcribe, Amazon Comprehend, AWS CloudFormation, and Amazon QuickSight

12-28

Comparison of the Top Speech Processing APIs

12-28

R Packages worth a look

12-28

R Packages worth a look

12-28

Fine-tuning for Natural Language Processing

12-28

Fine-tuning for Natural Language Processing

12-28

My

12-28

My

12-28

R Packages worth a look

12-28

My R Take on Advent of Code – Day 3

12-28

My R Take on Advent of Code – Day 3

12-28

My R Take on Advent of Code – Day 3

12-28

Supervised Learning: Model Popularity from Past to Present

12-28

Supervised Learning: Model Popularity from Past to Present

12-28

Hackers beware: Bootstrap sampling may be harmful

01-07

Part 5: Code corrections to optimism corrected bootstrapping series

12-29

Part 4: Why does bias occur in optimism corrected bootstrapping?

12-28

Using emojis as scatterplot points

12-28

Using emojis as scatterplot points

12-28

Using emojis as scatterplot points

12-28

Manning Countdown to 2019 – Big Deals on AI, Data Science, Machine Learning books and videos

12-28

Manning Countdown to 2019 – Big Deals on AI, Data Science, Machine Learning books and videos

12-28

Deep Learning for Media Content

12-28

Document worth reading: “Learnable: Theory vs Applications”

12-28

Document worth reading: “Learnable: Theory vs Applications”

12-28

Part 5: Code corrections to optimism corrected bootstrapping series

12-29

Part 5: Code corrections to optimism corrected bootstrapping series

12-29

R Packages worth a look

12-29

If you did not already know

12-29

“Check yourself before you wreck yourself: Assessing discrete choice models through predictive simulations”

12-29

“Check yourself before you wreck yourself: Assessing discrete choice models through predictive simulations”

12-29

“Check yourself before you wreck yourself: Assessing discrete choice models through predictive simulations”

12-29

“Check yourself before you wreck yourself: Assessing discrete choice models through predictive simulations”

12-29

Leaf Plant Classification: Statistical Learning Model – Part 2

12-31

Leaf Plant Classification: An Exploratory Analysis – Part 1

12-29

Leaf Plant Classification: Statistical Learning Model – Part 2

12-31

Leaf Plant Classification: An Exploratory Analysis – Part 1

12-29

Leaf Plant Classification: An Exploratory Analysis – Part 1

12-29

Combining apparently contradictory evidence

12-30

Combining apparently contradictory evidence

12-30

If you did not already know

12-30

If you did not already know

12-30

Distilled News

12-30

If you did not already know

12-30

Sudoku Solver

12-30

This dance, it’s like a weapon: Radiohead’s and Beck’s danceability, valence, popularity, and more from the LastFM and Spotify APIs

12-30

This dance, it’s like a weapon: Radiohead’s and Beck’s danceability, valence, popularity, and more from the LastFM and Spotify APIs

12-30

This dance, it’s like a weapon: Radiohead’s and Beck’s danceability, valence, popularity, and more from the LastFM and Spotify APIs

12-30

This dance, it’s like a weapon: Radiohead’s and Beck’s danceability, valence, popularity, and more from the LastFM and Spotify APIs

12-30

This dance, it’s like a weapon: Radiohead’s and Beck’s danceability, valence, popularity, and more from the LastFM and Spotify APIs

12-30

Good Feature Building Techniques and Tricks for Kaggle

12-31

Papers with Code: A Fantastic GitHub Resource for Machine Learning

12-31

Papers with Code: A Fantastic GitHub Resource for Machine Learning

12-31

Center for Ultrasound Research and Translation, Massachusetts General Hospital: Post-Doctoral Scholar / Research Scientist [Boston, MA]

12-31

Exploring 2018 R-bloggers & R Weekly Posts with Feedly & the ‘seymour’ package

12-31

Exploring 2018 R-bloggers & R Weekly Posts with Feedly & the ‘seymour’ package

12-31

Exploring 2018 R-bloggers & R Weekly Posts with Feedly & the ‘seymour’ package

12-31

Keras Conv2D and Convolutional Layers

12-31

2018.

12-31

Document worth reading: “A Survey: Non-Orthogonal Multiple Access with Compressed Sensing Multiuser Detection for mMTC”

12-31

New Year's Resolution: Help Data Scientists Help You

12-31

Authority figures in psychology spread more happy talk, still don’t get the point that much of the published, celebrated, and publicized work in their field is no good (Part 2)

12-31

Authority figures in psychology spread more happy talk, still don’t get the point that much of the published, celebrated, and publicized work in their field is no good (Part 2)

12-31

Introducing RcppDynProg

12-31

Introducing RcppDynProg

12-31

Introducing RcppDynProg

12-31

Introducing RcppDynProg

12-31

Introducing RcppDynProg

12-31

Introducing RcppDynProg

12-31

R Packages worth a look

12-31

Silent Duels and an Old Paper of Restrepo

12-31

R Packages worth a look

01-01

R Packages worth a look

01-01

R Packages worth a look

01-01

R Packages worth a look

01-01

R Packages worth a look

01-01

Nimble tweak to use specific python version or virtual environment in RStudio

01-01

Simulating Multi-state Models with R

01-01

New Year's Resolutions 2019

01-01

Seeing the wood for the trees

01-01

Seeing the wood for the trees

01-01

Your and my 2019 R goals

01-01

“Principles of posterior visualization”

01-01

How to Meet Your New Years Resolutions in 2019 (Udemy Coupons $9.99)

01-01

How to Meet Your New Years Resolutions in 2019 (Udemy Coupons $9.99)

01-01

Advanced Jupyter Notebooks: A Tutorial

01-02

Dataviz Course Packet Quickstart

01-02

Dataviz Course Packet Quickstart

01-02

Dataviz Course Packet Quickstart

01-02

What to do when you read a paper and it’s full of errors and the author won’t share the data or be open about the analysis?

01-02

Entering and Exiting 2018

01-02

Entering and Exiting 2018

01-02

Magister Dixit

01-02

Magister Dixit

01-02

Apache Drill 1.15.0 + sergeant 0.8.0 = pcapng Support, Proper Column Types & Mounds of New Metadata

01-02

Apache Drill 1.15.0 + sergeant 0.8.0 = pcapng Support, Proper Column Types & Mounds of New Metadata

01-02

Apache Drill 1.15.0 + sergeant 0.8.0 = pcapng Support, Proper Column Types & Mounds of New Metadata

01-02

Why Learning Data Science Live is Better than Self-Paced Learning

01-02

Considering sensitivity to unmeasured confounding: part 1

01-02

3 More Google Colab Environment Management Tips

01-02

3 More Google Colab Environment Management Tips

01-02

How to Learn Python in 30 days

01-12

How to Learn Python in 30 days

01-02

Document worth reading: “A Review for Weighted MinHash Algorithms”

01-02

Document worth reading: “A Review for Weighted MinHash Algorithms”

01-02

Document worth reading: “Recommendation System based on Semantic Scholar Mining and Topic modeling: A behavioral analysis of researchers from six conferences”

01-03

gganimate has transitioned to a state of release

01-03

Applying for a PhD program in visualization

01-03

Applying for a PhD program in visualization

01-03

Applying for a PhD program in visualization

01-03

x-mas tRees with gganimate, ggplot, plotly and friends

01-03

R Packages worth a look

01-03

R Packages worth a look

01-03

R Packages worth a look

01-03

Icon making with ggplot2 and magick

01-03

Ensemble Learning: 5 Main Approaches

01-03

If you did not already know

01-03

‘data:’ Scraping & Chart Reproduction : Arrows of Environmental Destruction

01-03

‘data:’ Scraping & Chart Reproduction : Arrows of Environmental Destruction

01-03

Approaches to Text Summarization: An Overview

01-03

Approaches to Text Summarization: An Overview

01-03

Approaches to Text Summarization: An Overview

01-03

Distilled News

01-05

Adding Firebase Authentication to Shiny

01-03

Adding Firebase Authentication to Shiny

01-03

How to Write a Great Data Science Resume

01-03

Magister Dixit

01-03

Magister Dixit

01-03

Published in 2018

01-03

Published in 2018

01-03

Published in 2018

01-03

Published in 2018

01-03

Check Machin-like formulae with arbitrary-precision arithmetic

01-03

Check Machin-like formulae with arbitrary-precision arithmetic

01-03

Check Machin-like formulae with arbitrary-precision arithmetic

01-03

Check Machin-like formulae with arbitrary-precision arithmetic

01-03

Improve your AI and Machine Learning skills at AI NEXTCon in Seattle, Jan 23-27

01-03

2018 Traffic Data

01-03

2018 Traffic Data

01-03

2018 Traffic Data

01-03

2018 Traffic Data

01-03

My R Take in Advent of Code – Day 5

01-03

My R Take in Advent of Code – Day 5

01-03

Whats new on arXiv

01-04

Southern Illinois University Edwardsville: Director of the Center for Predictive Analytics/(Associate) Professor of Mathematics and Statistics [Edwardsville, IL]

01-04

What does it mean to write “vectorized” code in R?

01-04

Timing the Same Algorithm in R, Python, and C++

01-06

Timing the Same Algorithm in R, Python, and C++

01-06

What does it mean to write “vectorized” code in R?

01-04

Looking into 19th century ads from a Luxembourguish newspaper with R

01-04

The cold start problem: how to build your machine learning portfolio

01-04

R Packages worth a look

01-04

Strata Data SF 2019 KDnuggets Offer

01-04

In case you missed it: December 2018 roundup

01-04

In case you missed it: December 2018 roundup

01-04

R Packages worth a look

01-04

R Packages worth a look

01-04

R Packages worth a look

01-04

If you did not already know

01-04

Back by popular demand . . . The Greatest Seminar Speaker contest!

01-04

Displaying our “R – Quality Control Individual Range Chart Made Nice” inside a Java web App using AJAX – How To.

01-04

Displaying our “R – Quality Control Individual Range Chart Made Nice” inside a Java web App using AJAX – How To.

01-04

Displaying our “R – Quality Control Individual Range Chart Made Nice” inside a Java web App using AJAX – How To.

01-04

Displaying our “R – Quality Control Individual Range Chart Made Nice” inside a Java web App using AJAX – How To.

01-04

My Activities in 2018 with R and ShinyApp

01-04

My Activities in 2018 with R and ShinyApp

01-04

My Activities in 2018 with R and ShinyApp

01-04

Maryville University: Business Intelligence Analyst [St. Louis, MO]

01-04

Maryville University: Business Intelligence Analyst [St. Louis, MO]

01-04

Document worth reading: “Recent Research Advances on Interactive Machine Learning”

01-05

“Dissolving the Fermi Paradox”

01-05

Structural Analisys of Bayesian VARs with an example using the Brazilian Development Bank

01-05

Structural Analisys of Bayesian VARs with an example using the Brazilian Development Bank

01-05

Structural Analisys of Bayesian VARs with an example using the Brazilian Development Bank

01-05

If you did not already know

01-05

If you did not already know

01-05

Here’s why 2019 is a great year to start with R: A story of 10 year old R code then and now

01-05

Distilled News

01-05

2018 Winners and Losers

01-06

2018 Winners and Losers

01-06

2018 Winners and Losers

01-06

2018 Volatility Recap

01-06

2018 Volatility Recap

01-06

2018 Volatility Recap

01-06

gganimation for the nation

01-06

gganimation for the nation

01-06

Scaling H2O analytics with AWS and p(f)urrr (Part 1)

01-06

Co-integration and Mean Reverting Portfolio

01-06

Co-integration and Mean Reverting Portfolio

01-06

Co-integration and Mean Reverting Portfolio

01-06

Co-integration and Mean Reverting Portfolio

01-06

If you did not already know

01-06

Rotagrams

01-06

Rotagrams

01-06

Rotagrams

01-06

Rotagrams

01-06

Announcing the ultimate seminar speaker contest: 2019 edition!

01-06

If you did not already know

01-07

Looking back on 2018, looking to 2019

01-07

BH 1.69.0-1 on CRAN

01-07

Rev Summit for Data Science Leaders featuring Daniel Kahneman

01-07

The Data Science Event You Need in 2019

01-07

The Data Science Event You Need in 2019

01-07

Dow Jones Stock Market Index (2/4): Trade Volume Exploratory Analysis

01-07

Dow Jones Stock Market Index (2/4): Trade Volume Exploratory Analysis

01-07

Hackers beware: Bootstrap sampling may be harmful

01-07

Hackers beware: Bootstrap sampling may be harmful

01-07

Part 2, further comments on OfS grade-inflation report

01-07

Part 2, further comments on OfS grade-inflation report

01-07

Role of Computer Science in Data Science World

01-07

RTest: pretty testing of R packages

01-07

The seminar speaker contest begins: Jim Thorpe (1) vs. John Oliver

01-07

The seminar speaker contest begins: Jim Thorpe (1) vs. John Oliver

01-07

Marketing analytics with greybox

01-07

Marketing analytics with greybox

01-07

Tutorial: An app in R shiny visualizing biopsy data —  in a pharmaceutical company

01-07

Tutorial: An app in R shiny visualizing biopsy data —  in a pharmaceutical company

01-07

February 21st & 22nd: End-2-End from a Keras/TensorFlow model to production

01-07

February 21st & 22nd: End-2-End from a Keras/TensorFlow model to production

01-07

The Ultrarich's dirty secret: not paying taxes

01-07

The Ultrarich's dirty secret: not paying taxes

01-07

The Ultrarich's dirty secret: not paying taxes

01-07

The Ultrarich's dirty secret: not paying taxes

01-07

Document worth reading: “Which Knowledge Graph Is Best for Me?”

01-07

Document worth reading: “Which Knowledge Graph Is Best for Me?”

01-07

Document worth reading: “Which Knowledge Graph Is Best for Me?”

01-07

Document worth reading: “Which Knowledge Graph Is Best for Me?”

01-07

RcppStreams 0.1.2

01-07

RcppStreams 0.1.2

01-07

RcppStreams 0.1.2

01-07

AzureR packages now on CRAN

01-08

AzureR packages now on CRAN

01-08

AzureR packages now on CRAN

01-08

AzureR packages now on CRAN

01-08

Do something for yourself in 2019

01-08

Do something for yourself in 2019

01-08

Dow Jones Stock Market Index (3/4): Log Returns GARCH Model

01-08

Dow Jones Stock Market Index (3/4): Log Returns GARCH Model

01-08

Document worth reading: “I can see clearly now: reinterpreting statistical significance”

01-08

Document worth reading: “I can see clearly now: reinterpreting statistical significance”

01-08

Don’t reinvent the wheel: making use of shiny extension packages. Join MünsteR for our next meetup!

01-08

Did she really live 122 years?

01-08

Did she really live 122 years?

01-08

From a Night of Insomnia to Competition Winner | An Interview with Martin Barron

01-08

Analysis of South African Funds

01-08

Analysis of South African Funds

01-08

Analysis of South African Funds

01-08

A Beautiful 2 by 2 Matrix Identity

01-08

A Beautiful 2 by 2 Matrix Identity

01-08

If you did not already know

01-08

Where does .Renviron live on Citrix?

01-08

Where does .Renviron live on Citrix?

01-08

French Baccalaureate Results

01-08

French Baccalaureate Results

01-08

French Baccalaureate Results

01-08

A Non-Compromising Approach to Privacy-Preserving Personalized Services

01-08

A Non-Compromising Approach to Privacy-Preserving Personalized Services

01-08

A Non-Compromising Approach to Privacy-Preserving Personalized Services

01-08

Philip Roth (4) vs. DJ Jazzy Jeff; Jim Thorpe advances

01-08

Philip Roth (4) vs. DJ Jazzy Jeff; Jim Thorpe advances

01-08

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

01-08

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

01-08

Apply to NYU Stern’s MS in Business Analytics

01-08

Apply to NYU Stern’s MS in Business Analytics

01-08

Apply to NYU Stern’s MS in Business Analytics

01-08

Apply to NYU Stern’s MS in Business Analytics

01-08

Document worth reading: “Recent Advances in Deep Learning: An Overview”

01-08

Document worth reading: “Recent Advances in Deep Learning: An Overview”

01-08

“The Book of Why” by Pearl and Mackenzie

01-08

The Right Kind of Internal Motivation Can Improve Your Studies

01-08

The Right Kind of Internal Motivation Can Improve Your Studies

01-08

Ed Sullivan (3) vs. Sid Caesar; DJ Jazzy Jeff advances

01-09

Ed Sullivan (3) vs. Sid Caesar; DJ Jazzy Jeff advances

01-09

Ed Sullivan (3) vs. Sid Caesar; DJ Jazzy Jeff advances

01-09

Industry leaders to speak at Mega-PAW, Las Vegas – June 16-20

01-09

Industry leaders to speak at Mega-PAW, Las Vegas – June 16-20

01-09

Nemirovski’s acceleration

01-09

Nemirovski’s acceleration

01-09

ML and NLP Publications in 2018

01-09

An even better rOpenSci website with Hugo

01-09

An even better rOpenSci website with Hugo

01-09

Understanding the maths of Computed Tomography (CT) scans

01-09

Magister Dixit

01-09

Magister Dixit

01-09

A deep dive into glmnet: offset

01-09

A deep dive into glmnet: offset

01-09

A deep dive into glmnet: offset

01-09

Updated Review: jamovi User Interface to R

01-09

Updated Review: jamovi User Interface to R

01-09

How do Convolutional Neural Nets (CNNs) learn? + Keras example

01-09

How do Convolutional Neural Nets (CNNs) learn? + Keras example

01-09

On the Road to 0.8.0 — Some Additional New Features Coming in the sergeant Package

01-09

On the Road to 0.8.0 — Some Additional New Features Coming in the sergeant Package

01-09

On the Road to 0.8.0 — Some Additional New Features Coming in the sergeant Package

01-09

On the Road to 0.8.0 — Some Additional New Features Coming in the sergeant Package

01-09

R Packages worth a look

01-09

R Packages worth a look

01-09

R Packages worth a look

01-09

R Packages worth a look

01-09

R Packages worth a look

01-09

R Packages worth a look

01-09

R Packages worth a look

01-09

How Data Scientists Think - A Mini Case Study

01-09

How Data Scientists Think - A Mini Case Study

01-09

Python Patterns: max Instead of if

01-10

Python Patterns: max Instead of if

01-10

The Role of the Data Engineer is Changing

01-10

“discover feature relationships” – new EDA tool

01-10

Considering sensitivity to unmeasured confounding: part 2

01-10

If you did not already know

01-10

Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data

01-10

MRP (multilevel regression and poststratification; Mister P): Clearing up misunderstandings about

01-10

MRP (multilevel regression and poststratification; Mister P): Clearing up misunderstandings about

01-10

MRP (multilevel regression and poststratification; Mister P): Clearing up misunderstandings about

01-10

Top Skills Needed to Work as Data Scientist in iGaming

01-10

AI in Healthcare (With a case study)

01-10

Who is the greatest finisher in soccer?

01-10

Who is the greatest finisher in soccer?

01-10

Who is the greatest finisher in soccer?

01-10

Document worth reading: “Universality of Deep Convolutional Neural Networks”

01-10

Document worth reading: “Universality of Deep Convolutional Neural Networks”

01-10

Who is the greatest finisher in soccer?

01-10

Tutorial: Time Series Analysis with Pandas

01-10

Tutorial: Time Series Analysis with Pandas

01-10

Babe Didrikson Zaharias (2) vs. Adam Schiff; Sid Caesar advances

01-10

Babe Didrikson Zaharias (2) vs. Adam Schiff; Sid Caesar advances

01-10

Babe Didrikson Zaharias (2) vs. Adam Schiff; Sid Caesar advances

01-10

My presentations on ‘Elements of Neural Networks & Deep Learning’ -Part1,2,3

01-10

My presentations on ‘Elements of Neural Networks & Deep Learning’ -Part1,2,3

01-10

Hackathon Winner Interview: Friendship University of Russia | Kaggle University Club

01-10

Hackathon Winner Interview: Friendship University of Russia | Kaggle University Club

01-10

Hackathon Winner Interview: Friendship University of Russia | Kaggle University Club

01-10

Hackathon Winner Interview: Friendship University of Russia | Kaggle University Club

01-10

vitae: Dynamic CVs with R Markdown

01-10

R Packages worth a look

01-11

vitae: Dynamic CVs with R Markdown

01-10

Roll Your Own Federal Government Shutdown-caused SSL Certificate Expiration Monitor in R

01-10

Roll Your Own Federal Government Shutdown-caused SSL Certificate Expiration Monitor in R

01-10

Murmuration: Data Scientist [New York, NY]

01-10

Murmuration: Data Scientist [New York, NY]

01-10

Add a static pdf vignette to an R package

01-11

Add a static pdf vignette to an R package

01-11

R Packages worth a look

01-11

R Packages worth a look

01-11

Visualizing the Asian Cup with R!

01-11

Visualizing the Asian Cup with R!

01-11

Visualizing the Asian Cup with R!

01-11

R Tip: Use seqi() For Indexes

01-11

R Tip: Use seqi() For Indexes

01-11

R Tip: Use seqi() For Indexes

01-11

R Tip: Use seqi() For Indexes

01-11

Pear Therapeutics: Data Scientist [San Francisco, CA]

01-11

epubr 0.6.0 CRAN release

01-11

Parallelize a For-Loop by Rewriting it as an Lapply Call

01-11

Parallelize a For-Loop by Rewriting it as an Lapply Call

01-11

How to Remove Unfair Bias From Your AI

01-11

How to Remove Unfair Bias From Your AI

01-11

How to Remove Unfair Bias From Your AI

01-11

Document worth reading: “Machine Learning in Official Statistics”

01-11

If you did not already know

01-11

If you did not already know

01-11

pinp 0.0.7: More small YAML options

01-11

pinp 0.0.7: More small YAML options

01-11

The SIAM Book Series on Data Science

01-11

The SIAM Book Series on Data Science

01-11

How simpleshow uses Amazon Polly to voice stories in their explainer videos

01-11

How simpleshow uses Amazon Polly to voice stories in their explainer videos

01-11

Document worth reading: “Deep Neural Network Approximation Theory”

01-12

Document worth reading: “Deep Neural Network Approximation Theory”

01-12

Document worth reading: “Deep Neural Network Approximation Theory”

01-12

Practical Data Science with R, 2nd Edition discount!

01-12

Practical Data Science with R, 2nd Edition discount!

01-12

Practical Data Science with R, 2nd Edition discount!

01-12

10 years of playback history on Last.FM: "Just sit back and listen"

01-12

10 years of playback history on Last.FM: "Just sit back and listen"

01-12

How to combine Multiple ggplot Plots to make Publication-ready Plots

01-12

How to combine Multiple ggplot Plots to make Publication-ready Plots

01-12

I walk the (train) line – part deux – the weight loss continues

01-12

I walk the (train) line – part deux – the weight loss continues

01-12

I walk the (train) line – part deux – the weight loss continues

01-12

I walk the (train) line – part deux – the weight loss continues

01-12

CES 2019

01-12

Showing a difference in means between two groups

01-13

XmR Chart | Step-by-Step Guide by Hand and with R

01-13

XmR Chart | Step-by-Step Guide by Hand and with R

01-13

Making sense of the METS and ALTO XML standards

01-13

Making sense of the METS and ALTO XML standards

01-13

Making sense of the METS and ALTO XML standards

01-13

Generating Synthetic Data Sets with ‘synthpop’ in R

01-13

Generating Synthetic Data Sets with ‘synthpop’ in R

01-13

Magister Dixit

01-13

Magister Dixit

01-13

If you did not already know

01-13

If you did not already know

01-13
SunJackson

SunJackson

3974 日志
5 分类
© 2018 - 2019 SunJackson