SunJackson Blog

How To Predict ICU Mortality with Digital Health Data, DL4J, Apache Spark and Cloudera

转载自：http://blog.cloudera.com/blog/2017/09/how-to-predict-icu-mortality-with-digital-health-data-dl4j-apache-spark-and-cloudera/

Cy Jervis

发表于 2017-09-18

Modeling EHR Data in Healthcare

阅读全文 »

Deep Learning Dead-End?

转载自：https://aimatters.wordpress.com/2017/09/17/deep-learning-dead-end/

Stephen Oman

发表于 2017-09-17

阅读全文 »

Customizing Docker Images in Cloudera Data Science Workbench

转载自：http://blog.cloudera.com/blog/2017/09/customizing-docker-images-in-cloudera-data-science-workbench/

Cy Jervis

发表于 2017-09-14

This article shows how to build and publish a customized Docker image for usage as an engine in Cloudera Data Science Workbench. Such an image or engine customization gives you the benefit of being able to work with your favorite tool chain inside the web based application.

阅读全文 »

QuantConnect – the only Game in Town

转载自：http://www.quintuitive.com/2017/09/10/quantconnect-game-town/

quintuitive

发表于 2017-09-10

At least in my town. Sometime back I kind of decided to use Quantopian as the backtesting platform of my choice. QuantConnect was a close second best. Now, a few months later, Quantopian has decided to end the live trading. No choice, but to go back to the second choice.QuantConnet has several advantages, it was a pretty close decision even when there was a choice. Now, there is little choice – makes things easier.

阅读全文 »

Joining ASAPP

转载自：https://cavaunpeu.github.io/2017/09/09/joining-asapp/

Will Wolf

发表于 2017-09-09

The open-source “masters” has come to a close. I’m now joining ASAPP, Inc. as a Machine Learning Engineer.

阅读全文 »

Semantic trees for training word embeddings with hierarchical softmax

转载自：https://blog.lateral.io/2017/09/semantic-trees-hierarchical-softmax/

Matthias

发表于 2017-09-07

Word vector models represent each word in a vocabulary as a vector in a continuous space such that words that share the same context are “close” together. Being close is measured using a distance metric or similarity measure such as the Euclidean distance or cosine similarity. Once word vectors have been trained on a large corpus, one can form document vectors to compare documents based on their content similarity. A central question is how to obtain “good” word vectors in the first place. For this various models based on neural networks have been proposed, one of the most popular ones being word2vec. In the “continous-bag-of-words” (CBOW) architecture of word2vec, word vectors are trained by predicting the central word of a sliding window given its neighbouring words. This is formulated as a classification problem, where the correct central word has to be selected among the full vocabulary given the context. Usually one would use a softmax classifier as the top layer of such a network. However, for the softmax the training time grows linearly in the number of possible outcomes, making the method unsuitable for large vocabularies.

阅读全文 »

Making Smart Phones Dumb Again

转载自：http://wesmckinney.com/blog/dumb-smartphones/

Wes McKinney

发表于 2017-09-07

** Thu 07 September 2017

阅读全文 »

What Killed the Curse of Dimensionality?

转载自：http://camron.xyz/index.php/2017/09/06/what-killed-the-curse-of-dimensionality/

Camron

发表于 2017-09-06

How does Deep Learning overcome this hurdle in machine learning and why?

阅读全文 »

Deep Learning with Intel’s BigDL and Apache Spark

转载自：http://blog.cloudera.com/blog/2017/09/deep-learning-with-intels-bigdl-and-apache-spark/

Cy Jervis

发表于 2017-09-06

Cloudera recently published a blog post on how to use Deeplearning4J (DL4J) along with Apache Hadoop and Apache Spark to get state-of-the-art results on an image recognition task. Continuing on a similar stream of work, in this post we discuss a viable alternative that is specifically designed to be used with Spark, and data available in Spark and Hadoop clusters via a Scala or Python API.

阅读全文 »

Software patents are evil, but BSD+Patents is probably not the solution

转载自：http://wesmckinney.com/blog/react-bsd-patents/

Wes McKinney

发表于 2017-09-05

** Tue 05 September 2017

阅读全文 »