SunJackson Blog


  • 首页

  • 分类

  • 关于

  • 归档

  • 标签

  • 站点地图

  • 公益404

Data Notes: Drought and the War in Syria

转载自:http://blog.kaggle.com/2018/08/23/data-notes-drought-and-the-war-in-syria/

Paul Mooney


发表于 2018-08-23

ARIMA, Syria, and mapping The Cure: Enjoy these new, intriguing, and overlooked datasets and kernels.

阅读全文 »

Document worth reading: “An Information-Theoretic Analysis of Deep Latent-Variable Models”

转载自:https://advanceddataanalytics.net/2018/08/24/document-worth-reading-an-information-theoretic-analysis-of-deep-latent-variable-models/

Michael Laux


发表于 2018-08-23

We present an information-theoretic framework for understanding trade-offs in unsupervised learning of deep latent-variables models using variational inference. This framework emphasizes the need to consider latent-variable models along two dimensions: the ability to reconstruct inputs (distortion) and the communication cost (rate). We derive the optimal frontier of generative models in the two-dimensional rate-distortion plane, and show how the standard evidence lower bound objective is insufficient to select between points along this frontier. However, by performing targeted optimization to learn generative models with different rates, we are able to learn many models that can achieve similar generative performance but make vastly different trade-offs in terms of the usage of the latent variable. Through experiments on MNIST and Omniglot with a variety of architectures, we show how our framework sheds light on many recent proposed extensions to the variational autoencoder family. An Information-Theoretic Analysis of Deep Latent-Variable Models

阅读全文 »

If you did not already know

转载自:https://advanceddataanalytics.net/2018/08/23/if-you-did-not-already-know-461/

Michael Laux


发表于 2018-08-23

Hazelcast Hazelcast, a leading open source in-memory data grid (IMDG) with hundreds of thousands of installed clusters and over 17 million server starts per month, launched Hazelcast Jet – a distributed processing engine for big data streams. With Hazelcast’s IMDG providing storage functionality, Hazelcast Jet is a new Apache 2 licensed open source project that performs parallel execution to enable data-intensive applications to operate in near real-time. Using directed acyclic graphs (DAG) to model relationships between individual steps in the data processing pipeline, Hazelcast Jet is simple to deploy and can execute both batch and stream-based data processing applications. Hazelcast Jet is appropriate for applications that require a near real-time experience such as sensor updates in IoT architectures (house thermostats, lighting systems), in-store e-commerce systems and social media platforms. …

阅读全文 »

Video: Azure Machine Learning in plain English

转载自:http://blog.revolutionanalytics.com/2018/08/aml-video.html

David Smith


发表于 2018-08-23

Data Scientist and author Siraj Raval recently released a 12-minute video overview of Azure Machine Learning (embedded at the end of this post). The video begins with a overview of cloud computing and Microsoft Azure generally, before getting into the details of some specific Azure services for machine learning:

阅读全文 »

Distilled News

转载自:https://advanceddataanalytics.net/2018/08/23/distilled-news-844/

Michael Laux


发表于 2018-08-23

A Practical Introduction to K-Nearest Neighbors Algorithm for Regression (with Python code)

阅读全文 »

3-D-Printed Time Series Plates

转载自:http://flowingdata.com/2018/08/23/3-d-printed-time-series-plates/

Nathan Yau


发表于 2018-08-23

Ever since my recent experience with 3-D printing, I’ve been itching for an excuse to print more. It’s a slow process that takes much longer than graphs rendered on a computer screen. But maybe that’s why it’s so satisfying.

阅读全文 »

R Packages worth a look

转载自:https://advanceddataanalytics.net/2018/08/23/r-packages-worth-a-look-1251/

Michael Laux


发表于 2018-08-23

Flexible and Efficient Evaluation of Principal Surrogates/Treatment Effect Modifiers (pssmooth)Implements estimation and testing procedures for evaluating an intermediate biomarker response as a principal surrogate of a clinical response to treat …

阅读全文 »

Getting Started with Competitions - A Peer to Peer Guide

转载自:http://blog.kaggle.com/2018/08/22/machine-learning-kaggle-competition-part-one-getting-started/

William Koehrsen


发表于 2018-08-22

Think of this as a standard Jupyter Notebook with slightly different aesthetics. You can write Python code and text (using Markdown syntax) just like in Jupyter and run the code completely in the cloud on Kaggle’s servers. However, Kaggle kernels have some unique features not available in Jupyter Notebook. Hit the leftward facing arrow in the upper right to expand the kernel control panel which brings up three tabs (if the notebook is not in fullscreen, then these three tabs may already be visible next to the code).

阅读全文 »

Why you can't have privacy on the internet

转载自:https://www.chrisstucchio.com/blog/2018/the_price_of_privacy.html?utm_medium=rss&utm_source=rss&utm_campaign=rss

Chris Stucchio


发表于 2018-08-22

I recently attended a discussion at Fifth Elephant on privacy. During the panel, one of the panelists asked the audience: “how many of you are concerned about your privacy online, and take steps to protect it?”

阅读全文 »

Distilled News

转载自:https://advanceddataanalytics.net/2018/08/22/distilled-news-843/

Michael Laux


发表于 2018-08-22

Causal Information Theory – Formal Introduction of Key Concepts

阅读全文 »
1 … 242 243 244 … 398
SunJackson

SunJackson

3974 日志
5 分类
© 2018 - 2019 SunJackson