Distilled News

转载自：https://advanceddataanalytics.net/2018/08/11/distilled-news-835/

Michael Laux

发表于 2018-08-11

What is Content Services?

阅读全文 »

R Packages worth a look

转载自：https://advanceddataanalytics.net/2018/08/11/r-packages-worth-a-look-1239/

Michael Laux

发表于 2018-08-11

Read, Validate, Analyze, and Map Files in the General Transit Feed Specification (tidytransit)Read General Transit Feed Specification (GTFS) zipfiles into a list of R dataframes. Perform validation of the data structure against the specification …

阅读全文 »

Discussion of the value of a mathematical model for the dissemination of propaganda

转载自：https://andrewgelman.com/2018/08/11/discussion-value-mathematical-model-dissemination-propaganda/

Andrew

发表于 2018-08-11

A couple people pointed me to this article, “How to Beat Science and Influence People: Policy Makers and Propaganda in Epistemic Networks,” by James Weatherall, Cailin O’Connor, and Justin Bruner, also featured in this news article. Their paper begins:

阅读全文 »

If you did not already know

转载自：https://advanceddataanalytics.net/2018/08/12/if-you-did-not-already-know-450/

Michael Laux

发表于 2018-08-11

Jubatus Jubatus is a distributed processing framework and streaming machine learning library. Jubatus includes these functionalities:· Online Machine Learning Library: Classification, Regression, Recommendation (Nearest Neighbor Search), Graph Mining, Anomaly Detection, Clustering· Feature Vector Converter (fv_converter): Data Preprocess and Feature Extraction· Framework for Distributed Online Machine Learning with Fault Tolerance …

阅读全文 »

LSTM的神奇之处

深度学习，LSTM

SunJackson

发表于 2018-08-10 | 分类于深度学习

前言

阅读全文 »

机器学习面试

项目经历

SunJackson

发表于 2018-08-10 | 分类于机器学习

主题模型

阅读全文 »

GBM

深度学习，GBM

SunJackson

发表于 2018-08-10 | 分类于深度学习

GBM(gradient boosting machine)

阅读全文 »

Jeremy Freese was ahead of the curve

转载自：http://andrewgelman.com/2018/08/10/jeremy-freese-ahead-curve/

Andrew

发表于 2018-08-10

Here’s sociologist Jeremy Freese writing, back in 2008:

阅读全文 »

Document worth reading： “Learning to Succeed while Teaching to Fail： Privacy in Closed Machine Learning Systems”

转载自：https://advanceddataanalytics.net/2018/08/10/document-worth-reading-learning-to-succeed-while-teaching-to-fail-privacy-in-closed-machine-learning-systems/

Michael Laux

发表于 2018-08-10

Security, privacy, and fairness have become critical in the era of data science and machine learning. More and more we see that achieving universally secure, private, and fair systems is practically impossible. We have seen for example how generative adversarial networks can be used to learn about the expected private training data; how the exploitation of additional data can reveal private information in the original one; and how what looks like unrelated features can teach us about each other. Confronted with this challenge, in this paper we open a new line of research, where the security, privacy, and fairness is learned and used in a closed environment. The goal is to ensure that a given entity (e.g., the company or the government), trusted to infer certain information with our data, is blocked from inferring protected information from it. For example, a hospital might be allowed to produce diagnosis on the patient (the positive task), without being able to infer the gender of the subject (negative task). Similarly, a company can guarantee that internally it is not using the provided data for any undesired task, an important goal that is not contradicting the virtually impossible challenge of blocking everybody from the undesired task. We design a system that learns to succeed on the positive task while simultaneously fail at the negative one, and illustrate this with challenging cases where the positive task is actually harder than the negative one being blocked. Fairness, to the information in the negative task, is often automatically obtained as a result of this proposed approach. The particular framework and examples open the door to security, privacy, and fairness in very important closed scenarios, ranging from private data accumulation companies like social networks to law-enforcement and hospitals. Learning to Succeed while Teaching to Fail: Privacy in Closed Machine Learning Systems

阅读全文 »

Create video subtitles with translation using machine learning

转载自：https://aws.amazon.com/blogs/machine-learning/create-video-subtitles-with-translation-using-machine-learning/

Rob Dachowski

发表于 2018-08-10

Businesses from around the globe require fast and reliable ways to transcribe an audio or video file, and often in multiple languages. This audio and video content can range from a news broadcast, call center phone interactions, a job interview, a product demonstration, or even court proceedings. The traditional process for transcription is both expensive and lengthy, often involving the hiring of dedicated staff or services, with a high degree of manual effort. This effort is compounded when a multi-language transcript is required, often leaving customers to over-dub the original content with a new audio track.

阅读全文 »