SunJackson Blog

From Analytical to Numerical to Universal Solutions

转载自：http://blog.ethanrosenthal.com/2017/03/20/analytical-numerical-universal/

Ethan Rosenthal

发表于 2017-03-20

I’ve been making my way through the recently released Deep Learning textbook (which is absolutely excellent), and I came upon the section on Universal Approximation Properties. The Universal Approximation Theorem (UAT) essentially proves that neural networks are capable of approximating any continuous function (subject to some constraints and with upper bounds on compute).

阅读全文 »

Cryptogram Puzzle

转载自：http://datagenetics.com/blog/march32017/index.html

未知

发表于 2017-03-20

There are a variety of tools that can be employed to solve cryptograms, just like there are lots of techniques that can be used to solve puzzles like Sudoku. These include: Algebraic tehniques, number theoory, factorization, digital roots, elimination, carries, divisibility rules, modular arithmetic, the Chinese Remainder Theorem, and even simple trial and error.

阅读全文 »

Model AUC depends on test set difficulty

转载自：http://efavdb.com/model-auc-depends-on-test-set-difficulty/

Jonathan Landy

发表于 2017-03-19

The AUC score is a popular summary statistic that is often used to communicate the performance of a classifier. However, we illustrate here that this score depends not only on the quality of the model in question, but also on the difficulty of the test set considered: If samples are added to a test set that are easily classified, the AUC will go up — even if the model studied has not improved. In general, this behavior implies that isolated, single AUC scores cannot be used to meaningfully qualify a model’s performance. Instead, the AUC should be considered a score that is primarily useful for comparing and ranking multiple models — each at a common test set difficulty.

阅读全文 »

Ordered Categorical GLMs for Product Feedback Scores

转载自：https://cavaunpeu.github.io/2017/03/17/ordered-categorical-glms-for-product-feedback-scores/

Will Wolf

发表于 2017-03-17

TL;DR: there’s a Shiny app too.

阅读全文 »

Building Safe A.I.

转载自：https://iamtrask.github.io/2017/03/17/safe-ai/

未知

发表于 2017-03-17

TLDR: In this blogpost, we’re going to train a neural network that is fully encrypted during training (trained on unencrypted data). The result will be a neural network with two beneficial properties. First, the neural network’s intelligence is protected from those who might want to steal it, allowing valuable AIs to be trained in insecure environments without risking theft of their intelligence. Secondly, the network can only make encrypted predictions (which presumably have no impact on the outside world because the outside world cannot understand the predictions without a secret key). This creates a valuable power imbalance between a user and a superintelligence. If the AI is homomorphically encrypted, then from it’s perspective, the entire outside world is also homomorphically encrypted. A human controls the secret key and has the option to either unlock the AI itself (releasing it on the world) or just individual predictions the AI makes (seems safer).

阅读全文 »

How to mine newsfeed data and extract interactive insights in Python

转载自：https://ahmedbesbes.com/how-to-mine-newsfeed-data-and-extract-interactive-insights-in-python.html

Ahmed BESBES

发表于 2017-03-15

It looks that there is a dominant cluster scattered all over the space: this is mainly due to the general category of news.

阅读全文 »

Millions of social bots invaded Twitter!

转载自：http://www.emilio.ferrara.name/2017/03/14/millions-of-social-bots-invaded-twitter/

admin

发表于 2017-03-14

Our work titled Online Human-Bot Interactions: Detection, Estimation, and Characterization has been accepted for publication at the prestigious International AAAI Conference on Web and Social Media (ICWSM 2017) to be held in Montreal, Canada in May 2017!

阅读全文 »

Random-Walk Bayesian Deep Networks： Dealing with Non-Stationary Data

转载自：https://twiecki.github.io/blog/2017/03/14/random-walk-deep-net/

Thomas Wiecki

发表于 2017-03-14

Download the NB: https://github.com/twiecki/WhileMyMCMCGentlySamples/blob/master/content/downloads/notebooks/random_walk_deep_net.ipynb

阅读全文 »

Discarded Hard Drives： Data Science as Debugging

转载自：http://inverseprobability.com/2017/03/14/data-science-as-debugging

未知

发表于 2017-03-14

As a University professor, when setting data orientated projects to Computer Science undergraduates, I used to find it difficult to get students to interact properly with the data. Students tended to write programs to process the data, produce a couple of plots, but fail to develop any understanding of what useful information was in the data, and how best to extract it.

阅读全文 »

Intercausal Reasoning in Bayesian Networks

转载自：https://cavaunpeu.github.io/2017/03/13/intercausal-reasoning-in-bayesian-networks/

Will Wolf

发表于 2017-03-13

This might look a little funky for those used to Bayesian parameter estimation for univariate systems - estimating $P(\text{heads})$ given the results of 15 coinflips, for example.

阅读全文 »