SunJackson Blog

Interactive panel EDA with 3 lines of code

转载自：http://feedproxy.google.com/~r/RBloggers/~3/rcZ0BD5PZ1s/

An Accounting and Data Science Nerd's Corner

发表于 2018-12-09

Exploratory data analysis is important, everybody knows that. With R, it is also easy. Below you see three lines of code that allow you to interactively explore the Preston Curve, the prominent association of country level real income per capita with life expectancy.

阅读全文 »

Timing Grouped Mean Calculation in R

转载自：http://www.win-vector.com/blog/2018/12/timing-grouped-means/

John Mount

发表于 2018-12-08

This note is a comment on some of the timings shared in the dplyr-0.8.0 pre-release announcement.

阅读全文 »

Timing Grouped Mean Calculation in R

转载自：http://feedproxy.google.com/~r/RBloggers/~3/Udc7Aet2p-U/

John Mount

发表于 2018-12-08

This note is a comment on some of the timings shared in the dplyr-0.8.0 pre-release announcement.

阅读全文 »

My footnote about global warming

转载自：https://andrewgelman.com/2018/12/08/footnote-global-warming/

Andrew

发表于 2018-12-08

At the beginning of my article, How to think scientifically about scientists’ proposals for fixing science, which we discussed yesterday, I wrote:

阅读全文 »

R Packages worth a look

转载自：https://analytixon.com/2018/12/09/r-packages-worth-a-look-1361/

Michael Laux

发表于 2018-12-08

Visualize R Data Structures with Trees (lobstr)A set of tools for inspecting and understanding R data structures inspired by str(). Includes ast() for visualizing abstract syntax trees, ref() for sh …

阅读全文 »

Document worth reading： “A Theory of Diagnostic Interpretation in Supervised Classification”

转载自：https://analytixon.com/2018/12/08/document-worth-reading-a-theory-of-diagnostic-interpretation-in-supervised-classification/

Michael Laux

发表于 2018-12-08

Interpretable deep learning is a fundamental building block towards safer AI, especially when the deployment possibilities of deep learning-based computer-aided medical diagnostic systems are so eminent. However, without a computational formulation of black-box interpretation, general interpretability research rely heavily on subjective bias. Clear decision structure of the medical diagnostics lets us approximate the decision process of a radiologist as a model – removed from subjective bias. We define the process of interpretation as a finite communication between a known model and a black-box model to optimally map the black box’s decision process in the known model. Consequently, we define interpretability as maximal information gain over the initial uncertainty about the black-box’s decision within finite communication. We relax this definition based on the observation that diagnostic interpretation is typically achieved by a process of minimal querying. We derive an algorithm to calculate diagnostic interpretability. The usual question of accuracy-interpretability tradeoff, i.e. whether a black-box model’s prediction accuracy is dependent on its ability to be interpreted by a known source model, does not arise in this theory. With multiple example simulation experiments of various complexity levels, we demonstrate the working of such a theoretical model in synthetic supervised classification scenarios. A Theory of Diagnostic Interpretation in Supervised Classification

阅读全文 »

It was twenty years ago …

转载自：http://feedproxy.google.com/~r/RBloggers/~3/gUgzO0q0uao/

Thinking inside the box

发表于 2018-12-08

… this week that I made a first cameo in the debian/changelog for the Debian R package:

阅读全文 »

Dr. Data Show Video： Five Reasons Computers Predict When You’ll Die

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/vjZtOOHWU1s/dr-data-show-video-5-reasons-computers-predict-die.html

Eric Siegel

发表于 2018-12-08

Watch the latest episode of The Dr. Data Show, which answers the question, “Why do computers predict when you’ll die?” – with five example reasons!

阅读全文 »

Whats new on arXiv

转载自：https://analytixon.com/2018/12/08/whats-new-on-arxiv-836/

Michael Laux

发表于 2018-12-08

Interpretable Graph Convolutional Neural Networks for Inference on Noisy Knowledge Graphs

阅读全文 »

If you did not already know

转载自：https://analytixon.com/2018/12/08/if-you-did-not-already-know-571/

Michael Laux

发表于 2018-12-08

Context-aware Sentiment Word Identification (sentiword2vec) Traditional sentiment analysis often uses sentiment dictionary to extract sentiment information in text and classify documents. However, emerging informal words and phrases in user generated content call for analysis aware to the context. Usually, they have special meanings in a particular context. Because of its great performance in representing inter-word relation, we use sentiment word vectors to identify the special words. Based on the distributed language model word2vec, in this paper we represent a novel method about sentiment representation of word under particular context, to be detailed, to identify the words with abnormal sentiment polarity in long answers. Result shows the improved model shows better performance in representing the words with special meaning, while keep doing well in representing special idiomatic pattern. Finally, we will discuss the meaning of vectors representing in the field of sentiment, which may be different from general object-based conditions. …

阅读全文 »