SunJackson Blog

Automated Dashboard with various correlation visualizations in R

转载自：http://feedproxy.google.com/~r/RBloggers/~3/QfaKeVkYqU0/

Kristian Larsen

发表于 2018-12-05

Programming

阅读全文 »

Extract data from a PNG/TIFF

转载自：http://feedproxy.google.com/~r/RBloggers/~3/BTlXITP07EU/

aghaynes

发表于 2018-12-05

Sometimes itâ€™s useful to be able to extract data from a published figure. If the figure isnâ€™t a vector based format (for which the numeric data is probably still in the file), itâ€™s possible to digitize the image with R, click the points and extract it that way. The digitize package is simple to use for this purposeâ€¦

阅读全文 »

The Quick Python Book

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/XNIVKF9uR4s/manning-quick-python-book.html

Dan Clark

发表于 2018-12-05

Sponsored Post.The Quick Python Book, Third Edition is a comprehensive guide to the Python language by Naomi Ceder, Founder of the Python Education Summit. With the personal touch of a skilled teacher, she beautifully balances details of the language with the insights and advice you need to handle any task. The extensive, relevant examples and exercises inside further help you master each important concept, whether you’re scraping websites or playing around with nested tuples!

阅读全文 »

Creating Tables Using R and Pure HTML

转载自：http://feedproxy.google.com/~r/RBloggers/~3/d_fnULLxOxY/

Tim Bock

发表于 2018-12-05

A problem with R is that its tables are not good enough to share with non-R users, both in terms of visual attractiveness and ease of reading – particularly when the table is large. Quite a few different packages, tools, and workflows have been developed to address this problem, from formattable through to R Markdown and Displayr, to name a few. Over the past few months I have found myself increasingly using R to write tables in pure HTML. Why? Because pure HTML gives the greatest level of control. In this post I am going to work through a simple but easily generalizable example, which can both be used within R and RStudio, as well as when building interactive dashboards.

阅读全文 »

My Self-Driving Presentation for TTS

转载自：http://www.erogol.com/my-self-driving-presentation-for-tts/

erogol

发表于 2018-12-05

Here is the presentation, I gave at Mozilla All-Hands Orlando about https://github.com/mozilla/TTS

阅读全文 »

Anomaly detection on Amazon DynamoDB Streams using the Amazon SageMaker Random Cut Forest algorithm

转载自：https://aws.amazon.com/blogs/machine-learning/anomaly-detection-on-amazon-dynamodb-streams-using-the-amazon-sagemaker-random-cut-forest-algorithm/

YongSeong Lee

发表于 2018-12-05

Have you considered introducing anomaly detection technology to your business? Anomaly detection is a technique used to identify rare items, events, or observations which raise suspicion by differing significantly from the majority of the data you are analyzing. The applications of anomaly detection are wide-ranging including the detection of abnormal purchases or cyber intrusions in banking, spotting a malignant tumor in an MRI scan, identifying fraudulent insurance claims, finding unusual machine behavior in manufacturing, and even detecting strange patterns in network traffic that could signal an intrusion.

阅读全文 »

Magister Dixit

转载自：https://analytixon.com/2018/12/05/magister-dixit-1430/

Michael Laux

发表于 2018-12-05

“Current machine learning systems operate, almost exclusively, in a statistical, or model-free mode, which entails severe theoretical limits on their power and performance. Such systems cannot reason about interventions and retrospection and, therefore, cannot serve as the basis for strong AI. To achieve human level intelligence, learning machines need the guidance of a model of reality, similar to the ones used in causal inference tasks.” Judea Pearl ( July 2018 )

阅读全文 »

Kick Start Your Data Career! Tips From the Frontline

转载自：http://feedproxy.google.com/~r/kdnuggets-data-mining-analytics/~3/nWCVNiUckqc/kick-start-your-data-career.html

Matt Mayo Editor

发表于 2018-12-05

By Vaishali Lambe, Data Scientist

阅读全文 »

Learn to do Data Viz in R

转载自：https://www.dataquest.io/blog/learn-data-visualization-r/

Charlie Custer

发表于 2018-12-05

One of the reasons that R is a top language for data science is that it’s great for data visualization. R users can take advantage of the wildly popular ggplot2 package to turn massive data sets into easily-readable charts in just a few lines of code. That can be incredibly valuable for presenting your data, but more importantly, when it’s done right, data viz is a tool for helping you understand what the data is telling you.

阅读全文 »

Document worth reading： “A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition”

转载自：https://analytixon.com/2018/12/05/document-worth-reading-a-comparison-of-techniques-for-language-model-integration-in-encoder-decoder-speech-recognition/

Michael Laux

发表于 2018-12-05

Attention-based recurrent neural encoder-decoder models present an elegant solution to the automatic speech recognition problem. This approach folds the acoustic model, pronunciation model, and language model into a single network and requires only a parallel corpus of speech and text for training. However, unlike in conventional approaches that combine separate acoustic and language models, it is not clear how to use additional (unpaired) text. While there has been previous work on methods addressing this problem, a thorough comparison among methods is still lacking. In this paper, we compare a suite of past methods and some of our own proposed methods for using unpaired text data to improve encoder-decoder models. For evaluation, we use the medium-sized Switchboard data set and the large-scale Google voice search and dictation data sets. Our results confirm the benefits of using unpaired text across a range of methods and data sets. Surprisingly, for first-pass decoding, the rather simple approach of shallow fusion performs best across data sets. However, for Google data sets we find that cold fusion has a lower oracle error rate and outperforms other approaches after second-pass rescoring on the Google voice search data set. A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition

阅读全文 »