Text Similarity (similr)Using brute-force string comparator algorithms, this package facilitates finding a particular string’s closest match amongst a target vector of strings.
Distilled News
Let’s Talk About AI Ethics; We’re On A Deadline
The causal hype ratchet
Six Steps to Master Machine Learning with Data Preparation
By David Levinger, VP Dev and Cloud Operations at Paxata
Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese
We’re excited to announce that Amazon Transcribe now supports automatic speech recognition in three new languages: French, Italian, and Brazilian Portuguese. These new languages expand upon the 5 languages already available in Amazon Transcribe: US English, US Spanish, Australian English, British English, and Canadian French.
Feature engineering, Explained
By Paweł Grabiński
Machine Learning Explainability vs Interpretability: Two concepts that could help restore trust in AI
By Richard Gall, Packt
Spelling 2.0: Improved Markdown and RStudio Support
rOpenSci - open tools for open science
发表于
We have released updates for the rOpenSci text analysis tools. This technote will highlight some of the major improvements in the spelling package and also the underlying hunspell package, which provides the spelling engine for the spelling package.
R 3.5.2 now available
R 3.5.2, the latest version of the R language for statistical computation and graphics from the R Foundation, was released today. (This release is codenamed “Eggshell Igloo”, likely in reference to this or this Peanuts cartoon.) Compared to R 3.5.1, this update includes only bug fixes, so R scripts and packages compatible with R 3.5.0 or R 3.5.1 should work without modification.
How to Scrape Data from a JavaScript Website with R
In September 2017, I found myself working on a project that required odds data for football. At the time I didn’t know about resources such as Football-Data or the odds-api, so I decided to build a scraper to collect data directly from the bookmakers. However, most of them used JavaScript to display their odds, so I couldn’t collect the data with R and rvest alone. In this article, I’ll demonstrate how PhantomJS can be used with R to scrape JS-rendered content from the web.