Learning a new language can seem like an insurmountable challenge - however, approaching the problem with data science and using a few smart techniques can ease the learning curve and provide a starting point from which to base your studies.
Search labels and IDs from IAB-QAG and IPTC Subject Codes taxonomies
IAB-QAG and IPTC Subject Codes are two really useful taxonomies that you can use to classify news content as part of your workflow. They sort huge amounts of text content into categories that are readable to both humans and machines, so you can create value from the sorted content.
Python vs R: Head to Head Data Analysis
Which is better for data analysis?
Multi-Class Text Classification Model Comparison and Selection
By Susan Li, Sr. Data Scientist
Sharing the Recipe for rOpenSci’s Unconf Ice Breaker
rOpenSci - open tools for open science
发表于
While many people groan at the thought of participating in a group ice breaker activity, we’ve gotten consistent feedback from people who have been to recent rOpenSci unconferences.
Azure ML Studio now supports R 3.4
Azure ML Studio, the collaborative drag-and-drop data science workbench, now supports R 3.4 in the Execute R Script module. Now you can combine the built-in data manipulation and analysis modules of ML Studio with R scripts to accomplish other data tasks, as for example in this workflow for oil and gas tank forecasting.
If you did not already know
Nested Association Mapping (NAM)
Nested association mapping (NAM) is a technique designed by the labs of Edward Buckler, James Holland, and Michael McMullen for identifying and dissecting the genetic architecture of complex traits in corn (Zea mays). It is important to note that nested association mapping (unlike Association mapping) is a specific technique that cannot be performed outside of a specifically designed population such as the Maize NAM population. …
Talk: How Do We Support Under-represented Groups To Put Themselves Forward?
I gave a talk at the Royal Society’s 2018 annual diversity conference. This is the text of my contribution.
R Packages worth a look
Portfolio Management with R (PMwR)Functions and examples for ‘Portfolio Management with R’: backtesting investment and trading strategies, computing profit/loss and returns, analysing t …
Raghuveer Parthasarathy’s big idea for fixing science
The scientific enterprise has never been larger, or more precarious. Can we reshape publicly funded science, matching trainees to viable careers, fostering reproducibility, and encouraging risk?