Using Apache Impala (incubating) on top of Apache Kudu (incubating) has significant performance benefits
Wesley Crushes Ratings
Dear Wesley Crusher,
dotify: Recommending Spotify Music Through Country Arithmetic
Ever since the release of word2vec I’ve been fascinated with embedding things - words, places, people - into vector space. Though not a mathematical historian, I don’t believe this concept is at all new: matrix factorization methods like Singular Value Decomposition have given us this ability for years. This said, one of the most exciting revelations of word2vec is the remarkably intuitive results of taking arithmetic combinations of these vectors - adding, subtracting, multiplying, etc. - with
First 3rd party notebook for Databricks Community Edition
Yuki Katoh (yukiegosapporo@gmail.com)
发表于
I‘m happy to announce that my analysis about Golden State Warriors passing network is featured as the first 3rd party notebook to Databricks Community Edition in the latest databricks blog post.
Exploring convolutional neural networks with DL4J
andrew brooks (andrewbrooksct@gmail.com)
发表于
TL;DR version: This post walks through an image classification problem hosted on Kaggle for Yelp. I use Scala, DeepLearning4J and convolutional neural networks. For a self-guided tour, check out the project on Github here.
Create a Chrome extension to modify a website’s HTML or CSS
A technique we use to visualise how Lateral recommendations would look and work on a website is to create a Chrome extension that inserts the recommendations at load time. This is useful because:
“Redshift View Materializer” Now on Github
Although Amazon Redshift is a fantastic data warehousing product, it lacks several of the features available in other database systems. One such feature is materialized views, which are just what they sound like… views that have been pre-calculated and physically stored on disk for faster querying, and which are updated on request.
Becoming a Data Scientist Podcast Episode 09: Justin Kiggins
Podcast Audio Links:Link to podcast Episode 9 audioPodcast’s RSS feed for podcast subscription appsPodcast on StitcherPodcast on iTunes
Where Will Your Country Stand in World War III?
In the recent Panama Papers scandal, journalists analyzed 11.5 million documents using network graphs to trace the use of offshore tax structures. In this chapter, we use a network graph technique called Social Network Analysis (SNA) to map weapons transfer between countries. By analyzing bilateral weapons trade, a network of multilateral ties can be distilled, providing insights into the complex arena of international politics.