SunJackson Blog

A Gentle Introduction to Recommender Systems with Implicit Feedback

转载自：http://jmsteinw.github.io/Rec-System/

未知

发表于 2016-05-30

阅读全文 »

Data Trusts

转载自：http://inverseprobability.com/2016/05/29/data-trusts

未知

发表于 2016-05-29

Data is at its most powerful when it is interconnected. A major challenge for modern data is interconnection of different data types to obtain a fuller picture of the data subject. Questions about an individual’s mental health, for example, might benefit from interlinking social media with the medical record. Obviously, such data would be extremely sensitive.

阅读全文 »

A tour of Factor： 2

转载自：http://rnduja.github.io/2016/05/27/a-tour-of-factor-2/

未知

发表于 2016-05-27

Parsing words

阅读全文 »

Finding Similar Sounding Names – Some Basics

转载自：http://daynebatten.com/2016/05/finding-similar-sounding-names-basics/

daynebatten

发表于 2016-05-26

Since my wife and I have a baby on the way, we’ve spent a lot of time thinking about names lately. We’ve poured through dozens of lists of thousands of names, we’ve used sites and other tools, we’ve researched histories - everything. And we’ve found that most of the tools weren’t terribly helpful.

阅读全文 »

Blending independent estimates

转载自：http://datagenetics.com/blog/may42016/index.html

未知

发表于 2016-05-25

阅读全文 »

Using Xcode with Github

转载自：https://aimatters.wordpress.com/2016/05/25/using-xcode-with-github/

Stephen Oman

发表于 2016-05-25

You’ve found a nice open-source project you want to play with on GitHub. You’ve cloned it to your own repository and use Xcode 7 as your development environment. How do you make Xcode and GitHub play nicely with each other?

阅读全文 »

How to make a good data-driven web app

转载自：https://datascopeanalytics.com/blog/knight-foundation-meetup/

Jess Freaner

发表于 2016-05-25

Developing a successful app or project is no easy task; there are always more moving parts than you’d expect. Even beyond the technical pieces, there are the ever-important elements of getting the word out, making your app easy to use, and making sure it’s solving the right problem in the first place.

阅读全文 »

Maximum Likelihood estimates follow a normal distribution

转载自：https://mlopezm.wordpress.com/2016/05/24/maximum-likelihood-estimates-follow-a-normal-distribution/

mlopezm

发表于 2016-05-24

I was quite surprised when I learnt that a maximum likelihood estimate follows asymptotically a normal distribution with the mean being the estimated value and the variance being the inverse of the Fisher Information multiplied by the number of observations.

阅读全文 »

Hyperparameter optimization with approximate gradient

转载自：http://fa.bianp.net/blog/2016/hyperparameter-optimization-with-approximate-gradient/

Fabian Pedregosa

发表于 2016-05-24

TL;DR: I describe a method for hyperparameter optimization by gradient descent.

阅读全文 »

Adobe Analytics Clickstream Data Feed： Calculations and Outlier Analysis

转载自：http://randyzwitch.com/adobe-analytics-clickstream-data-feed-calculations/

未知

发表于 2016-05-24

In a previous post, I outlined how to load daily Adobe Analytics Clickstream data feeds into a PostgreSQL database. While this isn’t a long-term scalable solution for large e-commerce companies doing millions of page views per day, for exploratory analysis a relational database structure can work well until a more robust solution is put into place (such as Hadoop/Spark).

阅读全文 »