Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Max Shron, OkCupid Data Scientist here.

In an older version of the post, we did have the actual numbers, but they didn't seem to add much. Black women use "soul food" 20 times (!) more frequently than the site-wide average; for black men, "soul food" is 11 times as frequent.

AFAIK, nothing we put up for this article is less than twice as frequent for that group as it is for the general (OkC) population.



RE: Soul Food, one of the comments on the OKC site mentioned that Soul Food applies not only to a style of cuisine, but also to a movie and subsequent spin-off series on HBO.

The reason it pops up so frequently is because the movie is hailed within the black community as one of the few Hollywood productions that presents black people as multi-dimensional humans that deal with a number of problems related to race, class, and life in general. It's a classic.


Just for completeness, you guys should compute g-test statistics for this so that the statisticians see something they're used to.

http://en.wikipedia.org/wiki/G-test


I'll check it out. Thanks!


I think it should actually give you better results. It's monotonic in kl divergence, and does a much better job of taking into account how common the feature is rather than just how different it is. You no longer need to do things like throwing out phrases that appear less than x times if you use it.


Fair enough. OKC should really release an api for people to pull data for their own analyses. Make your data viral!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: