I built a pagerank implementation in Julia many years ago. It was operating on an adjacency list with half a billion edges, and I had to resort to writing a custom reader to get reasonable read speeds over the csv. In case anyone is interested: https://github.com/purzelrakete/Pagerank.jl/blob/070a193826f...
on a somewhat unrelated note, you can write a noisy channel model spell checker with a lot more data and a bit more code: http://norvig.com/spell-correct.html
SoundCloud is hiring! Back-end Developers, Front-End Developers, API Developers, VP Eng, Developer Evangelist, Partner Integration Manager, Systems Administrator, and Music Information Retrieval Developer.
Founded in late 2007, SoundCloud is an international start-up headquartered in Berlin with smaller satellite offices in London and San Francisco. With the 50+ people onboard, we’ve got 11+ nationalities covered and a range of interests so diverse that you’ll fit in all over the place!