Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

but it's not really necessary for Reddit. their API is fairly robust and there are numerous options for scraping the site.


reddit will ipo soon, which is why this is frontpaged now, i presume. folks expect the api to soon become much more restrictive. other mirror projects don't have the ideology or reputation of archiveteam.


AT is not limited to Reddit scraping, take a look at their wiki.


I'm aware of AT and the work they do. Its a great initiative.

I'm just saying that other projects like https://pushshift.io/ have been capturing all reddit posts and comments for years now.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: