Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is there actual case law that prohibits the use of copyrighted material for corpora and other training data?

Sure distribution can have issues, but do you have any references for simple possession as training and test data?



If you're gathering data for your own business, it would be difficult for anyone to know, sure.

But if the data/analysis is published, then the data source would need to be disclosed.

See more on the OKCupid case I mentioned above: http://www.vox.com/platform/amp/2016/5/12/11666116/70000-okc...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: