Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

More previous tokens in this case would mean more previous frames. But there's really no reason to just stick to rendered pixels as input (except for novelty's sake) because we could train directly on snapshots of full game state.


Yeah but then it's not generalizable


Doesn't that depend on how such game state is modeled?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: