The time (horizon) here is not that of the model completing the task, but a huma...

		scellus 62 days ago \| parent \| context \| favorite \| on: Measuring AI Ability to Complete Long Tasks The time (horizon) here is not that of the model completing the task, but a human completing the task.