These OCR improvements will almost certainly be brought to google books, which i...

levocardia · 2025-12-05T23:05:20 1764975920

This is a really interesting "data flywheel" -- better model >> more usable data >> even better model

tills13 · 2025-12-06T00:17:06 1764980226

surely there's an upper limit to this though with models literally eating themselves.

Choco31415 · 2025-12-06T07:04:19 1765004659

We can wait for that to start appearing in tests or benchmarks first.

jeffbee · 2025-12-06T00:34:15 1764981255

When a human students learns to read more carefully we don't consider that a negative.

kridsdale3 · 2025-12-05T21:16:29 1764969389

More Data for the Data Gods!