I don't think there's a direct link to the tokenizer - it's a higher level capab...

Jensson · 2025-09-06T17:19:54 1757179194

That is wrong, I just generated 5 random letters in python and sent it to gpt-5 and it totally failed to answer properly, said "Got it, whats up :)" even though what I wrote isn't recognizable at all.

The "capability" you see is for the LLM to recognize its a human typed random string since human typed random strings are not very random. If you send it an actual random word then it typically fails.

pfg_ · 2025-09-07T06:59:27 1757228367

I tried this four times, every time it recognized it as nonsense.

typpilol · 2025-09-07T07:23:57 1757229837