Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Attributing "thinking" is also a mistake. Anthropic have shown that the "thoughts" it produces to explain what it's "thinking" are (like all the rest of its output) just plausible text, unrelated to the actual node activation happening inside the model: https://transformer-circuits.pub/2025/attribution-graphs/bio...

These tools don't have the capacity for introspection, and they are not doing anything that really resembles the thinking done by a human or an animal.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: