All LLMs Research Tools Industry Tutorials

Something happened to Opus 4.6's reasoning effort

LLMs 2.1K points 349 comments 1 month ago

It now fails the car wash test consistently (5/5 tries) and doesn't display a thinking block. Sonnet 4.6 and Opus 4.5 still manage to get it right. This matches with my experience of it now making occasional stupid mistakes in boring data analysis tasks.

External link:

https://i.redd.it/vhaz98mvjztg1.jpeg

View Discussion on Reddit

Something happened to Opus 4.6's reasoning effort

More from r/ClaudeAI

You're right to push back.

Taught Claude to talk like a caveman to use 75% less tokens.

Opus tryna be TOO human