Something happened to Opus 4.6's reasoning effort

LLMs 2.1K points 349 comments 1 month ago

It now fails the car wash test consistently (5/5 tries) and doesn't display a thinking block. Sonnet 4.6 and Opus 4.5 still manage to get it right. This matches with my experience of it now making occasional stupid mistakes in boring data analysis tasks.

More from r/ClaudeAI