CLAUDE OPUS 4.6 IS NERFED!!

Industry 2.4K points 342 comments 1 month ago

(meaning Anthropic has reduced its capability since its launch) Last week Claude Opus 4.6 ranked #2 on the Hallucination benchmark with an accuracy of 83.3%. Today Claude Opus 4.6 was retested and it fell to #10 on the leaderboard with an accuracy of only 68.3%. A 98% increase in hallucination. bridgebench.ai just confirmed that Claude Opus 4.6 has reduced reasoning levels and is nerfed.

More from r/Anthropic