qwen3.6 performance jump is real, just make sure you have it properly configured

Tools 754 points 308 comments 1 month ago

I've been running workloads that I typically only trust Opus and Codex with, and I can confirm 3.6 is really capable. Of course, it's not at the level of those models, but it's definitely crossing the barrier of usefulness, plus the speed is amazing running this on an M5 Max 128GB 8bit 3K PP, 100 TG on oMLX + Pi.dev Just ensure you have \`preserve\_thinking\` turned on. Check out details here.

External link:

https://i.redd.it/wq76z71k9wvg1.jpeg

View Discussion on Reddit

qwen3.6 performance jump is real, just make sure you have it properly configured

More from r/LocalLLaMA

This is where we are right now, LocalLLaMA

Anthropic: "We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax." 🚨

I feel personally attacked