r/LocalLLaMA
Posted by u/Mysterious_Finish543
PR opened for Qwen3.5!!
Tools 613 points
73 comments
1 month ago
https://github.com/huggingface/transformers/pull/43830/ Looking at the code at `src/transformers/models/qwen3_5/modeling_qwen3_5.py`, it looks like Qwen3.5 series will have VLMs right off the bat!
External link:
https://i.redd.it/r10pwm02y7ig1.pngMore from r/LocalLLaMA
r/LocalLLaMA · u/KvAk_AKPlaysYT
Recent
Hot
Anthropic: "We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax." 🚨
Tools
3.1K 674 3 weeks ago
r/LocalLLaMA · u/HeadAcanthisitt...
Recent
Hot
I feel personally attacked
Tools
3.0K 151 1 week ago
r/LocalLLaMA · u/Xhehab_
Recent
Hot
Distillation when you do it. Training when we do it.
Tools
2.6K 156 3 weeks ago