r/LocalLLaMA
Posted by u/__Maximum__
My gpu poor comrades, GLM 4.7 Flash is your local agent
Tools 442 points
150 comments
4 months ago
I tried many MoE models at 30B or under and all of them failed sooner or later in an agentic framework. If z.ai is not redirecting my requests to another model, then GLM 4.7 Flash is finally the reliable (soon local) agent that I desperately wanted. I am running it since more than half an hour on opencode and it produced hundreds of thousands tokens in one session (with context compacting obviously) without any tool calling errors. It clones github repos, it runs all kind of commands, edits files, commits changes, all perfect, not a single error yet. Can't wait for GGUFs to try this locally.
More from r/LocalLLaMA
r/LocalLLaMA · u/jacek2023
Recent
Hot
This is where we are right now, LocalLLaMA
the future is now
Tools
3.2K 439 0 months ago
r/LocalLLaMA · u/KvAk_AKPlaysYT
Hot
Anthropic: "We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax." 🚨
Tools
3.1K 674 2 months ago
r/LocalLLaMA · u/HeadAcanthisitt...
Hot
I feel personally attacked
Tools
3.0K 151 2 months ago