AI Reddit Digest Refresh Feed

All LLMs Research Tools Industry Tutorials

Posted by u/ayylmaonade

GLM 4.7 Flash official support merged in llama.cpp

Tools 352 points 61 comments Yesterday

External link:

https://github.com/ggml-org/llama.cpp/pull/18936

View Discussion on Reddit

More from r/LocalLLaMA

r/LocalLLaMA · u/EmPips

My story of underestimating /r/LocalLLaMA's thirst for VRAM

1.3K 88 5 days ago

r/LocalLLaMA · u/Dark_Fire_12

zai-org/GLM-4.7-Flash · Hugging Face

719 225 Yesterday

r/LocalLLaMA · u/Fear_ltself

NVIDIA's new 8B model is Orchestrator-8B, a specialized 8-billion-parameter AI designed not to answer everything itself, but to intelligently manage and route complex tasks to different tools (like web search, code execution, other LLMs) for greater efficiency

I’ve seen some arguments we’ve reached AGI, it’s just about putting the separate pieces together in the right context....

705 129 6 days ago