r/LocalLLaMA
Posted by u/Fear_ltself
Google Research announces Sequential Attention: Making AI models leaner and faster without sacrificing accuracy
Tools 600 points
45 comments
3 months ago
External link:
https://research.google/blog/sequential-attention-making-ai-models-leaner-and-faster-without-sacrificing-accuracy/More from r/LocalLLaMA
r/LocalLLaMA · u/jacek2023
Recent
Hot
This is where we are right now, LocalLLaMA
the future is now
Tools
3.2K 439 0 months ago
r/LocalLLaMA · u/KvAk_AKPlaysYT
Hot
Anthropic: "We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax." 🚨
Tools
3.1K 674 2 months ago
r/LocalLLaMA · u/HeadAcanthisitt...
Hot
I feel personally attacked
Tools
3.0K 151 2 months ago