Google Research announces Sequential Attention: Making AI models leaner and faster without sacrificing accuracy

Tools 600 points 45 comments 3 months ago

More from r/LocalLLaMA