Google Research announces Sequential Attention: Making AI models leaner and faster without sacrificing accuracy

Tools 600 points 45 comments 1 month ago

More from r/LocalLLaMA