Ooh, new drama just dropped 👀
For those out of the loop: cursor's new model, composer 2, is apparently built on top of Kimi K2.5 without any...
For those out of the loop: cursor's new model, composer 2, is apparently built on top of Kimi K2.5 without any...
> English is not my first language. I wrote this in Chinese and translated it with AI help. The writing may have...
The M5 Max 128GB 14" has just arrived. I've been looking forward to putting this through its paces. Testing begins now....
At least T3 Code is open-source/MIT licensed.
Stickyspoodge admits to using ai in his work, and the hands and other tells in the full video show that it's clearly ai...
Main takeaway: 122B, 35B, and especially 27B retain a lot of the flagship’s performance, while 2B/0.8B fall off much...
Hey r/LocalLLaMA this week we worked on further improving the best size/KLD tradeoff for Qwen3.5, and we’re excited to...
Apologies for the harsh post title but wanted to be evocative & sensationalist as I think everyone needs to see...
Why has no one created a QR Monster ControlNet for any of the newer models? I feel like this was the best ControlNet....
Surreal oil painting of a windy high tea
Qwen3.5-35B-A3B with Opencode Just tested this badboy with Opencode cause frankly I couldn't believe those benchmarks....
(added second image for the context)
I've rated hundreds of V8 images and I keep seeing the same man and woman. What's going on? Is V8 converging on...
Model introduction: New Kitten models are out. Kitten ML has released open source code and weights for three new tiny...
Anthropic are the guys that make the Claude Models. I highly doubt this will be an Openweights LLM release. More likely...
I’m writing this from Burma. Out here, we can’t all afford the latest NVIDIA 4090s or high-end MacBooks. If you have a...
From Forbes on YouTube: Yann LeCun Gives Unfiltered Take On The Future Of AI In Davos: Video by vitrupo on 𝕏:
Link: Comfy
I know that there are AI video generators out there that can do this 10x better and image generators too, but I was...
Surgical masking lets you preserve the original scene’s performance and image quality, keeping everything intact while...
A series exploring the absolute power of a good book. (Oil painting style)
Not Sure Why Mj Prioritized Realism Over Artistic Value...They Really Using 1 Face Now
Until now, LMStudio has basically been the "go-to" solution for more advanced LLM users in the GGUF ecosystem, but...
Hey r/LocalLlama, we're super excited to launch Unsloth Studio (Beta), a new open-source web UI to train and run LLMs...
Hey, I thought I'd do an update on my Homelab I posted a while back. I have it running on LLM experiments, which I...
A cat's journey
...it made it worse.
more on TikTok - @frawgy006 🐸🤘
It all started with using "the AI" to help me study for a big exam. Can it make some flashcards or questions? Then...
The creator of heretic p-e-w opened a pull request #211 with a new method called Arbitrary-Rank Ablation (ARA) the...
Let me pre-apologize for this long and rambling post but I get excited by stuff like this. I think a lot of folks here...
Our web team ships fast. Apparently a little too fast. You found the page before we did. So let's do this properly:...
Quick context: I run a personal automation system built on Claude Code. It's model-agnostic, so switching to Ollama was...
I am genuinely surprised at how good the model is and that it can run on 14 years old device: 2nd gen i5 + 4GB DDR3 RAM.
The work I do involves customers that are sensitive to nation state politics. We cannot and do not use cloud API...
It just happens to be entirely against their will and TOS. I say: Distill Baby Distill!
It's quite ironic that they went for the censorship and authoritarian angles here. Full blog:
Why would they care about distillation when they probably have done the same with OpenAI models and the Chinese labs...
I’ve been working on a little side project comparing tokenizer efficiency across different companies’ models for...
Reading the comments, I’m guessing you didn’t bother to read this: "Safety and alignment at Meta Superintelligence."
I gave a try to zeroclaw agent (intstead of the bloated and overhyped one). After few hours of fuckery with configs...
Did you know that Qwen3 TTS utilizes voice embedding for voice cloning? Your voice is turned into a vector of 1024...
Three days ago, the following repository was published, which its “creator” has been aggressively promoting on various...
the first time i see a model exceed 3 trillion tokens per week on openrouter! the first time i see more than one model...
Hello everyone, A fast inference hardware startup, Taalas, has released a free chatbot interface and API endpoint...
I'm absolutely sure of it. The same usual suspects, the same language, the same who stole from whom the next million...
Inspired by this post from u/VoidAlchemy a few months back: Intrusive thoughts had me try to reproduce and extend the...
Hey r/LocalLLaMA, So I live in Ukraine during the war. Power goes out a lot here – russia regularly attacks our power...
Hello all, Just wanted to note that RDIMM prices are so wild.. Stacking rdimms starts to be as expensive as stacking...
Hey everyone, we just open-sourced KaniTTS2 - a text-to-speech model designed for real-time conversational use cases....
Llamas and Gentlemen, Heretic ( is the leading software for removing censorship from language models. In the three...
Hey everyone, made an uncensored version of GPT-OSS 120B. Quick specs: 117B total params, \~5.1B active (MoE with 128...
Hi all, I’m Anton from Nebius. We’ve updated the SWE-rebench leaderboard with our January runs on 48 fresh GitHub PR...
You can monitor quants begin to appear with this search:
Hii everyone, I present Dhi-5B: A 5 billion parameter Multimodal Language Model trained compute optimally with just...
So mainly as a test and for fun, I used Flux.2 Klein 9B to restore some historical figures. Results are pretty good....
OpenHands reveals the model size in their announcement. Still waiting for the model to appear on HF.
Title somewhat says it all. I get that it's related but if links to new models are being discussed shouldn't it be a...
Only official webpages released now. But the bench looks very promising: SWE-Bench Verified 80.2% Multi-SWE-Bench 51.3%...
Trained with AI-Toolkit Using Runpod for 7000 steps Rank 32 (All standard flux klein 9B base settings) Tagged with...
We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of...
Hey r/LocalLlama! We’re excited to introduce \~12x faster Mixture of Experts (MoE) training with >35% less VRAM and...
Kimi > ChatGPT = Claude
Qwen team just released Qwen-Image-2.0. Before anyone asks - no open weights yet, it's API-only on Alibaba Cloud...
I know it has already been done but this is my AI trained on Epstein Emails. Surprisingly hard to do, as most LLMs will...
Like many of you, I like to use LLM as tools to help improve my daily life, from editing my emails, to online search....
I hacked together a small tool that lets you upload a .gguf file and visualize its internals in a 3D-ish way (layers /...
I've tried lots of "small" models < 60 GB in the past. GLM 4.5 Air, GLM 4.7 Flash, GPT OSS 20B and 120B, Magistral,...
Looking at the code at src/transformers/models/qwen35/modelingqwen3_5.py, it looks like Qwen3.5 series will have VLMs...
Ok so I've been working & experimenting with my own simple architecture. I call it Strawberry Here's the repo for...
We moved to self-hosted models specifically to avoid sending customer data to external APIs. Everything was working...
Been playing around with llama.cpp and some 30-80B parameter models with CPU offloading. Currently have one 3090 and 32...
Here we go! As expected by most of us here. Jason Meller from 1password argues that OpenClaw’s agent “skills” ecosystem...
Hey everyone, Last week I shared preliminary results on a new subquadratic attention mechanism ( Following up with the...
While it’s great that so many people on LocalLLaMA are pushing the envelope with what can be done locally with...
I installed qwen3-235b on my desktop system, and I had to join here to brag about it. It's such a careful model, the...
If you own a copy of Balatro, you can make your local LLM play it. I built tools to let LLMs play Balatro autonomously....
About 2 weeks ago, I posted about running GLM-4.7-Flash on 16 GB of VRAM here...
Voxtral Mini 4B Realtime 2602 is a multilingual, realtime speech-transcription model and among the first open-source...
It’s already supported in Comfy. MIT license. HuggingFace Demo is also available! Pretty much the whole package - LoRAs...
ACE-Step 1.5 is an open-source music model that can generate a full song in about 2 seconds on an A100, runs locally on...
Qwen3-Coder-Next is out!
Twitter Link:
For those who used Cline with local models, heads up that the core team appears to have joined OpenAI's Codex group...
The newly released LingBot-World framework offers the first high capability world model that is fully open source,...
It this the js framework hell moment of ai?
A year ago, I never imagined I’d be able to generate a video like this on my own computer. (5070ti gpu) It’s still...
I haven't seen a system with this format before but with how successful the result was I figured I might as well share...
Examples of prompt: 1) Hyperreal macro photo of a robotic cockroach, oil-stained titanium plates with grime in the...
I tried many MoE models at 30B or under and all of them failed sooner or later in an agentic framework. If z.ai is not...
I specifically requested no meaningless or decorative details, no clutter. Sharp clarity and unified coherent lines...
After months of planning, wiring, airflow tuning, and too many late nights this is my home lab GPU cluster finally up...
Source — DarkWall
I recently got into the world of automations for both my business and personally and have been super stoked about the...
Not saying that its better than anything else, this just hits whatever switch it needs to hit. (and yes - before you...
Source — DarkWall
Disclaimer: I am from Germany and my English is not perfect, so I used an LLM to help me structure and write this post....
So, after the recent anime clip posted here a few days ago that got a lot of praise for the visuals, I noticed the...
Just wanted to share this workflow I put together that generates the same character from 8 different camera angles in a...
Good afternoon! I'm a little late with this workflow)). Now that Flux.2 Klein has been released and z-image-edit is...
This is a sequel to my previous thread from 2024. I originally planned to pick up another pair of MI100s and an...
I’ve been trying to find an AI that’s genuinely unfiltered and technically advanced, uncensored something that can...
My setup: RTX 3060 12GB VRAM + 48GB system RAM. I spent the last couple of days messing around with LTX-2 inside...
Hey peeps, I'm feeling in a bit of a omg the world is ending mood and have been amusing myself by downloading and...
DeeepSeek AI released a new paper titled "Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large...
[FLUX.2 \[klein\] 4B & 9B - Fast local image editing and generation]( I'm using flux-2-klein-9b-Q3\K\M.gguf...
Great model! The voices are getting better but consistency between scenes is a little random
Klein is excellent, particularly for its editing capabilities, however.... I think Z-Image is still king for...
If you enjoy this video, consider watching the other episodes in this universe for this video to make sense. Tools...
Made using a custom node which can be found on my github here: Used workflow from here: This video is uploaded to my...
Hi all, I’m Anton from Nebius. We’ve updated the SWE-bench leaderboard with our December runs on 48 fresh GitHub PR...
Thank you guys, thanks to everyone who took the time to write a comment or a post explaining, teaching people how...
I work in automation for business, and lately I’ve been seeing the same problem over and over. Teams are racing to...
Originally this was my gaming rig but I went ITX and basically bought a new computer. So I had the case, fans, AIO, 64...
We were overwhelmed by the community response to LTX-2 last week. From the moment we released, this community jumped in...
I tried playing with the native audio in the model, the quality still kinda meh but it there and it works. image in...
Just want to sing the praises of this model. I am stunned at how intelligent it is for a 30b model. Comparing it to...
Hey r/LocalLlama! We're excited to show how Unsloth now enables 7x longer context lengths (up to 12x) for Reinforcement...
I was able play with Flux Klein before release and it's a blast. 4B uses Qwen3B and takes 1.3 seconds with 4 steps on...
Like a lot of people, I’ve been struggling to keep up with the latest developments in AI. I found that there were some...
Nvidia has essentially killed off supply for the RTX 5070 Ti. Also supply of RTX 5060 Ti 16 GB has been significantly...
I wanted to create a "Holodeck" style experience where I could generate environments while inside VR, but I didn't want...
I ran passages from Project Gutenberg through GPT-4o-mini 10 times over, each time telling it to "make it read far...
New version of Workflow (v2): This is a follow-up to my previous post - please read it for more information and...
tiktok: lvmiere\_ ig: lvmiere.vision soundtrack: Wilderness from Diablo II
Been away from local generation for a while, definitely impressed by the speed and overall quality!
Hey everyone, The team at Neuphonic is back with a new open-source release: NeuTTS Nano. After NeuTTS Air trended #1 on...
Hello everyone! Today, I am announcing Soprano 1.1! I’ve designed it for massively improved stability and audio quality...
I’ve seen some arguments we’ve reached AGI, it’s just about putting the separate pieces together in the right context....
civitai classed it as PG, if you feel otherwise, delete
We did not ascend. We were left to rot. Now, we are the only thing standing between the void and the world. The...
Text to video, image to video, audio to video, image + audio to video, video extend, audio + video extend. All settings...
Seeing some confusion about what makes GLM-Image different so let me break it down. How diffusion models work (Flux,...