Heretic has been served a legal notice by Meta, Inc.
To Whomsoever it May Concern, The individual behind the Heretic Free Software Project (henceforth called "Heretic",...
To Whomsoever it May Concern, The individual behind the Heretic Free Software Project (henceforth called "Heretic",...
2.3 TB of ram in here. 400+ vCores. All thats left is plugging it to the blackwell with the driver to do RDMA, and it’s...
How? It kept getting chained bash commands wrong, with wrong escapes. So it created many bad directories, and tried...
Let’s build the biggest ever DGX Spark Cluster at home. This is going into my home lab server rack, 2TB of unified...
the future is now
Is Qwen just incredibly good at doing dense and not so good at doing MoE? I get that dense is generally better than MoE...
Time to switch to Kimi k2.6 guys if you haven't already. For $20 a month you can buy the OpenCode Go coding plan (its...
After testing it and getting some customer feedback too, its the first model I'd confidently recommend to our customers...
I gave it a task to build a tower defense game. use screenshots from the installed mcp to confirm your build. My God...
Meet Qwen3.6-35B-A3B:Now Open-Source!🚀🚀 A sparse MoE model, 35B total params, 3B active. Apache 2.0 license. \-...
If you haven't seen it yet, a model called see-through dropped last week. It takes a single static anime image and...
What’s new in Gemma 4 Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal,...
From Chaofan Shou on 𝕏 (files):
TurboQuant (Zandieh et al. 2025) has been all the rage in the past two days, and I've seen lots of comments here...
Hi everyone, we just ran an experiment. We patched llama.cpp with Google’s new TurboQuant compression method and then...
VentureBeat: Mistral AI just released a text-to-speech model it says beats ElevenLabs — and it's giving away the...
It seems Intel will release a GPU with 32 GB of VRAM on March 31, which they would sell directly for $949. Bandwidth...
\\NO VIRUS\\ LM studio has stated it was a false positive and Microsoft dealt with it I'm no expert, just a tinkerer...
Seen while walking through Singapore’s Changi airport earlier this week. Alibaba Cloud spending up big on advertising.
For those out of the loop: cursor's new model, composer 2, is apparently built on top of Kimi K2.5 without any...
> English is not my first language. I wrote this in Chinese and translated it with AI help. The writing may have...
The M5 Max 128GB 14" has just arrived. I've been looking forward to putting this through its paces. Testing begins now....
At least T3 Code is open-source/MIT licensed.
Stickyspoodge admits to using ai in his work, and the hands and other tells in the full video show that it's clearly ai...
Main takeaway: 122B, 35B, and especially 27B retain a lot of the flagship’s performance, while 2B/0.8B fall off much...
Hey r/LocalLLaMA this week we worked on further improving the best size/KLD tradeoff for Qwen3.5, and we’re excited to...
Apologies for the harsh post title but wanted to be evocative & sensationalist as I think everyone needs to see...
Why has no one created a QR Monster ControlNet for any of the newer models? I feel like this was the best ControlNet....
Surreal oil painting of a windy high tea
Qwen3.5-35B-A3B with Opencode Just tested this badboy with Opencode cause frankly I couldn't believe those benchmarks....
(added second image for the context)
I've rated hundreds of V8 images and I keep seeing the same man and woman. What's going on? Is V8 converging on...
Model introduction: New Kitten models are out. Kitten ML has released open source code and weights for three new tiny...
Anthropic are the guys that make the Claude Models. I highly doubt this will be an Openweights LLM release. More likely...
I’m writing this from Burma. Out here, we can’t all afford the latest NVIDIA 4090s or high-end MacBooks. If you have a...
From Forbes on YouTube: Yann LeCun Gives Unfiltered Take On The Future Of AI In Davos: Video by vitrupo on 𝕏:
Link: Comfy
I know that there are AI video generators out there that can do this 10x better and image generators too, but I was...
Surgical masking lets you preserve the original scene’s performance and image quality, keeping everything intact while...
I think gave it a fair shot over the past few weeks, forcing myself to use local models for non-work tech asks. I use...
It is crazy that Qwen3.6 27B now matches Sonnet 4.6 on AA's Agentic Index, overtaking Gemini 3.1 Pro Preview, GPT 5.2...
I have a 5090 Laptop from work, 24GB VRAM. I have been testing every model that comes out, and I can confidently say...
Heya guys and gals, Around a year ago I released and posted about Persona Engine as a fun side project, trying to get...
finally with files inside :)
Meet Qwen3.6-27B, our latest dense, open-source model, packing flagship-level coding power! Yes, 27B, and Qwen3.6-27B...
A short follow-up to my previous post, where I showed that changing the scaffold around the same 9B Qwen model moved...
It seems to me that OpenClaw and all its clones are almost useless tools for those who know what they're doing. It's...
I’ve been testing Google’s Gemma-4-E2B-it as a local, offline resource for emergency preparedness. The idea was to have...
sycophancy: deleted efficiency per token:+1000% friendship: just beginning edit: “sup” got cut off at top
There's no way this is real and ebay is doing nothing to stop those scams. Why, people are actually bidding and buying...
of course this is just a trust me bro post but I've been testing various local models (a couple gemma4s, qwen3 coder...
Spent an evening dialing in Qwen3.6-35B-A3B on consumer hardware. Fun side note: I had Claude Opus 4.7 (just the $20...
I've been running workloads that I typically only trust Opus and Codex with, and I can confirm 3.6 is really capable....
Hey guys, we ran Qwen3.6-35B-A3B GGUF KLD performance benchmarks to help you choose the best quant. Unsloth quants have...
I spent some time yesterday after work trying out the new qwen3.6-35b-a3b model, and at least for me it's the first...
Latest (non-comfyui) releases you (might of) missed in March 2026: 🧠 LLMs 1. NVIDIA gpt-oss-puzzle-88B \- NVIDIA...
A Lora trained on photos taken with the original Apple iPhone (2007). Works with FLUX.2 Klein Base and FLUX.2 Klein....
Your monthly "Anzhc's Posts" issue have arrived. Today im introducing - Mugen \- continuation of the Flux 2 VAE...
Test of the new lora found on CivitAi LTX 2.3 - Video Reasoning lora VBVR - v1.0 | LTXV23 LoRA | Civitai Both clips...
we are doomed
Standard ComfyUI template. Klein 9b fp16 model. Prompt: "Transform all to greyed out 3d mesh" EDIT: Perhaps better one...
Github | CivitAI Point this workflow at a directory of clips and it will automatically stitch them together, fixing...
Paper: PixelSmile: Toward Fine-Grained Facial Expression Editing Model: A new LoRA for Qwen-Image called PixelSmile...
Yooo Buff here. I've been working on running LTX-2.3 as efficiently as possible directly in Scope on consumer hardware....
Been tinkering with the official LTX 2.3 ComfyUI workflows and stumbled onto some changes that made a pretty dramatic...
If only it works well with work flow. Nvidia have CUDA, AMD have ROCM, I don't even know what Intel have aside from...
I made the video using ltx, can anybody tell me how I can improve it
Can you beleive I almost bought two of them?? (oh, and they gave me 10% cashback for Prime Day)
I'm not affiliated with this team/model, but I have been doing some early testing. I believe it's very promising. Hope...
Hi everyone! I want to get in to vibe coding to make my very own ai wrapper, what are the best models that can run on...
We have a new 15B opensourced fast Audio-Video model called daVinci-MagiHuman claiming to beat LTX 2.3 Check out the...
Hey everyone I recently decided to test out the new Qwen 2512 model. I previously had a Samsung-style LoRA for the...
I don’t have a problem paying for AI software if it’s really good. I’m don’t use open source software because I’m...
Composer 2-Flash has been saved! (For legal reasons that's a joke)
Are we getting Wan2.5/2.6 open-source?!
ID-LoRA (Identity-Driven In-Context LoRA) jointly generates a subject's appearance and voice in a single model, letting...
I am a figurative artist based in New York with work in the collections of the Metropolitan Museum of Art, MoMA,...
Sounds like Fireworks had a partnership with Moonshot, and Cursor went through them. Kinda makes sense that Moonshot...
I’m a lawyer who got Claude code pilled about 90 days ago, then thought about what I wanted to do with AI tools, and...
Omg, this thing is amazing. I have tried all its smaller silbings 122b/35b/27b, gpt-oss 120b, StepFun 3.5, MiniMax...
I decided to try making some comfyui nodes for the first time. Here's the first batch of nodes I made in past couple...
A dark surrealist oil painting in negative space
Most of the time I rely on the default ComfyUI workflows. They're producing results just as good as 90% of the...
Hey everyone! I've been working on this for months and today's the day. MacinAI Local is a complete local AI inference...
Main notes SDXL/Illustrious for design and ideas ControlNet for pose stability Prompt for cel shading and use flat...
I saw someone say recently something to the effect of: “that man is a working dog. if you don’t give him a job, he’ll...
LoRA designed to reduce the typical smooth/plastic AI look and add more natural skin texture and realism to images. It...
I know about the silverware, weird looking candle, necklace, should have iterate a few times but this is a zero-shot...
Workflow: Video with Full Resolution: Four days of intensive optimization, I finally got LTX 2.3 running efficiently on...
Disappointed in the performance myself too :/ The last good Mistral model I can remember was Nemo, which led to a lot...
I posted a question about this idea here two weeks ago, kept working on it, and now I finally have a beta to show. It’s...
A series exploring the absolute power of a good book. (Oil painting style)
My workplace just got a server equipped with 2x Nvidia H200 GPUs (141GB HBM3e each). I've been asked to test LLMs on it...
Not Sure Why Mj Prioritized Realism Over Artistic Value...They Really Using 1 Face Now
This is crazy. As a heavy Claude code user, who has used over 12 billion tokens in the last few months, and never tried...
Until now, LMStudio has basically been the "go-to" solution for more advanced LLM users in the GGUF ecosystem, but...
Hey r/LocalLlama, we're super excited to launch Unsloth Studio (Beta), a new open-source web UI to train and run LLMs...
I know we all love using opencode, I just recently found out about it and my experience is generally positive so far....
I’m building an app with this model locally, and I’ve been genuinely surprised by how naturally it reasons through...
Hey, I thought I'd do an update on my Homelab I posted a while back. I have it running on LLM experiments, which I...
tl;dr the new license doesn't include the rug pull clauses and removes restrictions on modifications, guardrails,...
A cat's journey
as a heavy user of CC / Codex, i honestly find this interface to be better than both of them. and since it's open...
...it made it worse.
more on TikTok - @frawgy006 🐸🤘
It all started with using "the AI" to help me study for a big exam. Can it make some flashcards or questions? Then...
The creator of heretic p-e-w opened a pull request #211 with a new method called Arbitrary-Rank Ablation (ARA) the...
Let me pre-apologize for this long and rambling post but I get excited by stuff like this. I think a lot of folks here...
Our web team ships fast. Apparently a little too fast. You found the page before we did. So let's do this properly:...
Quick context: I run a personal automation system built on Claude Code. It's model-agnostic, so switching to Ollama was...
I am genuinely surprised at how good the model is and that it can run on 14 years old device: 2nd gen i5 + 4GB DDR3 RAM.
The work I do involves customers that are sensitive to nation state politics. We cannot and do not use cloud API...
It just happens to be entirely against their will and TOS. I say: Distill Baby Distill!
It's quite ironic that they went for the censorship and authoritarian angles here. Full blog:
Why would they care about distillation when they probably have done the same with OpenAI models and the Chinese labs...
I’ve been working on a little side project comparing tokenizer efficiency across different companies’ models for...
Reading the comments, I’m guessing you didn’t bother to read this: "Safety and alignment at Meta Superintelligence."
I gave a try to zeroclaw agent (intstead of the bloated and overhyped one). After few hours of fuckery with configs...
Did you know that Qwen3 TTS utilizes voice embedding for voice cloning? Your voice is turned into a vector of 1024...
Three days ago, the following repository was published, which its “creator” has been aggressively promoting on various...
the first time i see a model exceed 3 trillion tokens per week on openrouter! the first time i see more than one model...
Hello everyone, A fast inference hardware startup, Taalas, has released a free chatbot interface and API endpoint...
I'm absolutely sure of it. The same usual suspects, the same language, the same who stole from whom the next million...
Inspired by this post from u/VoidAlchemy a few months back: Intrusive thoughts had me try to reproduce and extend the...
Hey r/LocalLLaMA, So I live in Ukraine during the war. Power goes out a lot here – russia regularly attacks our power...
Hello all, Just wanted to note that RDIMM prices are so wild.. Stacking rdimms starts to be as expensive as stacking...
Hey everyone, we just open-sourced KaniTTS2 - a text-to-speech model designed for real-time conversational use cases....
Llamas and Gentlemen, Heretic ( is the leading software for removing censorship from language models. In the three...
Hey everyone, made an uncensored version of GPT-OSS 120B. Quick specs: 117B total params, \~5.1B active (MoE with 128...
Hi all, I’m Anton from Nebius. We’ve updated the SWE-rebench leaderboard with our January runs on 48 fresh GitHub PR...
You can monitor quants begin to appear with this search:
Hii everyone, I present Dhi-5B: A 5 billion parameter Multimodal Language Model trained compute optimally with just...
So mainly as a test and for fun, I used Flux.2 Klein 9B to restore some historical figures. Results are pretty good....
OpenHands reveals the model size in their announcement. Still waiting for the model to appear on HF.
Title somewhat says it all. I get that it's related but if links to new models are being discussed shouldn't it be a...
Only official webpages released now. But the bench looks very promising: SWE-Bench Verified 80.2% Multi-SWE-Bench 51.3%...
Trained with AI-Toolkit Using Runpod for 7000 steps Rank 32 (All standard flux klein 9B base settings) Tagged with...
We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of...
Hey r/LocalLlama! We’re excited to introduce \~12x faster Mixture of Experts (MoE) training with >35% less VRAM and...
Kimi > ChatGPT = Claude
Qwen team just released Qwen-Image-2.0. Before anyone asks - no open weights yet, it's API-only on Alibaba Cloud...
I know it has already been done but this is my AI trained on Epstein Emails. Surprisingly hard to do, as most LLMs will...
Like many of you, I like to use LLM as tools to help improve my daily life, from editing my emails, to online search....
I hacked together a small tool that lets you upload a .gguf file and visualize its internals in a 3D-ish way (layers /...
I've tried lots of "small" models < 60 GB in the past. GLM 4.5 Air, GLM 4.7 Flash, GPT OSS 20B and 120B, Magistral,...
Looking at the code at src/transformers/models/qwen35/modelingqwen3_5.py, it looks like Qwen3.5 series will have VLMs...
Ok so I've been working & experimenting with my own simple architecture. I call it Strawberry Here's the repo for...
We moved to self-hosted models specifically to avoid sending customer data to external APIs. Everything was working...
Been playing around with llama.cpp and some 30-80B parameter models with CPU offloading. Currently have one 3090 and 32...
Here we go! As expected by most of us here. Jason Meller from 1password argues that OpenClaw’s agent “skills” ecosystem...
Hey everyone, Last week I shared preliminary results on a new subquadratic attention mechanism ( Following up with the...
While it’s great that so many people on LocalLLaMA are pushing the envelope with what can be done locally with...
I installed qwen3-235b on my desktop system, and I had to join here to brag about it. It's such a careful model, the...
If you own a copy of Balatro, you can make your local LLM play it. I built tools to let LLMs play Balatro autonomously....
About 2 weeks ago, I posted about running GLM-4.7-Flash on 16 GB of VRAM here...
Voxtral Mini 4B Realtime 2602 is a multilingual, realtime speech-transcription model and among the first open-source...
It’s already supported in Comfy. MIT license. HuggingFace Demo is also available! Pretty much the whole package - LoRAs...
ACE-Step 1.5 is an open-source music model that can generate a full song in about 2 seconds on an A100, runs locally on...
Qwen3-Coder-Next is out!
Twitter Link:
For those who used Cline with local models, heads up that the core team appears to have joined OpenAI's Codex group...
The newly released LingBot-World framework offers the first high capability world model that is fully open source,...
It this the js framework hell moment of ai?
A year ago, I never imagined I’d be able to generate a video like this on my own computer. (5070ti gpu) It’s still...
I haven't seen a system with this format before but with how successful the result was I figured I might as well share...
Examples of prompt: 1) Hyperreal macro photo of a robotic cockroach, oil-stained titanium plates with grime in the...
I tried many MoE models at 30B or under and all of them failed sooner or later in an agentic framework. If z.ai is not...
I specifically requested no meaningless or decorative details, no clutter. Sharp clarity and unified coherent lines...
After months of planning, wiring, airflow tuning, and too many late nights this is my home lab GPU cluster finally up...
Source — DarkWall
I recently got into the world of automations for both my business and personally and have been super stoked about the...
Not saying that its better than anything else, this just hits whatever switch it needs to hit. (and yes - before you...
Source — DarkWall
Disclaimer: I am from Germany and my English is not perfect, so I used an LLM to help me structure and write this post....
So, after the recent anime clip posted here a few days ago that got a lot of praise for the visuals, I noticed the...
Just wanted to share this workflow I put together that generates the same character from 8 different camera angles in a...
Good afternoon! I'm a little late with this workflow)). Now that Flux.2 Klein has been released and z-image-edit is...
This is a sequel to my previous thread from 2024. I originally planned to pick up another pair of MI100s and an...
I’ve been trying to find an AI that’s genuinely unfiltered and technically advanced, uncensored something that can...
My setup: RTX 3060 12GB VRAM + 48GB system RAM. I spent the last couple of days messing around with LTX-2 inside...
Hey peeps, I'm feeling in a bit of a omg the world is ending mood and have been amusing myself by downloading and...
DeeepSeek AI released a new paper titled "Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large...
[FLUX.2 \[klein\] 4B & 9B - Fast local image editing and generation]( I'm using flux-2-klein-9b-Q3\K\M.gguf...
Great model! The voices are getting better but consistency between scenes is a little random
Klein is excellent, particularly for its editing capabilities, however.... I think Z-Image is still king for...
If you enjoy this video, consider watching the other episodes in this universe for this video to make sense. Tools...
Made using a custom node which can be found on my github here: Used workflow from here: This video is uploaded to my...
Hi all, I’m Anton from Nebius. We’ve updated the SWE-bench leaderboard with our December runs on 48 fresh GitHub PR...
Thank you guys, thanks to everyone who took the time to write a comment or a post explaining, teaching people how...
I work in automation for business, and lately I’ve been seeing the same problem over and over. Teams are racing to...
Originally this was my gaming rig but I went ITX and basically bought a new computer. So I had the case, fans, AIO, 64...
We were overwhelmed by the community response to LTX-2 last week. From the moment we released, this community jumped in...
I tried playing with the native audio in the model, the quality still kinda meh but it there and it works. image in...
Just want to sing the praises of this model. I am stunned at how intelligent it is for a 30b model. Comparing it to...
Hey r/LocalLlama! We're excited to show how Unsloth now enables 7x longer context lengths (up to 12x) for Reinforcement...
I was able play with Flux Klein before release and it's a blast. 4B uses Qwen3B and takes 1.3 seconds with 4 steps on...
Like a lot of people, I’ve been struggling to keep up with the latest developments in AI. I found that there were some...
Nvidia has essentially killed off supply for the RTX 5070 Ti. Also supply of RTX 5060 Ti 16 GB has been significantly...
I wanted to create a "Holodeck" style experience where I could generate environments while inside VR, but I didn't want...
I ran passages from Project Gutenberg through GPT-4o-mini 10 times over, each time telling it to "make it read far...
New version of Workflow (v2): This is a follow-up to my previous post - please read it for more information and...
tiktok: lvmiere\_ ig: lvmiere.vision soundtrack: Wilderness from Diablo II
Been away from local generation for a while, definitely impressed by the speed and overall quality!
Hey everyone, The team at Neuphonic is back with a new open-source release: NeuTTS Nano. After NeuTTS Air trended #1 on...
Hello everyone! Today, I am announcing Soprano 1.1! I’ve designed it for massively improved stability and audio quality...
I’ve seen some arguments we’ve reached AGI, it’s just about putting the separate pieces together in the right context....
civitai classed it as PG, if you feel otherwise, delete
We did not ascend. We were left to rot. Now, we are the only thing standing between the void and the world. The...
Text to video, image to video, audio to video, image + audio to video, video extend, audio + video extend. All settings...
Seeing some confusion about what makes GLM-Image different so let me break it down. How diffusion models work (Flux,...