Uncensored Local Models: Abliteration Methods Compared

Someone compiled a solid list of uncensored local models on Hugging Face for folks who want fewer guardrails. Turns out different abliteration methods produce pretty different behavior, so it’s worth trying a few.

Popular options:

GLM 4.7 Flash (lightweight, fast):

https://huggingface.co/DavidAU/GLM-4.7-Flash-Uncensored-Heretic-NEO-CODE-Imatrix-MAX-GGUF
https://huggingface.co/mradermacher/Huihui-GLM-4.7-Flash-abliterated-GGUF

GPT OSS 20B (mid-range):

https://huggingface.co/DavidAU/OpenAi-GPT-oss-20b-abliterated-uncensored-NEO-Imatrix-gguf
https://huggingface.co/bartowski/p-e-w_gpt-oss-20b-heretic-GGUF

GPT OSS 120B (heavyweight):

https://huggingface.co/huihui-ai/Huihui-gpt-oss-120b-

Uncensored Local Models: Abliteration Methods Compared

Related Tips

KaniTTS2: Fast Local Text-to-Speech with Cloning

AdaLLM: True FP4 Inference on RTX 4090s Without FP16 Fallbac

Chatbot Framework Rebuilt in Rust: 10MB Binary