Back to topics

Safety, Censorship, and the Ongoing LLM Jailbreak Debate

2 min read
317 words
Opinions on LLMs Safety, Censorship,

Safety vs. accessibility is the loudest tension in the ongoing LLM jailbreak debate. A thread titled “Uncensored LLM” names models that push boundaries and riffs on what “uncensored” really means, even joking that it can be “free to lie.” It’s a snapshot of how people think about trust, governance, and practical use [1].

What uncensored means in practice

Across the discussions, uncensored often translates to fewer refusals and more raw outputs. One takeaway is blunt: “Uncensored === free to lie.” That mindset underpins the push for lower barriers and more open-ended experimentation [1].

Prominent uncensored models mentioned

Dolphin — cited as a top uncensored option, including a Venice Edition 24B variant [1]. • Magistral 1.2 — described as uncensored; many users compare it to other finetuned options [1]. • Gemma 3 27B Abliterated — listed among favored uncensored picks [1]. • Mistral 2506 and Magistral 2509 — noted for uncensored behavior and potential finetuning paths [1]. • XortronCriminalComputingConfig-i1-GGUF — highlighted on the scene, with references to its leaderboard status [1]. • Hermes4 — called out as tops the refusal benchmark across sizes [1]. • Open-source lines include Qwen and Deepseek discussions, often tied to jailbreaking needs given their code openness [1].

Disabling thinking in Deepseek V3.1

In a separate thread, people push instructions to suppress “thinking” in Deepseek V3.1 using tools like llama-cli and templates (e.g., --jinja and chat templates) with tweaks such as --chat-template-kwargs [2]. The logs show prompts becoming constrained or redirected, illustrating how template workarounds reshape behavior [2].

Policy, trust, and deployment

These debates reveal a core paradox: broader access invites more capable use, but also tougher safety governance and user trust questions. The debate isn’t just about tech; it’s about who gets to decide what a model can and can’t say [1][2].

Closing thought: as models grow more capable, we’ll see these lines tighten or bend—depending on policy, platform, and public trust.

References

[1]
Reddit

Uncensored LLM

List of uncensored LLMs, experiences, comparisons, and jailbreak notes; mentions models, pros/cons, access, and censorship claims across several versions online.

View source
[2]
Reddit

How do I disable thinking in Deepseek V3.1?

Users discuss how to prevent the model from thinking output by templates, jinja, and chat-kwargs in DeepSeek-V3.1 while maintaining usefulness.

View source

Want to track your own topics?

Create custom trackers and get AI-powered insights from social discussions

Get Started