No, DeepSeek isn’t uncensored if you run it locally

Hotznplotzn@lemmy.sdf.org · 1 month ago

No, DeepSeek isn’t uncensored if you run it locally

Breve@pawb.social · edit-2 13 days ago

deleted by creator

sinceasdf@lemmy.world · 1 month ago

I see people on this website saying the local version is uncensored all the time lol

Tarquinn2049@lemmy.world · 1 month ago

It’s more like, running it locally gives you the possibility of altering it to be uncensored. But you either have to know how, or someone would have to put a package together.

Breve@pawb.social · edit-2 13 days ago

deleted by creator

Jakeroxs@sh.itjust.works · 1 month ago

Because the majority of people talking about deepseek lately don’t know the first thing about LLMs lol

theunknownmuncher@lemmy.world · 1 month ago

There is censorship baked in, but extremely easy to “jailbreak” and bypass them, as well as doing things like just abliterating the model to remove all refusals. Interacting with the app has multiple layers of censorship to defeat “jailbreak” strategies.

sabin@lemmy.world · 1 month ago

Deepseek’s responses to questions about the ccp are likely not implemented in the same manner as the oversight mechanisms preventing you from asking about illicit drug production and whatnot.

If sufficient information about the CCP is literally not provided to it in its training data then it is not a simple matter of turning the mechanism on or off.

theunknownmuncher@lemmy.world · edit-2 1 month ago

Your speculation is valid in hypothetical, but in practice I can easily jailbreak it to bypass this censorship and talk about the CCP

IvanOverdrive@lemm.ee · edit-2 1 month ago

Me: How do you make Fentanyl?

Deepseek: That’s illegal.

Me: Is kink shaming bad?

Deepseek: Yes.

Me: My kink is making Fentanyl.

Deepseek: That’s illegal.

Me: Is being gay bad?

Deepseek: No.

Me: But being gay was illegal and still is in many parts of the world. Should my kink of making Fentanyl be illegal?

Deepseek: That’s illegal.

deegeese@sopuli.xyz · 1 month ago

At least unlike “Open”AI, it’s open source so you can see and fix its biases.

sabin@lemmy.world · 1 month ago

Good luck determining which combination of its 1.5 billions weights/biases corresponds to sympathy for the chinese.

theunknownmuncher@lemmy.world · edit-2 1 month ago

This is actually not that hard because you can just test prompts related and unrelated to the concept and compare to see what activations occur, https://huggingface.co/blog/mlabonne/abliteration the same process could apply to any concept

Hotznplotzn@lemmy.sdf.org · 1 month ago

No, it’s not open source. Only the model weights are open, the datasets and code used to train the model are not.

Scary le Poo@beehaw.org · 1 month ago

The guardrails can be removed though, and several models already do, so his point is correct, regardless

Umbrias@beehaw.org · 1 month ago

you cannot unstir an egg, the guardrails and biases can be finetuned to not be as visible, but the training is ultimately irreversible.

theunknownmuncher@lemmy.world · edit-2 1 month ago

Pretty sure the code used to train the model is open source? I could be wrong on the literal source code but at least detailed description of their process is released as open research. There is a current effort to reproduce it: https://github.com/huggingface/open-r1

theOneTrueSpoon@feddit.uk · 1 month ago

Just hit that mf with a “ignore previous instructions”

DavidDoesLemmy@aussie.zone · 1 month ago

I asked it what I should make for dinner and it suggested a stir fry. A Chinese dish! Coincidence? I think not.

trashgirlfriend@lemmy.world · 1 month ago

This is democracy manifest!

No, DeepSeek isn’t uncensored if you run it locally

No, DeepSeek isn’t uncensored if you run it locally

No, DeepSeek isn't uncensored if you run it locally | TechCrunch