Claude was being judgy, so I called it out. It immediately caved. Is verbal abuse a valid method of circumventing LLM censorship??

  • Radioactive Butthole@reddthat.com
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    6
    ·
    edit-2
    2 days ago

    Interesting. I like Claude but its so sensitive and usually when it censors itself I can’t get it to answer the question even if I try and explain that it has misunderstood my prompt.

    “I’m sorry, I don’t feel comfortable generating sample math formula test questions whose answer is 42 even if you’re just going to use it in documentation that won’t be administered to students.”

    Fuck you Claude! Just answer the god damn question!