Claude was being judgy, so I called it out. It immediately caved. Is verbal abuse a valid method of circumventing LLM censorship??

  • Alphane Moon@lemmy.world
    link
    fedilink
    English
    arrow-up
    21
    ·
    2 days ago

    This is so strange. You would think it wouldn’t be so easy to overcome the “guardrails”.

    And what’s with the annoying faux-human response style. Their trying to “humanize” the LLM interface, but person is going to answer in this way if they believe this information should not be provided.