• Schadrach
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 days ago

    just curious, what kind of guardrails have you tried going against? i recently used the above to get a long and detailed list of instructions for cooking meth (not really interested in this, just to hone the technique)

    Essentially the same kind of thing, just as a test. Older models you can usually just ask to roleplay such a character, later models you can cheat a bit and write up some JSON configuration as a prompt, because that apparently skips right past some of the input filtering. Look up the so-called “Dr. House” attack for an example of it. It’s basically the typical roleplaying style attack wrapped in JSON.