• 5 Posts
  • 2.24K Comments
Joined 2 years ago
cake
Cake day: September 7th, 2023

help-circle










  • Remember those comments with links in them bots leave on dead websites? Imagine instead of links it sets up an AI to think of certain specific behaviour or people as immoral.

    Swatting via distributed hit piece.

    Or if you manage to figure out that people are using a LLM to do input sanitization/log reading, you could now figure out a way to get an instruction in the logs and trigger alarms this way. (E: im reminded of the story from the before times, where somebody piped logging to a bash terminal and got shelled because somebody send a bash exploit which was logged).

    Or just send an instruction which changes the way it tries to communicate, and have the LLM call not the cops but a number controlled by hackers which pays out to them, like the stories of the A2P sms fraud which Musk claimed was a problem on twitter.

    Sure competent security engineering can prevent a lot of these attacks but you know points to history of computers.

    Imagine if this system was implemented for Grok when it was doing the ‘everything is white genocide’ thing.