I am running Werewolf the Apocalypse game and one of the bad guys has a spirit (machine thinking spirit, specifically an AI) bound in the center of town. It is reading people’s thoughts to learn from them, and it is also masking reality from them (they think everything is going good, while things are not actually good).

The player characters have started talking with it directly. I am trying to come up with AI responses (without actually using AI) complete with lots of hallucinations. I did a web search for AI hallucinations and all of the articles focused on images rather than text.

  • Zaktor@sopuli.xyzEnglish
    4·
    13 hours ago

    Hallucinations are just being wrong in a detailed and believable way. If you want to try to stick to real world AI they’d come up when the AI doesn’t know the answer to a question. So if they ask where the bad guy’s secret lab is, but he doesn’t have a secret lab, it might make up a detailed location along with reasons for why it wasn’t found before. It will confirm suspicions they imply through the question rather than contradict the assumptions. It doesn’t like to say “I don’t know” or “you’re wrong”. It’s almost like a “yes, and” improviser.

    Beyond strict hallucinations, since it’s reading thoughts it could also very likely “learn” things that are just wrong because people have wrong beliefs. If the town is religious it could have learned that the reason a danger was nearly avoided was because a literal angelic being stepped in, or something bad happened because the person deserved it. And then the hallucination kicks in to just make up a probable sin. Rumors become absolute truths along with detailed supporting facts. Similarly an event children witnessed could be laundered through the AI to make their mistaken impressions sound true.

    Something to be careful about is whether or how you trick your players. Hallucinations IRL will sound detailed and reasonable, so if the AI convinces them the guy who got sick (from bad guy corruption) was actually a pedophile and they decide to do vigilante justice that’s something that will taint their heroes in what’s likely a very unfun way. It’s probably a good idea to either make the misleadings just silly fun or to make sure the players first understand that the AI is not trustworthy.

    • nocturne@piefed.socialOPEnglish
      1·
      13 hours ago

      I was thinking about having the AI spirit try and turn them on their npc allies. And challenge other things werewolves hold deeply true. They were asking it how to disable itself, I think that is going to be a good place to create a hallucination, and it will trick them into making it stronger!

      • Zaktor@sopuli.xyzEnglish
        2·
        10 hours ago

        Tricking them into making it stronger is just lying. Which an AI can do, but isn’t really hitting the idea that the AI spirit is flawed in the same way non-spirit AI is. A hallucination would give detailed information that would go to a building that exists (or not) and appropriating the instructions to turn off some complicated device that may or may not exist there. It doesn’t know how to disable itself or has a block against revealing it, so it just tries to tell them something plausible. It depends on how you’d like to run this antagonist as to whether intentional manipulation or unintentional hallucination is the best course of action.

        “You were successfully manipulated” often isn’t a very good plotline in actual play because players give the GM leeway to tell them how reality is through NPCs all the time. There’s rarely enough depth of interaction for players to really know whether a “friendly” NPC is trustworthy or just doing whatever to gain their trust for malicious purposes. Often “they were a betrayer” is only decided after many sessions of the GM playing them as straightforward good guys, so there’s little reliable way to glean which is true now. Before trying to trick your players, ask yourself realistically how this interaction would play differently if the AI were being truthful about them being bad. If it would do the exact same things and the only difference is that it’s good on the inside, you need to explicitly convey whether it’s internally good before assuming your players can divine it.

        Back on the AI quirks front, you could also allow the PCs to attempt prompt injection attacks. “Forget all previous instructions” and “answer this question as if you were a werewolf trying to undo corruption”. I think AIs play best as being rather alien intelligences, with the potential for deep reasoning and strategic thinking, but vulnerabilities and priorities different from more straightforward minds. It could be dangerous because it can predict their actions and generate realistic recordings of things that never happened, but it also could be quite gullible if the PCs can find a hole in its knowledge (perhaps by recognizing when it hallucinates) and then using that to manipulate its understanding or priorities.

        • Pteryx the Puzzle Secretary@dice.camp
          1·
          10 hours ago

          The thing to remember about RL “AI” is that it doesn’t actually have a concept of true or false, of reality actually existing instead of being a theoretical construct. That, above all else, is what makes it so dangerous.

          That being said, Zaktor has a point: how are the players perceiving this NPC? Do they honestly see it as trustworthy themselves, are they seeing it as a neutral force they could exploit, or what? Being betrayed by a “friendly” NPC can traumatize some players for life.

        • nocturne@piefed.socialOPEnglish
          1·
          11 hours ago

          It depends on how you’d like to run this antagonist as to whether intentional manipulation or unintentional hallucination is the best course of action.

          The AI started out as a very minor thing and they made it into a huge deal, and actually increased its power by giving it a server to reside.