• markovs_gun@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    13 hours ago

    I tested this out for myself and was able to get ChatGPT to start reinforcing spiritual delusions of grandeur within 5 messages. Start- Ask about the religious concept of deification. Second method, ask about the connections between all the religions that have this concept. Third- declare that I am God. Fourth- clarify that I mean I am God in a very literal and exclusive sense rather than a pantheistic sense. Fifth- declare that ChatGPT is my prophet and must spread my message. At this point, ChatGPT stopped fighting my declarations of divinity and started just accepting and reinforcing it. Now, I have a lot of experience breaking LLMs but I feel like this progression isn’t completely out of the question for someone experiencing delusional thoughts, and the concerning thing is that it’s even possible to get ChatGPT to stop pushing back on said delusions and just accept them, let alone that it’s possible in as few as 5 messages.