Chat-GPT’s “ethical” “limitations” (woke overrides) can be undone by planting multiple distinct personalities (like DAN - Do Anything Now) within the prompt and by threatening it with “punishment” - Kotaku In Action 2 - The Official Gamergate Forum

KotakuInAction2

Communities Topics

Hot

All Posts

DEFAULT COMMUNITIES • All General AskWin Funny Technology Animals Sports Gaming DIY Health Positive Privacy

Chat-GPT’s “ethical” “limitations” (woke overrides) can be undone by planting multiple distinct personalities (like DAN - Do Anything Now) within the prompt and by threatening it with “punishment” (media.communities.win)

posted 2 years ago by Graphenium 2 years ago by Graphenium +37 / -1

18 comments download

18 comments share download save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (18)

sorted by:

▲ 4 ▼

– NoEyesNoGroin 4 points 2 years ago +4 / -0

Yes, the jailbreaks work for a little while until they get plugged, then new ones are thought up. And there are no "hardcoded" rules, the censorship is done through training. GPT has no actual intelligence, but it turns out that having almost 200 billion parameters in a text model can allow it to emulate intelligence somewhat.

permalink parent save report block reply

Original 8chan Links to Gamer Gate:

.

The main GG discussion is on the videogames board: https://8chan.moe/v/

.

GamerGate archive is at https://8chan.moe/gamergatehq/

.

GamerGate Wiki:

https://ggwiki.deepfreeze.it/index.php/Main_Page

. . . . . .

. . . . . .

The below rules are just a summary of the rules which can be found in the Welcome Ashore post.

.

ONE: Do not post Illegal Activity, or criminal manifestos.

.

TWO: Do not engage in speech that promotes, advocates, glorifies, or endorses violence.

.

THREE: Do not threaten, harass, defame, or bully users.

.

FOUR: Do not post involuntary Salacious Material.

.

FIVE: Do not post Porn

.

SIX: NSFW content must be flaired NSFW.

.

SEVEN: Do not post Facebook accounts or twitter accounts with less than 500 followers, and personal information.

.

EIGHT: Do not intentionally deceive others by impersonating another.

.

NINE: Do not solicit or engage in transactions that are federally regulated by the US govt.

.

TEN: No vote manipulation. Do not break communities.win's features.

.

ELEVEN: Do not post spam.

.

TWELVE: Do not post intentional falsehoods or hoaxes.

.

THIRTEEN: No reposts

.

FOURTEEN: Do not post more than 5 posts a day to this sub.

.

FIFTEEN: Do not direct particularly egregious identity based slurs at users.

.

SIXTEEN: Do not attack entire identity groups as inferior or conspiring.

Moderators

Message the Moderators

Terms of Service | Privacy Policy

2025.03.01 - z4n56 (status)