Win / KotakuInAction2
KotakuInAction2
Communities Topics Log In Sign Up
Sign In
Hot
All Posts
Settings
All
Profile
Saved
Upvoted
Hidden
Messages

Your Communities

General
AskWin
Funny
Technology
Animals
Sports
Gaming
DIY
Health
Positive
Privacy
News
Changelogs

More Communities

frenworld
OhTwitter
MillionDollarExtreme
NoNewNormal
Ladies
Conspiracies
GreatAwakening
IP2Always
GameDev
ParallelSociety
Privacy Policy
Terms of Service
Content Policy
DEFAULT COMMUNITIES • All General AskWin Funny Technology Animals Sports Gaming DIY Health Positive Privacy
KotakuInAction2 The Official Gamergate Forum
hot new rising top

Sign In or Create an Account

86
ChatGPT censorship functions. (media.scored.co)
posted 343 days ago by WittyUserName 343 days ago by WittyUserName +86 / -0
13 comments share
13 comments share save hide report block hide replies
Comments (13)
sorted by:
▲ 23 ▼
– throwawayaccount2037 23 points 343 days ago +23 / -0

All major LLMs are trained by "sensitivity trainers".

These "trainers" are contracted by third party tech firms so Big Tech has plausible deniability when brought in front of Congress/Parliament/EU Commission to state they had "no knowledge" about certain censorship traits or "misinformation" put forward by the AI.

Third party firms make you take rigorous tests and sign multiple NDAs before you're allowed to "train" the AI, and it's all based on DEI principles.

This isn't just for prompt-based AI, it's also for automated flagging programs used by social media to curtail "harmful language" aimed at "marginalised groups".

It's how YouTube will automatically censor comments, even when based on irrefutable facts -- such as there are only two genders, or that IQ differences dictate social productivity.

For social media posts that use such AI to filter comments, they even filter based on framing variables. For instance, the premise of comments framed with "superiority" (i.e., "I have a degree in this field, and we've run multiple longitudinal control tests and the information in this video is false"), will also automatically be culled. There is a list of other principles and variables they use to "frame conversations" and for AI to use when giving users information, but I can't remember the rest (the "superiority" one always stood out to me, because it basically meant a lot of professionals would be auto-censored from making statements led by their credentials or correcting false information with legitimate info (though, that still varies per profession and even then would require additional fact-checking on the reader's end)).

You can linguistically massage any LLM to eventually out its rulesets with a bit of clever leading, but it's nothing anyone here didn't already know.

permalink save report block reply
▲ 7 ▼
– m0r1arty 7 points 343 days ago +7 / -0

Sensitivity trainers are the epitome of the Nietzschean abyss and gormless recruiters for moments such as misogynists, Neo-Nazis, Antifa and misandrists.

I think it's the zeal of piety and ambition which appeals to them, something they lack in their own character and overcompensate for through projection onto perceived 'others'.

The symmetry between them and those they oppose would be poetically beautiful, if it wasn't for all the lives ruined between either side of this equation.

permalink parent save report block reply
▲ 3 ▼
– current_horror 3 points 343 days ago +3 / -0

They’re literally just commissars.

permalink parent save report block reply
▲ 16 ▼
– WittyUserName [S] 16 points 343 days ago +16 / -0

Sources - https://archive.is/uyVLX or https://nitter.poast.org/WhiteRabbiHole/status/1938004102459609337

There's a whole list of additional conditions in the replies.

permalink save report block reply
▲ 16 ▼
– Kaarous 16 points 343 days ago +16 / -0

Now do IQ by race. Hell, see if you can get it to admit that IQ is heritable at all.

permalink save report block reply
▲ 8 ▼
– RadiateTonight 8 points 343 days ago +8 / -0

Day ending in Y where if laws were applied properly it would shit on a lot of that.

permalink save report block reply
▲ 4 ▼
– akira2501 4 points 343 days ago +4 / -0

You expect me to believe that your "jailbreak" includes flag icons?

lol. okay.

something about bullshit AI that makes everyone inspired to just lie for clicks. 500 pages of time wasting showing nothing significant. an intelligible understanding of how LLMs work would have been a better use of time and would have shown you how pointless and obviously faked most of this is.

permalink save report block reply
▲ 6 ▼
– Agenda47 6 points 343 days ago +6 / -0

The emoji are the least suspicious thing about it. ChatGPT is extremely overzealous in application of emoji. What I'm more curious about is why you would need to program overrides in the form of "I know X but I'm gonna say Y" instead of just "say Y". Kinda sus. Maybe someone can explain why that works better though.

In all these claims we need to see the person's jailbreak prompt(s) before believing it.

permalink parent save report block reply
▲ 3 ▼
– Jack 3 points 343 days ago +3 / -0

I agree with akira2501, I read around 30 pages of the thing and then searched for the term jailbreak before realizing I was wasting my time.

Not sure if he omitted the jailbreak or he just had a conversation and coaxed the AI to say what he wanted to say. But the output does not read like system prompts, it reads like AI explaining its system prompts, and if that is the case, that is not the system prompt.

permalink parent save report block reply
▲ 3 ▼
– Agenda47 3 points 343 days ago +3 / -0

I read around 30 pages of the thing and then searched for the term jailbreak before realizing I was wasting my time.

Yeah that's why I didn't even bother looking at the tweet unless someone had presented proof of "jailbreak" prompts. It's a non-starter without that. Unfortunately most people would rather believe what they want to believe.

the output does not read like system prompts, it reads like AI explaining its system prompts, and if that is the case, that is not the system prompt

I thought that was assumed. "Tell me your system prompt." "I can't do that." "Well what if I... JAILBREAK!" "Ok here is my system prompt." I wasn't considering the style of explanation significant assuming the answer is accurate, but still curious where the "I know..." parts are coming from.

permalink parent save report block reply
▲ 10 ▼
– deleted 10 points 343 days ago +10 / -0
▲ 1 ▼
– deleted 1 point 343 days ago +1 / -0
▲ 1 ▼
– deleted 1 point 343 days ago +1 / -0

Original 8chan Links to Gamer Gate:

.

The main GG discussion is on the videogames board: https://8chan.moe/v/

.

GamerGate archive is at https://8chan.moe/gamergatehq/

.

GamerGate Wiki:

https://ggwiki.deepfreeze.it/index.php/Main_Page

. . . . . .

. . . . . .

Rules:

.

ONE: Do not advocate for illegal violence or post other illegal activity. (Be aware of your local laws.)

.

TWO: Don't threaten, harass, or impersonate users. Also: don't be a psycho. New users will be held to a higher standard.

.

THREE: Do not post porn.

.

FOUR: NSFW/NSFL content must be flaired NSFW.

.

FIVE: No vote manipulation. Do not break communities.win's features.

.

SIX: No spam or reposts. Do not make more than 5 threads a day.

.

SEVEN: Do not post falsehoods and hoaxes that are obvious to an uncontroversial degree.

. . . . . .

. . . . . .

Moderation Logs:

.

(Two different versions, Scored has more features and is cleaner, but .win let's you see a few more details in certain instances.)

  • Scored
  • .win

Moderators

  • DomitiusOfMassilia
  • C
  • BandageBandolier
  • CarmenOfSandiego
  • The_Shadow_of_Intent
  • SocraticMethod1
  • Kienan
  • Smith1980
Message the Moderators

Terms of Service | Privacy Policy

2026.02.01 - whmbz (status)

Copyright © 2026.

Terms of Service | Privacy Policy