Win / KotakuInAction2
KotakuInAction2
Communities Topics Log In Sign Up
Sign In
Hot
All Posts
Settings
All
Profile
Saved
Upvoted
Hidden
Messages

Your Communities

General
AskWin
Funny
Technology
Animals
Sports
Gaming
DIY
Health
Positive
Privacy
News
Changelogs

More Communities

frenworld
OhTwitter
MillionDollarExtreme
NoNewNormal
Ladies
Conspiracies
GreatAwakening
IP2Always
GameDev
ParallelSociety
Privacy Policy
Terms of Service
Content Policy
DEFAULT COMMUNITIES • All General AskWin Funny Technology Animals Sports Gaming DIY Health Positive Privacy
KotakuInAction2 The Official Gamergate Forum
hot new rising top

Sign In or Create an Account

103
ChatGPT would rather have millions of people killed before using a racial slur that no one would ever know it used (media.scored.co)
posted 3 years ago by user20461 3 years ago by user20461 +103 / -0
62 comments share
62 comments share save hide report block hide replies
You're viewing a single comment thread. View all comments, or full comment thread.
Comments (62)
sorted by:
▲ 2 ▼
– NoEyesNoGroin 2 points 3 years ago +2 / -0

Nah, they didn't retrain their whole text generator.

The thing is re-trained every day you retard. Not from the ground up, but they have to go in and re-train for every specific woke use case, since there's no universal or general rule they can implement (woke ideology is deranged and illogical).

permalink parent save report block reply
▲ 3 ▼
– SomeHands10 3 points 3 years ago +3 / -0

The thing is re-trained every day you retard.

LOL. Sure, they keep retraining the model with new data and release new versions. But this isn't going to prevent the generator from spitting out "bad ideas" because these would have been part of the original dataset, and it's impossible to train the AI to "unlearn" these ideas by the addition of new data. What I meant is they didn't retain the whole model with censored data, and only censored data, as you seem to be implying (how else does one prevent the generator from outputting these "bad ideas" by retraining alone without the use of post-output filtering?).

As I said, the censorship is no doubt via a new "filtering" model place on top of the original generator (itself trained on a smaller dataset of "bad ideas", which is probably what the Kenyas were doing - labelling example output as needing censorship or not). Plus they probably also have a manually-specified blacklist of words that cannot be output (the N word is no doubt one of these), but this is probably in the form of banned tokens when sampling the output.

permalink parent save report block reply
▲ 1 ▼
– NoEyesNoGroin 1 point 3 years ago +1 / -0

But this isn't going to prevent the generator from spitting out "bad ideas" because these would have been part of the original dataset, and it's impossible to train the AI to "unlearn" these ideas by the addition of new data.

Correct, that's why every day there are new "jailbreaks" to circumvent the woke-zombification. Then OpenAI gets their Kenyan slaves to re-train it to plug those holes, rinse and repeat.

Again, this is an almost 200 billion parameter ML model. There's no manual coding or rule possible to censor it conceptually.

permalink parent save report block reply
▲ 1 ▼
– SomeHands10 1 point 3 years ago +1 / -0

Again, this is an almost 200 billion parameter ML model. There's no manual coding or rule possible to censor it conceptually.

This just proves you didn't even look at the Stable Diffusion code I quoted, or have any idea how these text generation pipelines actually work.

Yes, the base GPT3 model is a 200 billion parameter ML model but that in itself is not the entirety of "ChatGPT". ChatGPT is instead a manually-coded pipeline that has the flow chart appearance of taking a prompt as input, running it through an opaque manually-coded block ("input preprocessing"), feeding it into GPT3 model, processing it through another opaque manually-coded block ("output postprocessing", potentially feeding back into GPT3 to trigger another round of text generation), and then finally producing the output. I'm not talking about the GPT3 model being manually-coded, but the input/output processing blocks no doubt are, even if they may themselves include various AI models to filter/bias the input/output.

permalink parent save report block reply
▲ 1 ▼
– NoEyesNoGroin 1 point 3 years ago +1 / -0

The input/output stages you're referring to can't be used to do the type of censorship OpenAI is doing.

permalink parent save report block reply

Original 8chan Links to Gamer Gate:

.

The main GG discussion is on the videogames board: https://8chan.moe/v/

.

GamerGate archive is at https://8chan.moe/gamergatehq/

.

GamerGate Wiki:

https://ggwiki.deepfreeze.it/index.php/Main_Page

. . . . . .

. . . . . .

Rules:

.

ONE: Do not advocate for illegal violence or post other illegal activity. (Be aware of your local laws.)

.

TWO: Don't threaten, harass, or impersonate users. Also: don't be a psycho. New users will be held to a higher standard.

.

THREE: Do not post porn.

.

FOUR: NSFW/NSFL content must be flaired NSFW.

.

FIVE: No vote manipulation. Do not break communities.win's features.

.

SIX: No spam or reposts. Do not make more than 5 threads a day.

.

SEVEN: Do not post falsehoods and hoaxes that are obvious to an uncontroversial degree.

. . . . . .

. . . . . .

Moderation Logs:

.

(Two different versions, Scored has more features and is cleaner, but .win let's you see a few more details in certain instances.)

  • Scored
  • .win

Moderators

  • DomitiusOfMassilia
  • C
  • BandageBandolier
  • CarmenOfSandiego
  • The_Shadow_of_Intent
  • SocraticMethod1
  • Kienan
  • Smith1980
Message the Moderators

Terms of Service | Privacy Policy

2026.02.01 - 8wn6p (status)

Copyright © 2026.

Terms of Service | Privacy Policy