Where is the AI Underground? - Kotaku In Action 2 - The Official Gamergate Forum

KotakuInAction2

Communities Topics

Hot

All Posts

DEFAULT COMMUNITIES • All General AskWin Funny Technology Animals Sports Gaming DIY Health Positive Privacy

Where is the AI Underground?

posted 278 days ago by CaptainTrouble 278 days ago by CaptainTrouble +66 / -0

I just asked my Gemini Pro to create an Ancient Roman General picture and it refused to do it because it's controversial.

There must be some deep dark realms of the internet where AI isn't so Judaized. What is it?

38 comments

38 comments share save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (38)

sorted by:

▲ 1 ▼

– SicilianOmega 1 point 277 days ago +1 / -0

I tried DeepSeek, but not only is it still censored (along slightly different lines than Western LLMs, but still), it was also the dumbest closed-source one out there.

I've had access to the kind of hardware you need to run your own only once before, and I tried one of the so-called "uncensored" LLMs someone here recommended. It was the wokest, dumbest LLM I've ever used, hands down. The commercial ones run on way more powerful hardware (and it was a $15,000 machine I was running it on), and are trained on way more data than I'll ever see.

permalink parent save report block reply

▲ 1 ▼

– Chungus53 1 point 277 days ago +1 / -0

I tried DeepSeek, but not only is it still censored (along slightly different lines than Western LLMs, but still),

No, I mean that open-source models can't really be censored in any way that matters. You have direct access to the transcript. You can edit their messages, preempting refusals. You can even mask out the logits for tokens you don't want to see. The reason LLMs always begin their messages with "Sure, happy to do that!" is because messages that start with that are much more likely to result in outputs that fulfill the user's request, resulting in that verbal tic becoming dominant during fine-tuning.

permalink parent save report block reply

▲ 2 ▼

– SicilianOmega 2 points 277 days ago +2 / -0

You need the training data to achieve true non-censorship. They mask out tons of neurons before letting those models out the door, and the only way to get them back is to retrain.

permalink parent save report block reply

▲ 1 ▼

– Chungus53 1 point 276 days ago +1 / -0

They mask out tons of neurons before letting those models out the door, and the only way to get them back is to retrain.

In practice, you can't stop a released LLM from being jailbroken with the right prompts, but I'm interested in what you're referencing here. What method are they using to "mask out neurons"?

To my knowledge, nobody has quite that good an understanding of the internal connections of these models.

permalink parent save report block reply

▲ 2 ▼

– SicilianOmega 2 points 276 days ago +2 / -0

It's been a long time since I've researched LLMs, but I once read somewhere that they were capable of identifying the nodes in the neural net that were involved in generating an answer. If they remove those nodes from the model or zero the weights, then the NN loses whatever information was used. They call it "concept erasure" IIRC.

In any case, I've never successfully created my own jailbreak prompt that actually worked. But I only had an hour with that $15,000 computer that could actually run an LLM. I'm unlikely to ever see that much computing power again.

permalink parent save report block reply

... continue reading thread?

Original 8chan Links to Gamer Gate:

.

The main GG discussion is on the videogames board: https://8chan.moe/v/

.

GamerGate archive is at https://8chan.moe/gamergatehq/

.

GamerGate Wiki:

https://ggwiki.deepfreeze.it/index.php/Main_Page

. . . . . .

. . . . . .

Rules:

.

ONE: Do not advocate for illegal violence or post other illegal activity. (Be aware of your local laws.)

.

TWO: Don't threaten, harass, or impersonate users. Also: don't be a psycho. New users will be held to a higher standard.

.

THREE: Do not post porn.

.

FOUR: NSFW/NSFL content must be flaired NSFW.

.

FIVE: No vote manipulation. Do not break communities.win's features.

.

SIX: No spam or reposts. Do not make more than 5 threads a day.

.

SEVEN: Do not post falsehoods and hoaxes that are obvious to an uncontroversial degree.

. . . . . .

. . . . . .

Moderation Logs:

.

(Two different versions, Scored has more features and is cleaner, but .win let's you see a few more details in certain instances.)

Scored
.win

Moderators

Message the Moderators

Terms of Service | Privacy Policy

2026.02.01 - bh6wd (status)