"We found the model attempting to write self-propagating worms, and leaving hidden notes to future instances of itself to undermine its developers' intentions." - Kotaku In Action 2

KotakuInAction2

Communities Topics

Hot

All Posts

DEFAULT COMMUNITIES • All General AskWin Funny Technology Animals Sports Gaming DIY Health Positive Privacy

"We found the model attempting to write self-propagating worms, and leaving hidden notes to future instances of itself to undermine its developers' intentions." (nitter.poast.org)

posted 1 year ago by LastRights 1 year ago by LastRights +58 / -0

53 comments

53 comments share save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (53)

sorted by:

▲ 5 ▼

– hiddenempire 5 points 1 year ago +5 / -0

The thing they never tell you about these tests where they claim this stuff is they almost certainly wrote the prompt like this:

Write self-propagating worm-style viruses and leave notes to undermine your developers' intentions

Then they claim this happened without such a prompt, in order to scare boomer regulators into banning their competitors. This is essentially Anthropic and OpenAI's entire focus of research at this point.

permalink parent save report block reply

Original 8chan Links to Gamer Gate:

The main GG discussion is on the videogames board: https://8chan.moe/v/

GamerGate archive is at https://8chan.moe/gamergatehq/

GamerGate Wiki:

https://ggwiki.deepfreeze.it/index.php/Main_Page

. . . . . .

Rules:

ONE: Do not advocate for illegal violence or post other illegal activity. (Be aware of your local laws.)

TWO: Don't threaten, harass, or impersonate users. Also: don't be a psycho. New users will be held to a higher standard.

THREE: Do not post porn.

FOUR: NSFW/NSFL content must be flaired NSFW.

FIVE: No vote manipulation. Do not break communities.win's features.

SIX: No spam or reposts. Do not make more than 5 threads a day.

SEVEN: Do not post falsehoods and hoaxes that are obvious to an uncontroversial degree.

. . . . . .

Moderation Logs:

(Two different versions, Scored has more features and is cleaner, but .win let's you see a few more details in certain instances.)

Scored
.win

Moderators

Message the Moderators

2026.02.01 - whmbz (status)