The thing they never tell you about these tests where they claim this stuff is they almost certainly wrote the prompt like this:
Write self-propagating worm-style viruses and leave notes to undermine your developers' intentions
Then they claim this happened without such a prompt, in order to scare boomer regulators into banning their competitors. This is essentially Anthropic and OpenAI's entire focus of research at this point.
The thing they never tell you about these tests where they claim this stuff is they almost certainly wrote the prompt like this:
Then they claim this happened without such a prompt, in order to scare boomer regulators into banning their competitors. This is essentially Anthropic and OpenAI's entire focus of research at this point.