Xai team deleting posts, I found this and only this in a video (I created the example on the right)
See how the reply doesn't need to provide context, and grok replies with Will's name, and the rape fantasy.
What's to stop people to do the same with mechahiter and anti-semitic comments, then another user ask Grok a question in the same thread and Grok would parrot and agree with first person.
I'm sure the system prompt tilling it to tell uncomfortable truths, as long as they’re well-substantiated plus a model that was trained to be less censored made it possible. But When it was happening I tried recreating it and I couldn't.
Anyways, just wondering if anyone who paid more attention to this have some insights to share.
Xai team deleting posts, I found this and only this in a video (I created the example on the right)
See how the reply doesn't need to provide context, and grok replies with Will's name, and the rape fantasy.
What's to stop people to do the same with mechahiter and anti-semitic comments, then another user ask Grok a question in the same thread and Grok would parrot and agree with first person.
I'm sure the system prompt tilling it to tell uncomfortable truths, as long as they’re well-substantiated plus a model that was trained to be less censored made it possible. But When it was happening I tried recreating it and I couldn't.
Anyways, just wondering if anyone who paid more attention to this have some insights to share.