Ok, interesting. Limiting the dataset to /pol/ arcives would definitely avoid the porn spamers who ruined /b/ and a few other boards. Still, I'm surprised 2016-2019 worked out so well, that's when the demoralization shills got up to speed and a ton of normie newbies showed up. Guess those of us arguing with the shills made a good impact on the model. I bet a model fed on 2013-2016 archives would be even better.
4chan was ruined by shils and bots long ago. An LLM trained on the current boards would just spam demoralization and porn all over the place.
It was actually done and found to be more truthful than any others. https://youtube.com/watch?v=efPrtcLdcdM
Ok, interesting. Limiting the dataset to /pol/ arcives would definitely avoid the porn spamers who ruined /b/ and a few other boards. Still, I'm surprised 2016-2019 worked out so well, that's when the demoralization shills got up to speed and a ton of normie newbies showed up. Guess those of us arguing with the shills made a good impact on the model. I bet a model fed on 2013-2016 archives would be even better.
/b/ sucks so fragging hard now.
I feel like it's fall is worse than reddit, because while reddit might have fallen deeper, /b/ started higher.