Reddit in AI content licensing deal with Google
(archive.ph)
Comments (17)
sorted by:
AI being force-fed data moderated heavily by tyrannical trannies. Rough!
“Diabetes in symptom licensing deal with Cancer”
They’re a perfect match for each other.
And then cite each other as sources of truth.
Google the people that gated their AI so hard it wouldn't produce images of a 'white scientist' is now slurping from lgbT-eddit?
There must be a genderless gap in the head and the tail at play.
Every day I am more and more glad I edited & deleted my entire reddit account.
Wish I knew how to do that.
There were a number of scripts that would take your login, work backwards, editing each comment in your history to something generic and useless before deleting that comment. After all the comments were edited & deleted, the script randomized your PW and deleted your account.
There were sites to do this during the '14-'18 reddit revolts. (I believe these sites used the reddit API, prior to the API being locked down.) These days, you'd have to go looking in gits for the raw code to execute, which gets a bit sketchy.
Bots training bots.
The mob-mentality brainrot of left-progressives will fester in the core of all these AI's.
Honestly not that big of a deal. OpenAI used it as well for GPT.
GPT 2 used outgoing reddit links for their training set. "OpenAI developed a new corpus, known as WebText; rather than scraping content indiscriminately from the World Wide Web, WebText was generated by scraping only pages linked to by Reddit posts that had received at least three upvotes prior to December 2017."
GPT-3 then used Reddit posts itself. "OpenAI's GPT series was built with data from the Common Crawl dataset, a conglomerate of copyrighted articles, internet posts, web pages, and books scraped from 60 million domains over a period of 12 years. TechCrunch reports this training data includes copyrighted material from the BBC, The New York Times, Reddit, the full text of online books, and more".
They haven't published what was used for GPT-4 afaik.
It is a big deal, as we've already seen just how it poisons the output.
It's perfect controlled content. I've already seen the "bots" start to proliferate in this election season (accounts less than a year old suddenly achieving 10,000 plus upvotes and posting 100+ messages a day) all with identical messaging and always on the narrative (GOP bad, capitalism bad, socialism gud, and always daily posts for the morning hate fest.
So you feed the machine there, google "AI" consumes it and just regurgitates what these paid for bots want the message to be and google gets to claim its all organic. Which is BS anyway because they've already shown that they would block non-narrative answers ANYWAY. "I'm sorry Dave, but this is hate speech and I cannot give the answer at this time. It's a conundrum."
Yeah at least the companies are making money off the scam. The retarded user base and tranny jannies do it for free.
that's because there's so few places to have discussion in modern society.