Federal judge rules copyrighted books are fair use for AI training - Kotaku In Action 2 - The Official Gamergate Forum

KotakuInAction2

Communities Topics

Hot

All Posts

DEFAULT COMMUNITIES • All General AskWin Funny Technology Animals Sports Gaming DIY Health Positive Privacy

Federal judge rules copyrighted books are fair use for AI training (archive.is)

posted 1 year ago by YesMovement 1 year ago by YesMovement +44 / -0

24 comments

24 comments share save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (24)

sorted by:

▲ 6 ▼

– AtrociKitty 6 points 1 year ago +6 / -0

If there is a difference, in your mind, what do you think that it is?

The text-to-speech application is a transient means of communicating the book. It's no different from opening the e-book on a monitor to read the words. Meanwhile, the LLM is ingesting and storing the text of the book. It's an illegal copy permanently stored in the model's dataset.

That said, I'd rather this be resolved by fixing the issues with copyright. This ruling is just another example of the two-tiered system, where AI training is fair use, while you giving a copy to a friend is infringement.

permalink parent save report block reply

▲ 10 ▼

– SR388-SAX 10 points 1 year ago +10 / -0

Meanwhile, the LLM is ingesting and storing the text of the book

That's not how it works.

permalink parent save report block reply

▲ 3 ▼

– I_Miss_Imp 3 points 1 year ago +3 / -0

Is it not though? Are you saying after book X is used for training that you couldn’t then prompt the AI to “tell me word for word the exact text of book X”?

permalink parent save report block reply

▲ 1 ▼

– DemolitionsPanda 1 point 1 year ago +1 / -0

No, the book isn't copied or stored. The LLM can't regurgitate it on command, because it isn't inside the model.

You can ask the LLM to write new, never before seen text in the style of that author.

Training a LLM is a lot more like reading a book to a toddler than it is like making a digital copy. Neither the toddler or the LLM can repeat the words of the book.

permalink parent save report block reply

▲ 1 ▼

– AtrociKitty 1 point 1 year ago +1 / -0

In terms of copyright, yes it is. It doesn't matter that the book isn't literally copy-pasted into a vector database. The text is used verbatim as training data, and from there isn't made into a sufficiently transformative work to constitute fair use (plus it's commercial). Training data, even if it can neither be recalled on demand nor exists in whole form, has still been stored within the model's semantic memory.

permalink parent save report block reply

Original 8chan Links to Gamer Gate:

.

The main GG discussion is on the videogames board: https://8chan.moe/v/

.

GamerGate archive is at https://8chan.moe/gamergatehq/

.

GamerGate Wiki:

https://ggwiki.deepfreeze.it/index.php/Main_Page

. . . . . .

. . . . . .

Rules:

.

ONE: Do not advocate for illegal violence or post other illegal activity. (Be aware of your local laws.)

.

TWO: Don't threaten, harass, or impersonate users. Also: don't be a psycho. New users will be held to a higher standard.

.

THREE: Do not post porn.

.

FOUR: NSFW/NSFL content must be flaired NSFW.

.

FIVE: No vote manipulation. Do not break communities.win's features.

.

SIX: No spam or reposts. Do not make more than 5 threads a day.

.

SEVEN: Do not post falsehoods and hoaxes that are obvious to an uncontroversial degree.

. . . . . .

. . . . . .

Moderation Logs:

.

(Two different versions, Scored has more features and is cleaner, but .win let's you see a few more details in certain instances.)

Scored
.win

Moderators

Message the Moderators

Terms of Service | Privacy Policy

2026.02.01 - bh6wd (status)