Is it not though? Are you saying after book X is used for training that you couldn’t then prompt the AI to “tell me word for word the exact text of book X”?
No, the book isn't copied or stored. The LLM can't regurgitate it on command, because it isn't inside the model.
You can ask the LLM to write new, never before seen text in the style of that author.
Training a LLM is a lot more like reading a book to a toddler than it is like making a digital copy. Neither the toddler or the LLM can repeat the words of the book.
Is it not though? Are you saying after book X is used for training that you couldn’t then prompt the AI to “tell me word for word the exact text of book X”?
No, the book isn't copied or stored. The LLM can't regurgitate it on command, because it isn't inside the model.
You can ask the LLM to write new, never before seen text in the style of that author.
Training a LLM is a lot more like reading a book to a toddler than it is like making a digital copy. Neither the toddler or the LLM can repeat the words of the book.