Any programmers understand why LLMs makes stuff up so much?

posted 126 days ago by CaptainTrouble 126 days ago by CaptainTrouble +19 / -0

Like, it literally creates answers out of thin air then sells it as if it's correct. It doesn't even try to get it right. What sort of redundancy is there in analyzing if the answer is correct before spewing it out? I thought LLMs were supposed to discern what the best answer is given what was said to it based on its training, yet it'll give answers that don't exist based on any training. It's not like it learned the wrong answer from a Reddit post and just posted what Reddit said. It legit is making up wrong answers then citing correct answers. It just outright gets it wrong almost on purpose.

Anyone understand why LLMs fail so much?

I understand they run correlations but how does it determine a wrong answer is the most correlated to the correct response given the prompt instead of the actual correct answer...

37 comments

37 comments share save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (37)

sorted by:

▲ 3 ▼

– ItLivesInTheWind 3 points 125 days ago +3 / -0

I'll get into it. I'll try it with some metaphors but the TLDR; is that they're probabilistic, not deterministic, and that probability plays out in the growing and training phases as well as the prompting phases. I'll leave out talk about transformers and some of the nitty gritty around prompting.

The metaphor for it might be:

imagine millions of islands
on each island, there are parts of words
while it's learning, the LLM stores pieces of words (or pictures/video/sound or whatever it's studying) on islands and tries to sort them so that they flow well into one another and make words and sentences and ideas
there are not enough islands for every piece of words so it has to try and populate the islands with bits that won't conflict
now imagine that there are thousands of types of portals that connect the islands (dimensions)
the LLM, while learning, tries to package content so that it makes sense along a given dimension from island to island

So your LLM studies. It has weights that help it ground and sort initially, but ultimately it's grown. It may or may not "understand" something. Like it might see numbers and go monkeys on typewriters trying to write a black box bit of code that represents a relationship. Adding or subtracting or what not. It might do it like us, but it's unlikely. Usually it comes up with thousands or millions of explanation functions and goes with the shortest and simplest, Occam's Razor.

so now you have islands and portals and black box subroutines that can be executed
then there's a post-training tuning where it gets asked questions and it tries to score
further sorting ensues during the tuning, a new layer

YOU PROMPT IT

your prompt is analyzed for statistical applicability to black boxes, islands, and portals
a path is drawn and from each island a path is drawn based on probabilities of the whole
until the probability indicates that it's done

A LOT CAN GO WRONG

maybe things get ambiguous on an island and it matches Empi instead of Eiff and tells you that the Eiffel Tower is in New York (this is the typical cause of hallucination)
maybe the black box doesn't actually do the operation in a way that's correct so the content is just wrong or used at the wrong time
maybe it selects the incorrect portals (dimensional relationship) between islands

These things are getting more correct though as you throw more money at it, you buy more islands and more portal networks. There is less need for ambiguity and superposition on the islands (less need for tokens to occupy the same position in the vector database + dimensions). There are also more monkeys on more typewriters, so there are more lottery tickets and simpler and smaller Occam's Razors emerge as functions.

Also there are better legacy models to train the new models so if the tuning is known to be good, it can fire millions or billions of questions to post-train refine the model.

*They do relate content to other content weighted but mostly on their own. They do have algorithms that "understand" data in their way. They certainly learn. They probably think in something akin to a reflex arc that responds to stimuli but is not sustained, unprovoked and emergent, or certain other properties we consider to be thinking.

*But they are doing something and that something will augment humans to where they can probably get it to self-improve. Also assisted humans will find a more human path more quickly if a self-improved LLM isn't good enough to stop making stuff up so much.

permalink save report block reply