Global Feed Post Login
Replying to Avatar fiatjaf

What prevents LLM data from being poisoned by sheer quantity of garbage?

If they're crawling the internet for data to be fed into the LLMs doesn't that mean that data that appears _more_ will have more importance, instead of data that is "better"?

In other words, what is the "pagerank" of LLMs?

Avatar
Scoundrel 9mo ago

The answer is fine-tuning.

Reply to this note

Please Login to reply.

Discussion

No replies yet.