What prevents LLM data from being poisoned by sheer quantity of garbage?
If they're crawling the internet for data to be fed into the LLMs doesn't that mean that data that appears _more_ will have more importance, instead of data that is "better"?
In other words, what is the "pagerank" of LLMs?
Maybe in the future we go full circle. Old printed books only, after a Butlerian Jihad of sorts
Please Login to reply.
No replies yet.