They don’t weight responses based on the amount of data. It has a lot to do with how the first pass takes the input of your prompt and creates embeddings. These are then attempted to best fit towards closest embeddings in its vector database. Essentially the more data you have the less likely it is to use bad data, but still requires a lot of trial and error to get there.