Replying to Avatar mark tyler

Published today. This is a kind of a big deal. One of the biggest limiters with training AI is that your training data needs to be good. These guys have LLMs write candidate training data, then select the best examples from training data, then they fine tune using those examples, then have the new LLM write new training data and they continue that process. This resulted in a better training data than the original human-sourced training data, and consequently a better model as evaluated by withheld human preference data… 👀

🛫

Avatar
Ben's BTC stories 2y ago

It was only a matter of time until we used ai to train ai.

How is this similar to using a computer to design a computer instead of doing all the chip and computer design work using paper.

Reply to this note

Please Login to reply.

Discussion

Avatar
mark tyler 2y ago

Where are you seeing that comparison?

Thread collapsed