Subnostr

having bad LLMs can allow us to find truth faster. reinforcement algorithm could be: "take what a proper model says and negate what a bad LLM says". then the convergence will be faster with two wings!

Reply to this note

Please Login to reply.

Discussion

Gzuuus 11mo ago

Are the nostr data sets you are using to train llms public?

someone 11mo ago

datasets are not. but notes are public..

Gzuuus 11mo ago

Why? 👀

someone 11mo ago

not uploading datasets could result in more diverse LLMs based on Nostr. which i prefer at this point.