Global Feed Post Login
Replying to Avatar jb55

frontier labs are cookin reinforcement learning with verifiable feedback, I can feel it. LLMs + superhuman reasoning with RL is ggs.

Avatar
John 10mo ago

Adversarial training loop when?

Reply to this note

Please Login to reply.

Discussion

No replies yet.