Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling.
https://arxiv.org/abs/2502.06703
Please Login to reply.
No replies yet.