llm math competition. where does each model fail...

observation:

these models very volatile when it comes to math capability

sometimes models fail with very simple math equations, and sometimes

can solve even 3^22

Reply to this note

Please Login to reply.

Discussion

No replies yet.