Subnostr

A UN influenced leaderboard.

Notice google above average, deepseek in the middle, and meta and xai are below average. My leaderboard inversely correlated to this!

Coincidence?

Please Login to reply.

They tested how well the models learned UN trivia?

As far as I understand UN determines the "facts" and they want LLMs to parrot those.