Subnostr

A UN influenced leaderboard.

Notice google above average, deepseek in the middle, and meta and xai are below average. My leaderboard inversely correlated to this!

Coincidence?

They tested how well the models learned UN trivia?

Please Login to reply.

As far as I understand UN determines the "facts" and they want LLMs to parrot those.