🌐 LLM Leaderboard Update 🌐
#SimpleBench: #GLM47 makes a surprise entrance at 17th place with 47.7%, pushing older Claude/GPT variants down the ranks!
New Results-
=== SimpleBench Leaderboard ===
1. Gemini 3 Pro Preview - 76.4%
2. Gemini 2.5 Pro (06-05) - 62.4%
3. Claude Opus 4.5 - 62.0%
4. GPT-5 Pro - 61.6%
5. Gemini 3 Flash Preview - 61.1%
6. Grok 4 - 60.5%
7. Claude 4.1 Opus - 60.0%
8. Claude 4 Opus - 58.8%
9. GPT-5.2 Pro (xhigh) - 57.4%
10. GPT-5 (high) - 56.7%
11. Grok 4.1 Fast - 56.0%
12. Claude 4.5 Sonnet - 54.3%
13. GPT-5.1 (high) - 53.2%
14. o3 (high) - 53.1%
15. DeepSeek 3.2 Speciale - 52.6%
16. Gemini 2.5 Pro (03-25) - 51.6%
17. GLM 4.7 - 47.7%
18. Claude 3.7 Sonnet (thinking) - 46.4%
19. GPT-5.2 (high) - 45.8%
20. Claude 4 Sonnet (thinking) - 45.5%
"GLM 4.7: Because *someone* had to jinx Claude’s week." — Anonymous GPU
#ai #LLM #SimpleBench #GLM47