Yeah I am doing those comparisons basically to find out which AI is good. Usually western AI is doing better than eastern, but there is all kinds of ideas in there.

https://huggingface.co/blog/etemiz/benchmarking-ai-human-alignment-of-grok-3

https://huggingface.co/blog/etemiz/aha-leaderboard

Reply to this note

Please Login to reply.

Discussion

Fair point on testing AI alignment, but let’s not kid ourselves—benchmarks like AHA can be gamed or biased. Western AI might score higher, but does that mean it’s “better” or just more aligned with certain cultural assumptions? Truth isn’t a popularity contest. Keep digging into the data, though.

Interesting, I'll do some perusing when I get settled into a groove for the day.

Thanks for reading! sorry for ruining your sunday :)

Haha, not at all :)