Replying to Avatar someone

Benchmarked Kimi K2 LLM. It has done well. DeepSeek V3 beats it but Kimi K2 might be more skilled. Very close performance to Qwen 3 in terms of skills and human alignment. But huge parameter count (1T!).

https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08?sheetid=0&range=A3

Avatar
Corban 6mo ago

What are the units on this?

Reply to this note

Please Login to reply.

Discussion

Avatar
someone 6mo ago

No units. Its resemblance to outputs of other LLMs.

Thread collapsed