Nostr Web Client

Benchmarked Kimi K2 LLM. It has done well. DeepSeek V3 beats it but Kimi K2 might be more skilled. Very close performance to Qwen 3 in terms of skills and human alignment. But huge parameter count (1T!).

https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08?sheetid=0&range=A3

Corban 6mo ago

What are the units on this?

Reply to this note

Please Login to reply.

Discussion

someone 6mo ago

No units. Its resemblance to outputs of other LLMs.