Nostr Web Client

Kimi K2 is an Open Source LLM that requires $1 million dollars to self-host.

nostr:nevent1qvzqqqqqqypzqprpljlvcnpnw3pejvkkhrc3y6wvmd7vjuad0fg2ud3dky66gaxaqydhwumn8ghj7emvv4shxmmwv96x7u3wv3jhvtmjv4kxz7gqyrdkr6a0cd2ekgdmqk4rtlhnzeqs9r7787enprtmdrvwree2tvyq522ygvc

Reply to this note

Please Login to reply.

Discussion

ynniv 5mo ago

You need to qualify that with a token speed. As soon as the ollama fixes are in, I'm going to run it on $4k of chrome (1TB DDR4 / 128 cores). Probably Q4_K_XL, but maybe Q8_K_XL... just to find out

ynniv 5mo ago

You should also be able to use 2x$10k Nvidia rigs, or one $10k Mac Studio

GNU/翠星石 5mo ago 💬 2

nostr:npub1q3sle0kvfsehgsuexttt3ugjd8xdklxfwwkh559wxckmzddywnws6cd26p >"open source"

>Look inside.

>Proprietary software.

ynniv 5mo ago

John 5mo ago

Fucking G-Nasa uses Ollama and not any of the real engines

ynniv 5mo ago

Ollama wrapps llama.cpp, which for single node inference is fantastic. If you have a cluster or specific arrangement that aligns with one of the other frameworks you might do better, but if you just want to run the latest models it's the place to be