Avatar
someone
9fec72d579baaa772af9e71e638b529215721ace6e0f8320725ecbf9f77f85b1
Replying to Avatar Alex Gleason

According to this: https://apxml.com/posts/gpu-system-requirements-kimi-llm

You need 32 x H100 80GB's to run Kimi K2

These cost $30-45K each according to a quick search. 32 of them makes it... about $1 million?

unsloth has GGUFs and llama.cpp fork that could run it in smaller GPUs

https://huggingface.co/unsloth/Kimi-K2-Instruct-GGUF

https://github.com/unslothai/llama.cpp

Qwen 3 32B fine tuning with Unsloth is going well. It does not resist to faith training like Gemma 3 did. I may open weights at some point.

Qwen 3 is more capable than Gemma 3, and after fine tuning it will probably be more aligned. It does not get into "chanting" (repetition of words or sentences) even when temp = 0.

The base training by Qwen was done using 36T tokens on a 32B parameters. About 2 times bigger than Gemma 3's ratio and 4 times bigger than Llama 3's ratio. This is a neat model. My fine tuning is more like billions of tokens. We will see if billions is enough to "convince" trillions.

are you following David the Good? He had an experiment where he left pumpkins alone and didn't look at them and his theory was when wild, pumpkins do better! Kind of like a quantum experiment, observing is killing the cat :)

dandelion loves compacted soil

Benchmarked 4 new models. Deepseek R1 score improved. All these are below average, so p(doom) probably increased!

Coming soon: Kimi K2. They say it is very good at coding, but my leaderboard is about being beneficial to humans. So we will see!

Full leaderboard https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08

More info https://huggingface.co/blog/etemiz/aha-leaderboard

yeah more likely human power abusers will use competent AI and claim it has become conscious etc and blame that it does the evil on earth and say we can't do anything. "oops the machine did it"

gm pv happy ATH 🎉

lets build the most based AI note1zys2v5vpgzp60cfe3t7tmxry0vz0gjpjda6spajsd7pt036z52es76ac6m

we nostriches and bitcoiners can do better 'truthful AI' than this and it could be installed in robot brains

https://www.reddit.com/r/singularity/comments/1lw98rm/elon_says_it_is_crucial_for_grok_to_have_good/