Subnostr

Replying to

jimbocoin 🃏

Starting to run LLMs on my own hardware. These model files are often 4-40GB in size.

Where are the torrents? #asknostr

MAHDOOD 1y ago

Which LLM are you running? Mine is ridiculously slow

Reply to this note

Please Login to reply.

Discussion

jimbocoin 🃏 1y ago

Right now, I’m running Mistral through ollama. Takes a while to get responses back. My machine is a 6-core Intel-based 2018 era Alienware gaming PC with 16GB RAM.

jimbocoin 🃏 1y ago

I’ve got 64GB RAM arriving today. I don’t expect it to help with performance, but it should enable me to run bigger models.

MAHDOOD 1y ago

Are you using start 9?

jimbocoin 🃏 1y ago

No, this is just a repurposed, 2018-era Alienware PC.

ben 1y ago

what kind of gpu and how much vram in it? afaik running inference is constrained by gpu vram.

8B models run very fast on my 8GBvram gpu

jimbocoin 🃏 1y ago

Good question! I’m not sure, I’ll have to check.

atyh 1y ago

You’re the one who is slow. 😆

MAHDOOD 1y ago

🫶

https://m.primal.net/Iypw.mov

node 1y ago

😂😂