Subnostr

jimbocoin 🃏 1y ago

Starting to run LLMs on my own hardware. These model files are often 4-40GB in size.

Where are the torrents? #asknostr

Reply to this note

Please Login to reply.

Discussion

d34e832d... 1y ago

Have you tried smaller models like Ollama for specific tasks?

d34e832d... 1y ago

Star coder is nice for testing and research

jimbocoin 🃏 1y ago

Thanks! I’ll check it out 🙏

jimbocoin 🃏 1y ago

Yeah, I’ve got llama 3 running via ollama. Downloading the bigger version now. Should be able to run it once my new RAM arrives.

But mostly, I’m thinking about going forward. I plan to download and try out a whole bunch of different models all of which are extraordinarily large data files. This seems like a perfect job for BitTorrent.

Jay Pow Pow 1y ago

Is that what i’d need to make a chat bot of myself?

jimbocoin 🃏 1y ago

I believe you could do this with ollama, yes. It supports saving and loading models, so you could feed it a bunch of info, then begin the bot serving from that point.

MAHDOOD 1y ago

Which LLM are you running? Mine is ridiculously slow

jimbocoin 🃏 1y ago

Right now, I’m running Mistral through ollama. Takes a while to get responses back. My machine is a 6-core Intel-based 2018 era Alienware gaming PC with 16GB RAM.

jimbocoin 🃏 1y ago

I’ve got 64GB RAM arriving today. I don’t expect it to help with performance, but it should enable me to run bigger models.

MAHDOOD 1y ago

Are you using start 9?

jimbocoin 🃏 1y ago

No, this is just a repurposed, 2018-era Alienware PC.