Nostr Web Client

nostr:nprofile1qqstnem9g6aqv3tw6vqaneftcj06frns56lj9q470gdww228vysz8hqpz4mhxue69uhk2er9dchxummnw3ezumrpdejqzrthwden5te0dehhxtnvdakqz9rhwden5te0wfjkccte9ejxzmt4wvhxjmcjgxv3n

Nunya Bidness 1y ago

I thought Llama was FB. Is it a collaboration between FB and nvidia?

Reply to this note

Please Login to reply.

Discussion

Guy Swann 1y ago

Pretty sure Llama is all Meta. But tons of people have dropped forks and fine tunes of it since it’s FOSS. This is likely NVIDIA just working off of their base model and applying their RLHF tricks or whatever to make it better.

Nunya Bidness 1y ago

That makes sense.

Kajoozie Maflingo 1y ago

Pardon me sir, but would you happen to have a spare 80gb vram I could borrow?

Kajoozie Maflingo 1y ago

It's open source so I'm assuming this is Nvidia's modified version of the 70b model.

Now I just need 80gb of vram to run it...

John 1y ago

You really just need 48gb to run a respectable quant. Or 24vram+ram if youre patient

Ivan 1y ago

Its early days but eventually most PCs etc will be tailored to run models so in 5-10 years the specs on machines will catch up and the models will also become more efficient.

Kajoozie Maflingo 1y ago

Have you seen this: https://youtu.be/GVsUOuSjvcg

Kajoozie Maflingo 1y ago

I'm not patient. Also I only have 12gb vram + 24gb system ram.

I wonder what's the cheapest way to get 48gb vram running on a local machine.

John 1y ago

2 used 3090s would be $1200-1400 and is the current best value. Folks have used p40s but those are ancient and the price actually rose this year

Kajoozie Maflingo 1y ago

But you could run a 48gb model on 2x 24gb cards without issue?

Ivan 1y ago

Llama is open source model anyone can build on it. Why Zuck said he open sourced it so anyone can do the work and make it better for him.