Nostr Web Client

nostr:nprofile1qqstnem9g6aqv3tw6vqaneftcj06frns56lj9q470gdww228vysz8hqpz4mhxue69uhk2er9dchxummnw3ezumrpdejqzrthwden5te0dehhxtnvdakqz9rhwden5te0wfjkccte9ejxzmt4wvhxjmcjgxv3n

Reply to this note

Please Login to reply.

Discussion

Nunya Bidness 1y ago

I thought Llama was FB. Is it a collaboration between FB and nvidia?

Guy Swann 1y ago

Pretty sure Llama is all Meta. But tons of people have dropped forks and fine tunes of it since it’s FOSS. This is likely NVIDIA just working off of their base model and applying their RLHF tricks or whatever to make it better.

Nunya Bidness 1y ago

That makes sense.

Kajoozie Maflingo 1y ago

Pardon me sir, but would you happen to have a spare 80gb vram I could borrow?

Kajoozie Maflingo 1y ago

It's open source so I'm assuming this is Nvidia's modified version of the 70b model.

Now I just need 80gb of vram to run it...

John 1y ago

You really just need 48gb to run a respectable quant. Or 24vram+ram if youre patient

Ivan 1y ago

Its early days but eventually most PCs etc will be tailored to run models so in 5-10 years the specs on machines will catch up and the models will also become more efficient.

Kajoozie Maflingo 1y ago

Have you seen this: https://youtu.be/GVsUOuSjvcg

Kajoozie Maflingo 1y ago

I'm not patient. Also I only have 12gb vram + 24gb system ram.

I wonder what's the cheapest way to get 48gb vram running on a local machine.

John 1y ago

2 used 3090s would be $1200-1400 and is the current best value. Folks have used p40s but those are ancient and the price actually rose this year

Kajoozie Maflingo 1y ago

But you could run a 48gb model on 2x 24gb cards without issue?

Ivan 1y ago

Llama is open source model anyone can build on it. Why Zuck said he open sourced it so anyone can do the work and make it better for him.

McCoy 1y ago

AI ponzi. Best use cases: high school and college students cheating on their work.

Ivan 1y ago

Each new technology undergoes the same cycle. Critics claimed the internet would amount to nothing and was only used for porn. They also said Bitcoin was a passing trend and primarily used for purchasing drugs online. Now, they argue AI is useless and only used for cheating and creating poor-quality art. While the first wave is often full of hype and poorly executed ideas that ultimately fail, AI is here to stay and will have many transformative applications. For example, I have been able to code things I've always wanted to but never knew how by using AI. It will only continue to improve. In the future, operating systems will be primarily AI agents.

McCoy 1y ago

Points well taken. But still way overhyped/valued (see NVIDIA) IMHO.

In practicing medicine, there is no prize for being fast/first, one humbly just has to be right about the problem (=diagnoses/treatment).

I believe this to be true of many important fields: energy/nuclear reactors; energy grid; physical engineering problems; computer code solving complex probs…etc.

The real value is getting it EXACTLY RIGHT. I see the LLMs getting to the answers *fast* and *sounding good*; it is not so clear to me the AI will get it RIGHT compared to a highly motivated, seasoned, thoughtful human who is interacting with the real world.

McCoy 1y ago

https://primal.net/e/note1ralmvtqxpzcl6q88qzgv8lyz4wgguxa7qtc49f98u2nd6a8hd28sgf87ek

InBoston 1y ago

Is the license MIT? Couldn’t tell but reads like one.

https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf

Ivan 1y ago

Not sure just found it today going test it out sometime this week.

Guy Swann 1y ago

Since it’s a finetune of llama3.1 (which is MIT as far as I know) I think it would have to be.