Discussion
I thought Llama was FB. Is it a collaboration between FB and nvidia?
Pretty sure Llama is all Meta. But tons of people have dropped forks and fine tunes of it since it’s FOSS. This is likely NVIDIA just working off of their base model and applying their RLHF tricks or whatever to make it better.
It's open source so I'm assuming this is Nvidia's modified version of the 70b model.
Now I just need 80gb of vram to run it...
You really just need 48gb to run a respectable quant. Or 24vram+ram if youre patient
Its early days but eventually most PCs etc will be tailored to run models so in 5-10 years the specs on machines will catch up and the models will also become more efficient.
Have you seen this: https://youtu.be/GVsUOuSjvcg
I'm not patient. Also I only have 12gb vram + 24gb system ram.
I wonder what's the cheapest way to get 48gb vram running on a local machine.
Llama is open source model anyone can build on it. Why Zuck said he open sourced it so anyone can do the work and make it better for him.
AI ponzi. Best use cases: high school and college students cheating on their work.
Each new technology undergoes the same cycle. Critics claimed the internet would amount to nothing and was only used for porn. They also said Bitcoin was a passing trend and primarily used for purchasing drugs online. Now, they argue AI is useless and only used for cheating and creating poor-quality art. While the first wave is often full of hype and poorly executed ideas that ultimately fail, AI is here to stay and will have many transformative applications. For example, I have been able to code things I've always wanted to but never knew how by using AI. It will only continue to improve. In the future, operating systems will be primarily AI agents.
Points well taken. But still way overhyped/valued (see NVIDIA) IMHO.
In practicing medicine, there is no prize for being fast/first, one humbly just has to be right about the problem (=diagnoses/treatment).
I believe this to be true of many important fields: energy/nuclear reactors; energy grid; physical engineering problems; computer code solving complex probs…etc.
The real value is getting it EXACTLY RIGHT. I see the LLMs getting to the answers *fast* and *sounding good*; it is not so clear to me the AI will get it RIGHT compared to a highly motivated, seasoned, thoughtful human who is interacting with the real world.
Is the license MIT? Couldn’t tell but reads like one.
https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf
