Nostr Web Client

So what’s all the hoopla about DeepSeek and why is it breaking everybody’s brain right now in Ai?

I’ve been doing a dive for a couple of days and these are the main deets I’ve pulled together, will have a Guy’s Take on it soon, so stay tuned to the nostr:npub1hw4zdmnygyvyypgztfxn8aqqmenxtwdf3tuwrd44stjjeckpc37q6zlg0q feed

DeepSeek ELI5:

• US has been hailed as the leader in Ai, while pushing fears that we need to be closed and not share with China cuz evil CCP and they can’t figure it out without us

• ChatGPT and “Open”Ai is poster child, eating up retarded amounts of capital for training and inference (using) LLMs. Estimates say around $100 million or more for ChatGPT o1 model.

• In just a couple of weeks China drops numerous open source models with incredible results, Hunyuan for video, Minimax, and now DeepSeek. All open source, all insanely competitive with the premiere closed source in the US.

• DeepSeek actually surpassed ChatGPT o1 on most benchmarks, particularly math, logic, and coding.

• DeepSeek is also totally open with how its thought process works, it explains and shows its work as it runs, while ChatGPT makes that proprietary. This makes building with, troubleshooting, and understanding with DeepSeek much better.

• DeepSeek is also multimodal, so you can give it PDFs, images, connect it to the internet, etc. it’s a literal full personal assistant with just a few tools to plug into it.

• The API costs 95% LESS than ChatGPT API per call. They claim that is a profitable price as well, while OpenAi is bleeding money.

• They state that DeepSeek cost only $5.6 million to train and operate.

• Capital controls on GPUs and chips went into effect in the past year or two trying to prevent China from “catching up,” and it seems to have failed miserably. As it seems China was able to do 20x the results per dollar with inferior hardware.

• The US model of Ai, its costs, its capes structure, and the massive demand for chips has been the model for assessing the valuation, pricing, and future demand of the entire Ai industry. DeepSeek just took a giant dump on all of it by out performing and spending a tiny fraction to achieve it while also dealing with lack of access to the newest chips.

All of this together is why people are freaking out about a plummet to Nvidia price, reevaluation of OpenAi, and the failure of US to stay dominant or even the legitimacy of staying proprietary as it may just cause us to fall behind rather than lead. All after a $700 billion investment was just announced that now just kinda looks like incompetent corporations wasting horrendous amounts of money for something they won’t even share with people, that you can’t run locally, and is surpassed by a few lean Chinese startups with barely a few million.

Reply to this note

Please Login to reply.

Discussion

Bangarangg 11mo ago

Ah!

₿lockchainYog丰 11mo ago

Oh this is juicy. Popcorn ready for this AI race.

(>0_0)> 11mo ago

Sounds to me like this was specifically created and released to kneecap the US AI industry in response to the chips controls. This may actually be successful too.

Dan 11mo ago

Almost 100% agree, but there is no way they did this without Nvidia chips.

They've open sourced a ground breaking LLM scaling paradigm (RL on COT), which is no small thing believe me, but our closed source reasoning models are likely doing something similar (we just can't see it).

This newly open scaling paradigm is a game changer, but you still don't get this performance without massive compute.

They have illegal H100s, I'm nearly certain of it. Nvidia was probably due for a correction anyway. But it'll be funny to see what happens when we all find out they did this with Nvidia chips

Dan 11mo ago

to expand on that a bit. They are using secret H100s, therefore their capex claims are complete BS, therefore their API price is complete BS. CCP smuggled in our chips and is bankrolling a loss to shake the market. Pretty freaking smart tbh

Chris 11mo ago

They claimed to have stock pilled them before the ban. Not that they didn’t use them at all. They just used less of them.

For what it’s worth.

Dan 11mo ago

I would bet they've continued stockpiling them.

Chris 11mo ago

Indeed.

Chris 11mo ago

What they claim….FYI

Currency of Distrust 11mo ago

This honestly wouldn’t surprise me in the least

Guy Swann 11mo ago

Sorry I didn’t mean to say they didn’t have Nvidia chips, but more that they likely are paying higher price and it’s slightly harder to get ahold of the same amount of compute. Or at least this was the goal of the govt actions.

So either:

• it did nothing and they have easy access. Or,

• access is slightly more difficult but it didn’t matter.

My bullet point was kinda vague and implied what your interpretation was but that’s not exactly what I meant.

SwBratcher 11mo ago

Just posted this. A great breakdown of why and how. I’m curious if his take is technically accurate.

nostr:note1rz3tnf7qcrseqyunefq43pm8hcx9myl20hwu2gas7tdjwew29kls2zz7ln

Guy Swann 11mo ago

Do you have a good write up on the scaling paradigm? I read that in another post but couldn’t confirm it yet and wasn’t sure what that meant.

Any explanation or breakdown link would be appreciatively zapped

Dan 11mo ago

this overview is pretty solid

https://www.youtube.com/watch?v=sGUjmyfof4Q

https://mirror-feeling-d80.notion.site/DeepSeek-R1-182808527b17801585dadb84f7c66cd9

Tauri 11mo ago

Silicon Valley VCs rekt

lol

Lmao even

Cincy 11mo ago

here’s a 50 IQ question: if this is all open source, what’s preventing Nasdaq companies from integrating this model into their ecosystems?

Guy Swann 11mo ago

Nothing. It’ll cost less too

Cincy 11mo ago

that’s kind of what I was thinking as I was watching the Nasdaq tank. My gut says this should mean cheaper better tech for them as well so was wondering why everyone is panicking

Retinadoc 11mo ago

Lina Khan tried to tell us

TheWildHustle 11mo ago

Looking forward to the Guys Take.

Running Deepseek.

Free Markets be Free Markets.

Brian Appavu, MD 11mo ago

This is all interesting. I just don't understand why it's making people sell their bitcoin 🤷‍♂️

Scrotus 11mo ago

When people need to settle positions (like a margin call) they sometimes need to sell Bitcoin since it is a source of liquidity... Has nothing to Bitcoin tech, it's Bitcoin as a 247 source of liquidity. I call it fafo tech

Hope With ₿itcoin 11mo ago

It's just sudden fear.

Guy Swann 11mo ago

Just shallow market correlation. Thats all. If major equities take a short dive, so does Bitcoin in the short term.

When you stretch it to a 6 month timeline or longer Bitcoin simply follows global liquidity (money printing basically). Its actually got the strongest correlation of any asset. So what you are seeing is nothing but noise due to extremely short term trading that affects all assets.

BoomTown 11mo ago

When is the next big print gonna be? I remember hearing Fred Thiel talk in November 2023 … he was asked about the upcoming halving and subsequent bull market and he said global liquidity was all that mattered. At the time, I thought it was a bearish (or at best conservative) comment but now - being 9 months post halving with only 40% appreciation above last cycle’s ATH - I’m wondering how accurate that assertion might have been.

Paul Sernine 11mo ago

Thx for having a look into this & sharing your results. Looking forward to your Take!

Trainer Dan 11mo ago

Where can we go to play around with Deep Seek?

wildcatfish 11mo ago

Is Chamath canceling any meetings today?

Henny B 11mo ago

🤣 I get it! What a turd that guy

CitizenPleb 11mo ago

Lolz 😂

Scrotus 11mo ago

I always kinda figured that open source llms would improve a lot faster than closed source ones so I'm not shocked about this. It's kinda funny.

I think though for more advanced AI there will be a need for advanced hardware. But yeah Nvidia loses their monopoly, open source levels the playing field, and open AI quickly goes from frontrunner to just another company (thank God be sure Altman is a POS and I believe his sisters accusations).

NBD just another day

Brunswick 11mo ago

Sam Altman had it coming

stackatoshi 11mo ago

you love to see it

JB 11mo ago

hmmm...

FunkCoffee 11mo ago

Even though it’s China, it feels bullish for open-source AI and by proxy, decentralization and humanity.

R 11mo ago

Appreciate the summary 🚁👍🏻

Kenshin 11mo ago

“What happens to all these wonderful ChatGPT models if a small Chinese startup builds a superior LLM for $6M❔”

“All your LLM models are destroyed, completely devastated, ₿itcoin goes to the Moon❕” ⤴️🌙

- Saylor to Sam Altman

Space Cake 11mo ago

How do you say rekt in Mandarin?

Henny B 11mo ago

Great break down Guy! I can now explain it to my octogenarian mother 😊

Gavin Green 11mo ago

I’ll catch that podcast for sure. Curious if any of their claims about what they spent can be verified? Also, any way to check that the search inputs and out outs aren’t stored somewhere. I guess if it’s fully open source that can be verified.

sachin 11mo ago 💬 2

nostr:nevent1qqswr2fjxdm3eh5c89epsuma2yk75pg3atvglaw9mfnkxukvfaaxg6spzamhxue69uhhyetvv9ujuurjd9kkzmpwdejhgtczyzu7we2xhgry2mknq8v7227yn7jguu9xhu3g90n6rtnjj3mpyq3acqcyqqqqqqgge3n7z

Rod 11mo ago

I'd add one more, the USD vs BRICS.

The narrative has been "yes, BRICS has all the energy and commodities, but US has the AI". We have been asked to believe that massive US-led productivity gains from AI will make the US deficit immaterial again.

Deepseek shakes that narrative because if US doesn't have a lead in AI (or energy or commodities), then what does it have?

Piko 11mo ago

Thanks for the breakdown, that was a handy little summary 👍

shaun 11mo ago

Great summary thanks

Ranger Andy 11mo ago

So this is a good thing?

Chris 11mo ago

DeepSeek is a refutation of the “Scaling Laws” nonsense that has plagued the AI world for the past couple of years. Instead of trying innovative new ways to improve model performance big tech has been content to throw more compute and more data at these models in order to improve performance. They were bound to run into diminishing returns at some point

V.I.Palidin 11mo ago

TinoLibertario 11mo ago 💬 1

Deepseek censors compromising questions about Chinese government.

#censorship #deepseek #AI

nothing 11mo ago

ChatGPT censors compromising questions about USA government, USA's foreign policy etc. #censorship #chatgpt #ai #deepseek #nostr

Brunswick 11mo ago

It still takes multiple GPU cards to run the full model, bit significantly less than openai. There are tutorials out there on how to spin up your own cloud system that can do it, and it's affordable even for the hobbyist, at least for a few hours.

Brunswick 11mo ago

Pertaining to "How the heck did they do this?" It was pretty obvious TBH. To use fixed point or smaller mantissa for the models is a simple optimization that is done all the time in embedded systems. This "big shock" is really due to the divide between electrical engineering and computer science.

Bitcoin and Space 11mo ago

Question: I have heard that Deepseek was so cheap to construct because they made use of Meta's open source AI models. So they basically built on top of Meta's work. Does this sound right to you?

Guy Swann 11mo ago

I have not heard this, will be trying to confirm this and other details though

burns 11mo ago

Hasn’t a lot of AI work been built on top of the work Meta has done?

Pretty sure they open sourced [at least some of] it. I may be mistaken but also through huggingface.ai was something they started…

John Smith 11mo ago

it wouldn't be fair to call it Meta's work

it's open source, so a LOT of ppl worked on it

it was created by Meta, but since they made it open source Pandora's box cannot be turned back.

US centralized attempt at AI failed, hope they review their approach before it burns them even more

Thursday 5∞ 11mo ago

nostr:nevent1qqswr2fjxdm3eh5c89epsuma2yk75pg3atvglaw9mfnkxukvfaaxg6spzemhxue69uhkummnw3ezuum5v94k27fwdejhgq3qh8nk2346qezka5cpm8jjh3yl5j88pf4ly2ptu7s6uu55wcfqy0wqxpqqqqqqzhn2yf8

Daryn Cavalier 11mo ago

Also seems to be a shining example of how broken the fiat system is...

Imagine if you could just securely store your wealth in money.

StackSats.IO 9mo ago

Silicon Valley expected to evolve their investments into “AI”.

The whole ecosystem of startups and banking and financing and VC firms and Angels and equity for early employees and advisors - the whole thing which has been built up for 4-5 decades now just got rug pulled.

The stock market side is a legit ponzi on top of the incumbent structures.

China wants the world to revolve around real world production. Manufacturing.

America wants it to revolve around software protected by IP laws protected by the US military.

The winner is already obvious.

BITKARROT 9mo ago

Never be the first guy out the gate, you always will get shot. piggybacking is always cheaper. And there is no such thing as "hailed leader", when its software, every second someone is jockeying to take your place. Tough place to be, but only the strong survive.