Nostr Web Client

“There’s a technique in AI called distillation, which you’re going to hear a lot about. It’s when one model learns from another model,” Sacks explained to Fox News. “Effectively, the student model asks the parent model millions of questions, mimicking the reasoning process and absorbing knowledge.”

“They can essentially extract the knowledge out of the model,” he continued. “There’s substantial evidence that what #DeepSeek did here was distill knowledge from OpenAI’s models.” “I don’t think #OpenAI is too happy about this,” Sacks added.

https://www.zerohedge.com/ai/us-navy-bans-deepseek-over-security-concerns-substantial-evidence-emerges-chinese-ai-ripped

Gunson 11mo ago

This sounds like cope.

Possibly they fine tuned DeepSeek on an OpenAI model (cheaper than using humans), but it makes no sense to primarily do this when self-supervised learning and RL is much more efficient. Also, DeepSeek performs better than OpenAI on several benchmarks - you can't achieve this purely by distilling a teacher model.

Likely they made a technical breakthrough and USA "AI tzar" is seething.

Reply to this note

Please Login to reply.

Discussion

Laeserin 11mo ago

They have said themselves that they did not make a technical breakthrough. They just open-sourced everything.

Gunson 11mo ago

Here are some of the specific technical things they did to achieve lower costs. More efficient training procedure, memory compression, reliance on RL, low level code optimisation.

https://www.analyticsvidhya.com/blog/2025/01/how-deepseek-trained-ai-30-times-cheaper/

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 11mo ago

open source > stupid copyright bullshit

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 11mo ago

the best part of #deepsnek is that this is gonna crater all the closed source projects prospects for future funding

investers will be like, "closed source means expensive, pass"

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 11mo ago

oh, there was another thing that is gonna come out of this that is awesome too

AMD was lagging in the general purpose compute space despite their simpler, cheaper hardware and open source AI

now they will be looked at again for further cost benefits

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 11mo ago

AI is now at that stage in its development like when you are writing code and it works, but it's slow, expensive and a bit clunky

it works!

but now the optimizations start and the race is on for the most streamlined implementations

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 11mo ago

personally, i am looking forward to when someone builds a model out of the nostr and all of the web pages embedded in the links on it, this will be epic

Laeserin 11mo ago

Even Meta has been giving up on closed source.

Closed-source products open to the public are sort of pointless, anyway. Either something is a secret, and then you keep it to yourself, or it's not and then who cares.