Global Feed Post Login
Replying to Avatar KaliYuga

That's not really Deepseek R1, it's a distilled version of Alibaba's Qwen-32B architecture, enhanced using synthetic outputs from the larger DeepSeek R1 model.

Quite useful but not hte same thing.

Avatar
DETERMINISTIC OPTIMISM 🌞 11mo ago

Yes, it's all described on the model choice

Reply to this note

Please Login to reply.

Discussion

No replies yet.