That's not really Deepseek R1, it's a distilled version of Alibaba's Qwen-32B architecture, enhanced using synthetic outputs from the larger DeepSeek R1 model.
Quite useful but not hte same thing.
Yes, it's all described on the model choice
Please Login to reply.
No replies yet.