DeepSeek R2's specifications:

- Unit cost reduced by 97.3%, ready for immediate release

- Its self-developed distributed training framework achieves 82% utilization of 910B chip clusters

- Reaches 512 PetaFLOPS computing power under FP16 precision

- Achieves 91% efficiency compared to same-scale A100 clusters (data verified by Huawei laboratory)

Reply to this note

Please Login to reply.

Discussion

No replies yet.