True, and there are still lots of rumors going around about the actual training hardware for deepseek, but I still think open sourcing a SOTA model is a huge step forward. One of the things the community has been best at is reducing the compute cost for various models.

Reply to this note

Please Login to reply.

Discussion

No replies yet.