The new M3 Max is pretty crazy. I can get quick inference on a 13b Llama 2 chat model.

Reply to this note

Please Login to reply.

Discussion

No replies yet.