M2 Ultra can run 128 streams of Llama 2 7B in parallel

Comments ( https://news.ycombinator.com/item?id=37846387 )

https://github.com/ggerganov/llama.cpp/pull/3228

Reply to this note

Please Login to reply.

Discussion

No replies yet.