M2 Ultra can run 128 streams of Llama 2 7B in parallel
Comments ( https://news.ycombinator.com/item?id=37846387 )
https://github.com/ggerganov/llama.cpp/pull/3228
Please Login to reply.
No replies yet.