How much fucking VRAM do you need to run this model?

Reply to this note

Please Login to reply.

Discussion

Dunno, but here a pure C Llama2 model that runs crazy fast on cpu

https://github.com/karpathy/llama2.c