70B Llama 2 at 35tokens/second on 4090

Link: https://github.com/turboderp/exllamav2

Discussion: https://news.ycombinator.com/item?id=37492986

Reply to this note

Please Login to reply.

Discussion

No replies yet.