mistral-7b-instruct feels like chatgpt, its running fast on my macbook, and its only a 5GB model. Wow!
https://cdn.jb55.com/s/9d74cabac4f4c15f.txt
ollama is also good. i think it is using GPU better than llama.cpp.
Please Login to reply.
No replies yet.