Subnostr

mistral-7b-instruct feels like chatgpt, its running fast on my macbook, and its only a 5GB model. Wow!

ollama is also good. i think it is using GPU better than llama.cpp.

Please Login to reply.

No replies yet.