How do I run ostrich-70 in ollama? What are the different versions of each quantization?
Discussion
download one of the GGUF files based on your VRAM or RAM size.
apparently there is web ui now:
https://github.com/open-webui/open-webui
if that doesnt work there is some explanation here
https://www.reddit.com/r/ollama/comments/1bgej4v/can_you_run_custom_models/