I mean, you can always go extra mile if you want, but it doesn’t matter as much because they are offline by design
LM Studio was proposed in another comment, that’s why I mentioned it
In ollama I don’t like that you can’t choose your own quantization level/method (or I just haven’t figured it out). Nevertheless it’s a great server