What size is useful for what tasks?
Looking for some usecases for max 12b models (works on my Air).
Found just one - summarisation of meetings transcriptions.
Even MCP tools calling is very unreliable with models of this size.
Unfortunately not much. Can't you run even quantized models?
Please Login to reply.
No, 16 GB memory can’t fit more.