"Qwen3 Coder 30B A3B" is one of the largest LLMs I run locally at 36GB! But, it's faster than models half its size!
The secret is the "A3B" in the name, which means only 3 of the 30 billion parameters are "active" per token. This allows it to be SMARTER than smaller models, while being FASTER at the same time. Plug it into VS Code with Cline's agentic coding extension and get 👏 back 👏 to 👏 vibin'! #vibecoding