Also extremely bullish on QwQ and the upcoming QWEN models
QwQ allegedly on par with deepseek R1 (real, not the tiny fine tunes)
Can get 50 T/s on a 3090 with SGLang
Librechat looks like it has good support for MCP so the weekend project is looking like replace as much of my internet usage with MCP tools