New model: Qwen3 30B A3B 2507 is now available!

https://nano-gpt.com/conversation?model=qwen3-30b-a3b-instruct-2507

This 30.5B-parameter mixture-of-experts language model from Qwen features 3.3B active parameters per inference, offering excellent efficiency. Significant improvements in general capabilities, including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage compared to its predecessor.

Reply to this note

Please Login to reply.

Discussion

No replies yet.