Skip to main content
ModelSizesNotes
Qwen 30.6B, 1.7B, 4B, 8B, 14B, 32BDefault for demos; supports tool-calling
Qwen 3 (Advanced)4B-2507, 30B-A3B, 235B-A22B*, 480B-A35B*Enhanced variants with Instruct/Thinking modes; MoE support
Qwen 3 Coder30B-A3B, 480B-A35B*Specialized for code generation
Note: 235B and 480B models must be sharded across multiple GPUs for inference and training.
I