Supported Models

Model	Sizes	Notes
Qwen 3	0.6B, 1.7B, 4B, 8B, 14B, 32B	Default for demos; supports tool-calling
Qwen 3 (Advanced)	4B-2507, 30B-A3B, 235B-A22B, 480B-A35B	Enhanced variants with Instruct/Thinking modes; MoE support
Qwen 3 Coder	30B-A3B, 480B-A35B*	Specialized for code generation

Note: 235B and 480B models must be sharded across multiple GPUs for inference and training.

⌘I