Qwen: Qwen3 30B A3B 2507

Qwen3-30B-A3B-2507

Qwen3 30B A3B 2507 Instruct is a 30.5B-parameter Mixture-of-Experts language model from the Qwen series, with 3.3B active parameters per inference. Operating in non-thinking mode, it is optimized for high-quality instruction following, multilingual comprehension, and agentic tool use. Trained further on instruction data, it delivers strong results across benchmarks in reasoning (AIME, ZebraLogic), coding (MultiPL-E, LiveCodeBench), and alignment (IFEval, WritingBench). Compared to its base non-instruct variant, it performs significantly better on open-ended and subjective tasks while maintaining robust factual accuracy and coding capabilities.

Conversations

Download TXT
Download PDF
CreatorAlibaba
Release DateJuly, 2025
LicenseApache 2.0
Context Window262,144
Image Input SupportNo
Open Source (Weights)Yes
Parameters30.5B, 3.3B active at inference time
Model WeightsClick here

Performance

Deepseek-V3-0324GPT-4o-0327Gemini-2.5-Flash Non-ThinkingQwen3-235B-A22B Non-ThinkingQwen3-30B-A3B Non-ThinkingQwen3-30B-A3B-Instruct-2507
Knowledge
MMLU-Pro81.279.881.175.269.178.4
MMLU-Redux90.491.390.689.284.189.3
GPQA68.466.978.362.954.870.4
SuperGPQA57.351.054.648.242.253.4
Reasoning
AIME2546.626.761.624.721.661.3
HMMT2527.57.945.810.012.043.0
ZebraLogic83.452.657.937.733.290.0
LiveBench 2024112566.963.769.162.559.469.0
Coding
LiveCodeBench v6 (25.02-25.05)45.235.840.132.929.043.2
MultiPL-E82.282.777.779.374.683.8
Aider-Polyglot55.145.344.059.624.435.6
Alignment
IFEval82.383.984.383.283.784.7
Arena-Hard v2*45.661.958.352.024.869.0
Creative Writing v381.684.984.680.468.186.0
WritingBench74.575.580.577.072.285.5
Agent
BFCL-v364.766.566.168.058.665.1
TAU1-Retail49.660.3#65.265.238.359.1
TAU1-Airline32.042.8#48.032.018.040.0
TAU2-Retail71.166.7#64.364.931.657.0
TAU2-Airline36.042.0#42.536.018.038.0
TAU2-Telecom34.029.8#16.924.618.412.3
Multilingualism
MultiIF66.570.469.470.270.867.9
MMLU-ProX75.876.278.373.265.172.0
INCLUDE80.182.183.875.667.871.9
PolyMATH32.225.541.927.023.343.1