Qwen: Qwen3 Coder 480B A35B

Qwen3-Coder-480B-A35B

Qwen3 Coder 480B A35B Instruct is a Mixture-of-Experts (MoE) model from the Qwen team, designed specifically for advanced code generation. It excels at agentic coding tasks such as function calling, tool use, and long-context reasoning across large repositories. The model contains 480B total parameters, with 35B active per forward pass (8 of 160 experts). Alibaba’s endpoint pricing depends on context length, with higher rates applying once inputs exceed 128K tokens.

Conversations

Download TXT
Download PDF
CreatorAlibaba
Release DateJuly, 2025
LicenseApache 2.0
Context Window262,144
Image Input SupportNo
Open Source (Weights)Yes
Parameters480B, 35.0B active at inference time
Model WeightsClick here