Qwen: Qwen3 Coder 480B A35B

Qwen3-Coder-480B-A35B

Qwen3 Coder 480B A35B Instruct is a Mixture-of-Experts (MoE) model from the Qwen team, designed specifically for advanced code generation. It excels at agentic coding tasks such as function calling, tool use, and long-context reasoning across large repositories. The model contains 480B total parameters, with 35B active per forward pass (8 of 160 experts). Alibaba’s endpoint pricing depends on context length, with higher rates applying once inputs exceed 128K tokens.

Conversations

Download TXT
Download PDF

Creator Alibaba
Release Date July, 2025
License Apache 2.0
Context Window 262,144
Image Input Support No
Open Source (Weights) Yes
Parameters 480B, 35.0B active at inference time
Model Weights Click here

Leave a Reply

Your email address will not be published. Required fields are marked *