
DeepSeek V3 is a 685B-parameter Mixture-of-Experts model and the newest generation in DeepSeek’s flagship chat model family. As the successor to the earlier DeepSeek V3, it delivers strong performance across a wide range of tasks.
Creator | Deepseek |
Release Date | March, 2025 |
License | MIT |
Context Window | 128,000 |
Image Input Support | No |
Open Source (Weights) | Yes |
Parameters | 671B, 37B active at inference time |
Model Weights | Click here |