DeepSeek-V3-0324
DeepSeek-V3-0324 demonstrates notable improvements over its predecessor, DeepSeek-V3, in several key aspects, including enhanced reasoning, improved function calling, and superior code generation capabilities.
Direct from Azure models are a select portfolio curated for their market-differentiated capabilities:
- Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no third-party dependencies, backed by Azure's enterprise-grade infrastructure.
- Streamlined operations: Benefit from unified billing, governance, and seamless PTU portability across models hosted on Azure - all part of Microsoft Foundry.
- Future-ready flexibility: Access the latest models as they become available, and easily test, deploy, or switch between them within Microsoft Foundry; reducing integration effort.
- Cost control and optimization: Scale on demand with pay-as-you-go flexibility or reserve PTUs for predictable performance and savings.
About this model
DeepSeek-V3-0324 shows significant improvements over its predecessor, DeepSeek-V3, in several key aspects.Key model capabilities
Reasoning Capabilities-
Significant improvements in benchmark performance:
- MMLU-Pro: 75.9 ? 81.2 (+5.3)
- GPQA: 59.1 ? 68.4 (+9.3)
- AIME: 39.6 ? 59.4 (+19.8)
- LiveCodeBench: 39.2 ? 49.2 (+10.0)
-
Front-End Web Development
- Improved the executability of the code
- More aesthetically pleasing web pages and game front-ends
-
Chinese Writing Proficiency
- Enhanced style and content quality:
- Aligned with the R1 writing style
- Better quality in medium-to-long-form writing
- Feature Enhancements
- Improved multi-turn interactive rewriting
- Optimized translation quality and letter writing
- Enhanced style and content quality:
-
Chinese Search Capabilities
- Enhanced report analysis requests with more detailed outputs
-
Function Calling Improvements
- Increased accuracy in Function Calling, fixing issues from previous V3 versions
Quick facts
Model providerDeepSeek
TypeChat completion
LifecycleGenerally available (GA)
Input typetext
Output typetext
Context window131.072k
Token limits131.072k output
PricingView pricing