DeepSeek V3.2
Version: 1
Fireworks on Foundry
Models available for use with Fireworks on Foundry deliver optimized, best-in-class performance on the Fireworks Inference Cloud. Fireworks on Foundry is a Preview subject to Azure Preview terms and the following supplemental terms: When you use Fireworks on Foundry, data is shared between Microsoft and Fireworks AI, and different compliance and data handling rules will apply. See documentation for details. Customers are responsible for evaluating whether data sharing between Microsoft and Fireworks is appropriate for their organization’s compliance requirements.Key capabilities
About this model
DeepSeek-V3.2 is a large language model from DeepSeek AI that harmonizes high computational efficiency with superior reasoning and agent performance. It uses a Mixture-of-Experts (MoE) architecture with 675.2 billion total parameters, and supports a context length of 163.8k tokens. DeepSeek-V3.2 is calibrated and supports function calling for agentic workflows.Key model capabilities
- High computational efficiency via Mixture-of-Experts (MoE) architecture
- Superior reasoning and agent performance
- Function calling support for tool use and agentic workflows
- 163.8k token context window
Use cases
See Responsible AI for additional considerations for responsible use.Key use cases
- Conversational AI
- Code assistance
- Agentic systems
- Enterprise RAG (retrieval-augmented generation)
Out of scope use cases
The provider has not supplied this information.Pricing
Pricing is based on a number of factors, including deployment type and tokens used. See pricing details here.Technical specs
The provider has not supplied this information.Training cut-off date
The provider has not supplied this information.Training time
The provider has not supplied this information.Input formats
TextOutput formats
TextSupported languages
English, ChineseSample JSON response
The provider has not supplied this information.Model architecture
DeepSeek V3.2 is a Mixture-of-Experts (MoE) language model.| Property | Value |
|---|---|
| Total Parameters | 675.2B |
| Architecture | Mixture-of-Experts (MoE) |
Long context
Context Length: 163.8k tokensOptimizing model performance
The provider has not supplied this information.Additional assets
The provider has not supplied this information.Training disclosure
Training, testing and validation
The provider has not supplied this information.Distribution
Distribution channels
The provider has not supplied this information.More information
Model Specifications
Context Length163840
LicenseOther
Last UpdatedApril 2026
Input TypeText
Output TypeText
ProviderFireworks
Languages2 Languages