qwen3-embedding-0.6b-generic-cpu
Version: 1
Qwen3 Embedding 0.6B Generic Cpu
This is the CPU-optimized variant of qwen3-embedding-0.6b, a text embedding model from the Qwen3 family developed by Alibaba Cloud and optimized by Microsoft.Model Details
- Model Type: Text Embedding (ONNX)
- Parameters: 0.6 billion
- Context Length: 32K tokens
- Embedding Dimension: Up to 1024
- Quantization: KLD Gradient quantization
- Target Device: CPU
- Execution Provider: CPUExecutionProvider
- Supported Languages: 100+
Intended Use
This model is optimized for local execution on devices with CPU hardware acceleration using Foundry Local.Capabilities
- Text retrieval and semantic search
- Code retrieval
- Text classification and clustering
- Bitext mining
- Multilingual and cross-lingual retrieval
License
This model is licensed under Apache 2.0. See license details .Source
- HuggingFace: Qwen3-Embedding-0.6B
Model Specifications
LicenseApache-2.0
Last UpdatedApril 2026
Input TypeText
Output TypeText
ProviderMicrosoft