Qwen3 Embedding 0.6B Generic Cpu

This is the CPU-optimized variant of qwen3-embedding-0.6b, a text embedding model from the Qwen3 family developed by Alibaba Cloud and optimized by Microsoft.

Model Details

Model Type: Text Embedding (ONNX)
Parameters: 0.6 billion
Context Length: 32K tokens
Embedding Dimension: Up to 1024
Quantization: KLD Gradient quantization
Target Device: CPU
Execution Provider: CPUExecutionProvider
Supported Languages: 100+

Intended Use

This model is optimized for local execution on devices with CPU hardware acceleration using Foundry Local.

Capabilities

Text retrieval and semantic search
Code retrieval
Text classification and clustering
Bitext mining
Multilingual and cross-lingual retrieval

License

This model is licensed under Apache 2.0. See license details .

Source

HuggingFace: Qwen3-Embedding-0.6B

Model Specifications

LicenseApache-2.0

Last UpdatedApril 2026

Input TypeText

Output TypeText

ProviderMicrosoft

Quick Start