qwen3-embedding-0.6b-generic-cpu
qwen3-embedding-0.6b-generic-cpu
Version: 1
MicrosoftLast updated April 2026

Qwen3 Embedding 0.6B Generic Cpu

This is the CPU-optimized variant of qwen3-embedding-0.6b, a text embedding model from the Qwen3 family developed by Alibaba Cloud and optimized by Microsoft.

Model Details

  • Model Type: Text Embedding (ONNX)
  • Parameters: 0.6 billion
  • Context Length: 32K tokens
  • Embedding Dimension: Up to 1024
  • Quantization: KLD Gradient quantization
  • Target Device: CPU
  • Execution Provider: CPUExecutionProvider
  • Supported Languages: 100+

Intended Use

This model is optimized for local execution on devices with CPU hardware acceleration using Foundry Local.

Capabilities

  • Text retrieval and semantic search
  • Code retrieval
  • Text classification and clustering
  • Bitext mining
  • Multilingual and cross-lingual retrieval

License

This model is licensed under Apache 2.0. See license details .

Source

Model Specifications
LicenseApache-2.0
Last UpdatedApril 2026
Input TypeText
Output TypeText
ProviderMicrosoft