FW-Qwen3-14B

FW-Qwen3-14B

Qwen3 14B is a 14.8B-parameter dense causal language model from Alibaba featuring seamless switching between thinking and non-thinking modes, strong reasoning, coding, and agent capabilities, support for 100+ languages, and a 131K token context window with
Fireworks
Version: 1
Models available for use with Fireworks on Foundry deliver optimized, best-in-class performance on the Fireworks Inference Cloud. Fireworks on Foundry is a Non-Microsoft Product. The following terms apply to a Customer's use of Fireworks on Foundry: When you use Fireworks on Foundry, data is shared between Microsoft and Fireworks AI, Customer Data will be sent outside of Microsoft systems, Customer Data will not be processed pursuant to any Foundry data residency documentation, and different compliance and data handling rules will apply. See Trust Center - Fireworks AI for details. Customers are responsible for evaluating whether data sharing between Microsoft and Fireworks is appropriate for their organization's compliance requirements.

About this model

Qwen3-14B is a 14.8 billion parameter dense causal language model from Alibaba's Qwen3 series. Built on extensive pre-training and post-training, Qwen3-14B delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support. It uniquely supports seamless switching between thinking mode (for complex logical reasoning, math, and coding) and non-thinking mode (for efficient, general-purpose dialogue) within a single model, ensuring optimal performance across various scenarios.

Key model capabilities

  • Seamless switching between thinking mode and non-thinking mode within a single model
  • Significantly enhanced reasoning capabilities, surpassing previous QwQ (in thinking mode) and Qwen2.5 instruct models (in non-thinking mode) on mathematics, code generation, and commonsense logical reasoning
  • Superior human preference alignment, excelling in creative writing, role-playing, multi-turn dialogues, and instruction following
  • Expertise in agent capabilities, with precise integration with external tools in both thinking and non-thinking modes, achieving leading performance among open-source models on complex agent-based tasks
  • Support for 100+ languages and dialects with strong multilingual instruction following and translation
  • 131K token extended context length via YaRN
  • Streaming and function calling support

Quick facts

Model providerFireworks
TypeChat completion
LifecycleGenerally available (GA)
Input typetext
Output typetext
Context window131.072k