FW-Qwen3-14B

Qwen3 14B is a 14.8B-parameter dense causal language model from Alibaba featuring seamless switching between thinking and non-thinking modes, strong reasoning, coding, and agent capabilities, support for 100+ languages, and a 131K token context window with

Fireworks

Version: 1

Fireworks on Foundry

Models available for use with Fireworks on Foundry deliver optimized, best-in-class performance on the Fireworks Inference Cloud. Fireworks on Foundry is a Non-Microsoft Product. The following terms apply to a Customer's use of Fireworks on Foundry: When you use Fireworks on Foundry, data is shared between Microsoft and Fireworks AI, Customer Data will be sent outside of Microsoft systems, Customer Data will not be processed pursuant to any Foundry data residency documentation, and different compliance and data handling rules will apply. See Trust Center - Fireworks AI for details. Customers are responsible for evaluating whether data sharing between Microsoft and Fireworks is appropriate for their organization's compliance requirements.

Key capabilities

About this model

Qwen3-14B is a 14.8 billion parameter dense causal language model from Alibaba's Qwen3 series. Built on extensive pre-training and post-training, Qwen3-14B delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support. It uniquely supports seamless switching between thinking mode (for complex logical reasoning, math, and coding) and non-thinking mode (for efficient, general-purpose dialogue) within a single model, ensuring optimal performance across various scenarios.

Key model capabilities

Seamless switching between thinking mode and non-thinking mode within a single model
Significantly enhanced reasoning capabilities, surpassing previous QwQ (in thinking mode) and Qwen2.5 instruct models (in non-thinking mode) on mathematics, code generation, and commonsense logical reasoning
Superior human preference alignment, excelling in creative writing, role-playing, multi-turn dialogues, and instruction following
Expertise in agent capabilities, with precise integration with external tools in both thinking and non-thinking modes, achieving leading performance among open-source models on complex agent-based tasks
Support for 100+ languages and dialects with strong multilingual instruction following and translation
131K token extended context length via YaRN
Streaming and function calling support

Use cases

Pricing

Technical specs

Training disclosure

Distribution

More information

Quick facts

Model providerFireworks

TypeChat completion

LifecycleGenerally available (GA)

Input typetext

Output typetext

Context window131.072k

PricingView pricing

FW-Qwen3-14B

About this model

Key model capabilities

Quick facts

Quick start