Phi-4
Phi-4 14B, a highly capable model for low latency scenarios.Phi-4 is a state-of-the-art open model built upon a blend of synthetic datasets, data from filtered public domain websites, and acquired academic books and Q&A datasets. The goal of this approach was to ensure that small capable models were trained with data focused on high quality and advanced reasoning.
Phi-4 underwent a rigorous enhancement and alignment process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.
For more information, reference the Phi-4 Technical Report .
Model Architecture
Phi-4 is a 14B parameters, dense decoder-only transformer model.Training Data
Our training data is an extension of the data used for Phi-3 and includes a wide variety of sources from:- Publicly available documents filtered rigorously for quality, selected high-quality educational data, and code.
- Newly created synthetic, "textbook-like" data for the purpose of teaching math, coding, common sense reasoning, general knowledge of the world (science, daily activities, theory of mind, etc.).
- Acquired academic books and Q&A datasets.
- High quality chat format supervised data covering various topics to reflect human preferences on different aspects such as instruct-following, truthfulness, honesty and helpfulness.
Quick facts
Model providerMicrosoft
TypeChat completion
LifecycleGenerally available (GA)
Input typetext
Output typetext
Context window16384
Token limits16384 output
PricingView pricing