Llama-4-Scout-17B-16E

Llama-4-Scout-17B-16E

Llama 4 Scout 17B 16E is great at multi-document summarization, parsing extensive user activity for personalized tasks, and reasoning over vast codebases.
Meta
Version: 1
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding. These Llama 4 models mark the beginning of a new era for the Llama ecosystem. This release includes two efficient models in the Llama 4 series, Llama 4 Scout, a 17 billion parameter model with 16 experts, and Llama 4 Maverick, a 17 billion parameter model with 128 experts. Model developer: Meta Model Architecture: The Llama 4 models are auto-regressive language models that use a mixture-of-experts (MoE) architecture and incorporate early fusion for native multimodality.
Model NameTraining DataParamsInput modalitiesOutput modalitiesContext lengthToken countKnowledge cutoff
Llama 4 Scout (17Bx16E)A mix of publicly available, licensed data and information from Meta’s products and services. This includes publicly shared posts from Instagram and Facebook and people’s interactions with Meta AI. Learn more in our Privacy Center .17B (Activated) 109B (Total)Multilingual text and imageMultilingual text and code10M~40TAugust 2024
Llama 4 Maverick (17Bx128E)17B (Activated) 400B (Total)Multilingual text and imageMultilingual text and code1M~22TAugust 2024
Supported languages: Arabic, English, French, German, Hindi, Indonesian, Italian, Portuguese, Spanish, Tagalog, Thai, and Vietnamese. Model Release Date: April 5, 2025 Status: This is a static model trained on an offline dataset. Future versions of the tuned models may be released as Meta improves model behavior with community feedback. License Notice:
This is a Llama 4 multimodal modal. Under the License and AUP, the rights granted under Section 1(a) of the Llama 4 Community License Agreement are not granted to any individual domiciled in, or any company with a principal place of business in, the European Union. This restriction does not apply to end users of a product or service that incorporates any multimodal models.
Where to send questions or comments about the model: Instructions on how to provide feedback or comments on the model can be found in the Llama README . For more technical information about generation parameters and recipes for how to use Llama 4 in applications, please go here .

Quick facts

Model providerMeta
TypeChat completion
LifecycleGenerally available (GA)
Input typetext, image
Output typetext
Context window10000k
Token limits4096 output