FW-Llama-v3.1-8B-Instruct

Llama 3.1 8B Instruct is an 8B-parameter multilingual instruction-tuned language model optimized for dialogue, with a 128K-token context window, tool-calling support, and multilingual capabilities across 8 languages.

Fireworks

Version: 1

Fireworks on Foundry

Models available for use with Fireworks on Foundry deliver optimized, best-in-class performance on the Fireworks Inference Cloud. Fireworks on Foundry is a Non-Microsoft Product. The following terms apply to a Customer's use of Fireworks on Foundry: When you use Fireworks on Foundry, data is shared between Microsoft and Fireworks AI, Customer Data will be sent outside of Microsoft systems, Customer Data will not be processed pursuant to any Foundry data residency documentation, and different compliance and data handling rules will apply. See Trust Center - Fireworks AI for details. Customers are responsible for evaluating whether data sharing between Microsoft and Fireworks is appropriate for their organization's compliance requirements.

Key capabilities

About this model

Llama 3.1 8B Instruct is an 8B-parameter multilingual instruction-tuned large language model from Meta's Llama 3.1 collection. It uses an optimized auto-regressive transformer architecture with Grouped-Query Attention (GQA), fine-tuned with supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). The model supports a 128K-token context window and is optimized for multilingual dialogue use cases across 8 languages.

Key model capabilities

Multilingual dialogue across 8 languages (English, German, French, Italian, Portuguese, Hindi, Spanish, Thai)
128K-token context window
Tool calling and function calling support
Code generation and understanding
Instruction following with strong system-prompt adherence

Use cases

Pricing

Technical specs

Training disclosure

Distribution

More information

Quick facts

Model providerFireworks

TypeChat completion

LifecycleGenerally available (GA)

Input typetext

Output typetext

Context window131.072k

PricingView pricing

FW-Llama-v3.1-8B-Instruct

About this model

Key model capabilities

Quick facts

Quick start