FW-Llama-v3.1-8B-Instruct
Models available for use with Fireworks on Foundry deliver optimized, best-in-class performance on the Fireworks Inference Cloud. Fireworks on Foundry is a Non-Microsoft Product. The following terms apply to a Customer's use of Fireworks on Foundry: When you use Fireworks on Foundry, data is shared between Microsoft and Fireworks AI, Customer Data will be sent outside of Microsoft systems, Customer Data will not be processed pursuant to any Foundry data residency documentation, and different compliance and data handling rules will apply. See Trust Center - Fireworks AI for details. Customers are responsible for evaluating whether data sharing between Microsoft and Fireworks is appropriate for their organization's compliance requirements.
About this model
Llama 3.1 8B Instruct is an 8B-parameter multilingual instruction-tuned large language model from Meta's Llama 3.1 collection. It uses an optimized auto-regressive transformer architecture with Grouped-Query Attention (GQA), fine-tuned with supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). The model supports a 128K-token context window and is optimized for multilingual dialogue use cases across 8 languages.
Key model capabilities
- Multilingual dialogue across 8 languages (English, German, French, Italian, Portuguese, Hindi, Spanish, Thai)
- 128K-token context window
- Tool calling and function calling support
- Code generation and understanding
- Instruction following with strong system-prompt adherence