mistralai-Mistral-7B-v01

mistralai-Mistral-7B-v01

Mistral AI
Version: 19
The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters.
Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks tested.
For full details of this model please read paper and release blog post .

Model Architecture

Mistral-7B-v0.1 is a transformer model, with the following architecture choices:
  • Grouped-Query Attention
  • Sliding-Window Attention
  • Byte-fallback BPE tokenizer
Mistral 7B v0.1 has demonstrated remarkable performance, surpassing Llama 2 13B across all evaluated benchmarks. Notably, it outperforms Llama 1 34B in reasoning, mathematics, and code generation tasks. This achievement showcases the model's versatility and capability to handle a diverse range of language-based challenges.

Notice

Mistral 7B is a pretrained base model and therefore does not have any moderation mechanisms.
TaskUse caseDatasetPython sample (Notebook)CLI with YAML
Text Generationquestion-answeringtruthful_qa abstractive_qna_with_text_gen.ipynb text-generation.sh

Quick facts

Model providerMistral AI
TypeText generation
LifecycleGenerally available (GA)