mistralai-Mistral-7B-v01
The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters.
Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks tested. For full details of this model please read paper and release blog post .
Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks tested. For full details of this model please read paper and release blog post .
Model Architecture
Mistral-7B-v0.1 is a transformer model, with the following architecture choices:- Grouped-Query Attention
- Sliding-Window Attention
- Byte-fallback BPE tokenizer
Notice
Mistral 7B is a pretrained base model and therefore does not have any moderation mechanisms.| Task | Use case | Dataset | Python sample (Notebook) | CLI with YAML |
|---|---|---|---|---|
| Text Generation | question-answering | truthful_qa | abstractive_qna_with_text_gen.ipynb | text-generation.sh |