Cohere
CohereOffers language models optimized for retrieval-augmented generation and enterprise applications.

Overview

Cohere Inc. is a Canadian‑founded generative‑AI company launched in 2019 by former Google Brain researchers. Built around large‑language‑model families such as Command R/R+, Command A, and high‑precision Rerank APIs, Cohere focuses on enterprise and regulated‑industry use cases—finance, healthcare, manufacturing, energy, and the public sector—rather than consumer chatbots. “Attention Is All You Need” transformer paper—continues to steer Cohere’s mission to deliver secure, domain‑tuned LLMs for real‑world business workflows.

Key Azure AI Foundry Models (July 2025)

  • Command R+ – Instruction‑following LLM optimized for long‑context chat.
  • Command A (03‑2025) – High‑throughput model for production copilots.
  • Rerank 3.5 – Plug‑and‑play reranker for precision search and tool use.

Why Cohere on Azure

Deploy Command or Rerank side‑by‑side with Azure AI Search, store embeddings in Azure Vector DBs, and enjoy unified billing plus data‑residency controls.
Total Models: 11
cohere-command-a
cohere-command-a

Command A is a highly efficient generative model that excels at agentic and multilingual use cases.

chat-completion
embed-v-4-0
embed-v-4-0

Embed 4 transforms texts and images into numerical vectors

embeddings
summarization
Cohere-rerank-v3.5
Cohere-rerank-v3.5

Cohere’s Rerank 3.5 provides a significant boost to the relevancy of search results. This AI model, also known as a crossencoder, precisely sorts lists of documents according to their semantic similarity to a provided query. This allows information retrieval systems to go beyond keyword search and

text-classification
Cohere-embed-v3-multilingual
Cohere-embed-v3-multilingual

Cohere Embed Multilingual is the market's leading text representation model used for semantic search, retrieval-augmented generation (RAG), classification, and clustering.

embeddings
Cohere-command-r-08-2024
Cohere-command-r-08-2024

Command R is a scalable generative model targeting RAG and Tool Use to enable production-scale AI for enterprise.

chat-completion
Cohere-command-r-plus-08-2024
Cohere-command-r-plus-08-2024

Command R+ is a state-of-the-art RAG-optimized model designed to tackle enterprise-grade workloads.

chat-completion
Cohere-rerank-v3-english
Cohere-rerank-v3-english

Cohere Rerank English is the market’s leading reranking model used for semantic search and retrievalaugmented generation (RAG). Rerank enables you to significantly improve search quality by augmenting traditional keyword based search systems with a semanticbased reranking system which can context

text-classification
Cohere-rerank-v3-multilingual
Cohere-rerank-v3-multilingual

Cohere Rerank Multilingual is the market’s leading reranking model used for semantic search and retrievalaugmented generation (RAG). Rerank Multilingual supports 100+ languages and can be used to search within a language (e.g., search with a French query on French documents) and across languages (e

text-classification
Cohere-embed-v3-english
Cohere-embed-v3-english

Cohere Embed English is the market's leading text representation model used for semantic search, retrieval-augmented generation (RAG), classification, and clustering.

embeddings
Cohere-command-r-plus
Cohere-command-r-plus

Command R+ is a state-of-the-art RAG-optimized model designed to tackle enterprise-grade workloads.

chat-completion
Cohere-command-r
Cohere-command-r

Command R is a scalable generative model targeting RAG and Tool Use to enable production-scale AI for enterprise.

chat-completion