NVIDIA
NVIDIAOffers GPU-optimized models and tools for high-performance AI applications across various domains.

Overview

NVIDIA’s Nemotron family supplies open‑weight reasoning models tuned for agentic workflows. The lineup scales from Nemotron Nano (8 B) for edge devices to Nemotron Ultra (253 B)—currently the top open model on reasoning leaderboards—while keeping permissive licensing.

Why NVIDIA on Azure

Combine Nemotron with Azure GPU SKUs, Triton inference, and ML ops tooling to build high‑throughput agents without commercial licensing hurdles.
Total Models: 36
Nemotron-3-8B-Chat-4k-SteerLMText generation
Llama-3.3-70B-Instruct-NIM-microservice
Llama-3.3-70B-Instruct-NIM-microserviceChat completion
Trellis-NIM-microservice
Trellis-NIM-microserviceImage to 3D,Text to 3D,3D generation
NVIDIA-Nemotron-Parse-NIM-microservice
NVIDIA-Nemotron-Parse-NIM-microserviceDocumentAnalysis
Llama-3.2-NV-embedqa-1b-v2-NIM-microservice
Llama-3.2-NV-embedqa-1b-v2-NIM-microserviceEmbeddings
Deepseek-R1-Distill-Llama-8B-NIM-microservice
Deepseek-R1-Distill-Llama-8B-NIM-microserviceChat completion
Llama-3.1-Nemotron-Nano-VL-8B-v1-NIM-microservice
Llama-3.1-Nemotron-Nano-VL-8B-v1-NIM-microserviceImage classification,Image to text,Summarization,Visual question answering,Zero shot image classification
Openfold3_1_2_0-NIM-microservice
Openfold3_1_2_0-NIM-microserviceBiomolecular complex structure prediction
Rfdiffusion-NIM-microservice
Rfdiffusion-NIM-microserviceProtein binder
MSA-search-NIM-microservice
MSA-search-NIM-microserviceProtein binder
Llama-3.3-Nemotron-Super-49B-v1-NIM-microservice
Llama-3.3-Nemotron-Super-49B-v1-NIM-microserviceChat completion
Boltz2-NIM-microservice
Boltz2-NIM-microserviceStructure Prediction
NVIDIA-Nemotron-3-Content-Safety-NIM-microservice
NVIDIA-Nemotron-3-Content-Safety-NIM-microserviceText classification,Image classification
Mixtral-8x7B-Instruct-v0.1-NIM-microservice
Mixtral-8x7B-Instruct-v0.1-NIM-microserviceChat completion
Evo2-40b-NIM-microservice
Evo2-40b-NIM-microserviceGenomics
Nemotron-3-8B-Chat-SteerLMText generation
Cosmos-reason1-NIM-microservice
Cosmos-reason1-NIM-microserviceTask completion verification,Action affordance,Next plausible action prediction
Llama-3.3-Nemotron-Super-49B-v1.5-NIM-microservice
Llama-3.3-Nemotron-Super-49B-v1.5-NIM-microserviceChat completion,Summarization
Llama-3.1-8B-Instruct-NIM-microservice
Llama-3.1-8B-Instruct-NIM-microserviceChat completion
NVIDIA-Nemotron-3-Ultra-NIM-microservice
NVIDIA-Nemotron-3-Ultra-NIM-microserviceChat completion,Question answering,Summarization,Text generation,Text summarization
NVIDIA-Nemotron-Nano-12B-v2-VL-NIM-microservice
NVIDIA-Nemotron-Nano-12B-v2-VL-NIM-microserviceChat completion
Nemotron-3-8B-Base-4kText generation
Openfold2-NIM-microservice
Openfold2-NIM-microserviceProtein binder
Llama-3.2-NV-rerankqa-1b-v2-NIM-microservice
Llama-3.2-NV-rerankqa-1b-v2-NIM-microserviceText classification
Mistral-7B-Instruct-v0.3-NIM-microservice
Mistral-7B-Instruct-v0.3-NIM-microserviceChat completion
NVIDIA-Nemotron-Content-Safety-Reasoning-4B-NIM-microservice
NVIDIA-Nemotron-Content-Safety-Reasoning-4B-NIM-microserviceText classification
Llama-3.1-Nemotron-Nano-8B-v1-NIM-microservice
Llama-3.1-Nemotron-Nano-8B-v1-NIM-microserviceChat completion
NVIDIA-Nemotron-Nano-9b-v2-NIM-microservice
NVIDIA-Nemotron-Nano-9b-v2-NIM-microserviceChat completion
ProteinMPNN-NIM-microservice
ProteinMPNN-NIM-microserviceProtein binder
NVIDIA-Nemotron-3-Nano-NIM-microservice
NVIDIA-Nemotron-3-Nano-NIM-microserviceChat completion
Nemotron-3-8B-Chat-RLHFText generation
Nemotron-3-8B-Chat-SFTText generation
Nemotron-3-8B-QA-4kText generation
earth2studio-fcn3-stormscope
earth2studio-fcn3-stormscopeWeather forecasting
earth2studio-fcn3
earth2studio-fcn3Weather forecasting
NVIDIA-Nemotron-3-Super-NIM-microservice
NVIDIA-Nemotron-3-Super-NIM-microserviceChat completion,Question answering,Summarization,Text generation,Text summarization