AI Model Catalog | Microsoft Foundry Models

Openfold2

Version: 3

Nvidia•Last updated May 2025

Openfold2 is a protein structure prediction model from the OpenFold Consortium and the Alquraishi Laboratory . The model is a PyTorch re-implementation of Google Deepmind’s AlphaFold2 , with support for both training and inference. OpenFold2 demonstrates parity accuracy with AlphaFold2, and improved speed, see the project home for more detail aqlaboratory/openfold . The NVIDIA OpenFold2 NIM can:

Predict a protein structure given an input protein sequence, and accepts optional inputs such as multiple sequence alignments and templates.

This NIM implements the ‘monomer’ version of OpenFold2, and uses the model parameter sets trained with Google Deepmind’s original jax implemenation of AlphaFold2:

params_model_1.npz
params_model_2.npz
params_model_3.npz
params_model_4.npz
params_model_5.npz

For more information about OpenFold2, see the OpenFold2 paper in Nature .

Intended Use

Primary Use Cases

NIMs offer a simple and easy-to-deploy route for self-hosted AI applications. Two major advantages that NIMs offer for system administrators and developers are:

Increased productivity: NIMs allow developers to build generative AI applications quickly, in minutes rather than weeks, by providing a standardized way to add AI capabilities to their applications.
Simplified deployment: NIMs provide containers that can be easily deployed on various platforms, including clouds, data centers, or workstations, making it convenient for developers to test and deploy their applications.

The OpenFold2 NIM provides a fast, accurate model behind a consistent API for predicting protein structure. As part of the broader NVIDIA NIM Ecosystem, OpenFold2 can be used in conjunction with other NIMs to build pipelines that generate and assess the structure and function of entirely new proteins and small molecules.

Responsible AI Considerations

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their supporting model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report security vulnerabilities or NVIDIA AI Concerns here .

Training Data

The model parameter sets were trained by Google Deepmind as part of AlphaFold2 development. A description of the training dataset and relevant download links are available at Highly Accurate ... Data Availability . This data was not collected by NVIDIA. Data Collection Method by dataset

Hybrid: Automatic/Sensors, Human
See the description at Highly Accurate ... Data Availability .

Labeling Method by dataset

Hybrid: Automatic/Sensors, Human
See the description at Highly Accurate ... Data Availability .

Properties (Quantity, Dataset Descriptions, Sensor(s)): Uniclust dataset of 355,993 sequences with the full MSAs. These predictions were then used to train a final model with identical hyperparameters, except for sampling examples 75% of the time from the Uniclust prediction set, with sub-sampled MSAs, and 25% of the time from the clustered PDB set.

Openfold2 NIM is optimized to run best on the following compute:

GPU	Total GPU memory	Azure VM compute	#GPUs on VM	Link
A100	80	Standard_NC24ads_A100_v4	1	link
A100	160	Standard_NC48ads_A100_v4	2	link
A100	320	Standard_NC96ads_A100_v4	4	link
A100	640	STANDARD_ND96AMSR_A100_V4	8	link
H100	94	STANDARD_NC40ADS_H100_V5	1	link
H100	188	STANDARD_NC80ADIS_H100_V5	2	link

Model Specifications

LicenseCustom

Last UpdatedMay 2025

PublisherNvidia

Quick Start