AI Model Catalog | Microsoft Foundry Models

ibm-granite-granite-4.0-1b-speech

Version: 1

Hugging Face•Last updated March 2026

ibm-granite/granite-4.0-1b-speech powered by vLLM

Transcriptions API

Send Request

You can use cURL or any REST Client to send a request to the Azure ML endpoint with your Azure ML token.

curl <AZUREML_ENDPOINT_URL> \
    -X POST \
    -H "Authorization: Bearer <AZUREML_TOKEN>" \
    -H "Content-Type: multipart/form-data" \
    -F "model="ibm-granite/granite-4.0-1b-speech" \
    -F file="@/path/to/file/audio.mp3"

Supported Parameters

The following are the only mandatory parameters to send in the HTTP POST request to v1/audio/transcriptions.

model (string): Model ID used to generate the response, in this case since only a single model is deployed within the same endpoint you can either set it to ibm-granite/granite-4.0-1b-speech or leave it blank instead.
file (file): The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.

The rest of the parameters are optional, and since this model is powered by vLLM with an OpenAI compatible interface on top for the Audio Transcriptions API for automatic speech recognition, then the I/O interfaces for both transcription and streaming are the same as in OpenAI Audio Transcriptions API. You can find the whole specification of the allowed parameters in the OpenAI Audio Transcriptions API Specification , or alternatively in the endpoint /openapi.json for the current Azure ML Endpoint.

Model Specifications

LicenseApache-2.0

Last UpdatedMarch 2026

ProviderHugging Face

Quick Start