nvidia-parakeet-ctc-1.1b
Version: 5
nvidia/parakeet-ctc-1.1b powered by Hugging Face + NeMo
Send Request
You can use cURL or any REST Client to send a request to the AzureML endpoint with your AzureML token.curl <AZUREML_ENDPOINT_URL> \
-X POST \
-H "Authorization: Bearer <AZUREML_TOKEN>" \
-H "Content-Type: multipart/form-data" \
-F file=@/path/to/file/audio.mp3 \
-F model=nvidia/parakeet-ctc-1.1b
Supported Parameters
- file (file): The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
- model (string): ID of the deployed model as in the Hugging Face Hub.
- language (string, optional): The language of the input audio (ISO-639-1 code, e.g., "en"). Supplying this improves accuracy and latency.
- prompt (string, optional): An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.
- temperature (float, optional): Defaults to
0.- The sampling temperature, between 0 and 1. Higher values (e.g., 0.8) make output more random; lower values (e.g., 0.2) make it more focused and deterministic.
- If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.
- response_format (string, optional): Defaults to
json.- The format of the output:
json,text,srt,verbose_json, orvtt.
- The format of the output:
Model Specifications
LicenseCc-by-4.0
Last UpdatedAugust 2025
ProviderHuggingFace