ibm-granite-granite-embedding-reranker-english-r2
Version: 2
ibm-granite/granite-embedding-reranker-english-r2 powered by Text Embeddings Inference
Text Embeddings Inference API
Send Request
You can use cURL or any REST Client to send a request to the Azure ML endpoint with your Azure ML token.curl <AZUREML_ENDPOINT_URL> \\
-X POST \\
-H "Authorization: Bearer <AZUREML_TOKEN>" \\
-H "Content-Type: application/json" \\
-d {"query":"What is Deep Learning?","texts":["Deep Learning is...","Deep Learning is not..."]}'
Supported Parameters
- query (string): Text on which sentences will be ranked.
- texts (array): Array of strings to rank.
- raw_scores (bool, optional): Wether to return the raw ranking scores instead of Softmax normalized values. Deafaults to false.
- return_text (bool, optional): Wether to return each text along with each ranking result or not. Defaults to false.
- truncate (bool, optional): Wether to truncate the inputs that are longer than the maximum sequence length supported by the model or not. Defaults to false.
- truncation_direction ('Left' or 'Right', optional): Can either be "Left" or "Right". Truncating to the "Right" means that tokens are removed from the end of the sequence until the maximum supported size is matched, whilst truncating to the "Left" means from the beginning of the sequence. Defaults to "Right".
Model Specifications
LicenseApache-2.0
Last UpdatedDecember 2025
ProviderHugging Face