microsoft-chatbench-distilgpt2
microsoft-chatbench-distilgpt2
Version: 3
Hugging FaceLast updated December 2025
microsoft/chatbench-distilgpt2 powered by Hugging Face API

Send Request

You can use cURL or any REST Client to send a request to the Azure ML endpoint with your Azure ML token.
curl <AZUREML_ENDPOINT_URL> \
    -X POST \
    -H "Authorization: Bearer <AZUREML_TOKEN>" \
    -H "Content-Type: application/json" \
    -d '{"inputs":"Write a short story about a robot learning to paint"}'

Supported Parameters

  • inputs (string): The input text prompt to generate text from
  • parameters (object):
    • do_sample (boolean): Whether to use sampling. Set to false for deterministic output.
    • max_new_tokens (integer): Maximum number of tokens to generate in the output.
    • repetition_penalty (float): Penalty for repeating tokens from the input or previous output.
    • return_full_text (boolean): Whether to return the full text including the prompt.
    • seed (integer): Seed for the random number generator to ensure reproducible results.
    • temperature (float): Controls randomness in generation. Lower values make output more deterministic.
    • top_k (integer): Number of highest probability vocabulary tokens to keep for top-k-filtering.
    • top_p (float): Nucleus sampling parameter. Keeps the smallest set of tokens whose cumulative probability exceeds top_p.
    • truncate (integer): Truncate input to this many tokens.
    • typical_p (float): Typical sampling parameter for locally typical sampling.
Check the full API Specification at the Hugging Face Inference API Documentation .

OpenAI Chat Completions API

Additionally, the API also supports the OpenAI Chat Completions API Specification under the endpoint /v1/chat/completions.
Model Specifications
LicenseApache-2.0
Last UpdatedDecember 2025
ProviderHugging Face