GLM 4.7
Version: 1
Fireworks on Foundry
Models available for use with Fireworks on Foundry deliver optimized, best-in-class performance on the Fireworks Inference Cloud. Fireworks on Foundry is a Preview subject to Azure Preview terms and the following supplemental terms: When you use Fireworks on Foundry, data is shared between Microsoft and Fireworks AI, and different compliance and data handling rules will apply. See documentation for details. Customers are responsible for evaluating whether data sharing between Microsoft and Fireworks is appropriate for their organization’s compliance requirements.Key capabilities
About this model
GLM-4.7 is a next-generation general-purpose model optimized for coding, reasoning, and agentic workflows, delivering strong gains in multilingual software engineering, tool use, and complex problem solving. It introduces advanced thinking controls: interleaved, preserved, and turn-level thinking; to improve stability on long-horizon, multi-turn tasks. You can explore these thinking modes using thereasoning_history field.
Key model capabilities
- Advanced thinking controls: interleaved, preserved, and turn-level thinking via the
reasoning_historyfield - Multilingual support (English and Chinese)
- Function calling and tool use (file search, code interpreter)
- Streaming support
- 200k token context window
Use cases
See Responsible AI for additional considerations for responsible use.Key use cases
- Coding and multilingual software engineering
- Reasoning and complex problem solving
- Agentic workflows with tool use
- Long-horizon, multi-turn tasks
Out of scope use cases
The provider has not supplied this information.Pricing
Pricing is based on a number of factors, including deployment type and tokens used. See pricing details here.Technical specs
The provider has not supplied this information.Training cut-off date
The provider has not supplied this information.Training time
The provider has not supplied this information.Input formats
TextOutput formats
TextSupported languages
English, ChineseSample JSON response
The provider has not supplied this information.Model architecture
GLM-4.7 is a general-purpose language model developed by Z.ai. It supports streaming, function calling, and agent workflows.Long context
Context Length: 204,800 tokensOptimizing model performance
The provider has not supplied this information.Additional assets
The provider has not supplied this information.Training disclosure
Training, testing and validation
The provider has not supplied this information.Distribution
Distribution channels
The provider has not supplied this information.More information
The provider has not supplied this information.Responsible AI considerations
This model is sourced from Fireworks AI. It is a Non-Microsoft Productunder the Product Terms, and has not been tested or evaluated by
Microsoft. Customers should ensure that the model is appropriate for
their specific use, including by evaluating any legal or export-control
considerations and conducting their own model risk and safety
evaluations. You can learn about Foundry risk and safety evaluations
here .
Model Specifications
Context Length204800
LicenseOther
Last UpdatedApril 2026
Input TypeText
Output TypeText
ProviderFireworks
Languages2 Languages