GLM 4.7

Version: 1

Fireworks•Last updated April 2026

GLM 4.7 is a general-purpose model optimized for coding, reasoning, and agentic workflows, featuring advanced thinking controls with interleaved, preserved, and turn-level thinking modes.

Coding

Agents

Fireworks on Foundry

Models available for use with Fireworks on Foundry deliver optimized, best-in-class performance on the Fireworks Inference Cloud. Fireworks on Foundry is a Preview subject to Azure Preview terms and the following supplemental terms: When you use Fireworks on Foundry, data is shared between Microsoft and Fireworks AI, and different compliance and data handling rules will apply. See documentation for details. Customers are responsible for evaluating whether data sharing between Microsoft and Fireworks is appropriate for their organization’s compliance requirements.

Key capabilities

About this model

GLM-4.7 is a next-generation general-purpose model optimized for coding, reasoning, and agentic workflows, delivering strong gains in multilingual software engineering, tool use, and complex problem solving. It introduces advanced thinking controls: interleaved, preserved, and turn-level thinking; to improve stability on long-horizon, multi-turn tasks. You can explore these thinking modes using the reasoning_history field.

Key model capabilities

Advanced thinking controls: interleaved, preserved, and turn-level thinking via the reasoning_history field
Multilingual support (English and Chinese)
Function calling and tool use (file search, code interpreter)
Streaming support
200k token context window

Use cases

See Responsible AI for additional considerations for responsible use.

Key use cases

Coding and multilingual software engineering
Reasoning and complex problem solving
Agentic workflows with tool use
Long-horizon, multi-turn tasks

Out of scope use cases

The provider has not supplied this information.

Pricing

Pricing is based on a number of factors, including deployment type and tokens used. See pricing details here.

Technical specs

The provider has not supplied this information.

Training cut-off date

The provider has not supplied this information.

Training time

The provider has not supplied this information.

Input formats

Text

Output formats

Text

Supported languages

English, Chinese

Sample JSON response

The provider has not supplied this information.

Model architecture

GLM-4.7 is a general-purpose language model developed by Z.ai. It supports streaming, function calling, and agent workflows.

Long context

Context Length: 204,800 tokens

Optimizing model performance

The provider has not supplied this information.

Additional assets

The provider has not supplied this information.

Training disclosure

Training, testing and validation

The provider has not supplied this information.

Distribution

Distribution channels

The provider has not supplied this information.

More information

The provider has not supplied this information.

Responsible AI considerations

This model is sourced from Fireworks AI. It is a Non-Microsoft Product
under the Product Terms, and has not been tested or evaluated by
Microsoft. Customers should ensure that the model is appropriate for
their specific use, including by evaluating any legal or export-control
considerations and conducting their own model risk and safety
evaluations. You can learn about Foundry risk and safety evaluations
here .

Model Specifications

Context Length204800

LicenseOther

Last UpdatedApril 2026

Input TypeText

Output TypeText

ProviderFireworks

Languages2 Languages

Quick Start