glm-4.5-air
Unified endpoint for Zhipu GLM series chat models exposed through the baze platform.
Supported model: glm-4.5-air. The same path supports plain text chat as well as image understanding requests containing image_url content parts.
Documented fields: model, messages, stream, thinking, temperature, top_p, max_tokens, stop, tools, tool_choice.
Authorizations
Pass Authorization: Bearer <token> in the request header.
Body
Zhipu Chat Completions request body.
Model name. Supported model: glm-4.5-air.
glm-4.5-air "glm-4.5-air"
Message list. Supports plain text messages and image input via image_url content parts. The request must contain at least one message that is not system or assistant; sending only system or only assistant messages will be rejected by the upstream provider.
1Whether to enable streaming output. When true, the response is returned as text/event-stream.
Thinking mode configuration. Applicable to models that support the thinking parameter.
Sampling temperature, range [0, 1].
0 <= x <= 1Nucleus sampling probability threshold, range [0.01, 1].
0.01 <= x <= 1Maximum number of output tokens.
1 <= x <= 131072Stop sequence list. Currently only a single stop sequence is supported.
1Tool definition list. Currently only function type tools are supported.
128Tool selection strategy. The Zhipu API currently only exposes auto.
auto Response
Success. Non-streaming mode returns JSON; streaming mode returns an SSE event stream.