seed-1-8-251228 Chat Completions
Invoke BytePlus chat completions capability.
Supported models include seed-1-8-251228
Documented fields: model, messages, thinking, stream, stream_options.include_usage, stream_options.chunk_include_usage, max_tokens, max_completion_tokens, stop, reasoning_effort, response_format, frequency_penalty, presence_penalty, temperature, top_p, logprobs, top_logprobs, tools, parallel_tool_calls, tool_choice.messages[].content supports text, image_url, and video_url multimodal parts; assistant messages support reasoning_content and tool_calls.
Authorizations
Pass Authorization: Bearer <token> in the request header.
Body
BytePlus chat completions request body.
Model name. Supported BytePlus chat models include seed-1-8-251228
seed-1-8-251228 "seed-1-8-251228"
Message list. This document covers four roles: system, user, assistant, and tool.messages[].content supports plain text or multimodal parts composed of text, image_url, and video_url.
1Controls whether the model enables deep thinking mode.
Whether to enable streaming output. When true, the response content type is text/event-stream.
Additional options for streaming responses. Only effective when stream=true.
Maximum length of the model response (excluding chain-of-thought), in tokens.
Maximum total output length (including response and chain-of-thought), in tokens. When set, max_tokens is ignored.
Stop sequence. Can be a single string or an array of strings.
Limit reasoning effort. minimal is fastest, high is deepest.
minimal, low, medium, high Controls the response format (Beta).
Frequency penalty. Higher values suppress repetitive expressions.
-2 <= x <= 2Presence penalty. Higher values encourage the model to introduce new topics.
-2 <= x <= 2Sampling temperature. Lower values produce more deterministic output; higher values produce more diverse output.
0 <= x <= 2Nucleus sampling parameter. Typically tuned as an alternative to temperature.
0 <= x <= 1Whether to return log probabilities of output tokens.
When logprobs=true, specifies how many candidate token log probabilities to return at each position.
0 <= x <= 20List of function tool definitions available for the model to call.
Whether to allow the model to issue multiple tool calls in parallel.
Tool calling strategy. Can be a string mode or an object specifying a particular function.
none, auto, required Response
Success. Returns JSON in non-streaming mode; returns an SSE event stream in streaming mode.