对话补全 - Mint Starter Kit

curl --request POST \ --url https://api.powertokens.ai/v1/chat/completions \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data ' { "model": "seed-2-0-pro-260328", "messages": [ { "role": "system", "content": "你是一个简洁、准确的中文助手。" }, { "role": "user", "content": "请用三句话总结一下 RAG 的核心流程。" } ], "thinking": { "type": "enabled" }, "reasoning_effort": "medium", "temperature": 0.3, "max_completion_tokens": 1024, "stream": false } '

{ "id": "chatcmpl_bp_123", "object": "chat.completion", "created": 1742342400, "model": "seed-2-0-pro-260328", "choices": [ { "index": 0, "message": { "role": "assistant", "content": "RAG 的核心流程通常分为检索、增强和生成三步。系统先把用户问题转换为向量并检索相关知识片段，再把检索结果与原问题一起拼接进提示词。最后模型基于增强后的上下文生成答案，从而降低幻觉并提升事实性。" }, "finish_reason": "stop" } ], "usage": { "prompt_tokens": 123, "completion_tokens": 98, "total_tokens": 221 } }

授权

Authorization

string

header

必填

在请求头中传入 Authorization: Bearer <token>。

请求体

application/json

BytePlus chat completions 请求体。

model

enum<string>

必填

模型名称。支持的 BytePlus 对话模型包括 seed-2-0-pro-260328、seed-2-0-lite-260228、seed-2-0-mini-260215、seed-1-8-251228、seed-1-6-250915、seed-1-6-flash-250715、deepseek-v3-2-251201、gpt-oss-120b-250805。

可用选项:

seed-2-0-pro-260328,

seed-2-0-lite-260228,

seed-2-0-mini-260215,

seed-1-8-251228,

seed-1-6-250915,

seed-1-6-flash-250715,

deepseek-v3-2-251201,

gpt-oss-120b-250805

示例:

"seed-2-0-pro-260328"

messages

object[]

必填

消息列表。当前文档公开 system、user、assistant、tool 四种角色。messages[].content 支持纯文本，也支持由 text、image_url、video_url 组成的多模态分片。

Minimum array length: 1

Show child attributes

thinking

object

控制模型是否开启深度思考模式。

Show child attributes

stream

boolean

默认值:false

是否启用流式输出。为 true 时响应内容类型为 text/event-stream。

stream_options

object

流式响应附加选项。仅在 stream=true 时生效。

Show child attributes

max_tokens

integer

模型回答最大长度（不含思维链长度），单位 token。

max_completion_tokens

integer

模型输出最大长度（含回答与思维链长度），单位 token。设置后 max_tokens 会失效。

stop

停止序列。可以是单个字符串，也可以是字符串数组。

reasoning_effort

enum<string>

限制思考工作量。minimal 最快，high 最深。

可用选项:

minimal,

low,

medium,

high

response_format

object

控制回答格式（Beta）。

Show child attributes

frequency_penalty

number

频率惩罚。值越高，越抑制重复表达。

必填范围: -2 <= x <= 2

presence_penalty

number

存在惩罚。值越高，越鼓励模型引入新话题。

必填范围: -2 <= x <= 2

temperature

number

采样温度。越低越稳定，越高越发散。

必填范围: 0 <= x <= 2

top_p

number

核采样参数。通常与 temperature 二选一调优。

必填范围: 0 <= x <= 1

logprobs

boolean

是否返回输出 token 的对数概率信息。

top_logprobs

integer

当 logprobs=true 时，指定每个位置返回多少个候选 token 的对数概率。

必填范围: 0 <= x <= 20

tools

object[]

可供模型调用的函数工具定义列表。

Show child attributes

parallel_tool_calls

boolean

是否允许模型并行发起多个工具调用。

tool_choice

工具调用策略。可以是字符串模式，也可以是指定具体函数的对象。

可用选项:

none,

auto,

required

响应

调用成功。非流式模式返回 JSON；流式模式返回 SSE 事件流。

BytePlus chat completions 成功响应。

string

响应 ID。

object

string

对象类型，例如 chat.completion。

created

integer

Unix 时间戳。

model

string

实际使用的模型名称。

choices

object[]

Show child attributes

usage

object

Token 用量统计。

Show child attributes

模型接口

授权

请求体

响应