curl --request POST \
--url https://api.powertokens.ai/v1/audio/speech \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '
{
"model": "qwen3-tts-instruct-flash",
"input": "Introduce this product with a slightly faster speaking rate.",
"voice": "Cherry",
"instructions": "Speak faster with a brighter tone.",
"optimize_instructions": false,
"language_type": "English"
}
'"<string>"Call Ali qwen3-tts-instruct-flash through the unified /v1/audio/speech endpoint.
Response contract:
302 Found + Location;text/event-stream and forwards Ali SSE payloads.curl --request POST \
--url https://api.powertokens.ai/v1/audio/speech \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '
{
"model": "qwen3-tts-instruct-flash",
"input": "Introduce this product with a slightly faster speaking rate.",
"voice": "Cherry",
"instructions": "Speak faster with a brighter tone.",
"optimize_instructions": false,
"language_type": "English"
}
'"<string>"Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
qwen3-tts-instruct-flash Input text to synthesize.
Voice name.
Style-control instruction.
Whether to optimize instructions. An explicit false is preserved.
Language type.
Streaming output format. Any non-empty value enables streaming on the unified endpoint; pcm is the common Ali value.
Streaming success. Returns Ali SSE payloads. For internal settlement, the gateway reads usage.characters as the unified input audio character count.
The response is of type string.