Vidu Start-End-to-Video viduq3-turbo

curl --request POST \
  --url https://api.powertokens.ai/vidu/ent/v2/start-end2video \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "viduq3-turbo",
  "images": [
    "https://example.com/start_frame.png",
    "https://example.com/end_frame.png"
  ],
  "prompt": "A person moving from standing to sitting",
  "duration": 5
}
'

{
  "task_id": "your_task_id_here",
  "state": "created",
  "model": "viduq3-turbo",
  "images": [
    "https://prod-ss-images.s3.cn-northwest-1.amazonaws.com.cn/vidu-maas/template/startend2video-1.jpeg",
    "https://prod-ss-images.s3.cn-northwest-1.amazonaws.com.cn/vidu-maas/template/startend2video-2.jpeg"
  ],
  "prompt": "The camera zooms in on the bird, which then flies to the right. The bird flight is smooth and natural, with a red light effect following and surrounding it from behind.",
  "duration": 5,
  "seed": 123456789,
  "resolution": "1080p",
  "audio": true,
  "off_peak": false,
  "credits": 10,
  "created_at": "2025-01-01T15:41:31.968916Z"
}

{
  "code": "bad_request_body",
  "message": "Request body format error or invalid field values",
  "data": null
}

{
  "code": "access_denied",
  "message": "Invalid token",
  "data": null
}

{
  "code": "insufficient_user_quota",
  "message": "Insufficient account quota, please try again later",
  "data": null
}

{
  "code": "internal_server_error",
  "message": "Internal server error, please try again later",
  "data": null
}

POST

vidu

ent

start-end2video

curl --request POST \
  --url https://api.powertokens.ai/vidu/ent/v2/start-end2video \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "viduq3-turbo",
  "images": [
    "https://example.com/start_frame.png",
    "https://example.com/end_frame.png"
  ],
  "prompt": "A person moving from standing to sitting",
  "duration": 5
}
'

{
  "task_id": "your_task_id_here",
  "state": "created",
  "model": "viduq3-turbo",
  "images": [
    "https://prod-ss-images.s3.cn-northwest-1.amazonaws.com.cn/vidu-maas/template/startend2video-1.jpeg",
    "https://prod-ss-images.s3.cn-northwest-1.amazonaws.com.cn/vidu-maas/template/startend2video-2.jpeg"
  ],
  "prompt": "The camera zooms in on the bird, which then flies to the right. The bird flight is smooth and natural, with a red light effect following and surrounding it from behind.",
  "duration": 5,
  "seed": 123456789,
  "resolution": "1080p",
  "audio": true,
  "off_peak": false,
  "credits": 10,
  "created_at": "2025-01-01T15:41:31.968916Z"
}

{
  "code": "bad_request_body",
  "message": "Request body format error or invalid field values",
  "data": null
}

{
  "code": "access_denied",
  "message": "Invalid token",
  "data": null
}

{
  "code": "insufficient_user_quota",
  "message": "Insufficient account quota, please try again later",
  "data": null
}

{
  "code": "internal_server_error",
  "message": "Internal server error, please try again later",
  "data": null
}

Authorizations

Authorization

string

header

required

Pass Authorization: Bearer <token> in the request header.

Body

application/json

model

enum<string>

required

Video generation model name.

viduq3-turbo: Compared to viduq3-pro, the generation speed is faster

Available options:

viduq3-turbo

images

string[]

required

Two images: first is start frame, second is end frame. Notes:

Public URL or Base64 format supported
Aspect ratios must be close: ratio between start/end frame must be in 0.8~1.25
Supported formats: png, jpeg, jpg, webp
Max size per image: 50MB
The post body of the HTTP request should not exceed 20MB, and it must include an appropriate content type string
Base64 format example: data:image/png;base64,{base64_encode}

Required array length: 2 elements

prompt

string

Text prompt for describing video generation content, with a maximum length of 5000 characters. Note: If the is_rec parameter for recommended prompts is used, the model will ignore the manually entered prompt for this parameter.

Maximum string length: 5000

is_rec

boolean

default:false

Whether to use the recommended prompt.

true: Yes, the system will automatically generate and apply a recommended prompt to create the video. The number of recommended prompts = 1
false: No, the video will be generated based on the user-provided prompt Note: When the recommended prompt feature is enabled, each task will consume an additional 10 credits.

duration

integer

default:5

Video duration in seconds. Default values vary by model:

viduq3-pro, viduq3-turbo: default 5, available: 1-16
viduq2-pro-fast, viduq2-pro, viduq2-turbo: default 5, available: 1-8
viduq1, viduq1-classic: default 5, available: 5
vidu2.0: default 4, available: 4, 8

Required range: 1 <= x <= 16

seed

integer

Random seed.

Defaults to a random seed number
Manually set values will override the default random seed

resolution

enum<string>

Resolution (based on model & duration):

viduq3-pro, viduq3-turbo (1-16s): default 720p, options: 540p, 720p, 1080p
viduq2-pro-fast (1-8s): default 720p, options: 720p, 1080p
viduq2-pro, viduq2-turbo (1-8s): default 720p, options: 540p, 720p, 1080p
viduq1, viduq1-classic (5s): default 1080p, options: 1080p
vidu2.0 (4s): default 360p, options: 360p, 720p, 1080p
vidu2.0 (8s): default 720p, options: 720p

Available options:

360p,

540p,

720p,

1080p

audio

boolean

default:true

Whether to use direct audio-video generation capability.

false: No direct audio-video generation, outputs silent video
true: Requires audio-video synchronization, outputs video with sound (including dialogue and sound effects) Note 1: Only the q3 models supports this parameter

payload

string

Transparent transmission parameters. No processing, only data transmission, with a maximum length of 1048576 characters.

Maximum string length: 1048576

off_peak

boolean

default:false

Off peak mode.

true: Off peak generate mode
false: Normal generate mode Notes:

The off_peak mode consumes lower points (see Pricing for details)
Tasks submitted in off peak mode will be generated within 48 hours. Tasks that are not completed will be automatically cancelled and their points will be refunded
We also support cancel off_peak tasks
Except for q3, other direct audio-video generation functions

callback_url

string

Callback When creating a task, you need to actively set the callback_url with a POST request. When the video generation task changes its status, Vidu will send a callback request to this URL, containing the latest status of the task. The structure of the callback request content will be the same as the return body of the GET Generation API. The "status" in the callback response includes the following states:

processing: Task is being processed.
success: Task is completed (if sending fails, it will retry the callback three times).
failed: Task failed (if sending fails, it will retry the callback three times). Vidu uses a callback signature algorithm for verification, check out the details here: Callback Signature

watermark

boolean

default:false

Whether to add a watermark.

true: Add watermark
false: Do not add watermark

Note 1: Currently the watermark content is fixed, default is no watermark Note 2: You can retrieve watermarked video content through the watermarked_url parameter in the query task API

wm_position

enum<integer>

default:3

Watermark position on the image. Options:

1: Top left
2: Top right
3: Bottom right
4: Bottom left

Default: 3

Available options:

1,

2,

3,

4

Required range: 1 <= x <= 4

wm_url

string

Watermark image URL. When not specified, the default watermark is used.

meta_data

string

Metadata identifier, a JSON format string used as a passthrough field. You can customize the format or use the example format below:

{
  "Label": "your_label",
  "ContentProducer": "your_content_producer",
  "ContentPropagator": "your_content_propagator",
  "ProduceID": "your_product_id",
  "PropagateID": "your_propagate_id",
  "ReservedCode1": "your_reserved_code1",
  "ReservedCode2": "your_reserved_code2"
}

When this parameter is empty, the Vidu-generated metadata identifier is used by default.

Response

Submission successful, returns a video task object.

task_id

string

Task ID.

state

enum<string>

Task processing state:

created: Task created successfully
queueing: Task in queue
processing: Task processing
success: Generation successful
failed: Task failed

Available options:

created,

queueing,

processing,

success,

failed

model

string

Model parameter used for this call.

images

string[]

Images used for this call (start frame and end frame).

prompt

string

Text prompt used for this call.

duration

integer

Video duration parameter used for this call.

seed

integer

Random seed parameter used for this call.

resolution

string

Resolution parameter used for this call.

payload

string

Payload parameter used for this call.

off_peak

boolean

Off-peak mode parameter used for this call.

credits

integer

Credits consumed for this call.

created_at

string<date-time>

Task creation time.

watermark

boolean

本次提交任务是否使用水印

Vidu Reference-to-Video viduq3

Vidu Start-End-to-Video viduq3-pro

Chat Models

Image Models

Video Models

Audio Models

asset library

Vidu Start-End-to-Video viduq3-turbo

Authorizations

Body

Response