---
title: Grok Imagine Video | Runware Docs
url: https://runware.ai/docs/models/xai-grok-imagine-video
description: AI video generation with synchronized audio from text and images
---
# Grok Imagine Video

Grok Imagine Video is a multimodal generative video model that produces short video clips with native audio from text descriptions or static images. It supports text-to-video and image-to-video generation with synchronized sound effects and dialogue, enabling developers to animate scenes with motion, camera dynamics, and audio in a single API workflow.

- **ID**: `xai:grok-imagine@video`
- **Status**: live
- **Creator**: xAI
- **Release Date**: January 29, 2026
- **Capabilities**: Text to Video, Image to Video, Video to Video

## Pricing

A 6-second T2V video starts at $0.30; each extra second is $0.05 (+$0.002 if using an image input or +$0.01/s of video input).

- **480p · T2V · 6s**: `$0.30`
- **720p · T2V · 6s**: `$0.42`
- **480p · I2V · 6s**: `$0.302`
- **720p · I2V · 6s**: `$0.422`
- **480p · V2V · 6s**: `$0.36`
- **720p · V2V · 6s**: `$0.48`

## Compatibility & Validation

Provide exactly one of: `inputs.frameImages` or `inputs.video` or `width/height`.

---

`frameImages` cannot be used with `referenceImages` (in `inputs`).

---

`frameImages` cannot be used with `video` (in `inputs`).

---

When `inputs.video` is provided, `width/height` and `resolution` cannot be used.

---

When `inputs.video` and `duration` are provided, `duration` must be between `2` and `10`.

---

`resolution` cannot be used with `width/height`.

---

When `resolution` is provided, `inputs.frameImages` requires at least 1 item.

---

`width` and `height` must be used together.

---

The following dimension combinations are supported:

| Configuration | Dimensions |
| --- | --- |
| `480p (1:1)` | `480x480` |
| `480p (16:9)` | `848x480` |
| `480p (9:16)` | `480x848` |
| `480p (4:3)` | `640x480` |
| `480p (3:4)` | `480x640` |
| `480p (3:2)` | `720x480` |
| `480p (2:3)` | `480x720` |
| `720p (1:1)` | `720x720` |
| `720p (16:9)` | `1280x720` |
| `720p (9:16)` | `720x1280` |
| `720p (4:3)` | `960x720` |
| `720p (3:4)` | `720x960` |
| `720p (3:2)` | `1088x720` |
| `720p (2:3)` | `720x1088` |

## Request Parameters

**API Options**

Platform-level options for task execution and delivery.

### [taskType](https://runware.ai/docs/models/xai-grok-imagine-video#request-tasktype)

- **Type**: `string`
- **Required**: true
- **Value**: `videoInference`

Identifier for the type of task being performed

### [taskUUID](https://runware.ai/docs/models/xai-grok-imagine-video#request-taskuuid)

- **Type**: `string`
- **Required**: true
- **Format**: `UUID v4`

UUID v4 identifier for tracking tasks and matching async responses. Must be unique per task.

### [outputType](https://runware.ai/docs/models/xai-grok-imagine-video#request-outputtype)

- **Type**: `string`
- **Default**: `URL`

Video output type.

**Allowed values**: `URL`

### [outputFormat](https://runware.ai/docs/models/xai-grok-imagine-video#request-outputformat)

- **Type**: `string`
- **Default**: `MP4`

Specifies the file format of the generated output. The available values depend on the task type and the specific model's capabilities.

- \`MP4\`: Widely supported video container (H.264), recommended for general use.
- \`WEBM\`: Optimized for web delivery.
- \`MOV\`: QuickTime format, common in professional workflows (Apple ecosystem).

**Allowed values**: `MP4` `WEBM` `MOV`

### [outputQuality](https://runware.ai/docs/models/xai-grok-imagine-video#request-outputquality)

- **Type**: `integer`
- **Min**: `20`
- **Max**: `99`
- **Default**: `95`

Compression quality of the output. Higher values preserve quality but increase file size.

### [webhookURL](https://runware.ai/docs/models/xai-grok-imagine-video#request-webhookurl)

- **Type**: `string`
- **Format**: `URI`

Specifies a webhook URL where JSON responses will be sent via HTTP POST when generation tasks complete. For batch requests with multiple results, each completed item triggers a separate webhook call as it becomes available.

**Learn more** (1 resource):

- [Webhooks](https://runware.ai/docs/platform/webhooks) (platform)

### [deliveryMethod](https://runware.ai/docs/models/xai-grok-imagine-video#request-deliverymethod)

- **Type**: `string`
- **Default**: `async`

Determines how the API delivers task results.

**Allowed values**:

- `async` Returns an immediate acknowledgment with the task UUID. Poll for results using getResponse. Required for long-running tasks like video generation.

**Learn more** (1 resource):

- [Task Polling](https://runware.ai/docs/platform/task-polling) (platform)

### [uploadEndpoint](https://runware.ai/docs/models/xai-grok-imagine-video#request-uploadendpoint)

- **Type**: `string`
- **Format**: `URI`

Specifies a URL where the generated content will be automatically uploaded using the HTTP PUT method. The raw binary data of the media file is sent directly as the request body. For secure uploads to cloud storage, use presigned URLs that include temporary authentication credentials.

**Common use cases:**

- **Cloud storage**: Upload directly to S3 buckets, Google Cloud Storage, or Azure Blob Storage using presigned URLs.
- **CDN integration**: Upload to content delivery networks for immediate distribution.

```text
// S3 presigned URL for secure upload
https://your-bucket.s3.amazonaws.com/generated/content.mp4?X-Amz-Signature=abc123&X-Amz-Expires=3600

// Google Cloud Storage presigned URL
https://storage.googleapis.com/your-bucket/content.jpg?X-Goog-Signature=xyz789

// Custom storage endpoint
https://storage.example.com/uploads/generated-image.jpg
```

The content data will be sent as the request body to the specified URL when generation is complete.

### [safety](https://runware.ai/docs/models/xai-grok-imagine-video#request-safety)

- **Path**: `safety.checkContent`
- **Type**: `object (2 properties)`

Content safety checking configuration for video generation.

#### [checkContent](https://runware.ai/docs/models/xai-grok-imagine-video#request-safety-checkcontent)

- **Path**: `safety.checkContent`
- **Type**: `boolean`
- **Default**: `false`

Enable or disable content safety checking. When enabled, defaults to `fast` mode.

#### [mode](https://runware.ai/docs/models/xai-grok-imagine-video#request-safety-mode)

- **Path**: `safety.mode`
- **Type**: `string`
- **Default**: `none`

Safety checking mode for video generation.

**Allowed values**:

- `none` Disables checking.
- `fast` Checks key frames.
- `full` Checks all frames.

### [ttl](https://runware.ai/docs/models/xai-grok-imagine-video#request-ttl)

- **Type**: `integer`
- **Min**: `60`

Time-to-live (TTL) in seconds for generated content. Only applies when `outputType` is `URL`.

### [includeCost](https://runware.ai/docs/models/xai-grok-imagine-video#request-includecost)

- **Type**: `boolean`
- **Default**: `false`

Include task cost in the response.

### [numberResults](https://runware.ai/docs/models/xai-grok-imagine-video#request-numberresults)

- **Type**: `integer`
- **Min**: `1`
- **Max**: `4`
- **Default**: `1`

Number of results to generate. Each result uses a different seed, producing variations of the same parameters.

**Inputs**

Input resources for the task (images, audio, etc). These must be nested inside the \`inputs\` object.

### [referenceImages](https://runware.ai/docs/models/xai-grok-imagine-video#request-inputs-referenceimages)

- **Path**: `inputs.referenceImages`
- **Type**: `array of strings`

List of reference images (UUID, URL, Data URI, or Base64).

### [frameImages](https://runware.ai/docs/models/xai-grok-imagine-video#request-inputs-frameimages)

- **Path**: `inputs.frameImages`
- **Type**: `array of strings or objects`

An array of frame-specific image inputs to guide video generation. Each item can be either a plain image input (UUID, URL, Data URI, or Base64) or an object that pairs an image with a target frame position.

The `frameImages` parameter allows you to constrain specific frames within the video sequence, ensuring that particular visual content appears at designated points. This is different from `referenceImages`, which provide overall visual guidance without constraining specific timeline positions.

When the `frame` parameter is omitted, automatic distribution rules apply:

- **1 image**: Used as the first frame.

**Examples**:

**Shorthand format:** When you don't need to specify a frame position, you can pass a plain image input directly.

```json
"frameImages": [
  "aac49721-1964-481a-ae78-8a4e29b91402"
]
```

**Object format:** When you need to specify a frame position, use an object with `image` and `frame`.

```json
"frameImages": [
  {
    "image": "aac49721-1964-481a-ae78-8a4e29b91402",
    "frame": "first"
  }
]
```

**Format 1: string[]**:

- **Type**: `string`

Image input (UUID, URL, Data URI, or Base64).

**Format 2: object[]**:

#### [image](https://runware.ai/docs/models/xai-grok-imagine-video#request-inputs-frameimages-format-2-image)

- **Path**: `inputs.frameImages.image`
- **Type**: `string`
- **Required**: true

Image input (UUID, URL, Data URI, or Base64).

#### [frame](https://runware.ai/docs/models/xai-grok-imagine-video#request-inputs-frameimages-format-2-frame)

- **Path**: `inputs.frameImages.frame`
- **Type**: `object`

Target frame position for the image. This model only supports the first frame.

**Allowed values**:

- `first` First frame of the video.
- `0` Frame index 0 (first frame).

### [video](https://runware.ai/docs/models/xai-grok-imagine-video#request-inputs-video)

- **Path**: `inputs.video`
- **Type**: `string`

Video input (UUID or URL).

**Generation Parameters**

Core parameters for controlling the generated content.

### [model](https://runware.ai/docs/models/xai-grok-imagine-video#request-model)

- **Type**: `string`
- **Required**: true
- **Value**: `xai:grok-imagine@video`

Identifier of the model to use for generation.

**Learn more** (3 resources):

- [Text To Image: Model Selection The Foundation Of Generation](https://runware.ai/docs/guides/text-to-image#model-selection-the-foundation-of-generation) (guide)
- [Image Inpainting: Model Specialized Inpainting Models](https://runware.ai/docs/guides/image-inpainting#model-specialized-inpainting-models) (guide)
- [Image Outpainting: Other Critical Parameters](https://runware.ai/docs/guides/image-outpainting#other-critical-parameters) (guide)

### [positivePrompt](https://runware.ai/docs/models/xai-grok-imagine-video#request-positiveprompt)

- **Type**: `string`
- **Required**: true
- **Min**: `1`

Text prompt describing elements to include in the generated output.

**Learn more** (2 resources):

- [Text To Image: Prompts Guiding The Generation](https://runware.ai/docs/guides/text-to-image#prompts-guiding-the-generation) (guide)
- [Image Outpainting: Other Critical Parameters](https://runware.ai/docs/guides/image-outpainting#other-critical-parameters) (guide)

### [width](https://runware.ai/docs/models/xai-grok-imagine-video#request-width)

- **Type**: `integer`
- **Required**: true
- **Paired with**: height

Width of the generated media in pixels.

**Learn more** (2 resources):

- [Image To Image: Dimensions Changing Aspect Ratio](https://runware.ai/docs/guides/image-to-image#dimensions-changing-aspect-ratio) (guide)
- [Image Outpainting: Dimensions Critical For Outpainting](https://runware.ai/docs/guides/image-outpainting#dimensions-critical-for-outpainting) (guide)

### [height](https://runware.ai/docs/models/xai-grok-imagine-video#request-height)

- **Type**: `integer`
- **Required**: true
- **Paired with**: width

Height of the generated media in pixels.

**Learn more** (2 resources):

- [Image To Image: Dimensions Changing Aspect Ratio](https://runware.ai/docs/guides/image-to-image#dimensions-changing-aspect-ratio) (guide)
- [Image Outpainting: Dimensions Critical For Outpainting](https://runware.ai/docs/guides/image-outpainting#dimensions-critical-for-outpainting) (guide)

### [resolution](https://runware.ai/docs/models/xai-grok-imagine-video#request-resolution)

- **Type**: `string`
- **Default**: `480p`

Resolution preset for the output. When used with input media, automatically matches the aspect ratio from the input.

**Allowed values**: `480p` `720p`

### [duration](https://runware.ai/docs/models/xai-grok-imagine-video#request-duration)

- **Type**: `integer`
- **Min**: `1`
- **Max**: `15`
- **Default**: `6`

Length of the generated video in seconds. The total number of frames produced is determined by duration multiplied by the model's frame rate (fps).

## Response Parameters

### [taskType](https://runware.ai/docs/models/xai-grok-imagine-video#response-tasktype)

- **Type**: `string`
- **Required**: true
- **Value**: `videoInference`

Type of the task.

### [taskUUID](https://runware.ai/docs/models/xai-grok-imagine-video#response-taskuuid)

- **Type**: `string`
- **Required**: true
- **Format**: `UUID v4`

UUID of the task.

### [videoUUID](https://runware.ai/docs/models/xai-grok-imagine-video#response-videouuid)

- **Type**: `string`
- **Required**: true
- **Format**: `UUID v4`

UUID of the output video.

### [videoURL](https://runware.ai/docs/models/xai-grok-imagine-video#response-videourl)

- **Type**: `string`
- **Format**: `URI`

URL of the output video.

### [videoBase64Data](https://runware.ai/docs/models/xai-grok-imagine-video#response-videobase64data)

- **Type**: `string`

Base64-encoded video data.

### [videoDataURI](https://runware.ai/docs/models/xai-grok-imagine-video#response-videodatauri)

- **Type**: `string`
- **Format**: `URI`

Data URI of the output video.

### [seed](https://runware.ai/docs/models/xai-grok-imagine-video#response-seed)

- **Type**: `integer`

The seed used for generation. If none was provided, shows the randomly generated seed.

### [NSFWContent](https://runware.ai/docs/models/xai-grok-imagine-video#response-nsfwcontent)

- **Type**: `boolean`

Flag indicating if NSFW content was detected.

### [cost](https://runware.ai/docs/models/xai-grok-imagine-video#response-cost)

- **Type**: `float`

Task cost in USD. Present when `includeCost` is set to `true` in the request.

## Examples

### Moonlit Clockwork Carnival Alley (Text to Video)

[Watch video](https://assets.runware.ai/examples/xai-grok-imagine-video/67739e9e-409d-4bba-a1eb-a44ead6f72cc.mp4)

**Request**:

```json
{
  "taskType": "videoInference",
  "taskUUID": "6378aa49-9f31-414a-8e00-3076dea79124",
  "model": "xai:grok-imagine@video",
  "positivePrompt": "A moonlit steampunk carnival tucked inside a narrow cobblestone alley, brass automatons performing for a small midnight crowd, glowing paper lanterns swaying overhead, a carousel horse mounted on mechanical rails gliding past the camera, curls of steam venting from copper pipes, sparkling puddles reflecting amber and cyan light. The camera begins with a slow dolly forward through hanging banners, then gently pans to reveal a violin-playing clockwork fox on a velvet crate while a tiny piston-driven drummer taps a lively rhythm nearby. A distant ferris wheel turns above the rooftops, gears clicking softly, carnival chimes ringing, murmuring spectators, fluttering fabric, footsteps on wet stone, occasional hiss of steam, and a warm female ringmaster voice off-screen saying, 'Step closer, the midnight wonders have only just awakened.' Cinematic, highly detailed, magical yet tactile, expressive lighting, atmospheric depth, dynamic motion, synchronized sound design.",
  "width": 1280,
  "height": 720,
  "duration": 8
}
```

**Response**:

```json
{
  "taskType": "videoInference",
  "taskUUID": "6378aa49-9f31-414a-8e00-3076dea79124",
  "videoUUID": "c8b6b301-7a5e-4aa8-a3e5-0f4377da76ea",
  "videoURL": "https://vm.runware.ai/video/os/a16d07/ws/5/vi/c8b6b301-7a5e-4aa8-a3e5-0f4377da76ea.mp4",
  "cost": 0.56
}
```

---

### Bioluminescent Reef Dreamscape (Video to Video)

[Watch video](https://assets.runware.ai/examples/xai-grok-imagine-video/19017c8e-7242-4a8f-8a4c-65ecddf260f3.mp4)

**Request**:

```json
{
  "taskType": "videoInference",
  "taskUUID": "050f2911-487c-49eb-820f-e2e47f39ec6f",
  "model": "xai:grok-imagine@video",
  "positivePrompt": "Transform the reference video into a luminous fantasy undersea vision. The tunnel becomes an ancient submerged coral cathedral with glowing arches, drifting jellyfish lanterns, shimmering schools of silver fish, soft shafts of turquoise light, bioluminescent particles, and vast whale-song atmosphere. Preserve the original pacing and camera movement from the source video, but restylize the walking figure as a mysterious traveler in a flowing deep-sea cloak with reflective details. Add synchronized native audio: distant whale calls, bubbling water resonance, gentle current swells, soft footsteps echoing on wet stone, and a faint whispered line, 'The reef remembers.' Cinematic, dreamlike, richly detailed, immersive, high-end visual effects, coherent motion, natural scene continuity.",
  "inputs": {
    "referenceVideos": [
      "https://assets.runware.ai/assets/inputs/931713e7-94a4-46ff-b6a2-a450a9ab9e53.mp4"
    ]
  }
}
```

**Response**:

```json
{
  "taskType": "videoInference",
  "taskUUID": "050f2911-487c-49eb-820f-e2e47f39ec6f",
  "videoUUID": "ff8ba334-00f7-4158-8296-6dbc1f2fd2d2",
  "videoURL": "https://vm.runware.ai/video/os/a17d13/ws/5/vi/ff8ba334-00f7-4158-8296-6dbc1f2fd2d2.mp4",
  "cost": 0.404
}
```

---

### Stormy Lighthouse Rescue (Text to Video)

[Watch video](https://assets.runware.ai/examples/xai-grok-imagine-video/2ed5ba0b-c38b-4270-b4f8-3f24063f4abd.mp4)

**Request**:

```json
{
  "taskType": "videoInference",
  "taskUUID": "ba22dba3-0b1d-44e1-ad81-b3eb720d91fe",
  "model": "xai:grok-imagine@video",
  "positivePrompt": "A cinematic night storm on a rocky coastline, waves crashing against black cliffs below a tall lighthouse. The camera begins with a wide aerial sweep through rain and sea mist, then pushes toward the lighthouse window glowing warm amber. Inside, a soaked coast guard captain grabs a radio and shouts, 'Hold on, we're coming for you!' Cut to the beam of the lighthouse slicing through sheets of rain as a rescue boat battles the surf below. Thunder rolls, wind howls, rain lashes metal railings, sirens pulse faintly in the distance, and the ocean roars with deep, immersive sound. Realistic water physics, dramatic lightning flashes, high contrast cinematic lighting, tense atmosphere, natural lip sync for the spoken line, polished blockbuster look.",
  "width": 1280,
  "height": 720,
  "duration": 8
}
```

**Response**:

```json
{
  "taskType": "videoInference",
  "taskUUID": "ba22dba3-0b1d-44e1-ad81-b3eb720d91fe",
  "videoUUID": "ed30ad9a-6693-4ff3-8775-796ae3cd8fe7",
  "videoURL": "https://vm.runware.ai/video/os/a03d21/ws/5/vi/ed30ad9a-6693-4ff3-8775-796ae3cd8fe7.mp4",
  "cost": 0.56
}
```