MiniMax Hailuo 2.3

High fidelity AI video generation from text or images

6ffa44ff-4029-4ae5-8792-c2a8d939fcdb

MiniMax Hailuo 2.3 is a cinematic video model for short form production. It accepts text prompts or image inputs and outputs 6 or 10 second clips at 768p or 1080p. It focuses on consistent motion, strong physics, and stable scenes for ads, social content, and creative shots.

MiniMax
Commercial use
text-to-videoimage-to-video
Currently 20% off for February 2026! 20% OFF

February special — save 20% until 1 March 2026

768p · 6s$0.28$0.22
768p · 10s$0.56$0.45
1080p · 6s$0.49$0.39

Examples

031046ba-1d86-4d76-92c8-bb89f7a65095
f4ab0170-5007-4cff-8605-ee156140c193
40e94e19-264d-4503-a8c0-3a78a589f23f
c0dc3454-d1b4-44d1-ae89-788082f08f64
45a852bf-3392-429e-b54a-510ec773eda6
da473e86-feca-4713-bc65-b7bf40527608
2190b391-21be-4786-b1a4-c75b0ada1a39
ea8f70f9-e76b-4f69-80f4-8145fe54b201
fa54ac85-2f03-45ea-aeab-62d2dcb3f247

README

Overview

Hailuo 2.3 is an AI video generation model that turns text prompts and images into video clips. It focuses on keeping motion stable over time and making sure your characters and visual style don’t drift between frames.

Compared to earlier Hailuo models, version 2.3 behaves more predictably during motion and does a better job of keeping characters consistent. It’s built for day-to-day video generation rather than simple experimentation.

How it Works

Prompt Interpretation

Text prompts guide what appears in the video, how the scene looks, and how things move. Prompts that clearly describe actions or the scene tend to create more reliable clips. You can specify the visual style directly in the prompt, too.

Image-Based Video Generation

For I2V, your generated video will keep the structure and appearance of your input image while adding motion over time. You can use this to animate existing assets or renders without rebuilding the scene.

Motion and Continuity

Hailuo 2.3 produces more stable motion across frames compared to earlier versions. Generated clips show less flicker and do a better job of keeping subjects consistent as the video plays.

Stylization and Visual Content

The model supports both realistic and stylised output. When you define a style the prompt or input image, it generally holds up across the clip without drifting.

Key Features

  • Motion and Continuity Improvements
    Produces smoother motion and more consistent results across frames compared to earlier models.

  • Character Consistency
    Maintains character appearance more reliably throughout a clip.

  • Stylisation Support
    Handles both realistic and stylized visuals with stable output.

  • Text and Logo Handling
    Improved stability when rendering text and logos within generated videos.

Technical Specifications

  • Model Name: MiniMax Hailuo 2.3
  • Model Type: Text-to-video and image-to-video
  • Input: Text prompt with optional input image
  • Resolution: 768p or 1080p.
  • Duration: 6 or 10 seconds (1366×768, default: 6), 6 seconds (1920×1080).

How to Use

  1. Use a text prompt and/or upload a static image.
  2. Pick your options, such as duration or resolution.
  3. Run the generation and see your creation.
  4. Adjust the prompt or input if you need to refine it.

Example prompt:
“A medium shot of a person walking through a quiet city street at night. The camera slowly tracks forward as the subject walks. Soft street lighting reflects on wet pavement. The scene has a cinematic, realistic style with natural motion.”

Documentation

You can find full usage details, parameters, and examples here:
https://runware.ai/docs/providers/minimax#hailuo-23