the challenge
HeyGen needed a fast, cost-effective image generation solution that wouldn't drain engineering resources or inflate unit costs at scale.
Images for HeyGen play a critical role in their workflow. High-quality images help users visualize their final output far earlier in the process. A user might start with an initial frame or build out a storyboard before committing to a full render, iterating quickly and confidently along the way.
To make that workflow viable at scale, HeyGen needed image generation that was both low-latency and affordable. The two options on the table were managed providers and self-hosting open-source models like FLUX. Managed providers offered convenience but came with costs that didn't hold up at production scale. Self-hosting offered theoretical savings, but would have required a significant engineering investment to build and maintain the supporting infrastructure. HeyGen didn't want to pull engineering resources away from its core product.
The team needed a third option: managed simplicity at self-hosted economics.

HeyGen's image generation tool lets users design a custom AI avatar before the camera rolls.


