Stories/HeyGen logoHeyGen

How HeyGen cut image generation costs by 50% without the infrastructure overhead

How a leading AI video platform unlocked new product capabilities by replacing expensive managed providers with a single, cost-effective inference API.

HeyGen product interface beneath the article title

50%+

Cost reduction per image

0

Infrastructure engineers needed

1

Unified API for open-source models

New features

Shipped with confidence

the challenge

HeyGen needed a fast, cost-effective image generation solution that wouldn't drain engineering resources or inflate unit costs at scale.

Images for HeyGen play a critical role in their workflow. High-quality images help users visualize their final output far earlier in the process. A user might start with an initial frame or build out a storyboard before committing to a full render, iterating quickly and confidently along the way.

To make that workflow viable at scale, HeyGen needed image generation that was both low-latency and affordable. The two options on the table were managed providers and self-hosting open-source models like FLUX. Managed providers offered convenience but came with costs that didn't hold up at production scale. Self-hosting offered theoretical savings, but would have required a significant engineering investment to build and maintain the supporting infrastructure. HeyGen didn't want to pull engineering resources away from its core product.

The team needed a third option: managed simplicity at self-hosted economics.

HeyGen workflow illustrating managed simplicity at self-hosted economics

HeyGen's image generation tool lets users design a custom AI avatar before the camera rolls.

the solution

Runware gave HeyGen reliable access to top-tier open-source models at competitive unit economics, with no operational overhead.

Runware's API was highly compatible with their existing workflows, making migration straightforward and requiring minimal engineering effort to complete.

Runware enabled HeyGen to:

  • Access top-tier open-source models without managing any hosting infrastructure
  • Migrate existing workflows with minimal integration work
  • Scale image generation reliably under real production load
  • Keep unit economics predictable as usage grows

The switch struck the balance HeyGen was looking for: the reliability and ease of a managed service, without the cost structure that had made other providers unworkable at scale.

HeyGen using Runware for open-source image models at scale

HeyGen's avatar style selector. Users can pick their look & style, then move straight to AI video generation.

the results

HeyGen reduced its cost per image by more than 50% and gained the confidence to ship new image-based features without hesitation.

The impact was immediate and measurable. Overall cost per image dropped by over 50%, which changed how the team approached product development. Features that had previously felt too expensive to build and release at scale became viable. HeyGen could now ship new image-based capabilities without running the numbers every time.

Key outcomes:

  • 50%+ reduction in cost per image
  • New image-based features shipped with confidence
  • Zero infrastructure management overhead
  • Strong reliability under production traffic with no major incidents
Runware halved our image generation costs. It sounds simple, but that kind of saving completely changes your product roadmap.

Kevin Raheja, Strategic Partnerships

Beyond the cost savings, Runware has proven consistently reliable under real production conditions. HeyGen has experienced no major issues since switching, and the platform has become a core part of their infrastructure going forward, serving as their primary solution for open-source image models.

Why did HeyGen choose Runware?

HeyGen chose Runware to cut image generation costs, eliminate the engineering burden of self-hosting, and unlock product velocity through a single, reliable inference API.