Qwen-Image
Qwen-Image high fidelity text aware image generation model

Qwen-Image is a 20B parameter vision language model from Alibaba Cloud. It focuses on precise text conditioned image generation and supports complex Chinese or English typography. It also enables accurate image editing workflows that need layout control and strong prompt following.
Commercial use
Each image generation costs $0.0058 at 1024x1024.
1024x1024 · 20 steps$0.0058
text-to-imageimage-editingimage-to-image
Examples

















