MODEL IDideogram:4@0

live

Ideogram 4.0

by IdeogramJune 3, 2026

Ideogram 4.0 is Ideogram's most capable text-to-image model for design-heavy image generation. It is built for frontier text rendering across languages, structured prompt control through natural language or JSON, bounding-box layout control, transparent background generation, and high-fidelity 2K output. It is well suited to posters, branded graphics, packaging, product visuals, typography-led compositions, and other workflows where design precision matters as much as visual quality.

API Options

Platform-level options for task execution and delivery.

taskType stringrequiredvalue: imageInference: Identifier for the type of task being performed

taskUUID stringrequiredUUID v4: UUID v4 identifier for tracking tasks and matching async responses. Must be unique per task.

outputType stringdefault: URL: Image output type.
Allowed values3 values

outputFormat stringdefault: JPG

Specifies the file format of the generated output. The available values depend on the task type and the specific model's capabilities.

`JPG`: Best for photorealistic images with smaller file sizes (no transparency).
`PNG`: Lossless compression, supports high quality and transparency (alpha channel).
`WEBP`: Modern format providing superior compression and transparency support.

**Transparency**: If you are using features like background removal or LayerDiffuse that require transparency, you must select a format that supports an alpha channel (e.g., `PNG`, `WEBP`, `TIFF`). `JPG` does not support transparency.

Allowed values3 values

outputQuality integermin: 20max: 99default: 95: Compression quality of the output. Higher values preserve quality but increase file size.

webhookURL stringuri

Specifies a webhook URL where JSON responses will be sent via HTTP POST when generation tasks complete. For batch requests with multiple results, each completed item triggers a separate webhook call as it becomes available.

Learn more1 resource

Webhooks
PLATFORM

deliveryMethod stringdefault: sync

Determines how the API delivers task results.

Allowed values2 values

: Returns complete results directly in the API response.
: Returns an immediate acknowledgment with the task UUID. Poll for results using getResponse.

Learn more1 resource

Task Polling
PLATFORM

uploadEndpoint stringuri

Specifies a URL where the generated content will be automatically uploaded using the HTTP PUT method. The raw binary data of the media file is sent directly as the request body. For secure uploads to cloud storage, use presigned URLs that include temporary authentication credentials.

Common use cases:

Cloud storage: Upload directly to S3 buckets, Google Cloud Storage, or Azure Blob Storage using presigned URLs.
CDN integration: Upload to content delivery networks for immediate distribution.

// S3 presigned URL for secure upload
https://your-bucket.s3.amazonaws.com/generated/content.mp4?X-Amz-Signature=abc123&X-Amz-Expires=3600

// Google Cloud Storage presigned URL
https://storage.googleapis.com/your-bucket/content.jpg?X-Goog-Signature=xyz789

// Custom storage endpoint
https://storage.example.com/uploads/generated-image.jpg

The content data will be sent as the request body to the specified URL when generation is complete.

safety object

Content safety checking configuration for image generation.

Properties1 property

safety » checkContent checkContent boolean: Enable or disable content safety checking.

ttl integermin: 60: Time-to-live (TTL) in seconds for generated content. Only applies when outputType is URL.

includeCost boolean: Include task cost in the response.

numberResults integermin: 1max: 20default: 1: Number of results to generate. Each result uses a different seed, producing variations of the same parameters.

Core Parameters

Primary parameters that define the task output.

model stringrequiredvalue: ideogram:4@0

Identifier of the model to use for generation.

Learn more3 resources

positivePrompt stringmin: 2

Text prompt describing elements to include in the generated output. Automatically expanded into a structured prompt before generation.

Learn more1 resource

Prompts
LEARN

width integerrequired*paired with height

Width of the generated media in pixels.

Learn more2 resources

Dimensions
LEARN
Image Outpainting: Dimensions Critical For Outpainting
LEARN

height integerrequired*paired with width

Height of the generated media in pixels.

Learn more2 resources

Dimensions
LEARN
Image Outpainting: Dimensions Critical For Outpainting
LEARN

Settings

Technical parameters to fine-tune the inference process. These must be nested inside the settings object.

settings » copyrightDetection copyrightDetection boolean: Opt into post-generation copyright detection using Hive likeness and logo checks.

settings » renderingSpeed renderingSpeed stringdefault: DEFAULT

Generation speed/quality tradeoff.

Allowed values3 values

: Fastest generation.
: Balanced speed and quality.
: Best quality.

settings » structuredPrompt structuredPrompt object

settings.structuredPrompt is a structured prompt passed to the model directly. A natural-language positivePrompt is automatically expanded into this structured form before generation, so use settings.structuredPrompt when you want explicit control over the result. It must include compositional_deconstruction. Other fields are optional, and extra fields beyond the documented ones are accepted.

A few rules the field definitions don't capture:

Include exactly one of art_style or photo in style_description, depending on whether the image is illustrated or photographic.
Key order matters — keep fields in the order the model expects, as shown in the example. The style_description order also differs between photo and non-photo images.
bbox is row-first — [ymin, xmin, ymax, xmax] on a 0–1000 top-left canvas, so build it as [top, left, bottom, right].

Examples1 example

"settings": {
  "structuredPrompt": {
    "high_level_description": "A vintage travel poster of a coastal town at sunset.",
    "style_description": {
      "aesthetics": "Retro mid-century travel poster",
      "lighting": "Warm golden-hour glow",
      "medium": "Screen-printed poster",
      "art_style": "Flat vector illustration",
      "color_palette": ["#E8A33D", "#2E5E78", "#F2E9D8"]
    },
    "compositional_deconstruction": {
      "background": "Mediterranean harbor under a warm sunset sky with soft golden light.",
      "elements": [
        {"type": "obj", "bbox": [400, 120, 760, 880], "desc": "White sailboats moored along the stone pier"},
        {"type": "text", "bbox": [60, 100, 200, 900], "text": "VISIT THE COAST", "desc": "Bold retro headline across the top"}
      ]
    }
  }
}

Properties3 properties

settings » structuredPrompt » compositional_deconstruction compositional_deconstruction objectrequired

Scene breakdown into background and individual elements.

Properties2 properties

settings » structuredPrompt » compositional_deconstruction » background background string: Environment, framing, and lighting context.

settings » structuredPrompt » compositional_deconstruction » elements elements array of objectsrequired

Objects and text to place in the scene.

Properties5 properties

settings » structuredPrompt » compositional_deconstruction » elements » type type string: Element type. Use 'obj' for a visual object and 'text' for rendered text.
Allowed values2 values

settings » structuredPrompt » compositional_deconstruction » elements » bbox bbox array of integersmin: 0max: 1000items: 4: Element placement as [ymin, xmin, ymax, xmax].

settings » structuredPrompt » compositional_deconstruction » elements » color_palette color_palette array of stringsmax items: 5: Hex colors for this element.

settings » structuredPrompt » compositional_deconstruction » elements » desc desc string: Detailed description of the element.

settings » structuredPrompt » compositional_deconstruction » elements » text text string: Text content to render. Only applies when type is 'text'.

settings » structuredPrompt » high_level_description high_level_description stringrequired: One or two sentences describing the full image, including subject and style.

settings » structuredPrompt » style_description style_description object

Styling guidance for the image.

Properties6 properties

settings » structuredPrompt » style_description » aesthetics aesthetics string: Overall aesthetic direction.

settings » structuredPrompt » style_description » art_style art_style string: Art style for non-photographic images.

settings » structuredPrompt » style_description » color_palette color_palette array of stringsmax items: 16: Hex colors guiding the overall palette.

settings » structuredPrompt » style_description » lighting lighting string: Lighting setup and mood.

settings » structuredPrompt » style_description » medium medium string: Artistic medium or capture format.

settings » structuredPrompt » style_description » photo photo string: Photographic style for photo-realistic images.

Features

Standalone addons and post-processing features.

advancedFeatures » watermark watermark object

Configuration object for adding watermarks to generated videos. Watermarks can be applied using either text or image content with customizable positioning and appearance. You must provide either text or image content for the watermark, but not both.

Text watermark

"advancedFeatures": {
  "watermark": {
    "text": "© 2025 Company",
    "displayPosition": "bottom-right",
    "opacity": 0.6,
    "fontColor": "#ffffff",
    "bgColor": "#000000"
  }
}

Image watermark

"advancedFeatures": {
  "watermark": {
    "image": "c64351d5-4c59-42f7-95e1-eace013eddab",
    "displayPosition": "top-left",
    "opacity": 0.6
  }
}

Tiled watermark

"advancedFeatures": {
  "watermark": {
    "text": "PREVIEW",
    "tiled": true,
    "opacity": 0.4,
    "fontColor": "#cccccc"
  }
}

Properties6 properties

advancedFeatures » watermark » text text stringmin: 2max: 32: Watermark text.

advancedFeatures » watermark » image image string: Watermark image (UUID, URL, Data URI, or Base64).

advancedFeatures » watermark » displayPosition displayPosition string: Watermark position.
Allowed values10 values

advancedFeatures » watermark » opacity opacity floatmin: 0.1max: 1step: 0.01: Watermark opacity.

advancedFeatures » watermark » fontColor fontColor string: Text color in hex format.

advancedFeatures » watermark » bgColor bgColor string: Background color in hex format.

Notes

Ideogram 4.0 is driven by a structured JSON prompt. You can drive it two ways, and each request must use exactly one:

Natural language via positivePrompt. It's automatically expanded into the structured form before generation.
Structured JSON via settings.structuredPrompt. Passed to the model directly, skipping that expansion, for explicit control over composition, text, and layout.

See the settings.structuredPrompt parameter for the field reference, the Structured prompts guide for when to use each mode, and Text and design output for typography and layout.

Parameter Dependencies

Dimensions

The following dimension combinations are supported:

Configuration	Dimensions
`2K (1:1)`	`2048x2048`
`2K (1:2)`	`1440x2880`
`2K (2:1)`	`2880x1440`
`2K (2:3)`	`1664x2496`
`2K (3:2)`	`2496x1664`
`2K (4:5)`	`1792x2240`
`2K (5:4)`	`2240x1792`
`2K (9:16)`	`1440x2560`
`2K (16:9)`	`2560x1440`
`2K (5:8)`	`1600x2560`
`2K (8:5)`	`2560x1600`
`2K (3:4)`	`1728x2304`
`2K (4:3)`	`2304x1728`
`2K (9:22)`	`1296x3168`
`2K (22:9)`	`3168x1296`
`2K (9:23)`	`1152x2944`
`2K (23:9)`	`2944x1152`
`2K (3:8)`	`1248x3328`
`2K (8:3)`	`3328x1248`
`2K (5:12)`	`1280x3072`
`2K (12:5)`	`3072x1280`
`2K (1:3)`	`1024x3072`
`2K (3:1)`	`3072x1024`

Pricing

Pricing is $0.03 per image for Turbo, $0.06 per image for Default, and $0.10 per image for Quality

Turbo$0.03

Default$0.06

Quality$0.10