Text-to-Image Processor

Overview

The Text-to-Image Processor (text_to_image) generates a brand-new PNG image from a text prompt. It supports:


Example Usage

- processor:
    processor: "text_to_image"
    params:
      prompt: "A samurai standing under cherry blossoms"
      negative_prompt: "blurry"
      model: "sdxl"
      seed: 42

      width: 800
      height: 600

      options_json: |
        {
          "num_inference_steps": 28,
          "guidance_scale": 3.5,
          "aspect_ratio": "4:3"
        }
    outputs:
      target: "mem/samurai.png"

Inputs

This processor does not require a source image.

Key Required Description
N/A

Params

Key Type Required Default Description
prompt string Yes Positive text describing the desired image.
negative_prompt string No "" Text to steer the model away from unwanted content.
model string No "sdxl" Model code (e.g. "flux", "flux2", "sdxl", "recraft", "imagen", "imagen-fast", "imagen-ultra").
seed int No (rand) Random seed for reproducibility.
width int Yes Output width in pixels.
height int Yes Output height in pixels.
options_json string No "{}" JSON string of additional model-specific inputs; allowed keys vary by model and are safely filtered.

Outputs

Key Required Description
target Yes Path where the PNG is written.