Riverflow 2.0 Pro (Riverflow Pro)

Riverflow 2.0 Pro is a high-quality, agentic image generation and editing model designed for reliability, strong prompt-following, and autonomous self-correction. It is optimized for quality over speed and may take longer to produce the best result.

Recommended timeout: up to 10 minutes per request (longer reasoning = better outputs)

Model capabilities

1) Text-to-Image (T2I)

Generate images from a text instruction, with support for:

a) Multiple resolutions (1K, 2K, 4K) b) 10+ aspect ratios (including auto) c) Optional transparent backgrounds d) Optional prompt enhancement e) Multi-iteration agentic refinement for higher reliability f) Font Control for accurate, legible text rendering

2) Image-to-Image (I2I)

Edit or transform one or more input images using a text instruction. Common use cases:

a) Style transfer and re-rendering b) Background changes c) Object edits (add/remove/replace) d) Layout/scene changes e) Brand-safe variations f) Font Control for editing or adding text in-image g) Optional transparency h) Reference-Based Super-Resolution (detail fixing with reference images)

3) Font Control (Text accuracy in images)

Font Control improves accuracy and legibility of text rendered inside generated or edited images.

Key points

Works for both text-to-image and image-to-image
Supports up to 2 fonts
Each font includes:
a font file URL (.ttf, .otf, .woff, .woff2)
the exact text to be rendered (up to ~300 characters per font)

Best practices In your instruction, explicitly specify:

the exact text (verbatim)
casing (e.g. ALL CAPS)
placement (e.g. centered, top-left)
legibility requirements (e.g. “must be perfectly legible”)
the font name
In fontInputs, provide the same text again to improve accuracy.

Model created 5 months, 3 weeks ago