Cover

Guide: Image Generation Base Models

All the differences between our Image Generation Models explained


When generating images on LetzAI, you can choose between different "Base Models".
While LetzAI supports public models from various partners, we also offer our own in-house base models. Each Base Model offers a different balance of speed, quality, and detail, and they come with different capabilities.

Here's what makes each of them special.



How to pick a base model

Right below your prompt bar, next to the generation button, you can expand image settings. Here you can choose the base model in which your images should be generated.


switch modes


Instant Models

Instant Models are our newest generation of served models. They work with reference images at generation time rather than requiring trained models, offering instant results with strong likeness preservation.

  • Nano Banana Pro — Our recommended model for most use cases
  • Seedream — Excellent for atmospheric and narrative-driven imagery


Nano Banana Pro (by Google) ⭐

The best all-around model — Recommended
Available on LetzAI since November, 2025
Nano Banana Pro is currently our most powerful and recommended base model. Developed by Google DeepMind, it offers exceptional image quality with enhanced visual accuracy, better text rendering, and superior detail preservation. Unlike our Flux-based models, Nano Banana Pro works as an "Instant Model" — it uses reference images at generation time rather than requiring a trained model, meaning you get instant results even with new subjects.

The model excels at maintaining consistency across multiple subjects, supports high resolutions up to 4K, and provides studio-quality controls for lighting, focus, and color grading. It's particularly strong at preserving likeness when using the tagging system with your personal models.
Speed
Aesthetics
Consistency
Best For:
  • General-purpose image generation with maximum quality
  • Portrait and likeness preservation with tagged models
  • Product photography and commercial work
  • Complex scenes with multiple subjects
  • Text rendering within images
Trade-offs:
  • Results depend on reference image quality when using tags
  • Less stylized than some of our artistic Flux-based modes


Seedream

Exceptional for atmospheric and narrative imagery
Available on LetzAI since December, 2025
Seedream 4.5, developed by ByteDance, is an advanced multimodal image generation model known for its exceptional natural language understanding and expressive light and shadow rendering. Unlike models that rely on mechanical tag stacking, Seedream truly comprehends 'stories' and 'atmosphere,' allowing it to seamlessly switch between hyper-realistic photography and surreal art styles.

Like Nano Banana Pro, Seedream is an Instant Model that doesn't require trained models — it works directly with reference images at generation time. It supports high-resolution 4K outputs and excels at complex multimodal tasks including style transfer and multi-image composition.
Speed
Aesthetics
Consistency
Best For:
  • Narrative and story-driven imagery
  • Atmospheric scenes with complex lighting
  • Switching between photorealistic and artistic styles
  • Style transfer and multi-image compositions
  • Creative projects requiring emotional depth
Trade-offs:
  • May add artistic interpretation to straightforward prompts
  • Slightly slower than Nano Banana Pro


Trained models "LetzAI V3" (Flux.dev-based)

These 5 original LetzAI Base Models are based on different fine-tuned versions of Flux.dev. They work best with trained models for consistent likeness preservation.



Cinematic

Doubling down on cinematic drama
Released in August, 2025

Cinematic Mode

Cinematic Mode

Cinematic Mode

Cinematic Mode

Cinematic Mode


If you want something that looks like it's straight out of a film frame, this is your pick. The Cinematic base model pushes lighting, framing, mood, color grading, depth, and atmosphere to the max. You'll see richer shadows, stronger contrast, and cinematic dynamics baked into every image.
Speed
Aesthetics
Consistency
Best For:
  • Film-inspired imagery and storytelling visuals
  • Dramatic portraits with moody lighting
  • Scenes requiring strong atmospheric depth
  • Projects where visual impact matters more than speed
Trade-offs:
  • Cinematic lighting and film grain are very prominent
  • Higher risk of depicting small hands and similar details flawed


Creative

Let imagination run wild
Released in July, 2025

Creative Mode

Creative Mode

Creative Mode

Creative Mode

Creative Mode


This model is basically Sigma model on cinematic steroids. Creative mode leans into realism and cinematic feel, but sharp details, hyperrealism, and neutral lighting take a back seat to mood, metaphor, and expressive flair. It's a hybrid between Default and Cinematic — ideal when you want something artistic but grounded.
Speed
Aesthetics
Consistency
Best For:
  • Artistic and expressive imagery with personality
  • Creative projects valuing mood over precision
  • Conceptual work and metaphorical visuals
  • Balanced approach between realism and artistry
Trade-offs:
  • Has a visible film grain and unique color tint
  • Can feel a little limited in its variety when using similar prompts


Turbo

Quick & dirty
Released in May 2025

Turbo Mode

Turbo Mode

Turbo Mode

Turbo Mode

Turbo Mode


As the name suggests, Turbo prioritizes speed over polish. You'll lose a bit of fidelity but gain a lot of generation speed compared to the other modes. Since the overall preservation of likeness and detail remains decent, Turbo is perfect when you're exploring many different prompts or directions at once.
Speed
Aesthetics
Consistency
Best For:
  • Rapid ideation and exploring multiple concepts
  • Quick drafts and testing different prompts
  • High-volume generation needs
  • When time matters more than perfection
What you give up:
  • Slightly lower quality, especially in complex lighting or fine texture areas
  • Reduced likeness of people and objects

Sigma

Faster, lighter, with surprising finesse
Released in March, 2025

Sigma Mode

Sigma Mode

Sigma Mode

Sigma Mode

Sigma Mode


Sigma is our lean but still powerful mode. It's engineered to be roughly 2× faster than Default and tends to produce busier, rougher results. It's especially strong at generating close-ups and portraits. The grainy and noisy nature of this model has a higher chance of creating detailed skin and surface textures, making results feel less AI-like and more organic. Because it's lighter, Sigma is also great for quick iteration and testing.
Speed
Aesthetics
Consistency
Best For:
  • Portraits and close-up shots with realistic textures
  • Organic and natural-looking results
  • Fast iteration without sacrificing too much quality
  • Images where some grain adds character
Trade-offs:
  • Images can become too noisy and chaotic in extreme cases
  • Slightly lower base resolution compared to Default


Default

The neutral baseline and allrounder
Released in September, 2024

Default Mode

Default Mode

Default Mode

Default Mode

Default Mode


This is our reliable workhorse. It leans into consistency, quality, and predictable results. Use Default when you want a balance of good detail, likeness preservation, and overall stability. It's a reliable option whenever you want to produce product images or portraits that maintain structure and realism. But it's also able to cover a wide variety of art, comic, and anime styles.
Speed
Aesthetics
Consistency
Best For:
  • Product photography and commercial work
  • Professional portraits with maintained likeness
  • Projects requiring predictable and consistent results
  • Wide range of styles including art, comics, and anime
Trade-offs:
  • Slower than the other modes we offer
  • Has the most "Flux-Like" AI feel to it when generating people

No matter the base model, remember: all image generations cost 5 credits in V4, so try a few modes for the same prompt and compare. You’ll quickly get a feel for which model best serves your idea.