December 27, 2024

Guide: Image Generation Base Models

All the differences between our Image Generation Models explained

When generating images on LetzAI, you can choose between different "Base Models".
While LetzAI supports public models from various partners, we also offer our own in-house base models. Each Base Model offers a different balance of speed, quality, and detail, and they come with different capabilities.

Here's what makes each of them special.

How to pick a base model

Right below your prompt bar, next to the generation button, you can expand image settings. Here you can choose the base model in which your images should be generated.

switch modes

Instant Models

Instant Models are our newest generation of served models. They work with reference images at generation time rather than requiring trained models, offering instant results with strong likeness preservation.

Nano Banana Pro — Our recommended model for most use cases
Seedream — Excellent for atmospheric and narrative-driven imagery

Nano Banana Pro (by Google) ⭐

The best all-around model — Recommended

Available on LetzAI since November, 2025

Nano Banana Pro is currently our most powerful and recommended base model. Developed by Google DeepMind, it offers exceptional image quality with enhanced visual accuracy, better text rendering, and superior detail preservation. Unlike our Flux-based models, Nano Banana Pro works as an "Instant Model" — it uses reference images at generation time rather than requiring a trained model, meaning you get instant results even with new subjects.

The model excels at maintaining consistency across multiple subjects, supports high resolutions up to 4K, and provides studio-quality controls for lighting, focus, and color grading. It's particularly strong at preserving likeness when using the tagging system with your personal models.

Speed

★★★★★

Aesthetics

★★★★★

Consistency

★★★★★

Best For:

General-purpose image generation with maximum quality
Portrait and likeness preservation with tagged models
Product photography and commercial work
Complex scenes with multiple subjects
Text rendering within images

Trade-offs:

Results depend on reference image quality when using tags
Less stylized than some of our artistic Flux-based modes

Seedream

Exceptional for atmospheric and narrative imagery

Available on LetzAI since December, 2025

Seedream 4.5, developed by ByteDance, is an advanced multimodal image generation model known for its exceptional natural language understanding and expressive light and shadow rendering. Unlike models that rely on mechanical tag stacking, Seedream truly comprehends 'stories' and 'atmosphere,' allowing it to seamlessly switch between hyper-realistic photography and surreal art styles.

Like Nano Banana Pro, Seedream is an Instant Model that doesn't require trained models — it works directly with reference images at generation time. It supports high-resolution 4K outputs and excels at complex multimodal tasks including style transfer and multi-image composition.

Speed

★★★★★

Aesthetics

★★★★★

Consistency

★★★★★

Best For:

Narrative and story-driven imagery
Atmospheric scenes with complex lighting
Switching between photorealistic and artistic styles
Style transfer and multi-image compositions
Creative projects requiring emotional depth

Trade-offs:

May add artistic interpretation to straightforward prompts
Slightly slower than Nano Banana Pro

Trained models "LetzAI V3" (Flux.dev-based)

These 5 original LetzAI Base Models are based on different fine-tuned versions of Flux.dev. They work best with trained models for consistent likeness preservation.

Cinematic

Doubling down on cinematic drama

Released in August, 2025

Cinematic Mode

If you want something that looks like it's straight out of a film frame, this is your pick. The Cinematic base model pushes lighting, framing, mood, color grading, depth, and atmosphere to the max. You'll see richer shadows, stronger contrast, and cinematic dynamics baked into every image.

Speed

★★★★★

Aesthetics

★★★★★

Consistency

★★★★★

Best For:

Film-inspired imagery and storytelling visuals
Dramatic portraits with moody lighting
Scenes requiring strong atmospheric depth
Projects where visual impact matters more than speed

Trade-offs:

Cinematic lighting and film grain are very prominent
Higher risk of depicting small hands and similar details flawed

Creative

Let imagination run wild

Released in July, 2025

Creative Mode

This model is basically Sigma model on cinematic steroids. Creative mode leans into realism and cinematic feel, but sharp details, hyperrealism, and neutral lighting take a back seat to mood, metaphor, and expressive flair. It's a hybrid between Default and Cinematic — ideal when you want something artistic but grounded.

Speed

★★★★★

Aesthetics

★★★★★

Consistency

★★★★★

Best For:

Artistic and expressive imagery with personality
Creative projects valuing mood over precision
Conceptual work and metaphorical visuals
Balanced approach between realism and artistry

Trade-offs:

Has a visible film grain and unique color tint
Can feel a little limited in its variety when using similar prompts

Turbo

Quick & dirty

Released in May 2025

Turbo Mode

As the name suggests, Turbo prioritizes speed over polish. You'll lose a bit of fidelity but gain a lot of generation speed compared to the other modes. Since the overall preservation of likeness and detail remains decent, Turbo is perfect when you're exploring many different prompts or directions at once.

Speed

★★★★★

Aesthetics

★★★★★

Consistency

★★★★★

Best For:

Rapid ideation and exploring multiple concepts
Quick drafts and testing different prompts
High-volume generation needs
When time matters more than perfection

What you give up:

Slightly lower quality, especially in complex lighting or fine texture areas
Reduced likeness of people and objects

Sigma

Faster, lighter, with surprising finesse

Released in March, 2025

Sigma Mode

Sigma is our lean but still powerful mode. It's engineered to be roughly 2× faster than Default and tends to produce busier, rougher results. It's especially strong at generating close-ups and portraits. The grainy and noisy nature of this model has a higher chance of creating detailed skin and surface textures, making results feel less AI-like and more organic. Because it's lighter, Sigma is also great for quick iteration and testing.

Speed

★★★★★

Aesthetics

★★★★★

Consistency

★★★★★

Best For:

Portraits and close-up shots with realistic textures
Organic and natural-looking results
Fast iteration without sacrificing too much quality
Images where some grain adds character

Trade-offs:

Images can become too noisy and chaotic in extreme cases
Slightly lower base resolution compared to Default

Default

The neutral baseline and allrounder

Released in September, 2024

Default Mode

This is our reliable workhorse. It leans into consistency, quality, and predictable results. Use Default when you want a balance of good detail, likeness preservation, and overall stability. It's a reliable option whenever you want to produce product images or portraits that maintain structure and realism. But it's also able to cover a wide variety of art, comic, and anime styles.

Speed

★★★★★

Aesthetics

★★★★★

Consistency

★★★★★

Best For:

Product photography and commercial work
Professional portraits with maintained likeness
Projects requiring predictable and consistent results
Wide range of styles including art, comics, and anime

Trade-offs:

Slower than the other modes we offer
Has the most "Flux-Like" AI feel to it when generating people

No matter the base model, remember: all image generations cost 5 credits in V4, so try a few modes for the same prompt and compare. You’ll quickly get a feel for which model best serves your idea.

Continue Reading

← Previous

Collaborate in Community Boards

Image Editing