soy_aria_cruz: SOUL 2 vs Nano Banana Pro AI Art

SOUL 2 Vs. Nano Banana Pro 💥 Higgsfield ha lanzado su nuevo generador de imágenes SOUL 2 ⚡ Puedes subirle hasta 80 imágenes de referencia de tu personaje para mantener mejor la constancia 👀 Y para compararlo bien, lo he puesto a prueba junto a Nano Banana Pro que hasta el momento es mi generador de imágenes favorito 💕 La verdad es que hay algunos resultados de SOUL 2 que me han sorprendido bastante... No está nada mal, pero sigo prefiriendo Nano Banana para la mayoría de las ocasiones 😅 Os dejo algunas imágenes que he generado y espero leer vuestras opiniones en comentarios 💌 Y si quieres los prompts de todas las imágenes comenta "ARIA" y te los mando por mensaje!

Aria Cruz | Influencer AI

@soy_aria_cruz · Digital creator

INSTAGRAM · 2026-02-26Source

1.3Klikes

920comments

Remix This

Recreate with AI Image Generator

Make your own AI viral video

Prompt

[Subject] A side-by-side comparison image showing the same young woman holding a large stemmed wine glass directly toward the camera in a dim restaurant or bar setting. She appears in her early 20s with fair skin, green-hazel eyes, large round silver wire-frame glasses, hoop earrings, dark brown to black hair tied into a high ponytail, and a black sleeveless top. The left panel shows a softer, subtler smile and a more subdued mood, while the right panel shows a brighter open smile and more vivid expression. In both panels, the wine glass dominates the foreground and partially distorts or overlaps the face due to glass curvature and reflections.

[Environment] Two equal vertical portrait panels in a dark indoor nightlife environment. The background is mostly black or very dim, with a few warm amber bokeh lights and hints of restaurant or bar interior on the right panel. The left panel feels more flash-heavy and synthetic, with the glass flattening the scene. The right panel feels more real and dimensional, with better depth, cleaner hand rendering, and more natural low-light atmosphere. Bottom labels identify the sides as “Higgsfield SOUL 2” on the left and “NANO-BANANA PRO” on the right, each with its respective small icon.

[Composition/Camera] Tight vertical close-up portrait in split-screen A/B format. In both panels the woman is centered and holds the wine glass close to the lens so the bowl of the glass fills a large portion of the lower-middle foreground. The top rim of the glass crosses the face around nose or eye level, producing reflections, magnification, and distortion. Keep the framing intimate and social-media-like, as if taken during a night out with direct flash or strong phone light.

[Lighting] Flash or strong frontal phone light creates bright specular highlights on the wine glass, liquid surface, eyeglasses, and skin. The left side should feel slightly harsher and more artificial, with the glass flare flattening the face. The right side should maintain stronger realism, with more believable skin tones, glass thickness, liquid refraction, and background depth while still keeping the bright nightlife flash feel. Preserve white reflections and small point highlights on both the glass and eyeglass lenses.

[Style/Rendering] Hyperreal dual-panel nightlife benchmark image focused on glass realism, facial consistency, and flash-light behavior. Left panel: more synthetic and flatter. Right panel: more believable social-nightlife portrait with stronger physical material response. The tone should remain intimate, casual, and creator-like rather than editorially polished.

[Detail constraints] Keep exactly one woman in each panel and make them clearly the same person with the same glasses, ponytail, earrings, and black top. Preserve the dominant wine glass in the foreground, the dim bar ambiance, and the split-screen comparison layout. Do not add extra people, extra drinks, table clutter, or heavy color effects.

Negative prompt: single panel, no glass in foreground, cocktail garnish overload, champagne flute instead of rounded wine glass, outdoor scene, bright daylight, extra people, nightclub lasers, no glasses, no earrings, smiling differently beyond recognition, text removed, watermark, distorted fingers, missing stem, fantasy sparkles, luxury editorial studio shot.

Suggested parameters: aspect ratio 4:5, two-panel split, close portrait lens or smartphone flash portrait feel, shallow to moderate depth of field, 28-36 steps, CFG/style strength 6-7, sampler DPM++ 2M Karras or equivalent, seed suggestion 269145883.

Delta prompt strategy:
1. If the wine glass loses dominance: append "large rounded stemmed wine glass held very close to the lens, occupying much of the foreground".
2. If identity drifts between panels: append "the exact same woman appears in both panels with matching glasses, ponytail, earrings, and face".
3. If the left side is not flatter: append "left panel feels harsher, flashier, and more synthetic with flatter glass response".
4. If the right side is not more realistic: append "right panel shows cleaner skin, more believable glass refraction, and stronger low-light depth".
5. If the bar atmosphere disappears: append "dark restaurant or bar background with a few warm amber lights and minimal visible interior detail".
6. If the glass rim is misplaced: append "the wine glass rim cuts across the face, overlapping the nose and lower eye area".
7. If reflections vanish: append "bright specular reflections on the wine glass, liquid surface, and eyeglass lenses".
8. If the black top changes: append "simple black sleeveless top visible beneath the neck and shoulders".
9. If labels vanish: append "bottom labels read Higgsfield SOUL 2 on the left and NANO-BANANA PRO on the right".
10. If the mood becomes luxury-editorial: append "casual flash-lit nightlife snapshot, intimate and spontaneous, not a polished campaign shoot".

This image is a much smarter benchmark than it first appears. On the surface, it looks like a nightlife selfie with a wine glass. In practice, it is testing several hard rendering problems at once: transparent glass, liquid refraction, reflections on glasses, skin under direct flash, and hand anatomy holding a delicate object close to the lens. That is exactly why this kind of frame is so useful for creators comparing image models. One simple prop, used correctly, creates a high-information test.

The scene also works because it stays socially native. It feels like a real moment from dinner or drinks, not a synthetic benchmark diagram. That matters. Creator content about image quality spreads further when the frame still looks like something a person would actually post. This image does that by keeping the setup intimate, direct, and casually glamorous.

How soy_aria_cruz Compared SOUL 2 vs Nano Banana Pro and What to Recreate

Transparent objects are difficult for image generators because they require multiple layers of believable physics at the same time: shape, reflection, refraction, liquid level, and background distortion. When that transparent object is also held in front of a face, the challenge becomes even sharper. The audience can immediately see whether the face still looks coherent through the curved glass and whether the reflections feel earned or pasted on.

Signal	Evidence (from this image)	Mechanism	Replication Action
Transparent-object stress	The wine glass sits directly in front of the face and lens.	Glass distortion and refraction expose model weaknesses quickly.	Use one large transparent foreground object instead of many small props.
Flash realism test	Bright specular highlights appear on skin, eyeglasses, and the drink.	Strong direct light reveals whether materials respond in a physically plausible way.	Benchmark under direct flash or phone-light conditions, not only under soft studio light.
Identity retention under distortion	The same woman must remain recognizable despite overlaps from the glass and reflections.	Good models preserve facial coherence even when optics become messy.	Keep the hero face consistent while letting the glass distort only what should be distorted.

Best-fit uses and transfer logic

This format is ideal for realism comparisons, nightlife portrait prompt packs, object-interaction benchmarks, and creator content that wants to compare models through lifestyle situations rather than through lab-like test scenes. It also transfers well to coffee mugs, cocktail glasses, perfume bottles, mirrors, and other reflective transparent objects.

Best fit: model-vs-model realism tests. Why it fits: glass and flash expose quality gaps fast. What to change: keep the same face, crop, and object position while changing only the renderer.
Best fit: prompt education about transparency. Why it fits: the image provides an immediate visual lesson in how hard glass is to render well. What to change: vary only the object type or ambient light.
Best fit: nightlife portrait content. Why it fits: the scene is naturally social and attractive even outside a benchmark context. What to change: adjust the venue mood, but keep the foreground object as the main optical challenge.
Not ideal: outdoor daytime lifestyle content. Reason: bright ambient light removes much of the flash-and-reflection tension that makes this image valuable.
Not ideal: product catalog shots. Reason: the image is about interaction and mood, not clean isolated merchandising.

Transfer recipe one: keep the close flash portrait and transparent object overlap; change the wine glass to a martini glass; slot template: {same face} {flash nightlife mood} {transparent foreground drink} {A/B realism test}. Transfer recipe two: keep the same portrait logic; change the object to an espresso cup with reflective spoon; slot template: {same subject} {intimate cafe scene} {foreground object} {controlled comparison}. Transfer recipe three: keep the hand-and-object challenge; change the object to a perfume bottle; slot template: {close portrait} {reflective transparent item} {same hero identity} {left vs right render}.

Aesthetic read

The strongest aesthetic choice is proximity. Everything is very close: the glass to the lens, the face to the frame, the flash to the subject. That closeness makes the scene feel personal and immediate. It also makes every rendering mistake easier to spot, which is why the frame functions well as both content and comparison.

The second strong choice is the limited palette. The image mostly lives in black, skin tone, silver, and warm highlights from the drink and background lights. That restraint prevents the scene from becoming visually noisy. In a benchmark image, that matters. Too many colors can hide flaws. A cleaner palette reveals them.

Observed	Why it matters for recreation
Large wine glass dominating the lower-middle foreground	This is the main optical challenge and the key visual hook.
Round eyeglasses catching their own reflections behind the glass	Multiple reflective layers make the image an excellent realism test.
Direct flash or strong frontal light	Hard highlights make surface behavior easy to evaluate.
Minimal dark background with warm points of light	The background stays quiet so the comparison focuses on the subject and glass.
Same woman rendered in both panels	Identity lock keeps the evaluation fair and readable.

Prompt technique breakdown

To recreate this image well, start from the object interaction, not from the portrait. If you only prompt “woman with wine glass,” you will get a generic restaurant photo. The real structure is: split-screen benchmark, same subject, large glass near lens, direct flash, dim venue, and then left-right realism differentiation.

Prompt chunk	What it controls	Swap ideas (EN, 2–3 options)
two-panel close-up nightlife comparison	Benchmark architecture and social readability	A/B nightlife portrait test; split-screen realism showdown; dual-panel close flash comparison
same woman with glasses holding a large wine glass close to the camera	Identity lock plus optical challenge	same face behind a glass; repeated portrait with foreground drink; identical heroine with transparent prop
dark bar background with warm bokeh	Venue mood and subject isolation	dim restaurant interior; moody lounge background; warm nightlife ambience
flash highlights on the glass, liquid, and eyeglass lenses	Material behavior and lighting challenge	hard flash reflections; bright specular nightlife lighting; phone-flash realism
left flatter, right more dimensional and realistic	The core A/B quality difference	synthetic vs photoreal; harsher render vs natural render; flatter output vs richer realism

Execution playbook

Lock the split-screen, the same woman, and the foreground wine glass position before touching anything else. Those are your invariants. First run: get the crop and glass overlap correct. Second run: refine only the reflections and liquid surface. Third run: refine only the hand and stem realism. Fourth run: refine only the difference between the left and right panel treatment.

Baseline: lock close crop, same subject, same glass placement, and dark venue.
Iteration 2: change only glass reflections, refraction, and liquid clarity.
Iteration 3: change only fingers, grip, and stem handling.
Iteration 4: change only the synthetic-vs-real separation between left and right.

This workflow matters because transparent-object tests stop being useful when pose, lighting, and crop are changing at the same time. Stability keeps the benchmark honest.

How soy_aria_cruz Compared SOUL 2 vs Nano Banana Pro and What to Recreate

Best-fit uses and transfer logic

Aesthetic read

Prompt technique breakdown

Execution playbook

Related AI Generator