AI Generate Image Anime

AI generate image anime is a bridge query for people who want an anime style image but are not sure whether to start from text or a reference photo. This page helps them choose between text to anime and transform from image workflows, then compare resolution, PNG export, and transparent background options.

soy_aria_cruz: Nano-Banana Pro vs Nano-Banana Realism vs Stylized Comparison
[Subject] Side-by-side comparison image with two vertical portrait panels of the same tattooed young woman interpreted in two different model styles. Left panel: realistic version, early 20s, feminine presentation, light olive skin, long straight black hair in a high ponytail with small orange flowers placed through the hair, thin round metal glasses, small septum ring, large yellow drop earrings, layered silver necklaces, black tank top, floral chest and shoulder tattoos, and puckered kiss-face lips. Right panel: stylized or beautified version of the same woman, still wearing thin round glasses, yellow earrings, layered necklaces, and visible tattoos, but with turquoise-blue hair styled into two high buns with long front sections falling down. She has a softer smile, smoother face, slightly more illustrative or beautified finish, gray sleeveless top, and the same general identity translated into a more stylized rendering. Bottom labels identify the left as "NANO-BANANA PRO" and the right as "NANO-BANANA".
[Environment] Minimal studio-style portrait background with soft beige or warm neutral backdrop, no environmental props. This image is a direct comparison poster illustrating the difference between a more realistic output and a more stylized output while preserving the same character identity markers.
[Composition/Camera] Vertical 3:4 canvas divided into two equal rounded-corner columns with a narrow divider. Both portraits are chest-up, centered, front-facing, and tightly cropped. Subject fills most of each panel. The left panel emphasizes realism and sharper photographic fidelity. The right panel emphasizes a cleaner, more beautified, somewhat illustrative aesthetic. Composition must remain highly symmetrical and consistent so viewers can compare style drift, identity retention, and detail fidelity.
[Lighting] Soft frontal portrait lighting with balanced illumination on both faces, gentle catchlights in the eyes, subtle reflections on glasses, and minimal shadow. Light should be neutral and flattering, allowing differences in texture realism and rendering style to show naturally. Skin and tattoos must remain readable in both panels.
[Style/Rendering] Comparison poster for AI portrait generation quality. Left panel should feel photorealistic, detailed, and grounded. Right panel should feel smoother, more stylized, and slightly digital or illustrated while retaining realism-adjacent portrait structure. Both images should remain polished and attractive, but the contrast between realism and stylization should be obvious. No extra poster graphics beyond the bottom labels.
[Detail constraints] Keep exactly two portrait panels, preserve key identity markers across both sides: glasses, yellow earrings, layered necklaces, tattoos, youthful female face, and centered framing. Maintain bottom labels "NANO-BANANA PRO" on the left and "NANO-BANANA" on the right. Do not add side props, cluttered backgrounds, extra people, or text beyond the labels. The comparison is about fidelity versus stylization, not about different scenes.

Negative prompt: single panel only, missing tattoos, missing glasses, no yellow earrings, no labels, cluttered background, wildly different identities between panels, cartoon exaggeration, anime eyes, painterly texture, distorted face, low-detail jewelry, no septum ring, extra objects, watermark.

Suggested parameters: aspect ratio 3:4, 70-85mm portrait feel, shallow depth of field, 28-36 steps, CFG/style strength 6.5-8, sampler DPM++ 2M Karras or equivalent, seed around 465328.

Delta prompt strategy:
1. If the split-screen disappears: add "two equal vertical comparison panels with a narrow divider and matched crop".
2. If identity diverges too much: add "same woman, same facial structure, same jewelry, same tattoos across both panels".
3. If the left panel is not realistic enough: add "left panel photorealistic, crisp skin detail, natural hair texture, grounded realism".
4. If the right panel is not stylized enough: add "right panel smoother, more beautified, slightly illustrative finish with turquoise twin buns".
5. If the tattoos fade: add "clear floral tattoos across chest, neck, and shoulders visible in both panels".
6. If the earrings disappear: add "large yellow drop earrings clearly visible on both sides".
7. If the orange flowers on the left vanish: add "small orange flowers tucked into the black ponytail on the left panel".
8. If the labels disappear: add "bottom left label NANO-BANANA PRO and bottom right label NANO-BANANA in bold white text".
9. If the background gets busy: add "plain warm neutral portrait backdrop with no props or decor".
10. If the glasses distort: add "thin round metal eyeglasses with natural reflections and correct lens proportions".
soy_aria_cruz: Winter Pink Puffer Comparison AI Image
[Subject] A side-by-side winter portrait comparison showing the same young woman in two closely matched variations. She appears in her 20s with fair skin, large blue-green eyes, soft pink cheeks, dark eyebrows, delicate facial features, and a calm friendly expression. She has long black hair gathered back with a fluffy white bow headband or plush winter hairband. She wears oversized round silver wire-frame glasses and medium hoop earrings. Her winter outfit consists of a light pink puffer jacket with a plush white faux-fur collar, a cream ribbed knit sweater, and beige knit fingerless gloves or mitten-style hand warmers. In the left panel, she faces the camera more directly with one hand lifted near the collar; in the right panel, she holds the jacket edges or collar with both hands and has a slightly more polished, symmetrical pose. Both panels should depict the same person and same cold-weather styling.

[Environment] Snowy outdoor mountain setting with softly blurred pale blue-white mountains and winter sky in the background. Light snowflakes drift across the frame. The composition is presented as a social-media comparison board: two tall rounded rectangular portrait panels side by side with a thin dark teal divider between them and a dark teal outer border or background. Keep the background clean, bright, and wintry, with no buildings or city elements. Do not include any bottom labels or text overlays.

[Composition/Camera] Vertical 4:5 diptych layout. Each panel is a tight medium portrait from chest level to above the head, centered on the subject. Eye-level camera, straightforward beauty composition, shallow depth of field, and balanced spacing around the headband. Subject fills most of each panel, with the fluffy white collar occupying the lower-middle area and the snowy mountains softly visible behind. The left panel feels slightly more candid and open; the right panel is more refined and centered. Maintain strong consistency so the image reads as a clear model comparison.

[Lighting] Soft overcast winter light with cool ambient brightness and very gentle contrast. Even illumination across the face, no harsh shadows, and subtle pink warmth in the skin from the cold environment. Snowy daylight creates clean catchlights in the eyes and slight shine on the glasses rims. Overall the light should feel bright, diffuse, and flattering, with a crisp cold-weather atmosphere but no blue cast so strong that skin looks lifeless.

[Style/Rendering] Photorealistic AI beauty portrait with cozy winter fashion styling. Clean skin rendering, fine knit and puffer textures, believable faux-fur softness, and lightly stylized social-media polish. The goal is a premium generator-comparison image that still feels warm and aspirational. Preserve natural facial identity across both panels and avoid cartoonish perfection. No text, no watermark, no logos.

[Detail constraints] Show exactly one matching woman in two side-by-side winter portrait variants. Keep the fluffy white bow headband, round glasses, pink puffer jacket, white faux-fur collar, cream sweater, and beige knit gloves consistent in both panels. Maintain snowy mountain background, falling snow, rounded panel corners, and the dark divider/background frame. Exclude the source image’s bottom model-name labels. Do not add scarves, hats, extra jewelry, or extra people. The subject must remain soft, approachable, and cold-weather polished.

Negative prompt: text overlay, "NANO-BANANA PRO", "NANO-BANANA", watermark, logo, app UI, extra people, city street, indoor background, no snow, harsh sun shadows, different hair color, no glasses, wrong jacket color, no fur collar, bulky scarf, ski goggles, heavy makeup, anime look, plastic skin, duplicate person inside one panel, distorted hands, asymmetrical eyes, cropped headband, overly blue skin tone.

Suggested parameters for reproducibility: aspect ratio 4:5; portrait lens feel 70mm; aperture f/2.8 look; 28-36 steps; CFG/style strength 5.5-7; sampler DPM++ 2M Karras or equivalent; seed suggestion 592714308.

Delta prompt strategy:
1. If the image stops reading as a comparison: append "two tall rounded portrait panels side by side showing the same woman in matching winter styling".
2. If the headband changes: append "fluffy white bow headband framing the top of the dark hair in both panels".
3. If the outfit drifts: append "light pink puffer jacket with a thick white faux-fur collar over a cream ribbed sweater".
4. If the gloves disappear: append "beige knit fingerless gloves or hand warmers visible near the collar and jacket opening".
5. If the glasses are lost: append "large round silver wire-frame eyeglasses with soft daylight reflections".
6. If the mountain setting becomes vague: append "snowy alpine mountain background, softly blurred, with light falling snow".
7. If bottom labels reappear: append "image-only comparison board, no words, no captions, no model names".
8. If the lighting becomes too dramatic: append "soft overcast winter daylight, even facial illumination, no harsh shadows".
9. If the skin becomes too cold or lifeless: append "natural rosy cheeks and soft warm skin tones despite the snowy environment".
10. If both panels become identical in pose: append "left panel slightly more candid with one hand near the collar, right panel more centered with both hands holding the jacket".
soy_aria_cruz: SOUL 2 vs Nano Banana Pro AI Art

[Subject] A split-screen comparison image with two vertical fashion-portrait panels separated by a dark teal divider. Both panels show the same young woman with fair skin, long black hair tied in a high ponytail, large silver hoop earrings, thin round eyeglasses, and refined neutral makeup. She wears a soft taupe-beige dress. In the left panel, she is shown in side profile by a sheer curtain, wearing a strapless or off-shoulder draped dress with one hand touching the curtain and the other resting at her waist. In the right panel, she faces more toward camera in a three-quarter pose, wearing a long-sleeve wrap-style version of the same taupe dress, one hand on the curtain and the other on her hip. Her expression on the right is more direct and editorial.

[Environment] Soft indoor portrait scene beside tall cream sheer curtains with warm filtered daylight passing through. Background should remain minimal, elegant, and softly shadowed, with the curtain texture acting as the main environmental element. The setting should feel like a luxury editorial interior without visible clutter.

[Composition/Camera] Two-panel vertical comparison cover for social media. Left panel is a softer side-profile portrait emphasizing translucency, skin softness, and subtle pose. Right panel is a more defined three-quarter fashion portrait emphasizing dress structure, body proportions, and direct gaze. Keep both panels consistent in subject identity and color palette while showing slightly different pose and garment interpretation. Add bottom labels: “Higgsfield SOUL 2” on the left with its neon-green icon and “NANO-BANANA PRO” on the right with a colorful star icon.

[Lighting] Diffused window light through sheer curtains, very soft shadows, warm neutral tones, flattering skin highlights, gentle contouring, no harsh sunlight or flash. The left side should look lighter and more ethereal, while the right side should feel more contrasty and sculpted but still soft.

[Style/Rendering] Hyper-real high-end fashion comparison image, elegant window-light editorial aesthetic, realistic skin, realistic cloth folds, refined portrait anatomy, luxurious but understated styling, clean benchmark graphic for AI-image-model comparison.

[Detail constraints] Preserve the split-screen layout, same woman across both panels, taupe-beige dress, eyeglasses, hoop earrings, ponytail, curtain interaction, and bottom model labels with icons. Do not add furniture, extra people, bold accessories, or dramatic background decor.

Negative prompt: single-panel portrait, outdoor scene, harsh sunlight, heavy jewelry, busy interior, missing glasses, different hair color, bright saturated dress, fantasy glow, neon room, extra people, text errors, distorted hands, malformed dress folds, watermark.

Suggested parameters: aspect ratio 4:5, split-screen composite, portrait lens around 50-85mm, shallow-to-medium depth of field, 28-38 steps, CFG 6.5-8, sampler DPM++ 2M Karras or similar, style strength low to medium, seed suggestion 906134228.

Delta prompt strategy:
1. If the split layout disappears: “two vertical comparison portrait panels with a center divider”.
2. If curtain lighting weakens: “soft sheer curtains with diffused daylight filtering through”.
3. If the left panel loses side profile: “left panel shows a calm side-profile pose touching the curtain”.
4. If the right panel loses direct editorial energy: “right panel shows a three-quarter fashion pose with one hand on hip”.
5. If the taupe dress changes color: “muted taupe-beige draped dress with elegant folds”.
6. If glasses disappear: “thin round eyeglasses visible in both panels”.
7. If icons or labels vanish: “bottom labels: Higgsfield SOUL 2 left, NANO-BANANA PRO right, each with a small icon”.
8. If the background becomes cluttered: “minimal interior with curtains as the only major environmental element”.
9. If skin looks overprocessed: “natural editorial skin texture with soft window-light realism”.
10. If the woman identity drifts between panels: “same woman, same face, same hairstyle, same accessories across both outputs”.
Video
GLOBAL LOCK: Subject is Natalia Dyer, an American actress with an oval face, high cheekbones, large expressive brown eyes, and fair skin with natural warmth. Her hair is dark brown, long, and wavy, styled into two thick, loose braids falling over her shoulders. She wears a dark, high-collared cloak/coat. Her expression is neutral, serene, and slightly melancholic, looking directly at the camera. The camera is a static Medium Close-Up (MCU) with a cinematic 35mm lens feel. High-fidelity skin textures and realistic lighting are mandatory.

[00:00–00:01]
Subject is centered in a grand, atmospheric gothic cathedral. Background features intricate stone arches and stained glass windows. Lighting: Misty, volumetric light beams (God rays) filter through the windows, creating a teal and orange contrast. Subject's face is softly lit by the ambient glow. Motion: Subtle dust motes dancing in the light beams.

[00:01–00:02]
Subject is centered in a vast golden hour meadow. Background features tall, dry grass and a distant horizon under a setting sun. Lighting: Warm, intense amber backlighting creating a soft rim light on her hair and cloak. A subtle lens flare peeks from the corner. Motion: Very slight swaying of the grass in the background.

[00:02–00:03]
Subject is centered in a dense autumn forest. Background is filled with vibrant orange and red maple leaves. Lighting: Dappled sunlight filtering through the canopy, creating soft patches of light on her face. Shallow depth of field with a creamy bokeh effect on the leaves. Motion: A few leaves slowly falling in the background.

NEGATIVE PROMPT: 
Facial distortion, changing eye color, changing hair style, inconsistent facial features, cartoonish look, plastic skin, extra limbs, blurry face, text, watermark, logo, flickering lighting, sudden jumps in subject position, robotic movement, oversaturated colors, low resolution.
soy_aria_cruz: SOUL 2 vs Nano Banana Water Realism AI
[Subject]
A split-screen comparison image featuring the same young adult woman in a swimming pool at night. She has fair skin, large round metal glasses, hoop earrings, dark hair pulled back tightly or in high tied sections, and a bright expressive face. In the left panel, only the upper face and glasses rise above the waterline, creating a dramatic half-submerged close-up. In the right panel, more of the body is visible above the water, showing a dark one-piece swimsuit and a friendly smile.

[Environment]
Nighttime pool scene with a nearly black background, strong flash-lit subject, reflective water surface, visible ripples, underwater light scatter, and floating particles or droplets. The pool water should feel deep and glassy with realistic reflections and refraction. Keep the setting minimal so the viewer focuses on water realism and facial detail.

[Composition/Camera]
Vertical two-panel comparison layout with rounded corners and a thin divider between panels. Left panel is a tighter close-up portrait dominated by face and waterline. Right panel is a medium portrait showing upper torso in water. Both panels should align visually as a model-comparison graphic. Bottom labels identify the models: "Higgsfield SOUL 2" on the left with a small neon-green icon, and "NANO-BANANA PRO" on the right with a multicolor diamond icon.

[Lighting]
Direct flash or strong frontal night lighting that creates crisp highlights on skin, bright reflections on the glasses, luminous specular highlights on the waterline, and a glossy wet-skin look. Preserve strong contrast against the dark background while keeping the face readable.

[Style/Rendering]
Photoreal AI comparison graphic focused on water realism, skin texture, flash portrait aesthetics, and high-contrast nocturnal mood. The result should feel like a creator benchmark post comparing image generators on a difficult refraction-heavy scene.

[Detail constraints]
do not remove the split-screen comparison, preserve the same woman in both panels, keep the glasses, waterline crossing the face or torso, dark night background, one-piece swimsuit on the right, and the model-name text labels at the bottom. Water realism and reflective distortion must remain central.

[Negative prompt]
daylight pool, beach scene, multiple people, no glasses, dry skin, no waterline, missing reflections, underwater camera only, cartoon water, anime, broken facial symmetry, distorted eyes behind glasses, random pool toys, text missing, over-smooth skin, different woman in each panel

[Suggested parameters]
- aspect ratio: 4:5 vertical overall
- lens/focal length: 50mm portrait with flash feel
- depth of field: shallow-medium
- steps: 32-40
- CFG/style strength: 5.5-7.0
- sampler: DPM++ 2M Karras or Euler a
- seed suggestion: 90734126

[Delta prompt strategy]
1. If the waterline looks fake, append: "realistic water refraction crossing the face, crisp reflective surface tension"
2. If the split comparison disappears, append: "two vertical benchmark panels side by side with rounded corners"
3. If the night mood weakens, append: "dark nighttime pool background with flash-lit subject"
4. If the glasses deform, append: "large round metal glasses with realistic reflections and correct lens shape"
5. If the left panel becomes too open, append: "tight close-up with only eyes, glasses, and upper face emerging above water"
6. If the right panel loses body context, append: "upper torso visible in water, dark one-piece swimsuit, smiling toward camera"
7. If the water becomes too clean, append: "tiny suspended particles, ripples, and specular highlights in the pool"
8. If labels vanish, append: "bottom comparison labels for Higgsfield SOUL 2 and NANO-BANANA PRO"
9. If the image becomes too glamorous, append: "benchmark-style realism test, flash portrait honesty, not luxury resort imagery"
10. If likeness drifts between panels, append: "the same woman appears in both panels with matching face and glasses"
Kiki Inspired Flying Selfie AI Image Prompt
[Subject] One young woman in a hyperreal flying selfie scene inspired by a whimsical witch-anime aesthetic. She appears early 20s, feminine presentation, slim build, light olive skin, large green-hazel eyes, long dark brown to black hair pulled back with loose strands blowing strongly in the wind, thin round glasses, medium gold hoop earrings, bright open smile showing teeth, rosy cheeks, and a joyful adventurous expression. She wears a dark navy dress or top. On her head is a very large bright red bow headband with white polka dots, tied dramatically above the crown. In her left arm she holds a small fluffy black kitten with yellow-gold eyes, white patch on the chest, and soft fur. Behind her left shoulder a straw broom is visible, angled backward in flight.
[Environment] High above a snow-covered mountain range under a vivid blue sky with soft white clouds. The ground far below is a textured expanse of icy peaks and ridges. The whole scene suggests fast airy motion through open sky, but remains bright and cheerful rather than dangerous. In the bottom-right corner of the image there is a small inset reference picture showing a more cartoon/anime-styled version of the same composition, accompanied by a curved red arrow pointing toward the main hyperreal image, indicating transformation from reference to realistic output.
[Composition/Camera] Vertical 3:4 composition with dynamic extreme selfie perspective, camera held high and close, subject face large and centered slightly right, arm extending toward the lens from the lower-right edge. The kitten sits in the lower-left foreground, close to the camera. The broom enters diagonally from the left-rear area. Hair and bow stream backward to emphasize movement. Bottom-right inset image occupies a small rectangular area and must remain clearly visible as a secondary element. Use a wide selfie lens feel around 20-24mm equivalent, but maintain attractive facial proportions.
[Lighting] Bright natural daylight from above and slightly front-left, with even illumination across the face, soft highlights on cheeks and glasses, and clear visibility of the kitten fur and bow texture. Sky and snow provide cool ambient bounce, while skin tones remain warm and lively. No harsh shadows; the mood should be crisp, optimistic, and airy.
[Style/Rendering] Photorealistic yet playful social-media comparison image, designed to show a cartoon-inspired concept translated into hyperreal photography. Clean, high-detail skin texture, realistic fabric, natural wind motion in hair, sharply rendered kitten fur, believable broom straw, saturated but controlled sky blues, and cheerful adventure energy. The inset should look noticeably more illustrated/anime-like, while the main image remains convincingly real.
[Detail constraints] Keep exactly one smiling flying subject, one black kitten, one straw broom, one oversized red polka-dot bow, and one small reference inset at bottom-right with a red arrow indicating transformation. Preserve the snowy mountain background and bright sky. Do not add extra characters, city elements, witches’ hats, magical sparkles, or multiple animals. This is a whimsical flying selfie with a realistic finish, not a fantasy battle scene.

Negative prompt: extra people, missing kitten, missing bow, missing broom, no inset reference image, no red arrow, witch hat, magical particles, dark storm sky, painterly main image, cartoon main image, distorted selfie face, warped cat anatomy, low-detail fur, generic clouds only with no mountains, text overlay, watermark.

Suggested parameters: aspect ratio 3:4, 20-24mm selfie lens feel, moderate depth of field, 28-38 steps, CFG/style strength 6.5-8, sampler DPM++ 2M Karras or equivalent, seed around 273644.

Delta prompt strategy:
1. If the cartoon-to-real comparison cue disappears: add "small anime-style reference inset at bottom-right with a curved red arrow pointing to the realistic main image".
2. If the bow becomes too small: add "oversized bright red bow with white polka dots dominating the top of the hairstyle".
3. If the kitten is missing or wrong: add "small fluffy black kitten with golden eyes and a tiny white chest patch held in one arm".
4. If the broom disappears: add "straw broom trailing diagonally behind the subject during flight".
5. If the scene loses motion: add "wind-swept hair and bow streaming backward, dynamic airborne selfie angle".
6. If the setting becomes generic sky: add "snow-covered mountain range far below, crisp icy ridges visible under the subject".
7. If the subject loses glasses: add "thin round eyeglasses clearly visible on the smiling face".
8. If the main image drifts cartoonish: add "main scene photorealistic, only the inset image remains anime-styled".
9. If facial proportions distort from wide angle: add "wide selfie lens with natural flattering facial proportions".
10. If lighting turns moody: add "bright cheerful daylight with clean sky and soft even facial illumination".
Video
GLOBAL LOCK: A young man in his early 20s, Mediterranean/Southern European appearance, olive skin tone, curly dark brown hair, well-groomed mustache and goatee. He wears a black cotton t-shirt with a vintage-style graphic print. The environment is a modern home office with soft, natural indoor lighting and a blurred background containing shelves and posters. Cinematic color grading with high dynamic range and soft highlight rolloff. Speech is energetic, clear, and direct-to-camera.

[00:00–00:02]
Subject: The man in a maroon and navy blue soccer jersey with "PEOPLESTYLE 07" on the front.
Environment: A grey asphalt street with white crosswalk markings.
Action: Standing still, looking directly at the camera with a neutral expression.
Framing: Medium shot, eye level.
Lighting: Warm, sepia-toned, mimicking the aged oil painting texture of the Mona Lisa shown in the top half of the split screen.
Motion: Subtle handheld camera micro-shake.
Speech: No speech, upbeat background music starts.

[00:02–00:03]
Subject: The man in a dark charcoal suit, white shirt, and striped tie.
Environment: A high-rise office with a large window overlooking a city skyline.
Action: Holding a vintage black desk phone to his ear, looking slightly off-camera.
Framing: Medium shot, eye level.
Lighting: High contrast, deep blues and vibrant yellows, mimicking Van Gogh's "Starry Night" shown in the top half.
Motion: Static camera.

[00:03–00:05]
Subject: The man in a plain black t-shirt.
Environment: An outdoor desert landscape at dusk.
Action: Profile view, looking over his shoulder toward the camera.
Framing: Medium close-up, side angle.
Lighting: Monochromatic warm orange glow, soft backlighting, mimicking the geometric 3D art above.
Motion: Slow camera pan around the subject.

[00:05–00:11]
Subject: The man in the global lock black graphic tee.
Environment: Home office desk with a laptop in the foreground.
Action: Talking to the camera, using expressive hand gestures (palms up, moving outward).
Framing: Medium close-up, eye level.
Lighting: Natural window light from the side, shallow depth of field.
Speech: "to your... with absolutely no prompts... that's why I started using..." (Energetic, persuasive tone).
Sync: High lip-sync strictness; cuts land on phrase endings.

[00:11–00:20]
Visual: Screen recording of the Higgsfield Hex interface. A dark mode dashboard. A cursor moves to click a "Color transfer" button. An abstract red, black, and white painting is uploaded. The UI extracts a color palette (red, pink, tan).
Action: Digital UI interaction.
Lighting: Clean digital screen glow.
Speech: Narrating the process (implied).

[00:20–00:37]
Subject: Back to the man in the home office.
Environment: Same as [00:05-00:11].
Action: Continuing to talk and gesture. Floating UI cards appear in front of him showing various images (a white goat, a vintage car, a blonde woman) all styled with the same color palette.
Framing: Medium close-up.
Text Overlays: "ARTISTIC VISION NOW DECODED", "#hex", "Comment 'SOUL'".
Speech: "and that's it... choose... artistic vision now decoded... if you want to try this out, comment 'SOUL' and I'll send you..."
Sync: High lip-sync strictness. Final cut on the CTA.

NEGATIVE PROMPT: Robotic speech, flat delivery, blurry face, inconsistent facial hair, flickering lighting, distorted UI text, messy background, unnatural hand movements, low-resolution textures, over-saturated colors, lip-sync lag.

SPEECH PACK:
[00:05–00:11]
Transcript: "...to your videos with absolutely no prompts. That's why I started using..."
TAKE_A: (Fast, excited) "...to your videos with absolutely NO prompts! That's why I started using..."
TAKE_B: (Confident, steady) "...to your videos with absolutely no prompts. [pause] That's why I started using..."

[00:20–00:37]
Transcript: "And that's it. Choose... artistic vision now decoded. If you want to try this out, comment 'SOUL' and I'll send you the link."
TAKE_A: (Inviting) "And that's it! Just choose... artistic vision now decoded. If you want to try this out, comment 'SOUL' [emphasis] and I'll send you the link!"
TAKE_B: (Direct) "And that's it. Choose your style. Artistic vision decoded. Comment 'SOUL' now and I'll send it over."
Video
GLOBAL LOCK: A vertical promotional AI video tile designed like a social-media prompt pack cover. Keep the composition consistent: a black decorative border with tiny star sparkles, large handwritten-style text at the bottom reading “+100 Prompts”, and a central portrait area showing a blonde young woman whose look shifts between stylized cartoon beauty and photoreal beauty. Keep the subject identity consistent across all frames: fair-skinned young woman, short blonde bob haircut, soft green or hazel eyes, black off-shoulder top with thin straps, black choker, delicate pretty expression. The visual concept is a smooth transformation or comparison between two aesthetics: a doll-like illustrated version and a realistic camera-ready portrait version. Background stays minimal and soft. Motion is subtle, focused on transition and light pose variation rather than action. No dialogue, no extra subtitles, no logos beyond the baked-in “+100 Prompts” design.

[00:00-00:01] Open on the stylized version of the blonde woman inside the black framed promo card. The face is slightly doll-like, with softened illustrated features, while the “+100 Prompts” text and sparkly border are already visible.

[00:01-00:02] The central portrait begins shifting into a more photoreal interpretation. Keep the bob haircut, choker, and off-shoulder black top fixed so the viewer reads this as a style transformation, not a different person.

[00:02-00:03] The realistic version becomes dominant: cleaner skin detail, natural lighting, and a more photographic face. The border, stars, and handwritten title remain static and legible.

[00:03-00:04] The portrait subtly drifts back toward the softer stylized look, as if comparing two prompt outcomes within the same branded card layout. Preserve the same gentle head angle and calm expression.

[00:04-00:05] End with the stylized portrait or a halfway blend that still clearly communicates the before-and-after concept. The final frame should feel like a course promo visual for a large prompt pack focused on portrait styles.

NEGATIVE PROMPT: missing border, missing stars, missing “+100 Prompts” text, unrelated background, hair color drift, changing clothing, extra accessories, warped bob haircut, asymmetrical face, heavy camera movement, subtitles, logos, watermark clutter, broken style transition, distorted eyes, unstable choker, aggressive morphing, uncanny blend artifacts.

SHOT PROMPTS:
SHOT 1 DELTA: establish stylized blonde portrait inside sparkly black promo frame.
SHOT 2 DELTA: begin transition toward realistic portrait while identity stays locked.
SHOT 3 DELTA: realistic beauty version fully readable, promo layout unchanged.
SHOT 4 DELTA: soften back toward stylized look for direct prompt-comparison feel.
SHOT 5 DELTA: finish on a clear branded style-comparison hero frame with “+100 Prompts”.

SPEECH PACK:
Timecoded transcript: no dialogue is present in the reference clip.
TAKE_A [00:00-00:05]: silent promo-card transformation, no speech.
TAKE_B [00:00-00:05]: no spoken words, portrait-style comparison only.
TAKE_C [00:00-00:05]: quiet prompt-pack cover animation showing stylized versus realistic portrait output.
Closest audible version: no intelligible spoken content detected.
Safe paraphrase version: a blonde portrait shifts between cartoon-like and realistic styles inside a branded “+100 Prompts” card.
curiousrefuge: Medieval Knight Mountain Peak AI Portrait
[Subject] A three-panel AI transformation showcase featuring an older man with short salt-and-pepper hair and a serious expression, presented first as a multi-angle indoor reference sheet and then reimagined as a medieval knight in dark steel armor with a fur-trimmed cape on a snowy mountain peak.

[Environment] Clean social-post layout on a pale gradient background. Top card: a 3x3 reference collage captured inside a softly lit modern room with doors, walls, and houseplants. Middle card: cinematic fantasy result on an alpine ridge with snow, rock faces, and cold blue sky. Bottom card: a dark green prompt box displaying the instruction text used to generate the fantasy version.

[Composition/Camera] Vertical infographic composition with rounded rectangular panels and subtle teal outlines. The reference grid uses varied medium shots, side profiles, and close-ups to establish likeness. The generated knight image uses a centered waist-up portrait with the subject facing camera on a mountain slope. The prompt panel is flat, readable, and aligned beneath the output image.

[Lighting] Soft natural indoor window light in the reference sheet; crisp daylight with cool high-altitude contrast in the knight result; even graphic lighting for the text panel.

[Style/Rendering] Photoreal AI workflow board, before-and-after comparison graphic, identity-preserving character transfer demo, polished creator education asset, crisp editorial UI framing, realistic metal textures, cinematic fantasy styling.

[Detail constraints] Preserve the man''s facial structure, age, nose shape, jawline, eyebrow shape, and salt-and-pepper hair color between reference and result. Emphasize the transformation from casual navy sweater to layered medieval armor without changing identity. Keep visible labels for REFERENCE IMAGE and NANO BANANA 2. Maintain a premium tutorial-post feel rather than a meme layout.

Negative prompt: extra characters, young face swap, different hair color, beard added, fantasy helmet covering the face, messy typography, distorted hands, duplicate panels, unreadable text, low-detail armor, cartoon rendering, oversaturated lighting.

Suggested parameters: image strength 0.55, stylization 220, contrast medium, sharpness medium-high, layout guidance strong, identity preservation very high.

Delta prompt strategy:
1. If likeness drifts, restate identical facial structure and hair color from the reference sheet.
2. If armor feels generic, specify dark steel breastplate, layered pauldrons, fur collar, and heavy blue cape.
3. If the board loses its tutorial format, reinforce three stacked cards with reference, output, and prompt sections.
4. If the mountain setting becomes vague, call for snowy ridge, jagged rocks, and clear alpine sky.
5. If the model ages the subject incorrectly, specify mature middle-aged male with consistent facial lines.
6. If the prompt card disappears, require a dark green text panel with visible instruction copy.
7. If the indoor references become inconsistent, ask for a multi-angle 3x3 room collage in a navy sweater.
8. If the result becomes too stylized, request photoreal fantasy costuming with believable metal texture.
9. If labels are missing, explicitly preserve REFERENCE IMAGE and NANO BANANA 2 text overlays.
10. If composition becomes cluttered, ask for clean spacing, rounded panels, and a premium creator-post layout.
Video
GLOBAL LOCK: Subject is Major Motoko Kusanagi (Scarlett Johansson), pale porcelain skin, sharp facial structure, short dark razor bob hairstyle, hair is wet and plastered with raindrops. Her right eye has a glowing cyan-colored cybernetic ring. She wears a glossy black form-fitting bodysuit. Environment is a futuristic cyberpunk city, Kurokawa Spiral Interchange, wet concrete, metallic pillars, neon signage in cyan and magenta. Weather is heavy rain with cold neon haze and mist. Lighting is high-contrast, hard strobes, motivated by neon sources. Cinematic film style, 35mm lens feel, high fidelity.

[00:00–00:02]
Tight profile close-up on Major Motoko Kusanagi. Her face is turned toward the right, gaze directed off-frame. Raindrops are visible on her skin and wet hair. Her right eye glows with a bright cyan cybernetic ring, casting a cool light on her cheekbone. Camera does a micro push-in. Lighting is cold and moody with blue highlights.

[00:02–00:05]
Transition to a wide action shot in the Kurokawa Spiral Interchange underpass. Major hooks her arm on a dripping metallic handrail, whips around a concrete pillar with high kinetic energy. She performs a mid-air dismount and kicks an enforcement rider off a moving motorcycle. Camera follows the movement with a dynamic tracking shot. Flashing white strobes from the motorcycle headlights.

[00:05–00:08]
Major slams into a neon vending kiosk. The impact causes the kiosk's glass to shatter, spraying sparks and magenta light onto the wet, reflective ground. Snap-pan camera movement following the impact. The scene is filled with rain mist and vibrant magenta signage glow. High-speed motion blur on the impact.

NEGATIVE PROMPT: blurry, low resolution, distorted face, inconsistent eye glow, dry hair, sunny weather, cartoonish, 3D render style, floating limbs, robotic movement, flickering lights, text, watermark, logo, messy background, flat lighting.

SPEECH PACK:
(No speech present in the video. Audio is focused on heavy synth-wave music and environmental foley.)
- Foley: Heavy rain ambience, metallic clink of the handrail, electrical buzz of the neon kiosk, shattering glass, high-voltage sparks.
- Music: Dark synth-wave, driving bassline, cinematic orchestral swells.
Video
GLOBAL LOCK: A vertical cinematic fashion tutorial video that begins with a direct hook and transitions into a moody nighttime beach portrait sequence. The subject is an East Asian young woman with short black hair and light skin, wearing a white satin slip dress, with visible tattoo sleeves and shoulder tattoos on one arm. The visual identity combines raw direct-flash photography, grainy night texture, dark ocean horizon, bright moonlight in the sky, wet sand reflections, and a dreamy editorial tone. Keep the same woman, dress, tattoos, beach-at-night setting, flash-lit skin highlights, and minimalist sensual styling throughout. The audio style is creator-led tutorial / prompt-sharing narration with concise social-video pacing, dry close mic sound, and an intimate but confident tone.

[00:00–00:04] Extreme close-up of the woman’s eye and cheek under hard direct flash, with a metallic star sticker on her face and strands of black hair crossing the frame. Large bold text appears in sequence: “STEAL,” “STEAL MY,” “STEAL MY AI,” “STEAL MY AI PROMPTS.” The camera is nearly static, intimate, and confrontational, using a macro beauty framing with shallow focus and stark flash highlights against deep shadow.

[00:00–00:04] The hook is spoken or implied as a fast creator-style opening line inviting viewers to take the prompts. Speech cadence is clipped and attention-grabbing, landing in sync with each text change. Lips are only partially visible, so sync matters less than timing and mood.

[00:04–00:09] Cut to a full-body night beach portrait. The woman stands barefoot at the shoreline in the white slip dress, lit by harsh on-camera flash while the moon glows above the horizon. Yellow subtitle-style text begins presenting prompt-writing advice across the lower portion of the frame. The camera alternates between profile and back views as she faces the sea, then touches her hair and turns slightly toward camera. Keep the wet sand glistening and the sky nearly black-blue.

[00:09–00:14] Continue the beach sequence with slower editorial posing. The woman steps through shallow water, then faces away from camera so the back of the dress and her damp hair are visible. Use a mix of medium full-body and lower-body shots that emphasize bare feet in the surf, dress hem in water, and direct-flash specular highlights on skin and fabric. The voiceover/tutorial text explains how the prompt should describe camera treatment and mood, while the images function as the visual result.

[00:14–00:19] The woman sits or kneels near the shoreline and opens her arms outward, then shifts into seated portrait poses looking toward the horizon and back to camera. The composition becomes softer and more romantic while still retaining the raw flash look. Yellow caption blocks continue in the lower frame with practical prompt tips. Motion is minimal, with small posture changes and gentle ocean movement carrying the scene.

[00:19–00:23] Move into medium close-up portraits of the seated woman in the surf. Her tattoos, shoulder line, cheekbones, and the satin texture of the dress become more prominent. She glances downward, then sideways, then leans toward the camera. Maintain the tension between harsh direct flash and soft emotional expression. The tutorial text suggests concrete structure for recreating the look rather than vague aesthetic language.

[00:23–00:25] End on standing and close-up beach portraits with the woman facing camera head-on and then slightly off-axis. The dress clings softly with dampness, the tattooed arm remains a clear identity anchor, and the flash creates a glossy editorial finish. The final beat feels like a complete visual example of the prompt style being taught: raw, romantic, direct-flash night photography translated into AI-video form.
soy_aria_cruz: Winter Pink Puffer Comparison AI Image
[Subject] A side-by-side winter portrait comparison featuring the same young woman shown twice in nearly matching styling and framing. She has fair skin, large blue-green eyes, long black hair tied into a high ponytail, oversized round wire-frame glasses, silver hoop earrings, soft pink lips, and a gentle friendly expression. She wears a fluffy white plush headband with a large bow on top, a pale pink puffer jacket with oversized white faux-fur collar, and a cream knit sweater underneath. On the left panel, her expression is slightly more neutral and direct, with one hand touching the collar near the lower right. On the right panel, she has a softer smile and slightly different hand placement near the coat opening. Keep the woman’s styling nearly identical in both panels while allowing minor natural variation.

[Environment] Snowy outdoor mountain setting in winter, blurred and pale in the background. The backdrop should show soft white snow, faint gray-blue mountain shapes, and floating snowflakes. The environment is simple and cold, but the subject remains warmly styled. This is not a single natural photograph: it is a split-screen comparison cover with two portrait panels placed side by side on a dark teal background. Each panel is framed as a rounded-rectangle card. White text overlays appear at the bottom of each panel: “NANO-BANANA PRO” on the left and “FLUX 2” on the right. Keep the full comparison layout because it is visibly part of the provided image.

[Composition/Camera] Vertical social-media comparison design, overall frame near 4:5. Two portrait cards fill most of the canvas, separated by a slim dark teal divider. Both portraits are medium close-ups from upper chest to slightly above the headband bow, centered and symmetrical enough to invite direct visual comparison. The left image is slightly tighter and cooler in facial expression, while the right image is a touch softer and more polished. Both subjects look directly at the camera. Preserve the clean side-by-side benchmarking layout and the card-like framing with rounded corners.

[Lighting] Soft overcast winter daylight with even frontal illumination on both faces. No harsh shadows. The light should feel diffuse, flattering, and cold-weather appropriate, keeping skin clear and smooth while preserving realistic facial depth. Snowflakes and pale mountain background remain softly lit. Overall color temperature is cool-neutral, but the pink jacket and cream knit add warmth. Maintain consistent lighting across both panels for fair comparison.

[Style/Rendering] Hyper-real winter selfie portrait with a social-media comparison aesthetic. The main emphasis is realism in skin, glasses, knit texture, faux-fur softness, and puffer-jacket material. The left panel should feel slightly sharper and more photographic, while the right panel can feel a little softer or more beautified, but both must remain plausible and high quality. The overall composition should read as a generator-versus-generator cover image, not a random collage and not a fashion magazine spread.

[Detail constraints] Do not remove the split layout. Keep exactly two vertical portrait cards of the same styled woman, with dark teal borders/divider and the white labels “NANO-BANANA PRO” and “FLUX 2” at the bottom of each respective panel. Preserve the pink puffer jacket, fluffy white collar, white bow headband, glasses, hoop earrings, ponytail, cream sweater, snowflakes, and snowy mountain backdrop. Do not convert the image into one single portrait or change the winter styling.

Negative prompt: single image only, missing split-screen, different people in each panel, blonde hair, no glasses, no bow headband, indoor background, Christmas room, ski goggles, heavy makeup, harsh sunlight, dark dramatic shadows, extra text, warped eyes, asymmetrical glasses, melted fur, deformed hands, anime illustration, painterly style, watermark clutter.

Suggested parameters: aspect ratio 4:5 vertical overall; lens 50-70mm equivalent portrait feel; aperture look f/2.8 to f/4; steps 30-40; CFG/style guidance 6.5-8; sampler DPM++ 2M Karras or photoreal portrait sampler; seed suggestion 286411570.

Delta prompt strategy:
1. If the split-screen disappears: "two rounded-rectangle portrait panels side by side on a dark teal background with a slim divider"
2. If the styling changes between panels: "same young woman in both images with matching pink puffer jacket, white bow headband, glasses, and cream sweater"
3. If the winter mood weakens: "snowy mountain background with floating snowflakes and cool diffuse daylight"
4. If the bow headband is wrong: "large plush white bow headband centered over a high black ponytail"
5. If the fur collar loses softness: "oversized fluffy white faux-fur collar around the neck and shoulders"
6. If the panels look too identical and artificial: "same subject, similar styling, slight natural variation in expression and hand placement between panels"
7. If text labels disappear: "white lower text labels reading NANO-BANANA PRO on the left and FLUX 2 on the right"
8. If it becomes a fashion editorial instead of a comparison: "social-media generator comparison cover, clean benchmarking layout"
9. If skin becomes over-retouched: "realistic skin texture, subtle winter softness, no beauty-filter plastic skin"
10. If the background gets busy: "minimal pale snowy mountains softly blurred behind the subject"
Underwater Pink Dress Comparison AI Image Prompt
[Subject] A side-by-side underwater portrait comparison of the same young woman shown in two similar but not identical images. She has fair skin, long dark brown to black hair floating freely in the water, large silver hoop earrings, bright blue-green eyes, and soft youthful facial features. She wears a pale pink outfit that combines a delicate pastel pink crop top or bralette and a voluminous layered pink tulle skirt resembling a ballerina or fairy-tale dress. In the left panel, her expression is calm and serene with lips closed, head slightly angled, and a more editorial stillness. In the right panel, she smiles more brightly, eyes more animated, and the pose feels slightly more buoyant and playful. Keep the woman’s identity and styling consistent across both panels.

[Environment] Both panels take place underwater in a clear turquoise-blue pool or water tank. The water surface is visible near the top, with shimmering wave distortions. Decorative chandelier elements hang from above and appear submerged or partially visible near the upper corners, adding a surreal luxury-fantasy mood. Sun-like caustic light patterns ripple across the woman’s skin and clothing. A few soft pink flower petals float near the frame edges. Small air bubbles are visible in the right panel. The overall image is presented as a vertical social-media comparison cover with two rounded portrait cards side by side, separated by a dark teal divider. White lower labels read “NANO-BANANA PRO” on the left and “NANO-BANANA” on the right.

[Composition/Camera] Overall layout is a two-panel split comparison. Each panel is a vertical rounded-rectangle portrait card. The woman is centered in both, framed from around waist to slightly above the head, floating upright underwater. The left image is slightly tighter and more composed, while the right image is a touch wider or more playful, revealing more of the tulle volume and water motion. The top waterline and chandelier fragments should remain visible. Keep the side-by-side benchmark structure intact and balanced.

[Lighting] Bright diffused underwater lighting with strong caustic reflections dancing over the face, shoulders, and pink fabric. The water should glow aqua and clean, while the subject remains well separated from the background. Light is soft but directionally patterned by the water surface. Maintain a dreamy, luminous underwater atmosphere without becoming murky. The right panel can feel slightly more vibrant, but both should remain believable in the same overall setup.

[Style/Rendering] Hyper-real fantasy-lifestyle underwater photography with a social comparison aesthetic. Highly detailed wet hair strands, realistic skin under water refraction, soft tulle textures, subtle bubbles, and elegant chandelier crystals. The image should feel like a benchmark between two high-end image generations, not a painting and not a mermaid fantasy illustration. Preserve a polished but still photoreal underwater look.

[Detail constraints] Do not collapse the comparison into one image. Keep the two-panel rounded-card layout with dark teal divider and the lower white labels “NANO-BANANA PRO” on the left and “NANO-BANANA” on the right. Preserve the same woman in both images, the pink top and fluffy tulle skirt, floating dark hair, hoop earrings, waterline at the top, chandelier crystal details, and underwater caustic lighting. Do not add fish, full mermaid tail, scuba gear, or poolside elements.

Negative prompt: single portrait only, different women in each panel, mermaid tail, fish swarm, scuba mask, snorkel, pool ladder, murky water, muddy green tones, missing chandelier, missing tulle skirt, no earrings, blonde hair, harsh flash, anime mermaid style, oil painting, extra text, watermark clutter, warped hands, distorted eyes, melted dress.

Suggested parameters: aspect ratio 4:5 overall comparison; lens 35-50mm equivalent underwater portrait feel; aperture look f/4 with broad subject clarity; steps 30-40; CFG/style guidance 6.5-8; sampler DPM++ 2M Karras or photoreal fantasy sampler; seed suggestion 662149305.

Delta prompt strategy:
1. If the split-screen disappears: "two vertical rounded portrait cards side by side with a dark teal divider and white generator labels at the bottom"
2. If the underwater mood weakens: "clear turquoise underwater scene with visible waterline, bubbles, and light caustics on skin and fabric"
3. If the dress loses volume: "pale pink layered tulle skirt blooming underwater around the waist and hips"
4. If the subject identity changes between panels: "same young woman in both panels with long dark floating hair and silver hoop earrings"
5. If the chandelier detail disappears: "submerged chandelier crystal elements hanging near the upper corners under the water surface"
6. If the panels feel identical and flat: "left panel calm editorial expression, right panel brighter smile and more playful buoyancy"
7. If it becomes too fantasy-like: "photoreal underwater portrait, not mermaid art, not illustration"
8. If caustic lighting is missing: "shimmering water-light reflections across face, shoulders, and pink clothing"
9. If text labels vanish: "bottom white labels reading NANO-BANANA PRO on the left and NANO-BANANA on the right"
10. If the water turns dark or dirty: "clean luminous aqua water with soft pastel atmosphere"
soy_aria_cruz: Flux vs Nano Banana Selfie AI Art

[Subject] A side-by-side split-screen comparison of the same young adult woman recorded outdoors in bright daytime. She has a slim build, light skin, dark hair tied into a high ponytail flying outward with motion, large round wire-frame glasses, hoop earrings, and a fitted black sleeveless athletic tank top. Left panel: she looks slightly downward with a soft smile, eyes partly lowered, in a candid sunny walking or jogging moment. Right panel: she faces the camera directly in a close selfie with a friendly open smile. In both panels her face is naturally lit by strong sunlight and her ponytail arcs dramatically behind her.

[Environment] Sunny city street with trees overhead, bright sky filtering through leaves, and soft urban buildings and traffic signals in the blurred background. The left panel feels more shaded by foliage with sun flares coming through the trees; the right panel is more direct and open, with a clearer urban daytime street behind the subject. Both panels share the same outdoor city-walk or light-exercise context.

[Composition/Camera] Vertical two-panel split layout separated by a narrow divider. Both sides are close smartphone portraits from chest-up framing. Left side is slightly more top-lit and downward-gazing, while the right side is a classic arm-extended selfie with direct eye contact. Bottom comparison labels identify the models: “FLUX 2 Klein” on the left and “NANO-BANANA PRO” on the right, with a colorful sparkle icon above the right-side text.

[Lighting] Strong natural daylight with warm highlights, sun filtering through tree leaves, and soft bright bokeh in the background. The left panel includes more dappled backlight and flare; the right panel has more direct front-side sunlight on the face. Contrast is lively but still flattering, typical of outdoor summer selfie conditions.

[Style/Rendering] Photoreal creator-style outdoor selfie comparison, bright social-media realism, everyday fitness/lifestyle content, natural skin texture, phone-camera framing, slight motion energy in hair, clean vibrant urban daylight look.

[Detail constraints] Preserve the side-by-side comparison, the black tank top, round glasses, hoop earrings, high ponytail, bright tree-lined city street, and the bottom model labels. Keep the two expressions distinct: candid downward smile on the left and direct happy selfie on the right. Do not add extra foreground people, hats, or heavy workout gear.

Negative prompt: indoor gym, sports bra only, sunglasses, no glasses, static studio portrait, cloudy moody weather, extra people beside the subject, no split layout, cartoon style, harsh over-retouching, dramatic fashion makeup, text overlays beyond the comparison labels.

Suggested parameters: aspect ratio 4:5 vertical; lens 28mm to 35mm smartphone selfie feel; medium depth of field; 22-32 steps; CFG/style strength 5.5-7; sampler DPM++ 2M Karras or equivalent; seed suggestion around 617284531.

Delta prompt strategy:
1. Split-screen disappears -> append: side-by-side two-panel smartphone selfie comparison with narrow divider and bottom labels.
2. Ponytail loses motion -> append: high ponytail lifted and swinging outward in bright outdoor movement.
3. Glasses vanish -> append: large round wire-frame glasses visible in both panels.
4. Outfit changes -> append: fitted black sleeveless athletic tank top, minimal styling.
5. City environment becomes generic park -> append: sunny tree-lined city street with soft buildings and traffic lights in the background.
6. Left panel loses its candid angle -> append: subject glancing slightly downward with a gentle smile in bright dappled sunlight.
7. Right panel stops reading as selfie -> append: direct arm-extended selfie with friendly smile and eye contact.
8. Lighting becomes flat -> append: strong natural daylight with leaf-filtered highlights and bright bokeh.
9. Image becomes polished ad campaign -> append: casual creator-style social media selfie realism, natural and approachable.
10. Labels disappear -> append: FLUX 2 Klein text on left and NANO-BANANA PRO text on right at the bottom.
soy_aria_cruz: Winter AI Portrait Comparison AI
[Subject] A side-by-side comparison image featuring the same young adult woman in a cozy winter portrait look, presented as two nearly matching close-up panels. She has light-to-medium skin, large round clear eyeglasses, dark brown to black hair with center part and loose face-framing strands, hoop earrings, soft natural makeup, and a gentle closed-mouth smile. She wears a fluffy animal-ear hood or plush winter hood framing the head, layered over a cream knit sweater and pale pink winter jacket. Small snow flecks or frost-like particles appear on the hood and hair. The subject remains front-facing and centered in both panels.

[Panel structure] Left panel is labeled "CHAT GPT 1.5" near the bottom in white text. Right panel is labeled "NANO-BANANA PRO" near the bottom in white text. Both panels show the same subject and outfit but with slightly different rendering quality, skin finish, and detail handling. The comparison layout is essential and should be preserved as a two-column split image with a thin divider.

[Wardrobe and materials] Plush cream or off-white hood with rounded ear shapes, pale pink puffer-style winter jacket, cream knit sweater underneath, layered delicate gold necklaces, and clear glasses. The hood and jacket should look soft and cozy. Skin should appear natural and lightly cold-flushed. On the right panel, subtle white frost freckles or snow specks on the cheeks and nose may appear more stylized and refined.

[Props/Objects] No handheld props. The only graphic elements are the two labels at the bottom of each panel. Backgrounds are soft, wintry, and minimally visible, with a teal outer frame or border around the full comparison card. No clutter, no scenery emphasis, and no extra people.

[Environment] Soft winter portrait environment, likely outdoors or lightly snow-dusted, but the background is blurred and secondary. The image functions more as an AI comparison card than as a narrative scene. The mood should stay clean, calm, and beauty-focused.

[Composition/Camera] Vertical split-screen comparison card with two equal portrait panels. Tight close-up or chest-up framing in both panels, face centered, eyes toward the camera. The subject fills most of each panel. The overall card has rounded-corner panels and a cool-toned surrounding background. Preserve the direct visual comparison format.

[Lighting] Soft flattering portrait light, bright enough to show skin texture and glasses clearly. Lighting remains even and gentle across both panels, with a cozy winter glow. No harsh flash. Slight warmth in the skin tones balanced with cool winter ambience.

[Color palette] Pale pink, cream, off-white, soft skin tones, black hair, subtle winter blue-green background accents, and white text labels. The palette should feel soft, clean, and feminine with a cold-weather beauty aesthetic.

[Style/Rendering] Realistic AI-comparison social graphic, polished portrait imagery designed for feed comparison, no cartoon styling. The output should clearly resemble a split-card comparison between two generators, with minor qualitative differences across panels but the same core prompt subject.

[Detail constraints] Keep exactly one woman repeated in two side-by-side panels with the same styling. Preserve the glasses, plush ear hood, pink jacket, cream sweater, and bottom text labels "CHAT GPT 1.5" and "NANO-BANANA PRO". Do not turn it into a single portrait or remove the comparison-card structure.

Negative prompt: single image only, different people in each panel, no glasses, no hood ears, no labels, busy background, full-body shot, studio backdrop, harsh flash, anime render, cartoon illustration, text in wrong positions, extra accessories dominating face, sunglasses, neon colors, watermark, brand logos other than the comparison labels, distorted split layout.

Suggested parameters: aspect ratio 4:5 vertical with two equal portrait columns, portrait lens feel around 50mm to 85mm equivalent, shallow-to-moderate depth of field, 28 to 36 steps, CFG/style strength 5.5 to 6.5, sampler DPM++ 2M Karras or similar natural-photo sampler, seed suggestion 361994 for stable twin-panel composition and subject consistency.

Delta prompt strategy:
1. If the split layout disappears: "two equal side-by-side portrait panels for AI comparison"
2. If labels disappear: "bottom text labels: CHAT GPT 1.5 on left, NANO-BANANA PRO on right"
3. If subject identity changes: "same woman with round glasses, center-part hair, and gentle smile in both panels"
4. If winter styling weakens: "plush cream animal-ear hood, pale pink jacket, cream knit sweater"
5. If glasses disappear: "large round clear eyeglasses clearly visible in both panels"
6. If background gets busy: "soft minimal winter blur, secondary to the face"
7. If the comparison stops reading as a tool demo: "clean AI comparison card graphic with rounded portrait panels"
8. If right panel loses extra refinement: "slightly cleaner winter detailing and subtle frost freckles on the right panel"
9. If color shifts too strong: "soft pale pink, cream, and cool winter tones"
10. If image becomes too editorial: "feed-friendly AI comparison post, straightforward portrait comparison"
Winter City Selfie Comparison AI Image Prompt
[Subject] A side-by-side winter city selfie comparison showing the same young woman in both panels. She has fair skin, long dark brown to black hair tied in a high ponytail, large round wire-frame glasses, silver hoop earrings, blue-green eyes, and soft natural makeup. She wears a medium-gray wool coat over a cream knit sweater and a thick light beige scarf wrapped around the neck. In the left panel, her smile is brighter and more casual, giving a friendly spontaneous selfie feeling. In the right panel, her expression is calmer and more refined, with a softer half-smile and slightly more polished look. Keep identity, clothing, and overall framing nearly identical across both panels, with only small natural variation.

[Environment] Outdoor city street at night in winter. Snowflakes drift across the scene. The background is softly blurred with warm orange streetlights, car bokeh, and tall dark skyscraper shapes rising behind. The atmosphere is cold, urban, and softly cinematic. The overall image is a social-media comparison cover made of two rounded portrait cards side by side on a dark teal background with a narrow divider between them. White text labels appear near the bottom of each card: “NANO-BANANA PRO” on the left and “NANO-BANANA” on the right. Keep this full comparison layout because it is visible in the original image.

[Composition/Camera] Vertical 4:5 overall comparison layout. Each panel is a medium close-up selfie crop from upper chest to slightly above the head. The woman is centered in both cards, with shallow background blur and snowy city lights behind. Left panel feels a touch looser and more candid, right panel slightly more centered and polished. Preserve the rounded-rectangle card framing, the dark divider, and the symmetrical side-by-side benchmark structure.

[Lighting] Soft cold evening ambient light on the face balanced with warm city bokeh in the background. The left panel can feel slightly more natural and on-location, while the right panel is a bit cleaner and more evenly flattering. Snowflakes catch small highlights. The subject’s face remains bright and readable without flash harshness. Keep realistic winter-night exposure and gentle contrast.

[Style/Rendering] Hyper-real winter selfie portrait with a social comparison aesthetic. Detailed glasses rims, natural skin texture, visible scarf knit and wool coat texture, subtle snow accumulation on hair and coat, and believable urban bokeh. Both panels should look high quality and plausible, with the left leaning slightly more candid and the right slightly more beautified. The frame should read as a generator benchmark cover, not a single standalone portrait and not a magazine editorial.

[Detail constraints] Do not remove the split layout. Keep the same woman in both panels with matching gray coat, cream sweater, beige scarf, glasses, hoop earrings, and high ponytail. Preserve the snowy city-night background, blurred streetlights, skyscraper silhouettes, and lower labels “NANO-BANANA PRO” and “NANO-BANANA”. Do not change the season, turn it into daylight, or replace the city with a studio backdrop.

Negative prompt: single image only, different women in each panel, indoor setting, no snow, daylight, forest background, no glasses, blonde hair, puffer jacket, dramatic makeup, heavy retouching, plastic skin, warped eyes, asymmetrical glasses, cluttered text, anime style, oil painting, watermark clutter, merged scarf.

Suggested parameters: aspect ratio 4:5 overall comparison; lens 50mm equivalent selfie-portrait feel; aperture look f/2 to f/2.8; steps 30-40; CFG/style guidance 6.5-8; sampler DPM++ 2M Karras or realistic portrait sampler; seed suggestion 480237161.

Delta prompt strategy:
1. If the split-screen disappears: "two rounded portrait cards side by side on a dark teal background with a slim divider"
2. If identity changes between panels: "same young woman in both panels with high dark ponytail, round glasses, hoop earrings, gray coat, and beige scarf"
3. If the winter city mood weakens: "night city street in snowfall with warm streetlight bokeh and distant skyscraper silhouettes"
4. If the scarf changes: "thick light beige scarf wrapped around the neck over a cream knit sweater"
5. If the left-right difference is lost: "left panel brighter candid smile, right panel calmer polished expression"
6. If snow disappears: "visible falling snowflakes and light snow dusting on hair and coat"
7. If it becomes too editorial: "social-media generator comparison cover, realistic winter selfie benchmark"
8. If labels vanish: "bottom white labels reading NANO-BANANA PRO on the left and NANO-BANANA on the right"
9. If the background gets too sharp: "soft urban bokeh with glowing car lights and blurred towers"
10. If skin becomes over-smoothed: "natural skin texture, believable winter-light portrait realism"

AI Generate Image Anime

Why this query sits between two workflows

Some users want to generate a new anime image from a prompt. Others want to start from a reference photo or sketch and turn that into anime. This page should handle both paths because the query sits between text generation and photo conversion.

A good tool level page does not force a user to understand model jargon. It should show the input choice clearly, then help them decide whether they are starting from scratch or from an existing image.

Key Insight: The best anime image page is a decision point, not a model lecture. Users need to know which input path fits their starting point.

Takeaway: Make the choice obvious first, then compare output quality and export options second.

Decision tree for the user

Starting from scratch: Use text prompt to anime image tools when you only have an idea or a scene description.

Have a reference image: Use transform from image tools when you already have a selfie, sketch, or composition you want to stylize.

Need clean export: Check whether the tool supports high resolution output, PNG export, or transparent background for design use.

Need fast iteration: Compare how quickly each tool lets you preview a result and adjust the look before you download.

What to compare in each tool

Input flexibility: The page should show whether the tool handles text, reference images, or both.

Output control: Resolution and file format matter when users want to reuse the image in design work.

Style clarity: The tool should keep the anime look readable without forcing the user into model specific settings.

Accessibility: Keep the explanation tool level so users can understand it without knowing about checkpoints or technical pipelines.

FAQ

Is this page about text to anime or photo to anime?

It covers both. The goal is to help users choose the right starting point.

Why mention PNG and transparent background?

Because many users want anime images for design use, not just for viewing.

Should the page go deep on model settings?

No. Keep it focused on the user decision and the output options that matter.

What is the most important comparison signal?

The clearest signal is whether the tool matches the input the user already has.