Kiki Inspired Flying Selfie AI Image Prompt

😍Copia cualquier imagen con Nano Banana Pro Antes, con Nano Banana se podían copiar imágenes, pero no con todas funcionaba bien (había que gastar muchísimos créditos y generaciones fallidas) para conseguir un resultado medio medio. 😅 Ahora, con Nano Banana Pro todo se vuelve mucho más fácil!! Lo volví a poner a prueba, pero esta vez intentando copiar o replicar una imagen de dibujos animados para conseguir un resultado hiperrealista. 🥹 La verdad es que no me esperaba estos resultados taaaan buenos… A veces me ha pasado que falla, pero estoy casi segura de que, adaptando el prompt un poquito, lo haría perfecto.🙊 💌 Si quieres que te pase los prompts que usé y las imágenes de referencia, comenta "ARIA�" y te los mando por DM 💕

Aria Cruz | Influencer AI

@soy_aria_cruz · Digital creator

INSTAGRAM · 2025-11-28Source

387likes

740comments

Remix This

Recreate with AI Image Generator

Make your own AI viral video

Prompt

[Subject] One young woman in a hyperreal flying selfie scene inspired by a whimsical witch-anime aesthetic. She appears early 20s, feminine presentation, slim build, light olive skin, large green-hazel eyes, long dark brown to black hair pulled back with loose strands blowing strongly in the wind, thin round glasses, medium gold hoop earrings, bright open smile showing teeth, rosy cheeks, and a joyful adventurous expression. She wears a dark navy dress or top. On her head is a very large bright red bow headband with white polka dots, tied dramatically above the crown. In her left arm she holds a small fluffy black kitten with yellow-gold eyes, white patch on the chest, and soft fur. Behind her left shoulder a straw broom is visible, angled backward in flight.
[Environment] High above a snow-covered mountain range under a vivid blue sky with soft white clouds. The ground far below is a textured expanse of icy peaks and ridges. The whole scene suggests fast airy motion through open sky, but remains bright and cheerful rather than dangerous. In the bottom-right corner of the image there is a small inset reference picture showing a more cartoon/anime-styled version of the same composition, accompanied by a curved red arrow pointing toward the main hyperreal image, indicating transformation from reference to realistic output.
[Composition/Camera] Vertical 3:4 composition with dynamic extreme selfie perspective, camera held high and close, subject face large and centered slightly right, arm extending toward the lens from the lower-right edge. The kitten sits in the lower-left foreground, close to the camera. The broom enters diagonally from the left-rear area. Hair and bow stream backward to emphasize movement. Bottom-right inset image occupies a small rectangular area and must remain clearly visible as a secondary element. Use a wide selfie lens feel around 20-24mm equivalent, but maintain attractive facial proportions.
[Lighting] Bright natural daylight from above and slightly front-left, with even illumination across the face, soft highlights on cheeks and glasses, and clear visibility of the kitten fur and bow texture. Sky and snow provide cool ambient bounce, while skin tones remain warm and lively. No harsh shadows; the mood should be crisp, optimistic, and airy.
[Style/Rendering] Photorealistic yet playful social-media comparison image, designed to show a cartoon-inspired concept translated into hyperreal photography. Clean, high-detail skin texture, realistic fabric, natural wind motion in hair, sharply rendered kitten fur, believable broom straw, saturated but controlled sky blues, and cheerful adventure energy. The inset should look noticeably more illustrated/anime-like, while the main image remains convincingly real.
[Detail constraints] Keep exactly one smiling flying subject, one black kitten, one straw broom, one oversized red polka-dot bow, and one small reference inset at bottom-right with a red arrow indicating transformation. Preserve the snowy mountain background and bright sky. Do not add extra characters, city elements, witches’ hats, magical sparkles, or multiple animals. This is a whimsical flying selfie with a realistic finish, not a fantasy battle scene.

Negative prompt: extra people, missing kitten, missing bow, missing broom, no inset reference image, no red arrow, witch hat, magical particles, dark storm sky, painterly main image, cartoon main image, distorted selfie face, warped cat anatomy, low-detail fur, generic clouds only with no mountains, text overlay, watermark.

Suggested parameters: aspect ratio 3:4, 20-24mm selfie lens feel, moderate depth of field, 28-38 steps, CFG/style strength 6.5-8, sampler DPM++ 2M Karras or equivalent, seed around 273644.

Delta prompt strategy:
1. If the cartoon-to-real comparison cue disappears: add "small anime-style reference inset at bottom-right with a curved red arrow pointing to the realistic main image".
2. If the bow becomes too small: add "oversized bright red bow with white polka dots dominating the top of the hairstyle".
3. If the kitten is missing or wrong: add "small fluffy black kitten with golden eyes and a tiny white chest patch held in one arm".
4. If the broom disappears: add "straw broom trailing diagonally behind the subject during flight".
5. If the scene loses motion: add "wind-swept hair and bow streaming backward, dynamic airborne selfie angle".
6. If the setting becomes generic sky: add "snow-covered mountain range far below, crisp icy ridges visible under the subject".
7. If the subject loses glasses: add "thin round eyeglasses clearly visible on the smiling face".
8. If the main image drifts cartoonish: add "main scene photorealistic, only the inset image remains anime-styled".
9. If facial proportions distort from wide angle: add "wide selfie lens with natural flattering facial proportions".
10. If lighting turns moody: add "bright cheerful daylight with clean sky and soft even facial illumination".

How to Create a Kiki Inspired Flying Selfie AI Image

This image is effective because it does two jobs at once. It gives the audience a charming fantasy scene they already understand at a glance, and it also demonstrates a tool capability in a very visual way. The tiny reference image in the corner matters a lot. Without it, the post would just be a cute whimsical portrait. With it, the image becomes proof of transformation, and proof-based content almost always travels better than beauty alone.

For creators, this is a useful lesson in AI comparison design. If you want people to believe that a model can translate style, do not explain it only in the caption. Build the argument directly into the image. Here, the reference inset and red arrow immediately tell the viewer what they are supposed to notice: this is not only a fantasy render, it is a successful conversion from illustration language into hyperreal output.

Why the image pulls people in so fast

The first hook is recognizability. Even without naming a franchise, the red bow, the broom, the black kitten, and the airborne perspective instantly evoke a beloved witch-anime mood. That lowers friction. The viewer knows what emotional category the image belongs to almost immediately.

The second hook is motion. The wide selfie angle, flying hair, and mountain drop below create a sense of velocity without making the frame chaotic. Then the third hook arrives: the inset reference. That small corner device turns the post into a mini before-and-after story. It gives the audience a reason to inspect the details instead of scrolling past after one second.

Signal	Evidence (from this image)	Mechanism	Replication Action
Instant transformation proof	Reference inset and red arrow visibly compare cartoon source to realistic result	Viewers understand the claim without reading the caption	Add a small source-style inset when showcasing style-transfer or replication capability
High-recognition fantasy coding	Red polka-dot bow, broom, black kitten, airborne pose	Familiar visual signals speed up engagement and emotional connection	Lock the most iconic 3-4 markers before refining realism
Playful motion clarity	Hair streams backward while the face remains close, bright, and readable	The image feels energetic but still feed-friendly at thumbnail size	Use dynamic camera angle plus one stable smiling face for high-motion prompts

Where this format fits best

This structure is ideal for AI tool comparison posts, prompt educators teaching style transfer, fantasy remix pages, and creators who want to prove that a model can reinterpret illustration references into believable photographic images. It is especially useful when the audience needs visual evidence more than technical explanation.

It is less effective for luxury or minimal aesthetic pages, because the inset reference and bright whimsical props intentionally make the composition more instructional and playful. That is not a weakness here. It is the point. But it means the format is strongest when the goal is demonstration plus delight.

Best fit: AI comparison creators. Why fit: the inset makes the performance claim immediately visible. What to change: vary the source style and target realism level.
Best fit: prompt tutorial accounts. Why fit: the image teaches reference translation, motion, and prop preservation in one frame. What to change: annotate which details must stay constant between source and result.
Best fit: whimsical fantasy pages. Why fit: the core scene is charming even before the technical layer is understood. What to change: swap franchise-coded props while preserving the same transformation structure.
Not ideal: minimalist fashion pages. Reason: the image is intentionally playful and comparison-driven.
Not ideal: strict realism accounts. Reason: the core concept still depends on fantastical flying imagery.

Transfer recipes

Keep: reference inset, red arrow, one iconic character setup, and photoreal target. Change: witch-flying scene to mermaid, fairy, or sci-fi pilot translation. Slot template: "{cartoon source inset} transformed into {photoreal scene} with {signature props}"
Keep: wide selfie energy and one companion animal or object. Change: bow, outfit, and environment while preserving the transformation cue. Slot template: "{subject archetype} in a dynamic selfie above {environment} holding {companion detail}"
Keep: cheerful face plus high-recognition prop bundle. Change: the source art style and realism intensity only. Slot template: "{illustrated reference} converted into {realistic output} with {locked iconic markers}"

What makes the image aesthetically persuasive

The image succeeds because it keeps whimsy readable. The bow is oversized, the cat is dark and distinct, and the broom remains visible as a shape rather than a vague accessory. Those are strong silhouette decisions. In fantasy-style AI work, silhouette is often more important than microscopic detail because it is what preserves recognizability at feed speed.

The bright daylight also helps. It stops the scene from becoming muddy or overdramatic. The snowy mountains below provide scale, but the face stays dominant and friendly. This balance matters for social performance. If the environment were too epic, the image would risk becoming a landscape. If the face were too large, the fantasy would disappear. Here, the ratio is handled well.

Observed	Why it matters for recreation
Oversized red polka-dot bow	Acts as the fastest recognition cue in the entire frame
Black kitten held close to camera	Adds charm, contrast, and a second memorable subject
Wide flying selfie angle	Makes the transformation result feel energetic and modern
Small source-reference inset with arrow	Turns the image into proof of capability rather than simple fantasy art
Snowy mountains under bright blue sky	Provide scale and adventure without darkening the mood

Prompt chunks worth locking first

If you want this kind of result, do not begin with “cute witch flying in sky.” That is too generic. Start with the transformation mechanic, then lock the iconic props, then define the camera behavior. That order protects both the concept and the proof angle.

Prompt chunk	What it controls	Swap ideas (EN, 2–3 options)
photoreal flying selfie with anime-style reference inset	Comparison logic and post structure	before-after inset, source-preview corner card, reference-to-result composition
oversized red polka-dot bow and round glasses	Character recognition and face styling	yellow scarf and hat, ribbon headband, iconic hair clip set
black kitten held in one arm	Companion charm and visual contrast	small dog, owl, plush familiar creature
straw broom trailing behind in flight	Motion storytelling and genre cue	hoverboard, magic umbrella, flying scooter
snowy mountain range far below	Adventure scale and clean background structure	cloud sea, coastal cliffs, autumn valley
bright cheerful daylight with wind-swept hair	Readable optimism and believable movement	sunrise flight, golden cloud light, crisp midday sky

An iteration path that keeps the image clean

Lock these three things first: the comparison inset, the bow-kitten-broom prop set, and the dynamic selfie angle. Those are the non-negotiables. Then refine realism, fur texture, and mountain clarity one pass at a time.

Run 1: stabilize the face, bow size, and inset-reference composition.
Run 2: improve cat anatomy, broom visibility, and wind motion in the hair.
Run 3: refine photoreal skin texture and the snowy mountain depth below.
Run 4: remix the source franchise or character while preserving the same transformation structure.

If the output feels like generic fantasy art, add the reference inset. If it feels too technical and less charming, strengthen the subject smile and the companion-animal presence. The best version balances proof with delight.