Make Photo Dance With AI

If you want to make a photo dance with AI, you usually need one clear still image, one simple motion idea, and a workflow that does not feel like real animation software. The examples around this topic include dance-copy demos and photo-to-motion clips, with nearby results reaching 38,619 likes, which is enough proof that a single image can become a shareable short video when the pose and background are easy to read. Use this page to understand what kind of photo works best, what motion style to pick, and how to keep the result social-ready instead of overcomplicated.

Video
Create a vertical AI motion-transfer demo using WAN 2.2 Animate. The subject is a young Asian woman with a high half-ponytail, wearing an oversized black graphic T-shirt, loose black cargo pants, and casual sneakers. Place her outdoors in front of bright white stone arches and columns under clean daylight so the background feels architectural, minimal, and easy to read.

Use a fixed full-body camera and animate her with a sequence of viral dance-inspired arm patterns and light footwork copied from a reference clip. The choreography should focus on upper-body rhythm: crossed forearms, downward hand sweeps, open-palmed gestures near the face, small shoulder bounces, a side glance with body turn, and a final pose angled away from camera. Preserve facial identity, hair shape, T-shirt folds, and body proportions across all movements.

Present the result like a creator experiment. Add a narrow side strip with the source images and a visible plus sign to show the identity-plus-motion setup, and keep a small "WAN 2.2 Animate" label at the lower edge. The overall feel should be that of a practical benchmark for copying internet dance motions onto a static AI character while holding visual consistency in bright daylight.
Video
GLOBAL LOCK: A photoreal vertical 4:5 AI dance-copy test video showing a glamorous female character in the foreground performing a reference dance while a narrow left-side strip displays the source image pair used for the animation. Keep the main subject as a young woman with pale skin, round glasses, long dark hair, a fluffy oversized blue fur hat, and a dark navy velvet dress with a high slit. She performs in a snowy, starry night-style backdrop with floating white particles and cool blue lighting. On the left edge, keep a vertical reference strip featuring the source portrait, the dance reference thumbnail, a plus sign, yellow arrow graphic, and the label "WAN 22 Animate." The movement should stay mostly in medium close-up range with upper-body sways, arm crosses, and torso twists, because front-facing near-camera dances preserve facial consistency best. No subtitles, no narration, no extra overlays beyond the built-in reference strip.

[00:00-00:03.00] Start with the woman centered against the snowy night background, one hand lifting and shoulders swaying gently. Her blue fur hat, glasses, and velvet dress should all remain crisp. The left reference strip must clearly show the portrait-plus-motion setup.

[00:03.00-00:06.00] Move into crossed-arm and torso-twist dance gestures. Keep the movement close to camera and mostly upper-body driven. The face should remain more stable than a full-body distant dance would, though small inconsistencies are acceptable.

[00:06.00-00:09.52] End with a few sharper arm accents and a final flowing pose while the hair moves lightly and the slit dress shifts with the hips. The clip should read as a successful WAN 2.2 dance-copy test within current model limits, not as a flawless music-video take.

NEGATIVE PROMPT: wide far-away dancer, empty left strip, missing reference thumbnails, no blue fur hat, unstable glasses, broken arms, full-body footwork complexity, muddy snow background, random extra dancers, lip-sync talking, text overload, missing WAN 22 Animate label, low-detail velvet dress, distorted face, heavy camera shake, stage concert lights, cartoon styling.

SHOT PROMPTS:
SHOT 1 DELTA: Medium-close dance setup with strong facial clarity, blue fur hat, and visible reference strip.
SHOT 2 DELTA: Crossed-arm and torso-twist choreography tests how well WAN 2.2 copies near-camera motion.
SHOT 3 DELTA: Final accent movement with hair and dress motion while preserving the identity better than distant dance shots.

SPEECH PACK:
[00:00-00:09.52]
- speech_present: none required
- speakers: one visible female dancer
- transcript_segments: []
- audio_direction: optional dance beat or ambient track; no dialogue needed
- sync_notes: the benchmark is dance motion transfer, especially upper-body consistency and face preservation at close range
Video
GLOBAL LOCK: The subject is a young woman of Hispanic descent, approximately 22 years old, with olive skin, dark brown eyes, and long, straight black hair styled in a sleek, high ponytail. She has an athletic, toned build. She is wearing a matching black ribbed sports bra and high-waisted black mini shorts, paired with classic black stiletto high heels. The environment is a spacious, modern dance studio with white walls, large industrial-style windows, and a light grey, slightly reflective professional dance floor. Wall-to-wall mirrors are visible in the background, along with wooden ballet barres. The lighting is bright, natural, and high-key, coming from the windows. The color grade is clean and neutral with high clarity. No speech is present; the video is synced to upbeat dance music.

[00:00–00:02]
The subject walks confidently toward the camera from the center of the dance studio. She has a slight smile and looks directly at the lens. The camera is at eye level, capturing a full-body shot. The movement is smooth and rhythmic.

[00:02–00:04]
The subject begins the dance routine. She performs a quick series of arm gestures, crossing her hands in front of her chest and then throwing them outward. She performs a small, energetic jump with both feet leaving the floor. Her ponytail swings dynamically with the movement.

[00:04–00:06]
The subject transitions into a deep side lunge to her right, extending her left leg. She reaches her arms out toward the floor. The camera maintains a wide shot to capture the full range of motion. Reflections of her movements are visible on the polished floor.

[00:06–00:08]
She jumps back to a standing position and immediately places both hands behind her head, elbows out. She performs a rhythmic bounce/hop in place. The ponytail continues to show realistic physics, whipping behind her.

[00:08–00:10]
The subject performs a series of alternating side lunges. She extends her arms wide to the sides with each step. Her expression is focused and energetic. The lighting remains consistent, highlighting the muscle definition in her legs.

[00:10–00:12]
The subject completes the dance sequence with a final rhythmic step and then turns to her right, walking toward the side of the frame in a profile view. The camera follows her movement slightly. The video ends as she maintains her posture and walks out of the primary dance area.

NEGATIVE PROMPT: visual artifacts, flickering, distorted limbs, extra fingers, blurry face, inconsistent hair length, floating clothing, jittery background, robotic movement, unnatural joint angles, low resolution, watermarks, text overlays on the subject, mismatched reflections.

SPEECH PACK:
speech_present: false
music_style: Upbeat pop/dance, female vocals, high energy.
sync_notes: All major jumps and arm extensions must align with the rhythmic beats of the background track.
Video
GLOBAL LOCK: A consistent young woman of Hispanic/Latina descent, mid-20s, with long dark hair, wearing black-framed glasses and a black beanie. She wears an oversized black-and-white graphic hoodie with street-art style prints, olive green cargo pants, and chunky white sneakers. The environment is a dimly lit industrial warehouse with exposed brick walls, colorful graffiti, and large factory windows. Lighting is a mix of warm overhead industrial lamps and cool natural light. Cinematic color grade, high contrast, sharp textures.

[00:00–00:03]
The subject stands in the center of the warehouse, facing the camera. She begins a rhythmic, low-energy bounce, swaying her hips slightly. The camera is a static medium-full shot. Lighting emphasizes the folds in her oversized hoodie.

[00:03–00:06]
The subject performs a fluid arm "wave" motion, crossing her arms in front of her chest and then extending them outward. She has a slight, confident smile. The motion is smooth and perfectly timed to a rhythmic beat.

[00:06–00:09]
The subject transitions into footwork, shifting her weight from side to side in a "shuffle" style. Her hands move rhythmically near her waist. The graffiti background remains sharp and stable.

[00:09–00:11]
The subject performs a chest-pop and a quick arm flourish, pointing towards the camera. Her glasses and beanie remain perfectly in place. The lighting creates a rim-light effect on her shoulders.

[00:11–00:13]
The subject finishes the dance with a final energetic pose, looking directly into the lens with a friendly expression. The video ends on a high-energy beat.

NEGATIVE PROMPT: Texture flickering, boiling clothes, face warping, extra limbs, blurry graffiti, robotic motion, sliding feet, inconsistent lighting, low resolution, watermark, text overlays on character, distorted glasses, hair clipping through beanie.

SPEECH PACK:
(No speech present in this video. The focus is entirely on rhythmic motion and music synchronization.)
TAKE_A: [Rhythmic breathing sounds synced to dance movements]
TAKE_B: [Silence, focus on ambient warehouse room tone]
TAKE_C: [Slight fabric rustle sounds during arm movements]
Video
GLOBAL LOCK: A vertical 4:5 AI dance-swap demo layout. Left side is a dark teal instructional sidebar showing two stacked reference images connected by a yellow curved arrow, with white/yellow text reading “WAN 2.2 swap.” Right/main side shows the generated output: a young woman AI influencer standing outdoors on a rocky riverbed/field edge with green grass and tall trees behind her. Keep the woman’s identity consistent: long black hair, glasses, hoop earrings, light skin, slim build, fitted sleeveless black romper, soft smile, and casual dance energy. The clip demonstrates motion transfer from a dance reference onto a static AI influencer image.

[00:00-00:02] Start with the woman standing front-facing in the outdoor location, body mostly still, arms relaxed near her sides. She looks into camera with a calm pleasant expression. The left tutorial sidebar remains visible with the two input images and the yellow curved arrow pointing down toward the “WAN 2.2 swap” label.

[00:02-00:04] The dance begins subtly. She lifts one arm outward and starts a small side-to-side upper-body sway. Her head tilts slightly, glasses remain aligned, and long hair stays smooth over the shoulders and back. The outdoor background stays bright and slightly soft, emphasizing the character rather than the scenery.

[00:04-00:06] The motion transfer becomes clearer: her shoulders and elbows move in a simple rhythmic dance, and one knee or hip angle shifts lightly as if following a reference choreography. The movement stays close to camera and mostly upper-body dominant, which helps preserve facial consistency. The expression brightens into a wider smile.

[00:06-00:08] Continue the playful dance with hand gestures closer to the torso and slight alternating arm positions. Her body remains mostly centered, with only small weight shifts. Keep the black romper fitted and stable, avoid fabric glitches, and preserve the clean face identity and glasses.

[00:08-00:11] End on the clearest dance-swap payoff: she smiles directly at camera while doing small finger-heart or pinched-finger style gestures with both hands near chest height, hips slightly angled. The result should feel charming and social-media friendly rather than technically perfect, with emphasis on identity preservation during simple choreography. The left-side instructional column and “WAN 2.2 swap” label remain on screen to underline the workflow.

NEGATIVE PROMPT: broken fingers, warped elbows, melted face, drifting glasses, identity swap, floating feet, broken knees, impossible hip twist, random camera zoom, missing sidebar, unreadable text, extra people, messy hair deformation, outfit flicker, body wobble, low-res landscape, overblown highlights, dance motion too large, face losing consistency.

SHOT PROMPT DELTA: tutorial demo layout, left reference sidebar, right generated influencer dancing outdoors, simple social dance, soft smile, black sleeveless romper, glasses and long hair stable, motion transfer test for WAN 2.2.
Video
Create a vertical AI video test that demonstrates copying a viral dance performance from a reference clip onto a static AI influencer image using WAN 2.2 Animate. The subject is a young brunette woman with her hair tied up in a casual bun, wearing thin glasses, a light blue floral mini dress with ruffled hem, and white knee-high boots. Place her outdoors on a tiled patio at dusk, framed by tall hedges, a wooden railing, and a large planter glowing with warm light.

Keep the camera locked in a full-body medium-wide view so the dance motion is easy to judge. The performance should feel like a social dance test rather than a polished music video: quick arm swings, side-to-side hip movement, small foot pivots, one pose with both arms extended, one with a hand touching her head, one with a hand on her hip, and one energetic bounce that lifts her hair upward from motion. Preserve the same face, glasses, dress pattern, and body proportions across every move. Prioritize consistency in facial identity while translating the reference choreography.

Visually present it like a creator demo reel. Add a slim vertical strip at the left that shows the two source images used for the transfer, connected by a plus sign, and place a small "WAN 2.2 Animate" label near the bottom so viewers understand which model generated the motion. The final effect should communicate that close-to-camera dance references can be copied onto a static AI character with decent consistency, while still feeling like a real benchmark of motion fidelity.
Video
GLOBAL LOCK: A photoreal vertical dance-transfer demo video using a fixed left-side instructional strip labeled “WAN 2.2 Swap.” Keep the composition consistent across all frames: a narrow left panel showing two stacked reference images with a yellow arrow and the text “WAN 2.2 Swap,” plus the main dance area on the right taking most of the frame. Keep the dancer consistent: young East Asian woman, fair skin, slim fit build, long dark hair down, round glasses, calm playful expression, full black fitted unitard or tight black one-piece outfit, barefoot. Keep the environment locked: simple empty indoor room with beige walls, light floor, soft natural light, minimal clutter. Motion is a copied viral dance with side steps, cross-steps, arm flicks, small hip shifts, and playful bounce timing. The face should remain stable even during body movement. No dialogue, no extra subtitles beyond the built-in left-side demo strip.

[00:00-00:03] Open with the dancer already stepping lightly across the floor while the WAN 2.2 Swap reference strip is visible on the left. She performs a smooth cross-step and small hand flick, making it clear this is a dance-transfer proof clip, not a cinematic scene.

[00:03-00:06] The dance gains confidence with a relaxed smile and more readable footwork. She shifts weight from one leg to the other, bringing one arm up in a playful gesture. Keep the room empty and visually quiet so the motion stays easy to read.

[00:06-00:09] She rotates her torso slightly and steps wider, adding a soft bounce and shoulder rhythm. Hair should move naturally without breaking facial identity. The black one-piece outfit must remain clean and form-fitting.

[00:09-00:12] The choreography becomes a little more expressive, with arms lifting and a side sway. The clip should still feel like a casual dance test generated from a reference rather than a polished music video.

[00:12-00:15] Final beat settles into a forward-facing pose after a last cross-step. End with the dancer centered and readable, proving that the identity swap or motion-transfer held through the full dance phrase.

NEGATIVE PROMPT: missing left reference strip, unreadable WAN 2.2 Swap text, duplicated limbs, broken feet, mutated hands, face drift, outfit color change, shoes appearing, dramatic camera zooms, cluttered room, subtitles, logos, watermarks beyond the intended strip, low-detail hair, unnatural dance timing, robotic stiffness, background changes.

SHOT PROMPTS:
SHOT 1 DELTA: establish WAN 2.2 Swap demo layout with dancer entering a cross-step pattern.
SHOT 2 DELTA: playful hand flick and relaxed smile, barefoot dance readability emphasized.
SHOT 3 DELTA: torso turn and wider side-step, hair moves naturally while face stays stable.
SHOT 4 DELTA: more expressive arm lift and bounce rhythm in the empty room.
SHOT 5 DELTA: final forward-facing pose after last cross-step, clean motion-transfer payoff.

SPEECH PACK:
Timecoded transcript: no spoken dialogue is present in the reference clip.
TAKE_A [00:00-00:15]: silent dance-transfer demo, no speech.
TAKE_B [00:00-00:15]: no spoken words, motion-copy showcase only.
TAKE_C [00:00-00:15]: quiet WAN 2.2 Swap demonstration of a viral dance in a plain room.
Closest audible version: no intelligible dialogue detected.
Safe paraphrase version: a woman in a black fitted outfit performs a copied viral dance while a left-side WAN 2.2 Swap reference strip shows the source setup.
Video
GLOBAL LOCK: A photoreal vertical split-layout demo video showing AI motion-transfer from a reference dance clip onto a consistent influencer character. Preserve the full format across all frames: a narrow left-side instructional panel with two small stacked reference images and bold text reading “WAN 2.2 Animate”, plus the main right-side performance area filling most of the frame. Keep the dancing subject consistent: young East Asian woman, fair skin, slim athletic build, long black hair in a high ponytail, expressive face, natural makeup, energetic but controlled smile. Wardrobe is locked: shiny red satin camisole or corset-style top with thin straps, fitted black high-waisted shorts. Environment is locked: bright minimal apartment or empty room with gray floor, white walls, open doorway, and a freestanding mirror in the back. Lighting is soft natural daylight from the front-left, realistic indoor brightness, no nightclub effects. Motion should clearly resemble a copied viral dance routine, with hands crossing, pointing, shoulder pops, and a gradual turn toward profile and back view. Keep the face identity stable even during arm motion. No dialogue, no subtitles beyond the built-in left-side label, no logos except the visible “WAN 2.2 Animate” text panel already present in the composition.

[00:00-00:02] Open with the dancer facing camera in the room while the left-side reference panel is already visible. She starts the dance in a relaxed stance, hips shifting lightly, one hand low and the other beginning to rise, establishing that this is a motion-copy demonstration rather than a cinematic music video.

[00:02-00:04] She brings both hands into the choreography with playful upper-body rhythm. The red satin top should catch soft daylight and stay glossy. Preserve the clean room, doorway, and mirror in the background without changing furniture or layout.

[00:04-00:06] The dance becomes more readable as she crosses one arm over the torso and points or sweeps the other hand outward. Her expression turns brighter and slightly cheeky, as if following a popular social-media dance challenge.

[00:06-00:08] She rotates into a three-quarter profile while continuing the same routine. Keep the ponytail swinging naturally but do not let the face or outfit mutate. The left-side panel with the source/reference images must remain fixed and legible throughout.

[00:08-00:10] Final beat transitions toward a back-facing pose with one hand lifting toward the hair. End like a tutorial proof-of-concept: the viewer should understand that the AI successfully transferred a reference dance onto the character while holding identity and outfit consistency.

NEGATIVE PROMPT: missing left panel, random UI overlays, broken text, mutated hands, duplicated arms, face drift, age changes, different outfit color, missing shorts, warped hips, extra dancers, crowded studio, nightclub lighting, dramatic cinematic camera movement, zoom crashes, smeared ponytail, broken mirror, furniture appearing suddenly, lip-sync speech, subtitles, watermarks beyond the intended layout, low-detail anatomy, jerky stop-motion motion.

SHOT PROMPTS:
SHOT 1 DELTA: front-facing dance start with visible WAN 2.2 Animate reference strip on the left.
SHOT 2 DELTA: playful hand choreography, red satin top catching daylight.
SHOT 3 DELTA: cross-body dance move, smile brightens, tutorial-demo energy.
SHOT 4 DELTA: rotate to three-quarter profile while preserving face consistency.
SHOT 5 DELTA: finish toward back pose with hair touch, clear motion-transfer payoff.

SPEECH PACK:
Timecoded transcript: no spoken dialogue is present in the reference clip.
TAKE_A [00:00-00:10]: silent dance-demo clip, no speech.
TAKE_B [00:00-00:10]: no spoken words, movement-transfer showcase only.
TAKE_C [00:00-00:10]: silent tutorial-style proof clip with visual dance performance.
Closest audible version: no intelligible dialogue detected.
Safe paraphrase version: a woman in a red satin top performs a copied viral dance in a bright room while a left-side panel shows the reference and WAN 2.2 Animate label.
Video

GLOBAL LOCK: Create a vertical Christmas-themed AI motion-control dance reel in which iconic painted female portrait characters come alive one after another and perform synchronized choreography. Keep each main subject fully integrated into her original painterly art style, preserving brush texture, costume design, facial structure, and background aesthetics. Throughout the whole video, maintain a small inset reference dancer on the right side of the frame: a modern young woman in a short sleeveless red dress demonstrating the choreography. Use static framing, festive color palettes, clean scene cuts, and no dialogue or lip sync.

[00:00-00:03.8] A richly dressed woman inspired by a golden decorative portrait painting stands beside a Christmas tree in an ornate gold-toned interior. She animates into subtle rhythmic dance moves, moving her arms and shoulders in sync with the inset dancer on the right. Preserve the painterly golden textures, elegant posture, and festive holiday decoration.

[00:03.8-00:07.5] Cut to a new painted woman in a flowing green dress with a glamorous vintage portrait style. She performs the same choreography pattern, lifting and crossing her arms with graceful body movement while maintaining the look of a living painting. Keep the small red-dress reference dancer visible on the right edge of frame.

[00:07.5-00:11.2] Cut to a minimalist portrait woman in a black dress and broad hat against a warm brown artistic background. She sways and moves her arms in time with the same choreography, her painted face and body becoming animated while the art style remains intact. Keep the motion smooth and restrained, like a classic artwork gently dancing.

[00:11.2-00:15.0] Cut to a Frida Kahlo-inspired festive portrait figure in traditional colorful clothing, flowers in her hair, and a Christmas-decorated backdrop. She performs the final choreography beat with expressive arm gestures and body sway while the inset dancer continues as the motion guide. End with a celebratory holiday mood, painterly detail, and clean static composition.
Video
A vertical comparison reel showing AI video generation results from a single cinematic reference image. Each segment uses a split-screen stack: the top frame is labeled “REFERENCE IMAGE,” while the bottom frame shows the output from a specific model such as “SEEDANCE 2.0 OMNI” or “HEYGEN.” The example scenes are grounded, live-action-style dramatic setups rather than flashy VFX. One sequence shows a young man in a beanie talking with a woman on a wooden pier at dusk, with string lights and a lighthouse in the background. Another shows two men standing in front of a decaying Victorian haunted house under a grey overcast sky. A later scene places a smiling couple seated together in a subway car with cinematic teal-orange grading. The reel keeps the composition nearly identical between source and generated result to highlight motion fidelity, facial consistency, and realism across tools.
Video
GLOBAL LOCK:
The video features a white male creator in his mid-30s with medium-length, wavy brown hair and a groomed beard, wearing a clean white t-shirt. He is positioned in a bright home office with a professional black condenser microphone on a boom arm in the foreground. The video uses a split-screen or multi-panel layout to compare "Source Video" (the creator) with "AI Generated Results" (various celebrities and characters). The AI characters must perfectly mirror the creator's head tilt, facial expressions, lip-sync, and hand gestures. The lighting is soft, natural window light from the side. The color grade is clean and realistic.

[00:00–00:03]
The screen is split into three vertical panels. Top panel: The creator waves both hands excitedly and points to his right. Middle panel: Sabrina Carpenter in a pink feathered dress mimics the exact hand wave and pointing. Bottom panel: Billie Eilish in a black outfit and sunglasses mimics the same gestures. High-fidelity lip-sync as they all say "Hear me out."

[00:03–00:07]
The layout shifts. Top panel: Creator continues talking with expansive hand gestures. Middle panel: Taylor Swift in a red dress mimics the gestures. Bottom panel: Kim Kardashian in a black tank top mimics the gestures. The transitions between characters are sharp cuts.

[00:07–00:10]
Split screen: Creator (top) vs. Queen Elizabeth II (bottom). The creator looks to his left and then back to the camera with a skeptical expression. The Queen, wearing a crown and sash, mirrors the look perfectly.

[00:10–00:13]
Split screen: Creator (top) vs. Edna Mode from The Incredibles (bottom). The creator scratches the top of his head with his right hand. Edna Mode, with her signature bob and glasses, scratches her head in perfect sync.

[00:13–00:20]
A screen recording of a software interface (Enhancor). A cursor selects the "Wan2.2" model from a dropdown menu. The UI shows a "Source Video" of the creator and a "Character Image" of a woman. The cursor toggles "Pro Mode" on and adjusts resolution to 720p.

[00:20–00:23]
Split screen: Creator (top) vs. a woman with long brown hair in a floral dress (bottom). They are both in the same room. The creator raises his hands in a "stop" gesture; the woman mirrors him perfectly.

[00:23–00:27]
The UI returns, showing the "Photo Animate" tab being selected. A different reference photo of the same woman is used. The cursor clicks "Generate Video."

[00:27–00:35]
Final comparison. Split screen: Creator (top) vs. the woman (bottom). The creator looks around the room and then smiles at the camera while touching his hair. The woman mirrors the hair-touching and the smile, but her background is now a different indoor setting matching her reference photo. The text "AI" appears centered on the screen.

NEGATIVE PROMPT:
Visual: flickering faces, distorted limbs, extra fingers, blurry textures, face-swapping artifacts, unnatural skin smoothing, background warping, robotic movements, low resolution, watermarks.
Speech: robotic voice, mismatched lip-sync, muffled audio, background noise, unnatural pauses, clipping audio.

SPEECH PACK:
[00:00–00:07]
Transcript: "Hear me out, all of your favorite movies and animations are going to be completely acted out by someone else in the next two years."
TAKE_A: Energetic, fast-paced, direct-to-camera.
TAKE_B: Mysterious, slightly slower, emphasizing "completely."
TAKE_C: Casual, conversational, like a friend sharing a secret.

[00:07–00:13]
Transcript: "So I'm going to teach you everything you need to know about this in the next 20 seconds so that you can do this for yourself and stay ahead of the curve."
TAKE_A: Authoritative, instructional, rhythmic.
TAKE_B: Helpful, warm, encouraging.
TAKE_C: Urgent, fast-talking to fit the "20 seconds" claim.

[00:13–00:35]
Transcript: "So right now you have two options with this new AI video model called Wan 2.2. The first option is Character Swap... The second option is Photo Animate... This is absolutely mind-blowing. Comment AI for the link."
TAKE_A: Professional narrator style, clear enunciation.
TAKE_B: Enthusiastic, high energy on "mind-blowing."
TAKE_C: Calm, tech-reviewer tone, clear CTA at the end.
Video

GLOBAL LOCK: cinematic 1980s-style street-dance performance inspired by a fedora-wearing pop icon; central male dancer in black hat, black sequined or sharp black jacket, white shirt, black tie, dark trousers, and white gloves; moody industrial stage or subway-like set with backup dancers in dark suits; synchronized footwork, sharp arm hits, spins, and confrontational dance staging; cool blue-gray lighting with warm practical highlights; no text overlays, no logos, no fantasy elements, no modern casual outfits.

00:00-00:04
Open on the central fedora-wearing male dancer commanding the frame while backup dancers form a loose semicircle behind him. The performance space feels industrial and theatrical, with dramatic overhead lighting and strong contrast.

00:04-00:08
The choreography tightens into iconic pop-dance gestures: hat-brim emphasis, crisp upper-body hits, quick pivots, and face-forward attitude. Supporting dancers mirror and challenge the lead, creating a confrontational performance rhythm.

00:08-00:12
The scene expands into a larger group formation. The lead dancer drives the center while surrounding performers move in synchronized bursts, with kicks, slides, and sharp directional changes across the floor.

00:12-00:15
The routine resolves on the lead figure reclaiming center stage, framed by fallen or staggered dancers and strong pose-based finishing beats that preserve the music-video intensity.

NEGATIVE PROMPT:
bright daylight, empty studio, casual hoodies, neon cyberpunk effects, fantasy powers, readable text, UI panels, broken anatomy, low-energy movement, cartoon rendering, soft pastel palette, extra props, random crowd spectators
Video
GLOBAL LOCK: The presenter is a Caucasian female with long, straight light-brown hair, wearing a blue and grey gradient mock neck long-sleeved top. She is in a futuristic studio with glowing blue horizontal light bars. The pixel art style is consistent 16-bit, vibrant, with a Japanese aesthetic. The Ghibli-style boy has messy black hair and wears a yellow t-shirt.

[00:00–00:05]
Split screen. Top: A pixel art Japanese street at night with a blue car parked in front of a grocery store; a small pixel character with a red jacket and blue jeans stands on the sidewalk. Bottom: The female presenter speaks directly to the camera, gesturing with her hands. Text overlay: "Turn Retro Pixel Games to AI animations".

[00:06–00:10]
Full screen of the presenter. She gestures as text "Step 1: pick your source image" appears. The background is the tech studio.

[00:11–00:15]
Screen recording of the "Seedance 2.0" website. A volleyball is shown in a dynamic video on the site. Then, a split screen shows the pixel street and the pixel character being uploaded.

[00:16–00:27]
Pixel art animation sequence. [cut] The blue car parks. [cut] The character gets out and walks into the grocery store. [cut] Inside the store, he picks up an orange from a pyramid; a "+10" green text pops up. [cut] He runs through the aisle with a basket. The lighting is warm and indoor-commercial.

[00:28–00:35]
Presenter returns to screen, explaining Step 2. A graphic shows "Nano Banana Pro" with an image of a boy at an arcade machine. The screen of the arcade machine is being replaced by the previous pixel animation.

[00:36–00:42]
Cinematic Ghibli-style animation. A boy in a yellow shirt is seen from behind, playing an arcade game in a dimly lit, nostalgic arcade. [cut] He raises his arms in excitement as "LEVEL COMPLETE" appears on the screen. The lighting is warm, with a soft glow from the arcade monitors.

[00:43–00:51]
Presenter in the tech studio. She gestures toward a screen recording of the CapCut editing interface, showing multiple video and audio tracks. Text overlay: "Comment AI and I'll send it to you".

NEGATIVE PROMPT: Visual artifacts, distorted faces, blurry pixel edges, inconsistent character clothing, 3D realistic style in pixel sections, robotic presenter movement, text flickering, mismatched lip-sync, harsh lighting in Ghibli scene.

SPEECH PACK:
[00:00–00:05] "Did you know that you can turn retro pixel games into AI animations in just two simple steps?"
[00:06–00:10] "Step one is to pick your source image. I chose a pixelated environment and a character."
[00:11–00:15] "Step two: Head to Seedance 2.0 and upload these two images along with a prompt."
[00:16–00:27] "And you will get something like this. Now to continue, extract a still shot of the character fully visible in the store environment."
[00:28–00:42] "Now for the full control of the final shot, you can create an arcade scene of a boy playing and replace the game screen with the last frame of the pixel animation in Nano Banana Pro."
[00:43–00:51] "After that, choose a retro video game track that fits the vibe and combine everything in an editing tool. If you want the full tutorial, comment AI and I'll send it to you."

TAKE_A: Energetic, fast-paced, high-pitched emphasis on "two simple steps".
TAKE_B: Calm, instructional, steady cadence, clear pauses between steps.
TAKE_C: Friendly, conversational, slight smile while speaking, emphasis on the tool names.
Video
GLOBAL LOCK: Preserve the exact vertical promo-edit structure of a dark combat showcase branded as SEEDANCE 2.0. Keep the persistent upper graphic strip containing multiple small preview thumbnails and the bold text label “SEEDANCE 2.0” across the top portion of the frame. In the main lower action panel, maintain two Mortal Kombat-style masked ninja fighters in a smoky, high-contrast gray-black environment, facing off in side-view and three-quarter combat compositions. One fighter should read as a gray-blue Sub-Zero-like character, while the opposing fighter reads as a darker black-clad Scorpion-like rival. The action should cycle through weapon clashes, dodges, lunges, jumps, and dramatic distance changes, all with a game-trailer feel rather than a naturalistic movie scene. Do not remove the promo UI overlay, and do not turn the characters into generic modern soldiers.

0.00-4.00s: Open with the SEEDANCE 2.0 title and thumbnail strip already visible at the top, while the main panel below shows the two masked ninjas in close-range confrontation. Their bodies are angled aggressively, weapons and arms raised, with fog and backlight shaping the silhouettes in a dark arena-like void.

4.00-8.00s: Expand into wider combat beats where the fighters separate, circle, and re-engage. Show quick lateral movement, one fighter springing or stepping backward while the other advances through the smoky space. The monochrome visual palette should stay gritty and game-cinematic.

8.00-12.50s: Continue the duel with sharper impact moments: side-on exchanges, airborne kicks or leaps, and strong silhouette poses that resemble a fighting-game special move showcase. The upper thumbnail bar and title remain static, reinforcing that this is a promo package rather than diegetic footage.

12.50-17.00s: Intercut more aggressive stances and dramatic pause moments. The Sub-Zero-like and Scorpion-like figures should feel evenly matched, with alternating offensive and defensive beats. Smoke plumes, energy-like blur, and heavy contrast keep the environment abstract.

17.00-20.92s: End with the strongest dramatic action frames, including near-collision poses and a final hero-like composition where the fighters are still locked in conflict beneath the SEEDANCE 2.0 branding. The closing mood should be intense, dark, stylized, and unmistakably game-promo driven.

NEGATIVE PROMPT: realistic soldiers, bright daylight battlefield, fantasy elves, historical warriors, indoor living room, removed UI overlay, no title text, cartoon cel shading, comedic fight, blood gore close-up, crowd spectators, colorful sci-fi lasers, sports arena, soft pastel palette, modern firearms, casual clothing, photoreal actor faces.

SHOT PROMPTS:
1. SEEDANCE 2.0 promo layout with top thumbnail strip and lower main battle panel featuring two masked ninja fighters.
2. Dark smoky fighting-game confrontation between gray-blue and black-clad rivals in side-view action poses.
3. Alternating lunges, dodges, jumps, and clash silhouettes under a monochrome cinematic game trailer grade.
4. Final branded combat hero frame that still feels like a mod or feature showcase rather than a film scene.

SPEECH PACK:
- This reads like a vertical gameplay promo more than a standalone cinematic.
- The top strip and title keep reminding you that the fight is being showcased as a feature set.
- The main appeal is the dark ninja-versus-ninja choreography underneath the branding.
- It feels like Mortal Kombat-style action repackaged into a short social trailer.
Video
GLOBAL LOCK: Vertical premium poster-style AI commercial video with a split composition. The upper two-thirds show a cinematic ruined-city transformation sequence: a muscular adult man in a torn gray t-shirt crawls through rubble, reaches a black neon-green energy can, and transforms into a towering black armored winged entity with glowing toxic-green energy. The lower third remains a fixed product-card layout containing a clean can packshot, a silhouette or character reference, dark infographic panels, and small technical-text style blocks. Keep the environment cold, smoky, and post-apocalyptic, while the energy effects stay vivid green and the product section remains crisp, graphic, and consistently anchored at the bottom of every frame.

[00:00-00:03] Start with the ruined city street in the upper section: fires burn in the distance, debris covers the road, and the man crawls toward the glowing energy can. The lower panel already displays the product-card layout with the can image, a humanoid silhouette reference, and dark UI-style text blocks.

[00:03-00:06] The man gets closer to the can and reaches toward it. Keep the lower third static and readable like a futuristic ad board while the upper action remains cinematic and dusty. The contrast between moving narrative above and fixed promotional panel below should stay clean.

[00:06-00:08] As he grabs the can, green light erupts from his body. The upper scene becomes intensely illuminated with toxic neon energy, while the lower panel continues showing the can, character reference, and compact product-spec graphic treatment.

[00:08-00:11] The transformation escalates into black armor with green energy lines and expanding wing structures. Maintain the poster-like composition: the upper section carries the spectacle, the lower section continues functioning as a branded concept sheet with high contrast and dark minimal layout.

[00:11-00:13] The transformed creature stands fully revealed in the upper frame, dominant over the rubble with wings spread wide. Keep the bottom panel unchanged, reinforcing the idea that this is an ad or creative concept board tied to the transformation.

[00:13-00:15] End with the winged figure lifting or surging upward into the sky as a bright green beam or glow cuts through the clouds. The lower product-card section remains locked in place until the final frame, preserving the hybrid between cinematic trailer and infographic-style commercial poster.

NEGATIVE PROMPT: full-screen cinematic without lower panel, bright cheerful colors, cartoon style, blurry product section, unreadable can, modern clean city, extra characters, blue energy, broken wings, anime proportions, watermark, chaotic typography, low-detail infographic, humor tone, misplaced layout elements
Video
GLOBAL LOCK: vertical studio-style art parody video presenting famous paintings reimagined as children. Keep a clean portrait setup, front-facing medium shot framing, simple dark or painterly backgrounds, soft museum-like lighting, and a playful but polished editorial tone. Subjects should look like child versions of iconic painting characters, with costumes and color palettes clearly referencing classical artworks: a Girl with a Pearl Earring style child in a blue headwrap and muted ochre dress, an Adele Bloch-Bauer inspired child in a richly patterned gold dress against a gilded background, a mini Mona Lisa in dark Renaissance clothing against a painted landscape, and a small Van Gogh-inspired boy in a blue coat against a painted blue backdrop. Motions are light, cute, and rhythmic: small hand gestures, sways, claps, shoulder bounces, and smiling pose changes. No dialogue, no text overlays, no logos, no extra props.

[00:00-00:05] Show a child version of Girl with a Pearl Earring centered in frame, warm brown background, soft frontal lighting, pearl earring visible, blue-and-cream headwrap, long muted brown dress. She performs tiny hand gestures and shy rhythmic sways, alternating between folded hands and small outward arm motions. Keep camera static and portrait-like.

[00:05-00:09] Cut to a child version of Adele Bloch-Bauer against a shimmering gold ornamental background. She wears a gold patterned dress with decorative motifs and dark curly hair styled neatly. Her motions are cute and confident, with little hand flourishes, cheek touches, and bouncy pose changes. Keep the gilded palette dense and luxurious.

[00:09-00:14] Cut to a mini Mona Lisa in dark Renaissance dress before a soft painted landscape. She does small cheerful arm movements and playful shoulder motions while maintaining the iconic calm facial structure associated with the original portrait. Lighting stays soft and even, with a gentle painterly finish.

[00:14-00:19.5] Cut to a child Van Gogh-inspired boy in a blue coat and necktie against a textured blue painted backdrop. He performs tiny dance-like motions with hands at chest level, subtle sways, and charming head tilts. Preserve the hand-painted portrait feel while keeping the child performance adorable and light.

NEGATIVE PROMPT: adult proportions, creepy facial distortion, broken hands, flicker between costumes, inaccurate art references, modern streetwear, extra accessories, over-stylized cartoon rendering, muddy gold background, deformed pearl earring, duplicated limbs, lip movement implying speech, random text, logos, watermark, camera shake, low-resolution skin detail, temporal jitter.

Make Photo Dance With AI

How to Turn a Still Photo Into a Dance Clip That Actually Works

The biggest mistake here is choosing the wrong starting image. A photo dance clip works best when the subject is clearly separated from the background, facing the camera in a readable pose, and not blocked by heavy props or cropped limbs. That is why so many successful examples around this page use a centered character and clean framing. Nearby dance-transfer and photo-to-motion clips have already shown that simple setups can travel, including one adjacent example around 38,619 likes. The result does not need studio polish. It just needs a clean source image and a dance loop viewers can understand immediately.

If you are doing this for the first time, keep the process basic. Upload one sharp portrait or full-body image, pick a short dance reference or motion style, and choose a tool path that is built for fast generation rather than manual keyframing. The strongest outputs usually come from photos with visible arms, legs, and outfit edges because the AI has clearer body landmarks to animate. It also helps to avoid crowded backgrounds and dramatic perspective. The simpler the original image, the cleaner the dance result will feel once the motion starts.

Key insight: photo dance clips work best when the original image is clean, front-readable, and easy for the model to animate without guessing hidden limbs.

Takeaway: start with one sharp image, one short dance loop, and one uncluttered frame, then let the motion do the work instead of overloading the setup.

What kind of photo works best for AI dance videos?

A clear full-body or waist-up photo with visible limbs and a simple background usually works best. The examples on this page point toward clean framing because it gives the model a much better chance of producing stable dance motion.

Can I make a selfie dance with AI?

Yes, if the face is clear and the body pose is readable. A selfie with heavy cropping is harder, but a portrait with space around the shoulders or torso is usually enough for simple dance motion.

Do I need video footage first?

No. This page is for still-photo workflows, so the goal is to start from one image and animate it into a short clip. The easier routes use upload-first tools and preset motion styles rather than raw video input.

What makes the final result look better?

Short dance loops, centered framing, visible limbs, and simple backgrounds usually improve the output. Study the examples on this page before choosing your image so you can avoid the most common starting-photo problems.

Make a Photo Dance With AI: Easy Video Ideas | Alici.AI