Chinese Trend Montage

Chinese trend montages work when the sequence feels culturally specific, emotionally clear, and visually quick to read. This page helps creators study how style shifts, lyric-led pacing, urban imagery, romance cues, and memory fragments get combined into short videos that feel current instead of generic. Use it to understand how montage structure, music timing, and scene selection can turn a simple concept into a remixable social format.

Video
Stevie Mac
GLOBAL LOCK: cinematic fantasy battle short, one Mongolian warrior woman with long black braided hair, leather-and-fur steppe armor, short skirt armor panels, boots, bracers, single steel sword, fighting one massive white fellbeast with ram-like horns, shaggy fur, heavy shoulders, long arms, and tusked predatory face; dry Mongolian grassland surrounded by blue-gray mountains, dusty wind, later shifting into a narrow marshy stream with muddy water and reeds; keep the same woman, the same beast, the same costume silhouettes, the same terrain palette, and the same overcast midday light through the entire video. Native audio only: sword slices, body impacts, dust movement, splashing water, beast roars, short female exertion shouts, no modern UI, no subtitles, no extra soldiers, no second monster.

00:00-00:06  Wide steppe faceoff. The warrior advances through waist-high dry grass with sword raised while the white fellbeast charges low and forward from the opposite side. Use alternating medium-wide and wide shots with slight handheld shake. Dust lifts from the ground, mountains stay soft in the background, and the scene immediately establishes scale difference between the human fighter and the hulking creature.

00:06-00:13  First collision and throw. The beast closes distance, scoops or swats the warrior upward, and her body flips across frame. Show one impact from a medium-wide profile angle and one overhead-leaning angle as she tumbles above the beast's back. Emphasize hair whip, cloth movement, and the beast's forward momentum. Keep action readable, brutal, and physical rather than magical.

00:13-00:20  Recovery and renewed engagement. The warrior lands, scrambles up, resets her footing, and re-enters with sword slashes and evasive side steps while the fellbeast lunges with claws and shoulder checks. Camera stays near chest height, cutting between front three-quarter views of the monster and lateral tracking shots of the warrior running through open grass. Dust clouds and trampled grass should trail every charge.

00:20-00:28  Open-field chase and redirection. The woman breaks into a sprint across the plain, glances back, then pivots on a rock or uneven ground to redirect the beast's line. Use wider running shots that show empty ochre field and distant mountain ridges, then cut tighter as the beast rushes past and turns back toward her. Maintain natural motion blur from speed, but keep both bodies legible.

00:28-00:34  Dusty grapple transition. The beast crashes down or slides, the woman closes in, and both disappear into a burst of dust and grass before the fight spills into water. Camera briefly lowers close to the ground to catch hooves-or-claw-like footfalls, sword arm motion, and airborne debris. Sound design becomes heavier with impact thuds and breath strain.

00:34-00:41  Marsh stream finish. The location shifts into a shallow muddy stream cutting through the grassland. The beast collapses partially in the water while the warrior stands knee-deep, sword drawn, circling and pressing the blade toward the creature's head and neck. Use low medium shots from water level, then end on a tense standoff image: soaked warrior in foreground, white horned head bowed in the stream, muddy ripples spreading outward. The ending should feel like a hard-won pause, not a triumphant celebration.

CAMERA: medium-wide fantasy action coverage, occasional overhead for the flip, low waterline shots in the final stream sequence, mostly handheld or shoulder-mounted energy, moderate shutter realism, no impossible drone moves, no overly stylized lens effects.

LIGHTING: overcast natural daylight, cool mountain haze in the distance, soft shadow edges, dry beige field tones in the first two-thirds, wet muted browns and green reeds in the stream finale.

GRADE: grounded cinematic fantasy, desaturated earth palette, creamy whites in the beast fur, restrained contrast, no neon color, no glossy superhero finish.

MOTION: realistic body weight, aggressive creature lunges, clear sword arm arcs, visible landing force, dust-driven impacts, water drag and splash resistance in the finale.

SPEECH PACK: sparse native vocalization only. The warrior can emit short effort grunts, breath catches, and one or two sharp battle cries. The fellbeast should snarl, roar, and exhale heavily. No monologue, no narration, no modern dialogue.

NEGATIVE PROMPT: extra warriors, horses, armies, bows, fire magic, glowing spells, dragons, wings, modern clothing, polished superhero armor, city ruins, night lighting, snow, desert dunes, comedy tone, anime faces, rubber creature motion, excessive gore, dismemberment, subtitles, logos, UI overlays, text on screen, camera glitches, duplicate beast, duplicate sword, celebratory ending.

SHOT PROMPTS:
1. Mongolian warrior woman and giant white horned fellbeast squaring off in dry mountain grassland
2. beast collision throwing the warrior into the air, dust explosion, kinetic fantasy combat
3. warrior recovering, slashing, sidestepping, and sprinting across open steppe while monster pursues
4. dusty grapple transition into a shallow marsh stream
5. knee-deep water standoff with sword aimed at the beast's submerged horned head
Video
GLOBAL LOCK: widescreen anime-style cat battle edit featuring heroic and villainous cats in a dramatic apocalyptic world, stormy grayscale battlefield with orange fire accents, fast speed-ramp transitions, debris, dust, meteor-like streaks, close-up feline hero shots, swirling red and blue energy portals, exaggerated shonen power-up transformation, cinematic 16:9 framing, no logos, no text overlays required.

0.0s-3.0s: opening wide battlefield shot with cats bracing against a violent storm under a gray sky, fiery orange streaks tear through the atmosphere like meteors, debris and dust blast across the ground.

3.0s-6.0s: a heroic orange cat and smaller companion figures appear in medium-wide action frames, the environment remains windswept and destructive, the edit cuts rapidly between charge-up moments and impact-ready poses.

6.0s-9.0s: the montage pushes into closer cat hero shots with stronger facial detail, one cat appears to wield or confront a weapon-like object, motion blur and diagonal attack streaks heighten the sense of combat.

9.0s-12.0s: abstract swirling red-black vortex transitions spin around a cat face, the edit shifts from physical battle to supernatural energy struggle, the visual language becomes more stylized and unstable.

12.0s-15.0s: blue and red power effects explode around a central feline form, one cat transforms into a more monstrous or godlike battle state with glowing aura and amplified aggression, the background fractures into high-energy shards.

15.0s-16.8s: final climax on a fully powered cat figure radiating blue and red energy while other cats face the blast from the battlefield below, ending on a peak confrontation tableau.
Video
Naturesms
GLOBAL LOCK: A vertical cinematic hot-spring or thermal pool ambience shot at sunset, combining glowing turquoise water, warm deck lighting, and a dramatic red-cloud sky. The setting is an outdoor spa or geothermal pool bordered by railings, deck edges, and warm string or architectural lights. The water is the hero element: bright aqua-to-turquoise, slightly steaming or shimmering, with visible current and ripples catching green-blue highlights. Above it, the sky is filled with warm sunset light and dense crimson-orange clouds, with the sun low near the horizon. Camera begins with a brief high-angle or edge-of-pool movement, then settles into a stable scenic composition centered on the glowing water, pool edge, and sky. No people appear. No dialogue. Audio should be treated as ambient only: soft water movement, distant outdoor air, and quiet resort atmosphere if present.

[00:00-00:01] Open from a high or edge-of-pool angle looking down over the railing and pool boundary. The glowing blue-green water fills most of the frame while a narrow strip of deck edge, stone border, and lit railing detail appears along the bottom or side.

[00:00-00:01] Speech/audio: no dialogue, only soft water ambience and open-air hush.

[00:01-00:02] The camera lifts or settles into a wider scenic view, revealing more of the pool surface, warm lights along the far edge, and the dramatic sunset sky overhead. The color contrast between turquoise water and red-orange clouds becomes immediately readable.

[00:01-00:02] Speech/audio: no dialogue, same low environmental bed.

[00:02-00:04] Hold on the strongest composition where the glowing pool dominates the lower frame and the fiery cloud bank and sun glow above the horizon line. Keep the water surface active with visible ripples and subtle steam-like softness.

[00:02-00:04] Speech/audio: no dialogue, gentle water texture only.

[00:04-00:06] Continue the calm scenic hold as the pool edge, railings, and warm lights form a horizontal band between the water and sky. The clip should feel like a luxury geothermal retreat or resort view at golden hour.

[00:04-00:06] Speech/audio: no dialogue, soft ambient spa-like hush.

[00:06-00:08.1] End on the same stabilized sunset-pool framing with the brightest water glow and richest red clouds still visible. Preserve the quiet mood and the visual contrast of cool illuminated water against warm dramatic sky for a smooth loopable finish.

[00:06-00:08.1] Speech/audio: no dialogue, ambient water and air only.

NEGATIVE PROMPT: swimmers or people in frame, indoor pool, city skyline, flat midday sky, muddy water, no reflections, harsh neon nightclub lighting, dirty spa area, pool toys, lifeguard chairs, rainstorm, shaky camera, washed-out sunset, low-detail water texture, talking, music performance, temporal flicker, plastic resort clutter.

SHOT PROMPTS:
SHOT_01 [00:00-00:01]: Edge-of-pool high-angle reveal with bright turquoise water and railing detail.
SHOT_02 [00:01-00:02]: Settle into a wider hot-spring sunset view.
SHOT_03 [00:02-00:04]: Best balanced composition of glowing water, warm deck lights, and dramatic sunset clouds.
SHOT_04 [00:04-00:06]: Calm hold on the resort-like geothermal pool atmosphere.
SHOT_05 [00:06-00:08.1]: Final loopable scenic sunset-pool frame.

SPEECH PACK
Timecoded transcript: no spoken dialogue.
[00:00-00:01] Audio cue: soft water and open-air ambience.
TAKE_A: gentle pool ripple.
TAKE_B: quiet outdoor spa tone.
TAKE_C: soft geothermal hush.

[00:01-00:02] Audio cue: reveal into wider scenic hold.
TAKE_A: low ambient lift.
TAKE_B: calm watery atmosphere.
TAKE_C: subtle outdoor resort ambience.

[00:02-00:04] Audio cue: stable sunset-pool tableau.
TAKE_A: sustained water movement.
TAKE_B: quiet steam-like ambience.
TAKE_C: soft evening air and ripple bed.

[00:04-00:06] Audio cue: calm scenic continuation.
TAKE_A: gentle spa ambience.
TAKE_B: low outdoor hush.
TAKE_C: serene thermal-pool bed.

[00:06-00:08.1] Audio cue: peaceful ending.
TAKE_A: ambient water tail.
TAKE_B: soft evening fade.
TAKE_C: calm geothermal finish.

Safe paraphrase version:
[00:00-00:08.1] A glowing turquoise outdoor hot spring or thermal pool settles into a scenic sunset composition, contrasting luminous blue-green water and warm deck lights with a dramatic red-cloud evening sky.
Video
GLOBAL LOCK: Multiple young East Asian female characters, each with distinct traditional-modern fusion outfits. Environment is a consistent dark wooden temple corridor with symmetrical pillars and glowing orange paper lanterns. Camera is consistently low-angle, using a wide-angle lens (14mm-24mm) with slight fisheye distortion. Lighting is warm, cinematic, and moody, with high-contrast elemental VFX. High-quality skin textures, flowing fabric, and dynamic motion blur. Speech present: false.

[00:00–00:02]
Subject: Young East Asian woman in red and white martial arts attire, ponytail.
Action: Crouching low on a wet, reflective dark floor, then jumping explosively upward.
VFX: Golden sparks and glowing embers swirling around her feet on the floor.
Camera: Low-angle wide shot, tracking her upward movement.
Lighting: Warm lantern light from the sides, golden glow from the floor.

[00:02–00:04]
Subject: Young East Asian woman with short blonde bob, wearing an orange jumpsuit.
Action: Crouching in a "superhero landing" pose.
VFX: Intense yellow lightning bolts and electricity arcing around her body and the pillars.
Camera: Static low-angle wide shot, center composition.
Lighting: Bright yellow flashes from the lightning illuminating the dark corridor.

[00:04–00:06]
Subject: Young East Asian woman with twin tails, wearing a pink floral kimono.
Action: Crouching and looking down, then slowly rising.
VFX: Soft pink cherry blossom petals swirling in a magical aura around her.
Camera: Medium-wide shot, slight Dutch angle.
Lighting: Soft pink ambient glow mixed with warm lantern light.

[00:06–00:08]
Subject: Young East Asian woman in a flowing blue and white traditional robe.
Action: Spinning rapidly with arms outstretched.
VFX: Vibrant blue flames and ethereal energy trails following her movement.
Camera: Dynamic tracking shot, rotating slightly with the subject.
Lighting: Cool blue light reflecting off the wooden pillars and floor.

[00:08–00:11]
Subject: Young East Asian woman in a bright yellow kimono with a white sash.
Action: Running at full speed directly toward the camera, smiling.
VFX: Motion blur and subtle golden light trails.
Camera: Fast-moving tracking shot, low to the ground, moving backward.
Lighting: High-key warm light from the lanterns ahead.

[00:11–00:13]
Subject: A silhouette of a female figure in the center of the corridor.
Action: Standing still as a massive explosion of orange fire erupts from her body.
VFX: Realistic, high-detail fire explosion filling the frame, smoke and embers.
Camera: Wide shot, symmetrical composition.
Lighting: Extreme high-contrast orange firelight casting long shadows.

[00:13–00:14]
Subject: Young East Asian woman in a black ninja outfit with a face mask.
Action: Striking a martial arts palm-strike pose toward the camera.
VFX: Subtle distortion waves in the air around her hand.
Camera: Close-up punch-in on the hand and eyes.
Lighting: Low-key, moody, side-lit by lanterns.

[00:14–00:16]
Subject: Young East Asian woman in a white kimono with large red floral patterns.
Action: Floating or jumping backward through the air, fabric billowing.
VFX: Soft white light emanating from behind her.
Camera: Low-angle tracking shot following her flight.
Lighting: Backlit, creating a rim-light effect on her hair and clothes.

[00:16–00:19]
Subject: Young East Asian woman in a white jacket and black cargo pants, modern street style.
Action: Crouching, then standing as a large purple energy ring expands from her.
VFX: Glowing neon purple circular energy wave with particle effects.
Camera: Wide shot, centered.
Lighting: Vibrant purple glow reflecting on the floor and pillars.

[00:19–00:25]
Subject: All previous characters appearing in rapid succession.
Action: Jumping, spinning, and posing in a chaotic, high-energy montage.
VFX: A mix of all previous elemental effects (fire, lightning, petals, blue flames).
Camera: Rapid cuts, spinning fisheye lens movements, extreme wide angles.
Lighting: Saturated, flickering, high-energy mix of warm and colored lights.

NEGATIVE PROMPT: Robotic movement, distorted faces, extra limbs, blurry textures, low resolution, modern furniture, plastic materials, flat lighting, static camera, text, watermarks, inconsistent clothing, cartoonish VFX, jittery motion.
Video
GLOBAL LOCK: keep one young Order of the Flame monk consistent across the entire video, male, lean build, calm expression, shaved or closely cropped hair, saffron-orange monk robes with layered fabric folds, riding posture upright and balanced. Keep one rare grey dragon-horse consistent across the entire video, pale grey to white body, draconic scaled head, long neck, horned crest, powerful horse body, wet reflective hide, high-speed water-running gait. Preserve the same mountain lake environment across all shots: broad shallow water plain, distant rocky mountains, bright overcast daylight with dramatic clouds, reflective water surface, white spray trails, cinematic fantasy realism. Motion language must emphasize power and agility, not battle. No weapon swings, no enemy units, no urban elements, no extra riders, no costume swaps, no anatomy drift, no breed changes, no color drift, no cutaway to unrelated locations.

MASTER INTENT: a powerful fantasy travel sequence showing a flame monk charging across shallow water on a rare grey dragon-horse, moving from splash-heavy medium shots into wider landscape hero shots while maintaining elegance, speed, and control.

00:00-00:04 — opening impact shot
Subject: the monk is already mounted, centered slightly above the dragon-horse shoulders, robes pressed back by speed.
Environment: shallow lake water explodes outward under the mount, distant mountains softened by atmosphere, bright cloud cover overhead.
Action: the dragon-horse charges straight through water with forceful front-leg extension and heavy spray bursting around its chest and jawline.
Camera: medium telephoto side-front tracking shot running parallel to the mount, low enough to let spray hit foreground.
Lighting: crisp daylight with diffuse cloud-softened highlights, white reflections dancing across water droplets.
Grade: high-detail cinematic fantasy grade, cool water tones contrasted with warm saffron fabric.
Motion: fast, grounded, muscular stride cadence, sharp splash arcs, stabilized tracking with subtle impact vibration.
Speech: none.

00:04-00:09 — controlled speed and profile clarity
Subject: the monk remains calm and upright, hands steady, gaze fixed forward; dragon-horse head profile becomes more readable.
Environment: open water plain widens, mountain ridge line gains definition, reflective surface mirrors the mount in broken fragments.
Action: the mount maintains a sustained high-speed run, water peeling away from each hoof strike in layered sheets.
Camera: clean side profile tracking shot with slightly wider framing to reveal full neck, shoulders, and rider silhouette.
Lighting: daylight stays even and natural, highlights glint across scales and wet mane ridges.
Grade: polished adventure-fantasy look with clean contrast and realistic texture retention.
Motion: smooth lateral glide, elegant speed, no frantic shaking, motion blur contained to splash and hoof impact.
Speech: none.

00:09-00:15 — full-body power reveal
Subject: the pair now read as one unified silhouette, monk poised and disciplined, dragon-horse visibly long-striding and athletic.
Environment: more of the lake floor and horizon line enter frame, giving scale to the crossing.
Action: the dragon-horse skims and pounds through water with long, efficient strides while the monk absorbs motion through the hips and torso.
Camera: wider tracking shot with the full mount visible from nose to tail, horizon stabilized for heroic readability.
Lighting: luminous daylight with broad sky bounce, subtle rim on spray edges.
Grade: epic outdoor fantasy grade, slightly elevated clarity in the white spray and pale hide.
Motion: repeating rhythmic hoof impacts, consistent forward velocity, long streaks of water trailing behind.
Speech: none.

00:15-00:22 — landscape expansion
Subject: monk and mount shrink slightly within frame but remain the emotional focal point through silhouette and color contrast.
Environment: wide reflective shallows, mountain basin, open sky, sweeping sense of distance and freedom.
Action: the charge continues uninterrupted, reading less like a sprint attack and more like an unstoppable glide across the waterline.
Camera: wide lateral hero shot with slight crane-like lift, allowing the environment to frame the pair in motion.
Lighting: bright daylight with silver-white reflections on the water surface.
Grade: expansive cinematic fantasy palette with restrained saturation and premium naturalistic detail.
Motion: smooth macro movement across the basin, splash density lowers slightly as the framing widens.
Speech: none.

00:22-00:29.766 — final grandeur pass
Subject: the monk remains composed to the end, dragon-horse still forceful and agile, no fatigue or loss of form.
Environment: biggest view of the terrain, distant peaks and reflective water reading as a sacred travel corridor.
Action: the pair continue charging through the shallows toward open space, sustaining the same mythic momentum through the final frame.
Camera: very wide hero travel shot, lingering and stable, designed for awe and scale.
Lighting: daylight remains consistent, with soft cloud-filtered highlights preserving detail in white spray and pale hide.
Grade: premium fantasy-cinema finish, clean whites, deep environmental depth, subtle atmospheric haze.
Motion: forward glide remains strong and legible until cut, splash trails taper into a polished exit rhythm.
Speech: none.

SPEECH PACK:
- Dialogue: none
- Voiceover: none
- Mouth movement: minimal and non-emphatic, rider remains focused
- Audio direction: cinematic music-led presentation with hoof impacts, water spray, and low fantasy-adventure energy

NEGATIVE PROMPT:
low detail, muddy anatomy, extra limbs, rider duplication, second mount, wrong creature design, horse head replacing dragon head, dragon wings appearing suddenly, armor changes, robe color changes, facial drift, cartoon styling, low-resolution spray, desert terrain, forest terrain, night lighting, fire battle, enemy troops, weapon combat, static pose, floating hooves, broken reflections, oversaturated fantasy glow, text overlays, subtitles, logos, watermark, jump cuts to unrelated scenes
Video
Claire
GLOBAL LOCK: A vertical cinematic winter-wonder short about a young woman from a tropical climate experiencing her first snowfall in a traditional East Asian courtyard lane at blue dusk. The lead subject is a young East Asian woman with fair warm skin, black hair pinned into a soft updo, and a joyful expressive face, wearing a long flowing red silk robe or hanfu-inspired coat. She carries a glowing warm paper lantern that illuminates her hands and face against the cool blue snowlight. The setting should remain consistent: tiled rooftops, stone paths, snowy courtyard walls, gently falling snowflakes, and a peaceful old-town architectural atmosphere. Camera language moves from playful medium running shots to elegant close-ups of robe hem, lantern, and delighted face, ending in luminous smiling portraits. Lighting is strongly contrasted between cool snowy dusk and the lantern’s amber glow. The tone is tender, magical, and personal rather than epic. Audio should feel like soft winter ambience with delighted breaths or laughter, communicating wonder at seeing snow for the first time.

[00:00-00:01] Rear medium shot of the woman running lightly through a snow-covered courtyard lane, her red robe flowing behind her and a warm lantern swinging in one hand. Snowflakes drift through cool twilight air while old tiled roofs frame the path. The mood is immediately playful and awestruck.

[00:01-00:02] Side medium shot as she slows and turns slightly, lantern lifted, looking upward at the falling snow with fresh disbelief and joy. The warm lantern glow brushes her cheek and sleeve while the blue evening light fills the background.

[00:02-00:03] Close insert of the hem of her red robe and white shoes stepping into fresh snow, kicking up soft powder. The movement should feel tactile and childlike, as though she cannot resist testing the texture.

[00:03-00:04] Another close insert of the lantern near the snow, its amber light blooming against the white ground. This shot should make the warm-cold color contrast central to the piece.

[00:04-00:05] Medium frontal shot as she opens her arms slightly and smiles into the snowfall, fully giving herself over to the moment. The robe belt, collar, and flowing sleeves should remain elegant and period-inspired.

[00:05-00:06] Over-shoulder or profile shot of her turning in place with the lantern, letting snow collect on hair and shoulders. Her face reads wonder rather than theatrical performance.

[00:06-00:07] Wide shot from behind as she walks deeper into the courtyard lane, now calmer, absorbing the environment. The rooftops and walls create a sheltered cocoon around the first-snow experience.

[00:07-00:08] High overhead view of the snowy corridor with the red-clad woman as a small vivid figure below, lantern glowing like a moving ember. The world should feel quiet, pristine, and intimate.

[00:08-00:09] Medium close-up of her lifting a hand toward the falling snow or adjusting the lantern, smiling as if she cannot believe the cold flakes are real. Snow lands on her hair and lashes.

[00:09-00:11] Tight beauty close-ups of her face lit by the lantern, eyes bright, cheeks softly flushed, and smile widening with genuine delight. The red robe and amber light should make her look protected and warm inside the winter blue.

[00:11-00:13] Alternate between the lantern near her face and a direct smiling look toward camera, as if sharing the wonder with the viewer. The emotional arc peaks here in pure gratitude and surprise.

[00:13-00:15] Final soft portrait of her holding the lantern close, standing still in the snowfall with a calm, radiant smile. Let the scene close on serenity rather than motion, preserving the memory of a first snowfall as a tender life moment.

NEGATIVE PROMPT: modern city street, comedic slapstick, heavy blizzard danger, neon cyberpunk setting, subtitles, logos, low-detail costume, crowded festival scene, horror tone, muddy snow, bright midday light, aggressive windstorm, multiple characters speaking, modern winter jacket.

SPEECH PACK: Minimal spoken audio. Use snowfall ambience, robe movement, soft footsteps in snow, delighted breath or light laughter, and warm quietness from the lantern-lit environment. If any voice is present, keep it intimate, amazed, and heartfelt.
Video
Claire
GLOBAL LOCK: A romantic first-snow vignette set in a traditional East Asian courtyard alley at blue hour. The subject is a young East Asian woman with fair skin and dark hair in an elegant updo, wearing a flowing crimson hanfu-style robe with pale inner layers. She carries a warm glowing lantern while walking and lightly running through fresh snow between tiled roofs and old courtyard walls. The atmosphere should feel magical and intimate: large soft snowflakes falling, cool blue evening ambient light, warm lantern glow on her face, and a quiet historical-architecture setting with layered rooftops in the background. The mood is delighted, tender, and slightly wonderstruck, as if someone from a tropical climate is experiencing snowfall for the first time. Camera language should alternate between medium walking shots, close beauty shots, detail inserts of robe hem and snow, and one elevated courtyard reveal. No dialogue is needed.

[00:00-00:03] Open behind and beside the woman as she moves through a snowy courtyard lane in her flowing red robe, lantern in hand. Snow falls steadily under blue evening light while the warm lantern creates a golden pool of light around her sleeves and face.

[00:03-00:05] Cut to tactile close-ups: the robe hem brushing through fresh snow, the lantern swinging close to the ground, and powder kicking up around her feet. These inserts make the snowfall feel physical and new.

[00:05-00:08] Shift into medium and close portrait shots as she turns toward camera smiling, then opens her arms slightly in delight. The historic rooftops and courtyard walls behind her remain softly visible through the falling snow, grounding the fantasy in a specific architectural world.

[00:08-00:10] An elevated view reveals her as a small red figure moving through the white courtyard below, emphasizing the contrast between the vivid robe and the snow-covered rooftops. The shot makes the scene feel like a winter postcard.

[00:10-00:14] End with warm close-ups of her face beside the lantern as she smiles back over her shoulder and then directly toward camera. Snow gathers lightly in her hair while the lantern glow softens her expression, closing on a feeling of joyful first-snow wonder.
Video
Stevie Mac
GLOBAL LOCK: brutal fantasy creature fight in a cold mountain river at night; one lean grey-white werewolf with long tail, glowing eyes, wet fur, fast catlike agility; one towering muscular minotaur with black fur mane, curved horns, bare upper torso, heavy arms, and bull-like head; shallow rapids, black rocks, spray, distant mountains, moonlit mist, no humans, no weapons, no extra monsters, no fire magic, no subtitles, no dialogue performance.

00:00-00:06
Open in a dark river gorge with both creatures already in frame. The werewolf stays low and agile on rocks and in shallow water while the minotaur plants itself heavily in the current. Establish scale contrast immediately: fast predator versus massive brute.

00:00-00:06
The first exchange is explosive. The werewolf leaps from a rock or from the waterline toward the minotaur's upper body while the minotaur twists and absorbs impact. Use splashing water, short bursts of contact, and strong silhouette separation against misty backlight.

00:00:06-00:16
Move through repeated close-quarters grappling in the river. The werewolf circles, crouches, and snaps forward with bite-like attacks while the minotaur bends, turns, and tries to swat or seize it. Keep the fight readable, wet, and heavy rather than acrobatically clean.

00:00:16-00:28
Shift to a more tactical beat around large rocks. The werewolf climbs or perches briefly on boulders, then launches again. The minotaur wades and pivots through the current, constantly reorienting its shoulders and horns. Emphasize spray, slipping footing, and pressure from the water.

00:00:28-00:40
Escalate into direct upper-body collisions. The werewolf lands on the minotaur's chest, shoulders, and neck area in several rapid attempts while the minotaur lurches, lifts, or throws its mass sideways. The scene remains one continuous brutal encounter, not a heroic duel with pauses.

00:00:40-00:52
Resolve by taking the struggle into the water. The fight destabilizes, both bodies crash lower into the current, and the final images move partly underwater with distorted motion, bubbles, limbs, and silhouettes losing clarity in the torrent. End with chaos and unresolved survival.

CAMERA: aggressive creature-combat cinematography with medium-wide river coverage, low waterline angles, impact close-ups, rock-to-river perspective shifts, and final submerged shots; keep action readable, not hyper-cut into abstraction.

LIGHTING: moonlit night ambience with cool blue-grey highlights on wet fur and muscle, bright specular hits on spray, mist-softened backlight, dark surrounding valley shapes.

GRADE: cold steel-blue fantasy night palette, high wet contrast, dark river blacks, controlled highlight bloom on water splash, readable creature texture even in low light.

MOTION: lunges, swats, grapples, clawing contact, bite attempts, horn-led turns, body slams into water, scrambling on slick rocks, underwater thrash at the end.

SPEECH PACK: native audio creature combat only; snarls, roars, impact breaths, water crashes, splashes, submerged turbulence, no spoken language, no narration.

NEGATIVE PROMPT: human warriors, swords, fire, magic bolts, daytime lighting, arena audience, heroic speeches, extra beasts, clean dry costumes, comedy tone, gore fountains, subtitles, text overlays, cartoon creature design, blurry unreadable action.

SHOT PROMPTS:
1. lean werewolf and massive minotaur facing off in moonlit river with black rocks and spray
2. werewolf leaping at the minotaur through whitewater while the brute twists in the current
3. repeated close grappling and bite attempts around boulders in cold night mist
4. final crashing struggle dropping underwater with distorted limbs and bubbles
Video
Katrina
A minimalist fashion-beauty concept video set against a smooth pink studio backdrop. An elegant East Asian woman with straight black hair wears a glossy ivory sleeveless body-skimming dress that reflects light like polished latex or liquid satin. The film moves through a sequence of clean editorial poses: first she presents a transparent geometric acrylic box filled with floating silver pieces like a luxury object or art prop; then the styling shifts into a darker, more sculptural moment with long black opera gloves and close-up hand choreography; finally she lifts a large white lampshade-like object over her head, transforming it into a surreal couture headpiece. The overall mood is refined, symmetrical, and high-concept, combining beauty-campaign precision, gallery-object styling, and sculpture-inspired fashion movement. Soft studio lighting, immaculate skin detail, strong negative space, and pastel-commercial color design give the clip a premium editorial feel.
Video
GLOBAL LOCK: Subject is Lisa Manoban, a young woman of Thai-Korean ethnicity with light brown hair and straight bangs. She has a slim build and a glamorous, high-fashion makeup look with winged eyeliner and glossy pink lips. The environment is a minimalist studio with soft, ethereal lighting, often using pink, lavender, and white tints. The visual style is photorealistic, cinematic, and high-fashion editorial. The color grade is vibrant yet soft, with high-key highlights and deep, clean blacks. Pacing is fast, with cuts synced to a high-energy pop/rap beat. Speech is not present, but the video is synced to the song "Rockstar" by Lisa.

[00:00–00:01]
Subject: Lisa wearing an elaborate gold phoenix crown with pink and yellow fabric flowers and dangling gold chains. She wears a pink sheer qipao with 3D floral embroidery.
Environment: Soft-focus white lilies in the background.
Action: Subject looks directly at the camera with a neutral, confident expression.
Framing: Medium Shot (MS), eye-level.
Camera: Static.
Lighting: Warm, soft key light from the front.

[00:01–00:03]
Subject: A more doll-like, porcelain-skinned version of Lisa. She wears a headpiece made of translucent purple and pink glass-like flowers.
Environment: Solid lavender background.
Action: Subject's eyes are closed, then slowly open to look at the camera.
Framing: Close-up (CU).
Camera: Slight zoom-in.
Lighting: Cool, diffused purple light.

[00:03–00:05]
Subject: Lisa with pink rosy cheeks, holding a large white lily in front of her face. She wears a crown of mixed lilies and orchids.
Environment: Dark background with blurred white flowers.
Action: Subject moves the lily slightly to reveal her eye; her hand has long, stylized nails.
Framing: Close-up (CU).
Camera: Subtle handheld shake for a raw editorial feel.
Lighting: Bright, high-contrast white light.

[00:05–00:07]
Subject: Same as [00:00], Lisa in the phoenix crown.
Action: Subject performs a stylized hand gesture, bringing her hand toward the camera in a "rockstar" pose.
Framing: Medium Shot (MS).
Camera: Static.
Lighting: Warm, soft pink glow.

[00:07–00:09]
Subject: Lisa with blonde hair pulled back, wearing a side-mounted floral headpiece with long dangling pearl chains.
Environment: Solid light pink background.
Action: Subject turns her head from side-profile to face the camera, singing along to the lyrics.
Framing: Medium Close-up (MCU).
Camera: Static.
Lighting: Soft, even studio lighting.

[00:09–00:11]
Subject: Lisa with long blonde wavy hair, wearing a silver sequin-covered bodysuit and large pink floral wristbands.
Environment: Sparkling, out-of-focus background.
Action: Subject touches her face with silver-gloved hands, eyes looking down then up.
Framing: Medium Shot (MS).
Camera: Slight tilt up.
Lighting: High-key, shimmering light reflecting off sequins.

[00:11–00:13]
Subject: Lisa with blonde hair, wearing a high-collared sheer top with pink floral details.
Action: Subject has a small pink flower in her mouth, then lets it fall.
Framing: Close-up (CU).
Camera: Static.
Lighting: Soft, warm light.

[00:13–00:15]
Subject: Lisa's face partially obscured by a dense cluster of pink cherry blossoms.
Environment: Neutral beige background.
Action: Subject looks through the flowers with a piercing gaze.
Framing: Extreme Close-up (ECU).
Camera: Static.
Lighting: Bright, natural daylight feel.

[00:15–00:17]
Subject: Lisa with short blonde hair in a black suit, looking at a holographic UI screen showing her own face in different floral looks.
Environment: Futuristic, clean architectural space.
Action: Subject swipes her hand across the holographic screen.
Framing: Medium Shot (MS).
Camera: Slight pan right.
Lighting: Cool, blue-tinted light from the screen.

[00:17–00:20]
Subject: Lisa in a voluminous, iridescent pink "bubble" dress that looks like liquid glass, adorned with flowers.
Environment: Gradient pink and purple background.
Action: Subject stands elegantly, looking slightly away from the camera.
Framing: Medium Shot (MS).
Camera: Static.
Lighting: Dreamy, diffused pink light.

[00:20–00:24]
Subject: Rapid succession of Lisa in various floral-themed high-fashion looks: a purple crystalline dress, a white lace veil with flowers, a gold-trimmed qipao.
Action: Fast cuts, subject changes poses and expressions rapidly.
Framing: MCU and CU.
Camera: Fast cuts.
Lighting: Shifting between warm and cool tones.

[00:24–00:27]
Subject: Back view of Lisa in a sheer, backless dress with purple 3D flowers trailing down her spine.
Environment: Soft pink background.
Action: Subject turns her head to look over her shoulder at the camera.
Framing: Medium Shot (MS).
Camera: Slow dolly-in.
Lighting: Soft rim lighting to highlight the dress texture.

[00:27–00:31]
Subject: Lisa in a dense crown of white and pink lilies, looking intensely at the camera.
Action: Subject's face is framed by her hands; she blinks slowly.
Framing: Close-up (CU).
Camera: Static.
Lighting: High-key, flattering beauty light.

[00:31–00:33]
Subject: Extreme close-up of a human eye with a hazel iris and long, dark, defined lashes.
Action: The eye blinks once.
Framing: Extreme Close-up (ECU), macro.
Camera: Static.
Lighting: Sharp, focused light on the iris.

[00:33–00:37]
Subject: A white, smooth mannequin head being manipulated by several white robotic hands with visible wires and joints.
Environment: Dark, void-like background.
Action: The robot hands delicately touch the mannequin's face and head, pulling thin silver wires.
Framing: Close-up (CU).
Camera: Subtle zoom-out.
Lighting: Dramatic, high-contrast side lighting.

NEGATIVE PROMPT: blurry, low resolution, distorted facial features, inconsistent hair color, messy background, extra limbs, unnatural skin texture, flickering light, jittery motion, text, logos, watermarks, low quality, cartoonish, 2D, flat lighting.

SPEECH PACK:
[00:00–00:37]
Transcript: (Music only - "Rockstar" by Lisa)
Delivery Direction: No speech. The video is a visual montage synced to the high-energy, rhythmic beat of the song. The subject's mouth movements in some shots should match the lyrics of "Rockstar" (e.g., at 00:08 and 00:22).
Sync Requirements: High lip-sync strictness for the "Rockstar" lyrics during the blonde hair segments. All visual cuts must land exactly on the musical beats.
Video

A 35-second vertical fantasy character edit set against the Shanghai skyline at night, styled like an interactive power-control sequence for a handsome anime-inspired male lead similar to Sylus from Love and Deepspace. The video presents the same dark-haired male character in elegant black-and-silver formalwear, composited at giant scale over the city as if he is a supernatural guardian towering above the skyline. On-screen interface elements and command text divide the video into different abilities such as "Control Water," "Get Cloud," and "Control Thunder." In the first section, hand gestures and UI overlays appear to manipulate water across the skyline, creating rising waves and atmospheric mist. The second section shows the character summoning a small cloud in his hand, with soft smoke-like transitions. The final section shifts into lightning powers, with glowing icons, energy gathering in the palm, and dramatic thunderbolts striking through the city backdrop. The style is glossy, romantic, and fandom-driven, mixing mobile-game husband aesthetics, superpower UI design, and cinematic city compositing.
Video
GLOBAL LOCK: elderly East Asian man with long white eyebrows, long white moustache and beard, white hair tied in a topknot, smiling expressive face, wearing layered traditional robes in off-white and black with wide sleeves; seated or standing in front of broad stone steps outdoors or in a temple courtyard setting; the shot is a short portrait-style gesture test focused on both raised hands and finger visibility, no other characters, no combat, no camera cutaways, no prop changes, no text overlays, no logos, no UI.

VIDEO FORMAT: 5.22 seconds, single-subject gesture demonstration, fixed portrait framing with slight natural camera movement, emphasis on hand articulation and finger count visibility.

SHOT SEGMENTS:

0.00-1.20s: The elderly robed man begins in a medium portrait in front of stone steps, smiling and lifting one hand near his face as if beginning a playful demonstration. His beard and eyebrow length are immediately readable.

1.20-2.40s: He opens his gesture wider, bringing both hands higher into frame. Wide sleeves pull back enough to expose fingers clearly. The expression remains amused and welcoming rather than solemn.

2.40-3.80s: The gesture expands fully. Both hands are raised outward and upward, palms or fingers visible, making the clip suitable as a finger-count test. Facial performance stays light and animated.

3.80-5.22s: He holds the final open-handed pose with both arms spread, maintaining a grin. End on a stable frame where the fingers remain the most inspectable part of the shot alongside the elder's robe silhouette and stone-step backdrop.

CAMERA: steady medium portrait framing, slight handheld micro-movement only, no zooms, no pans away, no angle changes.

LIGHTING: soft daylight, even outdoor illumination, gentle shadow under brows and beard, clear visibility on hands and sleeves, no harsh colored light.

GRADE: natural cinematic portrait with warm-neutral stone tones, clean whites in the robe, preserved beard texture, restrained contrast to keep finger details readable.

MOTION: slow expressive arm lift, sleeve movement, slight torso sway, smiling face, no fast action.

SPEECH PACK: no visible speaking requirement, no narration required. Audio may be light ambience only; no lip-sync constraints.

NEGATIVE PROMPT: extra people, martial arts fight, magical energy, fantasy battle, indoor palace, modern clothing, fast hand blur, cropped fingers, hidden hands, duplicated arms, broken anatomy, extra hands outside the intended test, text captions, watermark, interface, exaggerated camera shake.
Video
GLOBAL LOCK: one pale vampire-styled guitarist as the primary character, white face makeup, dark lips, visible fangs during monster expression, black and red hat, dark robe-like clothing, acoustic or classical guitar, warm indoor room with exposed brick and cool blue window light, no text, no watermark, no modern stage, same costume throughout, clone-band effect appears only in the final section.

TIMECODED SHOT SEGMENTS:

00:00-00:04: Open on a close medium shot of the pale vampire guitarist playing the acoustic guitar calmly. He looks downward toward the strings while warm interior light shapes the face and the guitar body glows warmly against the cooler room background.

00:04-00:07: Shift into a playful horror performance beat as the character raises both hands toward camera, opens the mouth, and reveals fangs. Keep the same room and same costume, with an exaggerated mock-vampire expression rather than realistic horror violence.

00:07-00:09: Return briefly to the guitar-playing setup so the edit feels cyclical and musical, reinforcing the image-to-video effects experiment around one recognizable character.

00:09-00:11.4: Expand into a clone-band finale where multiple copies of the same vampire-styled guitarist stand side by side, each holding a guitar in matching costume. Preserve the same interior location and playful fake-horror tone.

SUBJECT: pale vampire-inspired guitarist performer with white makeup, fangs, dark hat, and acoustic guitar.

ENVIRONMENT: warm indoor room, exposed brick columns, cool blue light from windows or openings, intimate home-performance setting.

ACTION: guitar strumming, facial transformation into a fang reveal, raised claw-hands gesture, final clone-band multiplication.

CAMERA: stable music-performance framing with close and medium coverage, ending on a wider composition to reveal the duplicate band effect.

LIGHTING: mixed warm interior key light on face and guitar, cooler ambient fill from background openings, clear contrast that keeps the pale makeup readable.

GRADE: clean indoor performance grade, warm skin-light contrast against cool background, playful gothic palette without extreme darkness.

MOTION: natural hand strumming, expressive facial performance, controlled character gestures, then deliberate duplication in the last beat.

SPEECH: no dialogue, no subtitles, no narration.

SPEECH PACK: silent image-to-video performance clip, no lip sync, no spoken lines, no presenter overlay, only musical or ambient energy implied.

NEGATIVE PROMPT: realistic gore, blood spray, extra random people, modern concert stage, microphone stand, broken guitars, distorted hands, smeared face makeup, missing hat, text overlay, watermark, logo, heavy camera shake, dark unreadable lighting, uncontrolled monster transformation.
Video
MASTER PROMPT
GLOBAL LOCK: Create a vertical 9:16 heavenly battlefield fantasy video in bright high-altitude daylight with blue sky, mist, jagged mountain peaks, and a vast white ground or cloud plain crowded with white-robed haloed angelic figures. The main subject is one dark black-red winged warrior with massive feathered-black wings, a bright circular halo, a body covered in thorn-like armor or organic spikes, and a glowing red-orange spear or staff weapon. Keep the contrast between the dark central warrior and the luminous white heavenly army consistent across the whole timeline. Preserve the mountain backdrop, drifting mist, white crowd density, halo motif, black wing silhouette, and fiery spear glow. Camera progression should begin with a distant frontal approach, then move into crowd confrontation, weapon close-ups, forward lunges through angel ranks, and end on a radiant pure angel figure descending or hovering against the sky. Speech style: no spoken dialogue, no captions, no lip-sync requirements; audio should be epic holy-war sound design with wind, wing beats, crowd tension, fiery spear hum, impact bursts, and orchestral battle rise.

[00:00–00:03] Open wide on a vast white battlefield framed by steep mountains and drifting mist under bright blue daylight. The dark winged warrior appears far ahead, centered with halo visible, advancing across the pale ground while long lines of angelic figures stand on both sides. Use a long-lens frontal approach with soft mist layers and strong scale contrast. Audio: no speech, only wind, distant wing ambience, and slow epic score build.

[00:03–00:06] Move into a direct frontal reveal among the white crowd. The warrior's black wings open wider, halo burns brighter, and dust or mist kicks up from the ground. White-robed figures with halos create a corridor around the subject. Camera stays centered and pushes forward steadily. Sound adds heavier wing pressure and battle anticipation.

[00:06–00:09] Cut to intimate body and armor details: black feather mass, thorn-like armor spikes, and the glowing red-orange spear shaft entering frame. Use medium close and close macro-like inserts that show heat, metal-organic texture, and the contrast between dark armor and brilliant environment. Audio remains nonverbal with a low weapon hum and score escalation.

[00:09–00:12] Show the spear tip and shaft fully, glowing with molten red-orange energy. The warrior lowers the weapon and aims through the white ranks. Keep the halo and wing edges readable behind the weapon. Use a 50mm combat close-up with shallow depth of field, then accelerate into a forward thrust. Audio: energy hum, whoosh, no dialogue.

[00:12–00:17] Drive into ground-level action as the warrior charges or lunges through the white-robed figures, spear leading. Camera follows the weapon path forward through the crowd with motion blur and bright impact flashes. White robes scatter and the spear glow becomes the visual center. The setting stays bright and sacred rather than dark. Sound should feature fast movement, impact bursts, and rising orchestral intensity.

[00:17–00:23] Continue with side and three-quarter battle passes. The warrior leaps or sweeps across the white crowd, black wings arcing over haloed heads, while orange-gold spray, sparks, or energy bursts streak from the spear. Keep the heavenly crowd dense and mountains visible in the distance. Use fast tracking shots with controlled blur and high contrast. Audio remains purely cinematic battle design.

[00:23–00:28.3] End on a tonal reversal: cut away from the dark warrior to a luminous pure white angel figure rising or hovering against open sky, with bright halo, white wings, and a vertical beam of light descending through the body. Hold this final frame as a clean celestial counterpoint to the earlier chaos. Audio: no speech, just a resolved holy chord, airy wind, and a bright radiant tail.

NEGATIVE PROMPT: dark hellscape, night sky, muddy battlefield, missing mountains, missing halo, generic knight armor, soft undefined wings, small empty crowd, no weapon glow, low-detail spear, text overlays, captions, logos, bland gray palette, broken anatomy, duplicated wings, limp crowd staging, weak motion blur, cartoon angel style, spoken narration, lyrics, clipped impacts, cheap laser sounds.

SHOT PROMPTS
SHOT 01 DELTA: wide heavenly battlefield with mountains, mist, white crowd, distant black winged halo warrior approaching.
SHOT 02 DELTA: centered frontal reveal in white corridor of haloed figures, wings spread, dust and mist rising.
SHOT 03 DELTA: close armor and wing thorn details, glowing spear enters frame.
SHOT 04 DELTA: molten spear close-up aimed through angel ranks.
SHOT 05 DELTA: forward charge through white crowd with fiery spear motion blur.
SHOT 06 DELTA: lateral battle sweep, black wings over white figures, orange energy spray.
SHOT 07 DELTA: final radiant white angel in sky with vertical light beam.

SPEECH PACK
TIMECODED TRANSCRIPT
[00:00–00:28.3] No confirmed spoken dialogue. No visible text. Audio should be epic holy-war atmosphere with wind, wing beats, energy hum, battle impacts, and a final radiant orchestral release.

DELIVERY TAKES
TAKE_A: Start wide and majestic, intensify the spear hum and battle motion in the middle, then resolve into a bright celestial ending chord.
TAKE_B: Keep the first half heavy on wing pressure and crowd tension, with the loudest accents on the spear thrusts and side sweep attacks.
TAKE_C: Use a cleaner heroic mix with airy mountain wind, controlled weapon burn, and a luminous final sky reveal.

PROSODY / AUDIO PERFORMANCE NOTES
- No speaking and no lip-sync needs.
- Emotional arc: omen, confrontation, charge, battlefield sweep, divine final counter-image.
- Biggest sound accents should land on the spear ignition, the forward crowd charge, and the closing beam-of-light reveal.
Video
GLOBAL LOCK: A playful but grounded Mongol warrior dance reel performed by a small group of armored steppe fighters on an open grassland plain. Keep six to eight male-presenting Mongol warriors in layered brown, black, and tan leather-and-lamellar outfits with fur trims, belts, helmets or hoods, boots, and period accessories. Preserve the broad flat steppe, distant low mountains, overcast daylight, earthy muted palette, and one frontal medium-wide camera setup that keeps the whole group visible. The action is synchronized folk-dance-like stepping, arm swings, bouncing footwork, and side-to-side groove rather than combat or marching. No dialogue, no narration, no lip-sync. Audio is music-led only, with rhythmic beat emphasis and maybe light environmental wind.

[00:00-00:05] Open on the full group already moving toward the camera in a loose front-facing formation across the grassland. Their arms swing outward and inward while their steps bounce in time, with the center dancers leading the rhythm. Keep the framing wide enough to read the whole formation and the mountainous horizon behind them.

[00:05-00:10] The group settles into a more planted dance pattern. Several dancers step laterally while raising bent elbows and rocking their shoulders. Costumes sway, belts and fur details move subtly, and the formation stays shallow and readable. No one breaks into combat gestures; the feeling is celebratory and joking, not aggressive.

[00:10-00:16] Push the groove further with alternating foot taps, half-turns, and broader arm accents from the front row while the back row mirrors in looser timing. Keep the camera frontal and stable with only slight motion, letting the choreography and outfit silhouettes carry the clip. Preserve the open field and muted sky as a constant backdrop.

[00:16-00:21] Finish with the dancers still in formation, stepping and swinging through the final beat while facing camera. The clip should end as an ongoing dance reel rather than a completed story moment. Maintain the same grassland setting, warrior costumes, and fun group energy with no cutaway to battle or horses.

NEGATIVE PROMPT: actual battle, charging horses, sword fighting, arrows, war cries, extra crowd, modern dancewear, neon stage lights, nightclub environment, acrobatic breakdance moves, comedy costumes, broken anatomy, duplicate dancers, warped armor, text overlays, logos, subtitles, dialogue, chanting, narration, lip-sync artifacts, environment changes away from the open steppe.

SHOT PROMPTS:
SHOT 1: Mongol warrior group dance advancing toward camera across a flat grassland under gray sky.
SHOT 2: front-facing synchronized side steps and arm swings in layered leather-and-fur warrior outfits.
SHOT 3: rhythmic group bounce and shoulder accents with the steppe horizon held steady behind them.
SHOT 4: final formation groove continuing playfully with no transition into combat.

SPEECH PACK:
[00:00-00:21]
Closest audible version: no intelligible speech, no verbal performance, music-only reel energy.
Safe paraphrase version: maintain a non-verbal rhythmic track with clear beat, light ambient openness, and no dialogue or chanting.
TAKE_A: upbeat percussive dance-beat mix with subtle wind beneath.
TAKE_B: slightly heavier low-end groove while remaining fully non-verbal.
TAKE_C: lighter folk-rhythm feel with more open space and softer percussion.
Prosody / sync notes: no visible speaking, no transcript required, no lip-sync constraints, choreographic accents should align to musical beats and step changes.
Video
GLOBAL LOCK: two adult East Asian male martial artists only, one lean shirtless fighter in black kung-fu pants and black shoes, one fighter in a loose white traditional kung-fu outfit with white shoes, old stone courtyard with gray walls and tiled-roof architecture, daylight overcast exterior, no weapons, no crowd, no spectators, no modern objects, no text overlays, no subtitles, no scene changes, no costume changes, no extra fighters, keep both faces, body builds, outfits, fighting positions, and courtyard layout fully consistent across the whole clip.

Create a 64-second old-school kung-fu duel video inspired by classic Hong Kong martial arts cinema. The entire video stays in one traditional courtyard as two iconic archetypes fight at close range with very fast hand exchanges, evasive footwork, jump kicks, comedic reaction beats, and clean full-body martial arts readability. The tone is fast, playful, and highly physical rather than brutal. Keep the choreography crisp and legible, with both fighters always clearly tracked in frame.

00:00-00:10 — Open on a tight faceoff. The shirtless fighter and the fighter in white square up in the courtyard, hands raised, testing distance with quick probing motions and tiny feints. Camera alternates between medium two-shots and slightly tighter angle coverage.

00:10-00:22 — The exchange accelerates into rapid trapping hands, parries, and short-range straight punches. The white-clad fighter leans in aggressively, and the shirtless fighter snaps back with compact counters. Emphasize speed, timing, and clean eye-line contact between both men.

00:22-00:32 — Add a comic reaction beat where the white-clad fighter briefly scrunches his face and reacts while still staying inside the fight rhythm, then cut back to fast two-person exchanges. Maintain the same courtyard background and daylight tone.

00:32-00:46 — Expand into wider full-body coverage. Both fighters circle, lunge, and trade longer combinations with kicks, lunging strikes, side steps, and jump-in attacks. Keep motion sharp and grounded, with visible footwork on the stone ground.

00:46-00:56 — Push the choreography harder: spinning entries, jumping kicks, rapid arm blocks, and close infighting near the courtyard walls. Use wider action framing so both bodies remain readable during the fastest moments.

00:56-01:04 — Finish with a final sequence of fast hand trapping and snapping strikes. Let the white-clad fighter land a playful reaction-inducing beat, but do not resolve into a knockout. End on both men still engaged in the duel, preserving the fun, competitive energy.

CAMERA: classic Hong Kong action coverage, mostly medium two-shots and full-body wides, occasional tighter inserts for reaction faces, static or lightly panning camera, no modern handheld chaos, no drone shots, no impossible camera moves.

LIGHTING: natural overcast daylight, soft exterior contrast, realistic courtyard shadows, no colored lights, no theatrical stage lighting.

GRADE: vintage martial-arts film realism, muted earth tones, natural skin color, slightly soft 80s/90s cinema texture, clean motion with no heavy digital sharpening.

MOTION: fast kung-fu hand exchanges, block-counter rhythm, stance transitions, shuffle footwork, jump kicks, spins, body turns, slight comic facial reactions, but no wire-fu floating, no superpowers, no exaggerated slow motion.

SPEECH PACK: no dialogue-driven scene, only fight exertion, foot impacts, clothing swishes, quick breath bursts, courtyard ambience, and energetic action sound design; no narration, no off-screen commentary, no crowd cheering.

NEGATIVE PROMPT: extra fighters, weapons, blood, gore, broken bones, modern buildings, audience, subtitles, logos, watermarks, camera shake, anime styling, CGI energy blasts, fantasy powers, slow motion abuse, indoor dojo, costume drift, face swaps, incorrect hand counts, blurred anatomy, plastic skin, unresolved background morphing.

Chinese Trend Montage

Why Chinese trend montages travel so well

Montage formats tied to Chinese social trends often spread because they compress a lot of feeling into a short run time. A few frames of city lights, a close-up reaction, a lyric beat, and a sudden scene switch can create a strong mood without needing a heavy storyline. That makes the format easy to remix across romance edits, nostalgia posts, friendship videos, and stylized character clips.

The strongest versions do not rely on speed alone. They usually balance quick transitions with a very clear emotional thread, whether that is longing, softness, confidence, or a late-night cinematic mood. When the thread is clear, even mixed-source footage can feel coherent.

What creators usually get right

Good Chinese trend montages tend to choose one visual language and stay loyal to it. That might mean neon night scenes, warm indoor romance, school-memory styling, or polished street-fashion imagery. Once that lane is chosen, cuts, overlays, and captions feel purposeful rather than random.

A montage feels current when the mood changes fast but the emotional direction stays stable from the first frame to the last.

For AI-generated versions, this matters even more. The prompt should anchor the faces, setting, and tone early, then let the sequence vary through camera distance, lighting, and detail shots. Too many conflicting references usually weaken the trend feel instead of making it richer.

How to structure one

Most creators start with a hook image that immediately sells the mood. After that, the montage typically alternates between subject shots and environment shots so the viewer gets both emotion and context. Lyrics or text overlays often work best when they highlight the feeling instead of fully narrating it.

If the montage is built around music, cut to vocal entrances or phrase endings rather than every percussion hit. That gives the edit a more romantic and less mechanical flow, which is often what makes these videos replayable.

FAQ

What scenes fit this style best?

Night streets, train stations, apartment interiors, rainy windows, and school or campus spaces all work well because they already carry emotional texture.

Should captions or lyrics dominate the frame?

No. They usually work best as a supporting layer. The image sequence should still communicate the feeling even if the text is removed.

What should an AI prompt emphasize?

Lock the tone, setting, and subject relationship first. Then add lighting and camera details so the montage feels cohesive instead of mixed together.