A) MISE EN PLACE
2) Segment the video into scenes/shots:
- [00:00-00:03] Shot 1: ECU face, talking.
- [00:03-00:05] Shot 2: CU face, holding product.
- [00:06-00:09] Shot 3: MS, head turn, dramatic shadow.
- [00:10-00:12] Shot 4: CU, applying product.
- [00:13-00:15] Shot 5: WS, sitting on floor.
- [00:16-00:18] Shot 6: CU, touching neck.
- [00:19-00:21] Shot 7: MS, sitting on stool, talking.
- [00:22-00:24] Shot 8: MS, holding hair up.
- [00:25-00:27] Shot 9: CU, wind in hair.
3) Extract visual evidence:
- Keyframes: 00:01 (talking face), 00:04 (holding product), 00:07 (shadow face), 00:11 (applying product), 00:14 (full body), 00:17 (touching neck), 00:20 (sitting talking), 00:23 (holding hair), 00:26 (wind in hair).
4) Extract speech evidence:
- Speaker: 1 female voice (Speaker A).
- Transcript:
[00:00-00:03] "What if I told you I'm not even real."
[00:03-00:05] "But the product I'm holding is Hailey Bieber's Rhode lip balm."
[00:06-00:09] "Everything you're seeing was created with AI, no camera, no studio."
[00:10-00:12] "Just one image and a few prompts."
[00:13-00:15] "Every reflection, every highlight, every detail was generated in seconds."
[00:16-00:18] "Real product, unreal possibilities."
[00:19-00:21] "You don't need a full setup anymore."
[00:22-00:24] "Just imagination."
[00:25-00:27] "Comment guide to learn how."
- Lip visibility: Full visibility in shots 1 and 7. Partial/implied in others.
- Sync strictness: High for shots 1 and 7.
5) Invariants list (LOCK THESE):
- Visuals: Asian woman, mid-20s, flawless glowing skin, dark brown hair, fitted white ribbed sleeveless turtleneck tank top, small silver hoop earrings. Cinematic studio lighting, 85mm lens feel, photorealistic texture.
- Speech: Female voice, warm, confident, commercial beauty tone, close-mic studio sound, dry room.
6) Variables list (TWEAK THESE):
- Visuals: Lighting direction (soft beauty vs. hard directional), hair state (tied back vs. loose), background color (black, grey, white), pose, camera framing (ECU to WS).
- Speech: Pacing, emphasis on key words ("real", "AI", "seconds").
B) SHOTLIST
[00:00-00:03]
- framing: ECU, eye level.
- lens: 85mm, shallow DoF.
- camera movement: Static.
- subject: Looking directly at lens, speaking.
- environment: Dark studio background.
- lighting: Soft beauty lighting, high contrast.
- speech: Speaker A, on-camera. "What if I told you I'm not even real." High lip-sync strictness.
[00:03-00:05]
- framing: CU, eye level.
- lens: 85mm, shallow DoF.
- camera movement: Slight drift.
- subject: Holding a pink lip balm tube near her cheek, looking at camera.
- environment: Neutral studio background.
- lighting: Soft diffused lighting.
- speech: Speaker A, VO. "But the product I'm holding is Hailey Bieber's Rhode lip balm."
[00:06-00:09]
- framing: MS, eye level.
- lens: 50mm.
- camera movement: Slow pan following head turn.
- subject: Turns head from profile to face camera.
- environment: Dark studio background.
- lighting: Dramatic hard directional light, sharp diagonal shadow across face.
- speech: Speaker A, VO. "Everything you're seeing was created with AI, no camera, no studio."
[00:10-00:12]
- framing: CU, tight on mouth.
- lens: 100mm macro feel.
- camera movement: Static.
- subject: Applying pink lip balm to lips, eyes looking slightly down.
- environment: Neutral background.
- lighting: Bright, even beauty lighting.
- speech: Speaker A, VO. "Just one image and a few prompts."
[00:13-00:15]
- framing: WS, full body.
- lens: 35mm.
- camera movement: Static.
- subject: Sitting on floor, one leg bent, wearing black trousers with the white tank top.
- environment: Grey studio floor and wall.
- lighting: Soft overhead lighting.
- speech: Speaker A, VO. "Every reflection, every highlight, every detail was generated in seconds."
[00:16-00:18]
- framing: CU.
- lens: 85mm.
- camera movement: Slight push-in.
- subject: Touching neck and jawline with both hands.
- environment: Dark background.
- lighting: Warm rim light, deep shadows.
- speech: Speaker A, VO. "Real product, unreal possibilities."
[00:19-00:21]
- framing: MS.
- lens: 50mm.
- camera movement: Static.
- subject: Sitting on a metal stool, leaning forward, speaking to camera.
- environment: Neutral studio background.
- lighting: Neutral studio lighting, slight vignette.
- speech: Speaker A, on-camera. "You don't need a full setup anymore." High lip-sync strictness.
[00:22-00:24]
- framing: MS, slight low angle.
- lens: 50mm.
- camera movement: Static.
- subject: Arms raised, holding hair up in a high ponytail.
- environment: White studio background.
- lighting: Bright, high-key lighting.
- speech: Speaker A, VO. "Just imagination."
[00:25-00:27]
- framing: CU.
- lens: 85mm.
- camera movement: Static.
- subject: Looking intensely at camera, hair blowing.
- environment: Dark background.
- lighting: Soft dramatic lighting.
- motion cues: Wind blowing hair.
- speech: Speaker A, VO. "Comment guide to learn how."
C) STYLE BIBLE
- visual_style: Photorealistic cinematic commercial beauty portrait.
- camera_signature: 85mm portrait lens dominance, shallow depth of field, mostly static or slow, deliberate movements.
- lighting_signature: Highly variable but always professional studio quality, ranging from soft high-key beauty to dramatic low-key hard shadows.
- grade_signature: High contrast, natural skin tones, deep blacks, clean whites.
- texture_signature: Flawless skin detail, sharp focus on eyes and product.
- pacing_signature: Fast-paced cuts every 2-3 seconds.
- speech_style: Commercial beauty VO, confident, direct-to-camera hybrid.
- speaker_profile: Female, warm, articulate, modern vocal fry.
- mic_mix_profile: Close-mic, dry studio, high clarity, compressed for social media.
D) PROMPT SYNTHESIS
1. MASTER PROMPT
GLOBAL LOCK: Photorealistic cinematic commercial style. Subject: Asian woman, mid-20s, flawless glowing skin, dark brown hair, wearing a fitted white ribbed sleeveless turtleneck tank top, small silver hoop earrings. Environment: Minimalist studio setting with solid neutral backgrounds (white/grey/black). Lighting: High-end beauty lighting, varying from soft diffused to dramatic hard shadows. Camera: 85mm lens, shallow depth of field. Speech: Single female speaker, warm commercial tone, close-mic studio sound.
[00:00-00:03] ECU of the woman's face against a dark background. Soft beauty lighting. She is looking directly at the lens, speaking. Lips are moving in sync with speech.
[00:03-00:05] CU. The woman holds a pink lip balm tube next to her cheek. Soft diffused lighting. She looks at the camera. Slight camera drift.
[00:06-00:09] MS. The woman is turned slightly away in profile, then turns her head towards the camera. Dramatic lighting with a harsh diagonal shadow cutting across her face. Slow pan following the head turn.
[00:10-00:12] CU tight on the mouth. The woman is applying the pink lip balm to her lips. Eyes looking slightly down. Bright, even beauty lighting highlighting skin texture.
[00:13-00:15] WS. The woman is sitting on the floor, wearing black trousers with the white tank top. One leg bent. Grey studio background. Soft overhead lighting. Static camera.
[00:16-00:18] CU. The woman touches her neck and jawline with both hands. Warm, glowing rim light, deep shadows on the opposite side. Slight camera push-in.
[00:19-00:21] MS. The woman is sitting on a metal stool, leaning forward slightly, speaking directly to the camera. Lips moving in sync. Neutral studio lighting, slight vignette. Static camera.
[00:22-00:24] MS, slight low angle. The woman has her arms raised, holding her hair up in a high ponytail. Bright, high-key lighting, white background. Static camera.
[00:25-00:27] CU. The woman's hair is blowing in the wind. She looks intensely at the camera. Soft dramatic lighting, dark background. Static camera.
2. NEGATIVE PROMPT
Visuals: cartoon, illustration, anime, 3d render, deformed anatomy, extra fingers, mutated hands, unnatural skin texture, plastic skin, temporal jitter, flickering lighting, morphing objects, text, watermarks, logos, low resolution, blurry, out of focus.
Audio: robotic voice, unnatural cadence, harsh sibilance, plosives, clipping, background noise, room echo, lip-sync mismatch, slurred words.
4. SPEECH PACK
Speaker: Female, 20s, warm, confident, commercial beauty tone.
[00:00-00:03] "What if I told you... I'm not even real." (Pause for dramatic effect, direct eye contact).
[00:03-00:05] "But the product I'm holding... is Hailey Bieber's Rhode lip balm." (Slight emphasis on 'Rhode').
[00:06-00:09] "Everything you're seeing was created with AI... no camera... no studio." (Paced, emphasizing the negatives).
[00:10-00:12] "Just one image... and a few prompts." (Smooth, instructional tone).
[00:13-00:15] "Every reflection... every highlight... every detail... was generated in seconds." (Staccato emphasis on 'every').
[00:16-00:18] "Real product... unreal possibilities." (Contrast emphasis).
[00:19-00:21] "You don't need a full setup anymore." (Direct, conversational).
[00:22-00:24] "Just imagination." (Soft, aspirational).
[00:25-00:27] "Comment guide... to learn how." (Clear CTA, energetic).