AI face generator pages are most useful when they stay focused on synthetic faces that do not belong to a real person. Designers, developers, and creators usually want believable faces for mockups, stock-style uses, anonymous personas, or character planning without legal or identity friction. This page helps you compare face ideas that feel usable, diverse, and safe for practical visual work.

s1mple.ai: Surreal Liquid Head Police Officer AI Art
[Subject] A surreal uniformed officer standing in a sunlit institutional hallway, with a dark navy police-style shirt, metallic chest badges, and a calm body posture, while the head transforms into a flowing pale liquid ribbon that stretches sideways and resolves into two connected face forms. [Environment] A blue-toned corridor with tiled lower walls, long ceiling light fixtures, large windows on the right, warm daylight entering from outside, and a metal railing cutting across the foreground, evoking a school, hospital, or civic-building hallway. [Composition/Camera] Vertical mid-body portrait, camera positioned slightly below face level, subject centered but with the liquid head distortion extending dramatically to the right, foreground railing adding depth, corridor perspective lines guiding the eye toward the surreal deformation. [Lighting] Soft daylight mixed with cool interior ambient light, gentle reflections on the floor and badges, even illumination across the uniform, pale luminous highlights on the liquid face ribbon, warm window light balancing the cool hallway palette. [Style/Rendering] Surreal concept illustration with painterly anime-influenced realism, dreamlike institutional atmosphere, clean line structure, soft watercolor-like shading, uncanny visual metaphor, polished poster presentation. [Detail constraints] Preserve the officer uniform as grounded and readable, keep the liquid head transformation smooth and ribbon-like rather than gory, maintain the corridor perspective and windows as contextual anchors, ensure the surreal effect feels uncanny but elegant, and avoid horror clutter or excessive visual noise.

Negative prompt: gore, blood, horror splatter, zombie effect, messy distortion, extra limbs, broken anatomy, low detail hallway, cluttered background, harsh shadows, warped badges, muddy colors, low-resolution painterly blur, duplicated faces without flow connection

Suggested parameters: aspect ratio 2:3, stylize medium, high detail, surreal realism, painterly editorial mood, clean uncanny atmosphere

Delta prompt strategy:
1. If the surreal effect feels weak, increase the pale liquid ribbon stretching the head into two connected face forms.
2. If the image turns horror-heavy, remove gore and keep the transformation smooth, clean, and dreamlike.
3. If the hallway loses clarity, reinforce blue walls, windows, ceiling fixtures, and linear perspective.
4. If the uniform becomes generic, sharpen the police-style shirt, chest badges, and duty-belt details.
5. If the composition feels static, preserve the forward torso while letting the head distortion sweep laterally across the frame.
6. If colors become muddy, separate cool blue interior tones from warm daylight near the windows.
7. If the surreal ribbon lacks elegance, smooth the edges and create graceful flowing curvature between the faces.
8. If the image reads too realistic, add subtle painterly softness while preserving strong structural drawing.
9. If the foreground feels empty, keep the metal railing to anchor depth and realism.
10. If the mood becomes too literal, emphasize uncanny metaphor and poetic visual distortion over narrative explanation.
Video
GLOBAL LOCK: Retro sci-fi action-drama illustrated as painterly cinematic concept art with consistent late-20th-century dystopian thriller energy. Keep the main human lead as a white-presenting adult male in his 30s to early 40s with fair skin, strong jawline, short brown hair styled upward, athletic build, and a stern, protective posture. Keep the teenage boy slim, youthful, and slightly awkward, the red-haired mother tough and alert, the chrome humanoid machine perfectly metallic and expressionless, and the police officer as a surreal shape-shifting impostor whose face splits into a white liquid duplicate floating beside his head. Preserve the American Southwest setting with bars, alleys, concrete flood channels, desert highways, industrial firelight, institutional blue hallways, and motel-town streets. Maintain warm sunlit exterior tones, cool blue interior fluorescents, bright orange fire glow, silver chrome reflections, moderate painted texture, clean action readability, dramatic close-ups, and a mix of 35mm, 50mm, and 85mm cinematic framing. Speech style is sparse and trailer-like, with one or two short lines per beat, tense cadence, dry close-mic sound, and visible lip sync only where characters are front-facing in close-up.

[00:00-00:04] Night battlefield under a moonlit sky, a chrome endoskeleton warrior stands amid explosions and smoke, full-body wide shot with a low camera angle, burning debris behind it, hard orange backlight from fire, cold blue moon fill, drifting smoke and embers, no speech, only apocalyptic tension.

[00:04-00:08] Interior roadside bar or garage with warm practical lights and Pepsi signage, shirtless muscular man faces a woman at close conversational distance, medium close-up with shallow depth of field, tense eye contact, muted amber color grade, slight camera push-in, no speech or a low murmur that feels interrupted.

[00:08-00:12] Surprised close-up of the same male lead, 85mm portrait lens, eyes wide, a white liquid shape creeps into frame from the right edge, skin rendered with smooth painterly highlights, lips barely part as if about to say something, high lip-sync strictness if any whispered word is included.

[00:12-00:16] Wooden doorway confrontation, a heavy bearded man points a shotgun outward from inside a dim rustic room, reverse-shot structure from the visitor’s perspective, warm tungsten light inside, cooler dusk outside, angry expression, fast emotional escalation, cut sharply on the aiming gesture.

[00:16-00:20] Police officer stands behind a metal railing in a blue institutional corridor, daylight windows on the right, his face splits into a white liquid double that floats off to one side, medium shot and then close-up, uncanny identity distortion locked to the character description, dry fluorescent lighting, no comedy, pure body-horror unease.

[00:20-00:24] The police impostor faces a silver chrome humanoid in a workshop or station-like interior, alternating close two-shots and profile shots, the officer studies the machine while the machine remains unreadable, polished reflections on metal surfaces, low conversational tension, spoken line if used should be controlled, cold, and clipped.

[00:24-00:28] Exterior small-town alley with an ATM and pale morning sunlight, a teenage boy stands with another teen beside a red dirt bike, medium-wide framing, concrete walls and utility lines create depth, naturalistic golden daylight, hesitant body language, short casual dialogue possible with loose teenage cadence.

[00:28-00:32] The boy meets the red-haired mother, then they launch on the bike through a yard and into open road, camera alternates between side tracking and rear chase framing, dust and wind motion emphasized, warm sun, hopeful but urgent energy, no visible speech once the ride begins.

[00:32-00:36] Domestic kitchen interior with the red-haired mother alone, plaid sleeveless shirt, checking space around her with alert suspicion, then transition to a long concrete flood channel where a motorcycle races toward camera. Use medium shots indoors, wide vanishing-point exterior frames outside, bright noon light, pacing accelerates.

[00:36-00:40] A chrome humanoid emerges from a wall of fire in a blazing industrial doorway, centered heroic composition, flames licking around perfect metal anatomy, then cut to the main group lined up together like a defensive unit. Keep the contrast high, metal glossy, and fire glow intense, no speech, only mythic escalation.

[00:40-00:44] Return to the blue hallway where the police impostor’s face peels into a vertical white liquid split, then cut to a car interior crossing desert country with the teenage boy in the back seat and the stern protector driving. Tight close-up on the melting face, then side-profile car shots with golden late-afternoon light, minimal dialogue with deliberate pauses.

[00:44-00:48] Reveal a robotic hand behind glass, a Black male observer studies the mechanical fingers, then another man exposes his own cybernetic arm in a bright interior. Macro mechanical details, articulated joints, cables, metal knuckles, cool clinical light, slow deliberate hand motion, no speech or a single stunned reaction word.

[00:48-00:52] Mechanical hand flexes in close-up, then the police figure charges forward in front of fire, furious and no longer convincingly human. Close, aggressive framing, rapid motion, hot industrial glow, smoke, clenched teeth, voice if present should be forceful and urgent with hard consonants.

[00:52-00:56] Leather-jacketed hero loads a shotgun in a workshop, chest-up framing and insert shots of the weapon, then he appears in full figure, bandolier across his body, locked in battle stance. Use crisp action inserts, fiery orange backlight, metallic set dressing, and a hard determined facial expression.

[00:56-00:58] Final desert tag: a black-clad woman in sunglasses holds a rifle near a rugged vehicle and Joshua trees, medium-wide hero shot in harsh dry sunlight, wind moving clothes slightly, no speech, end on a survivalist future-war note.

NEGATIVE PROMPT: low-detail faces, inconsistent identities, duplicated limbs, broken fingers, warped firearms, unreadable props, incorrect police uniform details, cartoon slapstick tone, muddy chrome reflections, flicker between shots, temporal jitter, random text or logos, floating objects without narrative purpose, soft mushy anatomy, deformed motorcycles, broken perspective, accidental modern smartphones, robotic lip movement, off-timing mouth shapes, slurred dialogue, metallic synthetic voice, harsh sibilance, clipped peaks, pumping compression, over-denoised speech, and mismatched room tone between cuts.

SPEECH PACK:
[00:08-00:12] Speaker A, closest audible: "What the hell is that?" Safe paraphrase: "He sees something impossible entering frame." TAKE_A: shocked, breath catches before "hell". TAKE_B: lower, more controlled disbelief. TAKE_C: whispered panic. Lips visible: yes, high sync.
[00:20-00:24] Speaker B, closest audible: "You are not him." Safe paraphrase: "The officer realizes the machine is not human." TAKE_A: flat and clinical. TAKE_B: suspicious and tense. TAKE_C: almost whispered. Lips visible: partial, medium sync.
[00:24-00:28] Speaker C, closest audible: "Come on, let's go." Safe paraphrase: "The teens move toward the bike." TAKE_A: rushed. TAKE_B: nervous. TAKE_C: urgent whisper. Lips visible: partial, medium sync.
[00:32-00:36] Speaker D, closest audible: "Get inside." Safe paraphrase: "A protective instruction before the chase escalates." TAKE_A: firm. TAKE_B: louder warning. TAKE_C: clipped command. Lips visible: low to medium sync.
[00:40-00:44] Speaker A, closest audible: "He's still behind us." Safe paraphrase: "They realize the threat remains active during the drive." TAKE_A: tense low voice. TAKE_B: controlled urgency. TAKE_C: breathy fear. Lips visible: partial, medium sync.
[00:48-00:52] Speaker B, closest audible: "Run!" Safe paraphrase: "Immediate danger forces escape." TAKE_A: shouted. TAKE_B: raw panic. TAKE_C: hoarse command. Lips visible: yes, high sync.
Video
GLOBAL LOCK: vertical 3:4 Adobe Firefly Boards style promo card, static held frame, red brand treatment over a gloomy downtown city block. Main image shows a tall monolithic concrete tower tinted deep Firefly red, torn open by two vertical cracks, with a masked cyberpunk antihero figure emerging from the fissure. Character design: short white hair, white or silver face mask with dark eye slits, dark tactical armor or jacket, menacing upright posture. Preserve Firefly square 'Fi' logo at top left, bold white headline stacked center reading 'From Idea to Branded Mockup' with a red capsule beneath reading 'in minutes', smaller white subhead explaining how AI-first Firefly Boards help visualize concepts without leaving the flow, lower-left hashtags for Adobe Firefly ambassadors and Firefly Boards, and a small swipe cue at lower right. Rainy traffic, buses, taxis, and pedestrians anchor scale at street level.
[00:00-00:11] Hold on the same branded hero frame throughout with only subtle export shimmer. The red building, cracked facade, cyberpunk figure, overcast clouds, and downtown traffic remain static while the large white headline and red capsule emphasize the message that Firefly Boards turns an idea into a branded mockup in minutes.
Video
A vertical creator tutorial video about achieving AI character consistency across generations and workflows. A female presenter speaks directly to the camera against a clean lavender-purple background while holding a handheld microphone and explaining a multi-step process labeled with numbered sections like #1, #2, #3, and #4. As she talks, large overlays appear showing reference portraits, facial expressions, hat variations, prompt text, interface screenshots, parameter panels, model settings, and examples from different AI tools. The video walks through how to build a consistent character, refine realism, preserve facial identity, manage textures, and combine different generation tools into one repeatable system. The mood is educational, structured, creator-friendly, and optimized for short-form AI workflow teaching.
Video
GLOBAL LOCK:
Subject: A Caucasian woman in her late 20s, blonde hair tied in a neat ponytail, wearing a leopard-print (cheetah pattern) blouse.
Environment: A cozy home studio/office background with dark grey walls, wooden bookshelves filled with books, green indoor plants, and soft dual-tone lighting (warm orange light from one side, cool blue light from the other).
Camera: MCU (Medium Close-Up) framing, eye-level, 35mm lens feel with shallow depth of field.
Style: Professional UGC creator aesthetic, high-quality video, crisp audio.
Speech: Direct-to-camera delivery, energetic and authoritative tone.

[00:00–00:05]
Visual: Rapid montage of extreme macro close-ups (ECU). First, a human eye with visible iris patterns and eyelashes. Second, an ear with a gold hoop earring showing skin texture. Third, a wrist with a simple black line tattoo showing skin pores and fine hairs.
Action: Static macro shots.
Lighting: Bright, natural daylight feel for the macros.
Text Overlay: "most AI" -> "look fake" -> "because" -> "is trained".
Speech: "Most AI images look fake for one reason. Because AI is trained to remove flaws."

[00:05–00:11]
Visual: The woman (Subject) in the MCU studio setting, gesturing with her hands. Floating icons of AI tools (ChatGPT, Freepik, Ideogram, Nano Banana) appear around her.
Action: Subject talks directly to the camera, moving hands to emphasize points.
Lighting: Studio setup (Orange/Blue).
Text Overlay: "need" -> "AI tools" -> "to prompt".
Speech: "But we don't need better AI tools. We just need to prompt the model to create images that actually look real."

[00:11–00:21]
Visual: Transition to a black screen with white text titled "Master Prompt". The text scrolls or highlights specific sections. Then, a split screen showing the woman talking in a small window and the prompt text in a larger window.
Action: Subject continues talking while the prompt text is displayed.
Lighting: Studio setup for the talking head.
Text Overlay: "to create" -> "that actually" -> "look real".
Speech: "The key to realistic AI images is using a prompt with a specific structure. This prompt should force skin detail, including visible pores, uneven tone, and natural imperfections."

[00:21–00:30]
Visual: Montage of AI-generated faces with high realism. A man's face with stubble and pores, a woman's face with freckles and slight redness. Then, a screen recording of the Freepik interface showing a gallery of realistic portraits.
Action: Fast cuts between the portraits and the UI.
Lighting: Varied, matching the generated images.
Text Overlay: "most people start" -> "make" -> "image".
Speech: "Most people start their prompt with 'make a realistic image of'. I start by telling the model how the camera behaves."

[00:30–00:42]
Visual: Screen recording of a prompt being typed into a text box. Keywords like "iPhone 14 Pro", "handheld framing", and "imperfect composition" are highlighted in yellow.
Action: Scrolling through the prompt text.
Lighting: Digital UI.
Text Overlay: "model that" -> "camera behaves" -> "casual hand" -> "imperfect composition".
Speech: "Casual handheld framing, slightly imperfect composition, and a smartphone camera perspective. This alone already breaks the AI look."

[00:42–00:52]
Visual: The woman back in the MCU studio setting. She gestures toward floating app icons for "Enhancor" and "Higsfield". A screen recording shows a "Skin Enhancer" tool being used on a photo of a woman with goggles.
Action: Subject explains the final step.
Lighting: Studio setup.
Text Overlay: "But Most People Stop There" -> "Final Step" -> "Most Creators Are Gatekeeping".
Speech: "But most people stop there. I use a final step that most creators are gatekeeping. I run each image through a final skin enhancement step using Enhancor or Higsfield."

[00:52–01:00]
Visual: The woman in MCU, pointing down toward a text box that says "Comment GUIDE". A final zoom-out effect or a slight blur transition.
Action: Subject smiles and points.
Lighting: Studio setup.
Text Overlay: "Prompt Structure" -> "Workflow" -> "Comment GUIDE".
Speech: "If you want my exact prompt structure and the full workflow, just comment GUIDE and I'll send it over."

NEGATIVE PROMPT:
Smooth skin, plastic texture, perfect symmetry, airbrushed look, 6 fingers, distorted eyes, watermark, logo, blurry background (unless specified), robotic voice, lip-sync lag, harsh sibilance, flickering lights, low resolution.

SPEECH PACK:
[00:00-00:05] "Most AI images look fake for one reason. Because AI is trained to remove flaws."
[00:05-00:11] "But we don't need better AI tools. We just need to prompt the model to create images that actually look real."
[00:11-00:21] "The key to realistic AI images is using a prompt with a specific structure. This prompt should force skin detail, including visible pores, uneven tone, and natural imperfections."
[00:21-00:30] "Most people start their prompt with 'make a realistic image of'. I start by telling the model how the camera behaves."
[00:30-00:42] "Casual handheld framing, slightly imperfect composition, and a smartphone camera perspective. This alone already breaks the AI look."
[00:42-00:52] "But most people stop there. I use a final step that most creators are gatekeeping. I run each image through a final skin enhancement step."
[00:52-01:00] "If you want my exact prompt structure and the full workflow, just comment GUIDE and I'll send it over."
Video
GLOBAL LOCK: 
Subject identity must be consistent within each character segment. 
Character 1: Young Caucasian male, early 20s, messy brown hair, light blue knitted beanie, white cotton t-shirt. 
Character 2: Caucasian male, early 30s, short brown hair, light stubble/beard, green baseball cap with "A's" logo, green knit sweater over a white collared shirt. 
Character 3: Mixed-race female, short blonde buzzcut, freckles across nose and cheeks, gold hoop earrings, grey wool sleeveless turtleneck. 
Environment: Minimalist studio, neutral off-white or light grey background. 
Lighting: High-end editorial studio lighting, soft shadows, natural skin highlights. 
Color Grade: Clean, neutral, high-contrast editorial look, slight film grain. 
Camera: High-resolution digital cinema camera, shallow depth of field, sharp focus on skin textures. 
Speech: No speech, rhythmic percussive background music.

[00:00–00:01]
Character 1 (Beanie) in a Medium Close-up (MCU). He looks directly into the lens with a neutral, slightly bored expression. The lighting is soft and even. Static shot. Text overlay "I can spot AI from a mile away" appears centered in white.

[00:01–00:02]
Character 1 (Beanie) in an Extreme Close-up (ECU) focusing on the nose and mouth. Visible skin pores, natural lip texture, and slight imperfections. Subtle micro-movement of the lips. Text overlay remains.

[00:02–00:03]
Character 1 (Beanie) in an ECU focusing on the cheek and jawline. Clear view of small moles and fine peach fuzz hair. Soft side lighting emphasizes the skin's 3D texture. Text overlay remains.

[00:03–00:04]
Character 2 (Green Cap) in a Medium Close-up (MCU). He has a slight, confident smirk, looking at the camera. He is wearing a silver watch and a ring. Static shot. Text overlay remains.

[00:04–00:05]
Character 2 (Green Cap) in an ECU focusing on the left eye and temple. The iris has intricate, realistic patterns. Individual eyelashes and eyebrow hairs are sharp. Skin texture around the eye shows natural fine lines. Text overlay remains.

[00:05–00:06]
Character 2 (Green Cap) in an ECU focusing on the mouth and beard. Individual beard hairs are distinct with varying colors (brown/blonde). Natural lip lines and skin moisture. Text overlay remains.

[00:06–00:07]
Character 3 (Grey Turtleneck) in a Medium Close-up (MCU). She has her hands placed gently on her chest, showcasing gold rings. She looks directly at the camera with a serene expression. Soft light from the side creates a gentle glow on her skin. Text overlay remains.

[00:07–00:09]
Character 3 (Grey Turtleneck) in an ECU focusing on the eye and freckled cheek. High density of natural-looking freckles. Sharp focus on the eye's reflection. The skin looks hydrated and real. Text overlay remains until the end.

NEGATIVE PROMPT: 
Smooth plastic skin, "AI glow," distorted features, blurry textures, over-saturation, cartoonish look, extra fingers, floating jewelry, inconsistent lighting, flickering, low resolution, watermark, text (other than the specified overlay), robotic movement, perfect symmetry.

SPEECH PACK:
(No speech present in this video. The audio is a rhythmic, percussive beat.)
- Audio Style: Minimalist, bass-heavy, percussive "stomp" track.
- Sync: Visual cuts occur exactly on the primary downbeats.
- Room Tone: Clean, silent studio environment.
Video
by.shlabu
GLOBAL LOCK: vertical 9:16 creator-workflow reel about AI face swap and identity recasting, fast subtitle-led pace, selfie examples, side-by-side source-vs-swapped faces, dark tool UI screens, and cinematic dialogue-shot outputs featuring different actors in the same scene setup. The central promise is that a base clip plus a new face can create realistic recast content with audio, perspective, and motion preserved. Tone is practical, slightly provocative, and positioned around creative freedom without reshoots.

[00:00-00:05] Open on a bright selfie clip of a blonde woman, then quickly swap to a brunette version in the same framing and lighting. Large subtitle text says AI face swap has become so simple it almost feels unfair. The side-by-side or rapid alternation must make the identity replacement instantly obvious while preserving the same base scene.

[00:05-00:10] Continue with more side-by-side selfie comparisons and bold text cards naming a tool like WAN 2.2 Animate with audio. The reel should communicate that this is not just a static face edit, but a workflow that keeps motion and voice alignment intact. The energy is “drop it in and let it work.”

[00:10-00:16] Move into dark mobile-style UI screens where the user uploads a base clip and then a face image. Show buttons and panels suggesting automatic processing, then emphasize the word MAGIC or a similar claim that the system handles the swap internally. The UI should look accessible, like a plug-and-play creator tool.

[00:16-00:21] Transition to cinematic dialogue-scene examples featuring different men seated in the same warm-lit bar or restaurant setup. The point is that the perspective, acting, and shot grammar remain, while the identity changes. Subtitle text highlights that the perspective is still right even after the recast.

[00:21-00:26] Continue with a close-up man lying in bed at night, lit by phone glow, while the reel explains there are no reshoots and no budget stress, only more creative freedom. The same core message should be clear: once the base clip exists, the face swap unlocks many versions without rebuilding the scene.

[00:26-00:26] End on bold red CTA cards telling viewers to comment “SWAP” for a quick walkthrough. The final beat should feel like a direct lead magnet for a face-swap workflow, not a general AI rant.

NEGATIVE PROMPT: broken facial alignment, identity drift, uncanny eyes, warped mouth sync, mismatched skin tone, bad perspective after swap, flicker, low-detail UI, unreadable app screens, distorted hands, audio-lip mismatch, cheap deepfake artifacts, watermark, temporal jitter.

SPEECH PACK:
- Hook: AI face swap is so simple now it almost feels unfair.
- Beat 1: Drop in your base clip, add the face, and let the tool handle the rest.
- Beat 2: You keep the scene, the perspective, and the performance without needing reshoots.
- Beat 3: That means no budget stress, just more creative freedom.
- CTA: Comment SWAP and I’ll send you a quick walkthrough.
Video
GLOBAL LOCK:
Subject is a Caucasian male in his early 30s, dark wavy hair, well-groomed medium-length beard, expressive brown eyes. He maintains a consistent facial structure across all shots. The visual style is a mix of high-end editorial photography and UGC tutorial footage. Lighting is cinematic with soft key lights and motivated rim lighting. Color grade is professional with deep blacks and vibrant but natural skin tones. Speech is clear, energetic, and instructional, delivered with a warm, authoritative tone.

[00:00–00:01]
Subject: MCU of the man wearing a dark suit, white dress shirt, black tie, and a white baseball cap with a green brim.
Action: Talking directly to the camera. A vertical white rectangular mask moves across his face, revealing a slightly different version of the same scene.
Camera: Static MCU, eye-level.
Lighting: Soft studio lighting, neutral background.
Speech: "This is how you can create..."

[00:01–00:04]
Subject: Rapid montage of AI-generated images. 
1. Man in a dark suit and sunglasses driving a green car at night, "AI MAG" text overlay.
2. Man in a checkered blazer and paisley tie in front of a brick wall.
3. Man in a white short-sleeve shirt with multiple pens in his pocket, standing in a white studio.
Action: Static editorial poses.
Camera: Various (MS, MCU).
Lighting: Cinematic, high contrast, nighttime car lighting, studio softbox.
Grade: Magazine editorial style.

[00:05–00:08]
Subject: A 3x4 grid of 12 different AI portraits of the same man in various outfits (boxing gloves, red car, street style, suit).
Action: Static images.
Overlay: Large bold text "UNLIMITED GENERATIONS" in orange and blue.
Camera: Flat grid layout.
Lighting: Varied per image.

[00:09–00:14]
Environment: Screen recording of the Higgsfield.ai website interface. A cursor moves to click "Image" then "Soul ID Character".
Action: UI navigation.
Speech: "On Higgsfield.ai, go to image and select Soul ID Character..."

[00:15–00:20]
Subject: Picture-in-picture of the man talking (wearing a tan cap and beige shirt) over a screen recording of the "Make Your Own Character" page.
Action: Explaining the process while gesturing.
Speech: "...where you can actually create your own custom character of yourself by uploading a bunch of photos."

[00:21–00:24]
Subject: Montage of AI images with text prompts.
1. Man in a suit drinking from a glass (trippy lens effect).
2. Man in a tan suit with a "Micky Mouse Bag" in a city street.
3. Man in a white tank top and jeans in front of a "Tokyo Red Car".
Action: Posing.
Camera: Full body and MS.
Lighting: Bright daylight, stylized urban lighting.

[00:25–00:34]
Environment: Screen recording of the "Lipsync Studio" interface. Subject's PIP continues.
Action: Selecting "Video", then "Lipsync Studio", uploading an image of himself at the beach, and dragging an audio file named "voiceover.wav".
Speech: "Now you can go to video at the top of the page and select the Lipsync Studio where you can upload your photo and audio..."

[00:35–00:38]
Subject: CU of the man at a tropical beach. He is shirtless, wearing black swimming goggles on his head.
Action: He is lip-syncing perfectly to the audio, smiling slightly.
Environment: Bright blue ocean water with small waves in the background.
Camera: CU, static.
Lighting: Bright, direct sunlight with natural shadows.
Speech: "...and it will combine those two together with the best lip-sync models."

NEGATIVE PROMPT:
Visual: robotic movement, distorted facial features, inconsistent beard growth, blurry textures, flickering background, extra fingers, warped UI elements, low resolution, watermarks.
Speech: robotic monotone, lip-sync delay, muffled audio, background hiss, unnatural pauses, slurred consonants, popping sounds.

SPEECH PACK:
[00:00-00:08]
Transcript: "This is how you can create 25 magazine-ready images of yourself using AI and then you can even lip-sync on top of them with this brand new feature."
TAKE_A: (Energetic, fast-paced) "This is how you can create TWENTY-FIVE magazine-ready images of yourself using AI... and then you can even LIP-SYNC on top of them with this brand new feature!"

[00:09-00:20]
Transcript: "On Higgsfield.ai, go to image and select Soul ID Character where you can actually create your own custom character of yourself by uploading a bunch of photos."
TAKE_A: (Instructional, clear) "On Higgsfield dot A-I, go to image and select Soul I-D Character... where you can actually create your own custom character of yourself... by uploading a bunch of photos."

[00:25-00:38]
Transcript: "Now you can go to video at the top of the page and select the Lipsync Studio where you can upload your photo and audio and it will combine those two together with the best lip-sync models."
TAKE_A: (Helpful, concluding) "Now you can go to video at the top of the page and select the Lipsync Studio... where you can upload your photo and audio... and it will combine those two together with the best lip-sync models."
Video
GLOBAL LOCK:
The video features a white male creator in his mid-30s with medium-length, wavy brown hair and a groomed beard, wearing a clean white t-shirt. He is positioned in a bright home office with a professional black condenser microphone on a boom arm in the foreground. The video uses a split-screen or multi-panel layout to compare "Source Video" (the creator) with "AI Generated Results" (various celebrities and characters). The AI characters must perfectly mirror the creator's head tilt, facial expressions, lip-sync, and hand gestures. The lighting is soft, natural window light from the side. The color grade is clean and realistic.

[00:00–00:03]
The screen is split into three vertical panels. Top panel: The creator waves both hands excitedly and points to his right. Middle panel: Sabrina Carpenter in a pink feathered dress mimics the exact hand wave and pointing. Bottom panel: Billie Eilish in a black outfit and sunglasses mimics the same gestures. High-fidelity lip-sync as they all say "Hear me out."

[00:03–00:07]
The layout shifts. Top panel: Creator continues talking with expansive hand gestures. Middle panel: Taylor Swift in a red dress mimics the gestures. Bottom panel: Kim Kardashian in a black tank top mimics the gestures. The transitions between characters are sharp cuts.

[00:07–00:10]
Split screen: Creator (top) vs. Queen Elizabeth II (bottom). The creator looks to his left and then back to the camera with a skeptical expression. The Queen, wearing a crown and sash, mirrors the look perfectly.

[00:10–00:13]
Split screen: Creator (top) vs. Edna Mode from The Incredibles (bottom). The creator scratches the top of his head with his right hand. Edna Mode, with her signature bob and glasses, scratches her head in perfect sync.

[00:13–00:20]
A screen recording of a software interface (Enhancor). A cursor selects the "Wan2.2" model from a dropdown menu. The UI shows a "Source Video" of the creator and a "Character Image" of a woman. The cursor toggles "Pro Mode" on and adjusts resolution to 720p.

[00:20–00:23]
Split screen: Creator (top) vs. a woman with long brown hair in a floral dress (bottom). They are both in the same room. The creator raises his hands in a "stop" gesture; the woman mirrors him perfectly.

[00:23–00:27]
The UI returns, showing the "Photo Animate" tab being selected. A different reference photo of the same woman is used. The cursor clicks "Generate Video."

[00:27–00:35]
Final comparison. Split screen: Creator (top) vs. the woman (bottom). The creator looks around the room and then smiles at the camera while touching his hair. The woman mirrors the hair-touching and the smile, but her background is now a different indoor setting matching her reference photo. The text "AI" appears centered on the screen.

NEGATIVE PROMPT:
Visual: flickering faces, distorted limbs, extra fingers, blurry textures, face-swapping artifacts, unnatural skin smoothing, background warping, robotic movements, low resolution, watermarks.
Speech: robotic voice, mismatched lip-sync, muffled audio, background noise, unnatural pauses, clipping audio.

SPEECH PACK:
[00:00–00:07]
Transcript: "Hear me out, all of your favorite movies and animations are going to be completely acted out by someone else in the next two years."
TAKE_A: Energetic, fast-paced, direct-to-camera.
TAKE_B: Mysterious, slightly slower, emphasizing "completely."
TAKE_C: Casual, conversational, like a friend sharing a secret.

[00:07–00:13]
Transcript: "So I'm going to teach you everything you need to know about this in the next 20 seconds so that you can do this for yourself and stay ahead of the curve."
TAKE_A: Authoritative, instructional, rhythmic.
TAKE_B: Helpful, warm, encouraging.
TAKE_C: Urgent, fast-talking to fit the "20 seconds" claim.

[00:13–00:35]
Transcript: "So right now you have two options with this new AI video model called Wan 2.2. The first option is Character Swap... The second option is Photo Animate... This is absolutely mind-blowing. Comment AI for the link."
TAKE_A: Professional narrator style, clear enunciation.
TAKE_B: Enthusiastic, high energy on "mind-blowing."
TAKE_C: Calm, tech-reviewer tone, clear CTA at the end.
Video
A premium vertical beauty editorial focused on hyper-real human skin detail and facial micro-texture, structured as a rapid sequence of ultra-close-up portrait shots. Multiple young adult models of varied appearances are shown in soft studio light: an East Asian woman with freckles and bare skin, a young white man with curly brown hair, and additional female beauty portraits with natural lips, pores, peach fuzz, and detailed irises. The visual goal is to prove authentic skin realism, with macro framing on eyes, lips, nose bridges, cheek texture, and freckles. Lighting is clean, soft, and frontal with subtle shadow falloff, neutral color grade, crisp lens detail, and no heavy retouching. White center text reads variations of “I can spot AI from a mile away” across the sequence. The edit rhythm is smooth but quick, moving between full-face beauty portraits and extreme close details to emphasize authenticity, imperfection, and realistic skin texture.
Video
Kallaway

Vertical creator explainer video about the future of marketing in the AI image era, focused on image models and controllable creative workflows. A male presenter wearing a black baseball cap and black shirt talks directly to camera in a dark indoor environment with soft warm lights blurred behind him. The video cuts between close talking-head shots with large kinetic word captions and app-style demo screens showing image generation, face swaps, style transfers, product mockups, ad creatives, and model comparisons. Tools and examples shown include Nano Banana, Google Veo3, Freepik, Higgsfield, and ChatGPT-related image workflows. Sample visuals include movie character edits, Billie Eilish-inspired clothing/object swaps, people holding drink cans, branded beverage product shots, floating bananas, tabletop ad scenes, portrait transformations, and side-by-side comparisons of image and video outputs. Social-media AI marketing tutorial format, creator economy tone, practical ad-generation workflow, polished software-demo pacing, educational direct-to-camera presentation.
Video
GLOBAL LOCK: High-end editorial beauty photography style. Hyper-realistic skin textures including visible pores, fine hairs (peach fuzz), skin moisture, and natural imperfections. Soft, high-key studio lighting with large softbox sources. Neutral, clean background (off-white or light grey). Cinematic color grade with natural skin tones and soft highlight rolloff. 60fps feel with subtle, organic micro-movements. Subject identity must remain consistent within each segment.

[00:00–00:01] 
Subject: Caucasian woman, late 20s, blonde hair slicked back, green eyes, light makeup. 
Framing: Medium Close-Up (MCU), side profile, looking directly at the camera. 
Action: Neutral, confident expression, very slight breathing motion. 
Lighting: Soft rim light on the profile, bright catchlight in the eye.

[00:01–00:02] 
Subject: Extreme Close-Up (ECU) of the blonde woman's green eye. 
Action: The eye performs a slow, natural blink. Visible eyelashes with mascara, detailed iris texture. 
Camera: Macro lens, extremely shallow depth of field.

[00:02–00:03] 
Subject: ECU of the blonde woman's lips. 
Action: Lips are slightly parted, covered in clear, high-shine gloss. Subtle twitch of the lip corner. 
Texture: Visible lip lines and moisture reflections.

[00:03–00:04] 
Subject: ECU of the blonde woman's nose and cheek area. 
Action: Static macro shot. 
Texture: Extreme detail of skin pores, tiny freckles, and fine blonde hairs on the cheek.

[00:04–00:05] 
Subject: Black woman, early 20s, dark hair pulled back, prominent freckles across nose and cheeks. 
Framing: MCU, 3/4 view, her hand with dark burgundy nails is partially covering her forehead. 
Action: Direct gaze into the lens, calm and steady.

[00:05–00:06] 
Subject: ECU of the Black woman's brown eye. 
Action: Static macro shot, focus on the sharp detail of the eyelashes and the freckles on the eyelid. 
Lighting: Soft light reflecting in the pupil.

[00:06–00:07] 
Subject: ECU of the Black woman's nose and upper lip. 
Action: Subtle flare of the nostrils. 
Texture: Dense freckle patterns, natural skin sheen, visible skin grain.

[00:07–00:08] 
Subject: Mixed-race man, early 30s, short dark curly hair, light stubble. 
Framing: MCU, looking slightly off-camera to the left. 
Action: Slight head tilt, neutral masculine expression. 
Lighting: Side-lit to emphasize facial structure and stubble texture.

[00:08–00:09] 
Subject: ECU of the man's chin and lower lip. 
Action: Static macro shot. 
Texture: Individual hair follicles of the stubble, dry texture of the lips, skin pores.

[00:09–00:10] 
Subject: ECU of the man's eye and temple. 
Action: Subtle squinting motion. 
Texture: Visible crow's feet, fine lines, and skin texture around the eye.

NEGATIVE PROMPT: 
Smooth plastic skin, "uncanny valley" look, blurred textures, distorted eyes, extra limbs, cartoonish features, heavy makeup, unnatural blinking, flickering light, low resolution, watermarks, text, logos, shaky camera, over-saturated colors.

SPEECH PACK:
(No speech present in video, only rhythmic percussive audio.)
Audio Note: Sync cuts to a 120 BPM percussive "thump" or heartbeat sound. Each ECU cut should land exactly on a beat.
Video
GLOBAL LOCK: A consistent female subject, Caucasian, early 20s, shoulder-length messy blonde/light-brown hair, natural makeup, wearing a simple black tank top. The environment is a minimalist studio with a dark grey, out-of-focus background. Lighting is soft-box studio style, creating gentle highlights on the face. The video is a split-screen comparison with a vertical white slider line moving across the frame.

[00:00–00:03]
The subject is framed in a medium close-up, centered. On the left side of the vertical slider, her skin appears slightly too smooth and "AI-generated." On the right side, the skin is hyper-realistic with visible pores and natural texture. The slider is positioned on the far left. The subject remains static with a neutral, calm expression, looking directly at the camera.

[00:03–00:07]
The vertical white slider line moves steadily from the left edge of the frame to the right edge. As it passes over the subject's face, the "smooth" skin on the left is replaced by "hyper-textured" skin on the right. The transition is sharp and follows the slider line exactly. The subject's hair and clothing remain perfectly consistent across the transition.

[00:07–00:10]
The slider reaches the right side of the frame, revealing the fully enhanced, realistic face. The subject maintains her neutral gaze. The lighting remains constant, emphasizing the newly revealed skin texture, fine lines, and realistic highlights on the nose and forehead. The video loops seamlessly back to the start.

NEGATIVE PROMPT: blurry, distorted facial features, inconsistent hair movement, flickering lighting, plastic-looking skin on the "after" side, unnatural eye reflections, jittery slider movement, low resolution, watermarks, text artifacts on the subject.

SPEECH PACK:
(No speech present in the original video; it relies on text overlays and background music.)
TRANSCRIPT: [Background Music Only]
TAKE_A: N/A
TAKE_B: N/A
TAKE_C: N/A
PROSODY: N/A
SYNC: N/A
Video
GLOBAL LOCK: 
Subject: A Black man in his late 20s, athletic build, warm brown skin tone with visible texture (pores, slight stubble). 
Hair: Medium-length dark dreadlocks, some strands slightly frizzy. 
Wardrobe: Dark charcoal grey knitted crew-neck sweater with a visible weave pattern. 
Environment: Minimalist indoor setting, soft cream-colored curtains in the background. 
Lighting: Warm, cinematic directional lighting (Rembrandt style), soft shadows, high-end editorial feel. 
Color Grade: Warm earthy tones, slightly desaturated, rich contrast in skin highlights. 
Camera: 35mm and 85mm lens feel, shallow depth of field, sharp focus on subject. 
Speech: Male voice, calm, authoritative, medium-low pitch, professional cadence.

[00:00–00:03]
Subject: Medium close-up of the man. He has his right hand raised, fingers gently threading through his dreadlocks near his temple. He looks directly into the camera with a neutral, intense expression.
Action: Subtle movement of the hand in the hair.
Camera: Static MCU, eye-level.
Lighting: Soft light from the left, highlighting the side of his face and hand.
Speech: "This face is 100% AI." (Lips visible, high sync strictness).

[00:04–00:07]
Subject: Extreme macro close-up of the lower right cheek and jawline.
Action: Static shot showing the fine detail of skin pores, a few micro-scars, and a patchy, short-cropped beard with individual hairs visible.
Camera: ECU (Macro), static.
Lighting: Side-lit to emphasize the 3D texture of the skin.
Speech: "and brands still pay for it. You can see every clogged pore,"

[00:08–00:11]
Subject: Transition from a close-up of his dark brown eye (showing reflections) to an extreme macro of the sweater's shoulder.
Action: A tiny white flake of lint is visible on the dark knit of the sweater.
Camera: ECU, subtle shift in focus from eye to shoulder.
Lighting: Soft, revealing the texture of the wool.
Speech: "the patchy beard that looks like it's been growing since lockdown. You can see every strand in the hair. Even that little white flake on the shoulder"

[00:12–00:16]
Subject: Return to a medium close-up. The man is still looking at the camera, his hand is now down. He blinks once, naturally.
Action: A slow, almost imperceptible zoom-in.
Camera: MCU, slow dolly-in.
Speech: "could be t-shirt lint, could be a croissant crumb from breakfast. Either way, your brain buys it." (Lips visible, high sync).

[00:17–00:23]
Subject: A rapid montage of extreme macro shots: 1) Forehead skin with micro-scars. 2) Close-up of the eye and eyebrow. 3) Side of the neck with fine hairs and skin folds. 4) Macro of the cheek texture again.
Action: Fast cuts, minimal subject motion.
Camera: ECU, static shots.
Lighting: Consistent warm, directional light.
Speech: "I call this Genesis Engineering. Stacking pores, micro scars, lens dirt and bad pixels until it passes the client zoom test."

[00:24–00:26]
Subject: Medium shot of the man, centered. He maintains a steady, confident gaze.
Action: Static, final pose.
Camera: MCU, static.
Speech: "Comment Genesis and the prompt is yours." (Lips visible, high sync).

NEGATIVE PROMPT: 
Visual: Smooth plastic skin, "beauty filter" look, perfectly symmetrical beard, blurry textures, cartoonish dreadlocks, glowing eyes, distorted fingers, flickering light, floating hair, AI-generated text artifacts.
Speech: Robotic monotone, overly excited tone, slurred syllables, mouth movements not matching "Genesis" or "AI", background noise, echo, harsh "S" sounds.

SPEECH PACK:
[00:00-00:03] "This face is 100% AI."
TAKE_A: (Direct, factual) This face... is one hundred percent... AI.
TAKE_B: (Intriguing) This face? It's 100% AI.

[00:04-00:11] "and brands still pay for it. You can see every clogged pore, the patchy beard that looks like it's been growing since lockdown. You can see every strand in the hair. Even that little white flake on the shoulder"
TAKE_A: (Detailed, observational) ...and brands still pay for it. [pause] You can see every... clogged... pore. The patchy beard... every strand... even that flake.

[00:12-00:16] "could be t-shirt lint, could be a croissant crumb from breakfast. Either way, your brain buys it."
TAKE_A: (Conversational) Could be lint... could be a crumb. Either way? Your brain buys it.

[00:17-00:26] "I call this Genesis Engineering. Stacking pores, micro scars, lens dirt and bad pixels until it passes the client zoom test. Comment Genesis and the prompt is yours."
TAKE_A: (Professional/Closing) I call this... Genesis Engineering. [fast] Stacking pores, scars, dirt... until it passes the zoom test. Comment 'Genesis'... and the prompt is yours.
Video
GLOBAL LOCK: 
Subject identity must transition from high-profile celebrities to a consistent female creator. 
Celebrity segment: Chris Hemsworth (Caucasian male, short blonde hair, beard, black suit), Sydney Sweeney (Caucasian female, blonde hair, red lipstick, black dress), Timothée Chalamet (Caucasian male, curly brown hair, black blazer), Zendaya (African-American/Caucasian female, slicked back hair, silver choker). 
Creator segment: Caucasian female, mid-20s, wavy light brown hair, wearing a beige/cream blazer over a ribbed tan top. 
Environment: High-end studio backgrounds (dark green, white, grey) for celebrities; modern, bright office/indoor setting for creator. 
Lighting: Professional studio lighting with soft key and rim lights. 
Color Grade: High saturation and contrast for hooks; neutral, warm tones for tutorial. 
Speech: Energetic, clear female voiceover, direct-to-camera delivery.

[00:00–00:02]
Extreme close-up of Chris Hemsworth, sharp focus on eyes, studio lighting against a dark green textured background. Rapid cut to Sydney Sweeney, front-facing, bright red lipstick, white background. Camera is static. High-contrast color grading.

[00:02–00:04]
Extreme close-up of Timothée Chalamet, neutral expression, curly hair detail visible. Rapid cut to Zendaya, split-screen effect showing a "before and after" lighting change on her face. Text overlay "Photographers are officially cooked" in yellow and white.

[00:04–00:07]
A 4-way grid appears featuring the previous four celebrity portraits. The grid is static, then zooms in slightly. The text overlay remains at the bottom.

[00:07–00:10]
Screen recording of the Google Gemini mobile interface. A thumb taps the "+" icon, selects a selfie of a woman in a beige blazer, and types the text "selfie of yourself". The UI is in dark mode. The movement is smooth and functional.

[00:10–00:12]
The screen recording shows the AI processing and then reveals a stunning, professional headshot of the woman. The woman has wavy brown hair and is wearing a professional beige suit in a blurred modern office background.

[00:12–00:14]
Cut to the real-life creator (matching the AI headshot's identity). She is in a medium close-up, gesturing with her hands, speaking directly to the camera. Her expression is enthusiastic. Background is a bright, out-of-focus indoor space.

[00:14–00:16]
The creator continues speaking. A large text overlay appears: Comment "Photo". She points towards the camera/text. The cut is clean. The audio is crisp with a slight room resonance.

NEGATIVE PROMPT: 
Blurry faces, distorted eyes, inconsistent hair textures between cuts, robotic voice, laggy screen recording, messy background, low lighting, oversaturated skin tones, visible AI artifacts on hands or clothing, text flickering.

SPEECH PACK:
[00:05–00:16]
"Photographers are officially cooked, because you can go to Google's Gemini, upload any basic selfie of yourself and get this stunning professional headshot. It's that simple. If you want to try this, comment 'Photo' and I'll send you the prompt."

TAKE_A (Energetic/Fast): "Photographers are officially COOKED! Go to Google Gemini, upload a selfie, and boom—professional headshot. Simple. Comment 'Photo' for the prompt!"
TAKE_B (Informative/Steady): "Photographers are officially cooked. You can now use Google Gemini to turn any basic selfie into a stunning professional headshot. If you want to try it, just comment 'Photo' and I'll send the prompt."
TAKE_C (Casual/Friendly): "So, photographers are officially cooked. Just go to Gemini, upload your selfie, and get a pro headshot instantly. Want the prompt? Comment 'Photo' below!"

Prosody: Emphasis on "COOKED", "Gemini", and "PHOTO". Short pause after "headshot".
Sync: High lip-sync strictness for the final 4 seconds. Phrase boundaries aligned to cuts at 00:12.
Video

A) MISE EN PLACE

Reference summary
- Duration: 00:57.79
- Format: vertical 9:16, 720x1280, 24 fps
- Structure: talking-head tutorial reel demonstrating HeyGen AI Agent for UGC-style content creation
- Audio: direct-to-camera creator narration; exact words inferred best-effort from caption, visible UI, and pacing

Scene / shot segmentation
1. 00:00.00-00:10.00
   Hook section with phone-shot UGC example footage on screen, presenter lower center. A female creator-style vertical clip is shown as the practical target output while the host frames the feature as a new way to make UGC content.
2. 00:10.00-00:22.00
   More UGC examples and social-style before/after proof, including a hand pointing at the screen to emphasize generated results and mobile-native output.
3. 00:22.00-00:38.00
   HeyGen product interface section. Dark dashboard and setup screens take over, showing AI Agent-related controls, workflow panels, and configuration blocks while presenter keeps explaining.
4. 00:38.00-00:49.00
   Deeper editor / media management section. Grid-based asset views and back-office screens appear, suggesting avatar, scene, or media orchestration.
5. 00:49.00-00:57.79
   Presenter-forward close with strong CTA energy, likely asking viewers to comment “AI” for the link.

Visual evidence keyframes
- 00:00.00: UGC-style female selfie/creator shot framed on a phone screen, presenter lower center
- 00:08.00: finger pointing at screen, emphasizing mobile-native proof
- 00:16.00: second UGC-style clip with presenter continuing explanation
- 00:24.00: dark HeyGen interface with AI Agent-style workflow card and controls
- 00:32.00: dashboard-like panels and configuration widgets
- 00:40.00: media grid / project management view
- 00:52.00: presenter larger in frame with CTA close energy

Speech evidence (best-effort)
- speaker_count: 1
- speaker A: male-presenting creator speaking on-camera throughout
- speech style: upbeat tutorial narration, positioning the new HeyGen AI Agent feature as a way to produce UGC-style ad/social content
- likely content themes in order:
  1) how to create UGC-style content using HeyGen’s new AI Agent feature
  2) quick proof that the format works for social-style output
  3) walkthrough of the HeyGen setup / dashboard / workflow
  4) explanation of how the tool helps generate content faster
  5) comment “AI” for the link
- lip visibility: full for most presenter segments
- lip_sync_strictness: medium

Invariants list (LOCK THESE)
- presenter identity: male creator in casual cap, beard, light t-shirt, speaking directly to camera from a seated setup
- layout: presenter near bottom center while examples and interface screens rotate above and behind him
- product context: HeyGen AI Agent, UGC-style content creation, social media / ad creative workflow
- design language: creator tutorial, mobile-first, dark dashboard UI, concrete examples before tool explanation
- motion grammar: hard cuts between example clips and dashboard screens, no elaborate cinematic camera move
- lighting / grade: presenter evenly lit, warm-neutral skin tones, dark interface background, bright phone-screen examples
- audio style: concise, creator-education voice optimized for shorts/reels

Variables list (TWEAK THESE)
- exact UGC example faces and scenes
- exact dashboard panels and wording on HeyGen screens
- precise narration phrasing
- exact CTA wording beyond the comment-for-link mechanic

B) SHOTLIST

Shot 1
- shot_id: 1
- timecode_start: 00:00.00
- timecode_end: 00:10.00
- duration: 10.00s
- framing: presenter lower center beneath a large mobile-video example
- lens: presenter webcam/phone-style medium crop
- camera movement: static presenter crop, brisk background swaps
- subject: presenter introduces the HeyGen AI Agent use case for UGC content
- environment: female selfie-style UGC clip filling the upper frame, social-media-native layout
- speech/audio: Speaker A hook line about creating UGC-style content using the new feature

Shot 2
- shot_id: 2
- timecode_start: 00:10.00
- timecode_end: 00:22.00
- duration: 12.00s
- framing: more UGC proof clips and touch/point emphasis on screen
- camera movement: quick cuts and proof refreshes
- subject: presenter reinforces that the output looks like social-native creator content
- environment: phone-screen examples, finger pointing, comparative proof frames
- speech/audio: Speaker A highlights the outcome and use case

Shot 3
- shot_id: 3
- timecode_start: 00:22.00
- timecode_end: 00:38.00
- duration: 16.00s
- framing: HeyGen dashboard fills most of the frame, presenter remains lower center
- camera movement: rapid UI cuts
- subject: presenter explains AI Agent setup / workflow
- environment: dark product interface, cards, toggles, and pipeline sections
- speech/audio: Speaker A turns practical and tool-specific

Shot 4
- shot_id: 4
- timecode_start: 00:38.00
- timecode_end: 00:49.00
- duration: 11.00s
- framing: deeper project/media management screens
- camera movement: hard cuts through interface states
- subject: presenter explains scaling or organizing content generation
- environment: asset grid, project thumbnails, management view
- speech/audio: Speaker A continues the workflow explanation

Shot 5
- shot_id: 5
- timecode_start: 00:49.00
- timecode_end: 00:57.79
- duration: 8.79s
- framing: presenter-forward close with remaining dashboard context behind him
- camera movement: mostly static close
- subject: presenter lands the CTA and link offer
- environment: dark interface or blurred dashboard backdrop
- speech/audio: Speaker A asks viewers to comment “AI” for the link

C) STYLE BIBLE (GLOBAL)

- visual_style: AI creator tutorial reel, UGC marketing workflow breakdown
- camera_signature: persistent talking-head lower-third with changing proof and interface backgrounds
- lighting_signature: soft creator lighting on presenter; bright mobile examples contrasted with dark software UI
- grade_signature: warm-neutral presenter, darker dashboard, high-contrast phone-screen inserts
- texture_signature: crisp app interface, handheld/phone-look proof clips, creator desk setup feel
- pacing_signature: quick promise, quick proof, practical workflow, CTA
- speech_style: direct-to-camera tutorial narration
- speaker_profile: enthusiastic, practical, creator-marketer tone
- pronunciation_profile: casual English, medium-fast, emphasis on tool name and outcome
- mic_mix_profile: dry, clear creator audio with light compression

D) PROMPT SYNTHESIS

MASTER PROMPT

GLOBAL LOCK: Create a vertical 9:16 creator tutorial reel about using HeyGen’s new AI Agent feature to make UGC-style content. Keep one male creator presenter seated near the bottom center for most of the video. He has a short beard, baseball cap, casual light t-shirt, and speaks directly to camera with energetic but practical tutorial cadence. The background rotates between UGC-style phone footage, mobile-screen examples, dark HeyGen dashboard screens, AI Agent workflow panels, media-management views, and a final comment CTA. Preserve a mobile-first, scroll-stopping structure: proof first, interface next, conversion close. Lighting on the presenter stays soft and even, with a clean creator-desk feel.

[00:00-00:10.00] Open with a realistic UGC-style female selfie or creator clip filling the upper frame, as if viewed on a phone screen, while the presenter appears lower center and introduces how to create this kind of content using HeyGen’s new AI Agent feature. Keep the frame immediately legible for social media: the viewer should instantly understand that the end goal is ad-ready, creator-native short-form content. Speaker A is upbeat and explanatory, lips visible, medium lip-sync strictness.

[00:10.00-00:22.00] Continue with more proof-driven UGC examples and mobile-native frames. Include finger-pointing or screen-emphasis moments to make the tutorial feel tactile and practical rather than abstract. The presenter keeps speaking and gesturing while showing that the output can pass as social-ready creator content. Use quick cuts with clear result-first momentum.

[00:22.00-00:38.00] Transition into the HeyGen product interface. Show a dark dashboard with AI Agent workflow blocks, setup cards, toggles, and configuration panels. Keep the presenter lower center and have him explain how the feature works in practice. The background should clearly read as real software, not a mockup. Sync sentence accents to UI changes.

[00:38.00-00:49.00] Show deeper operational screens such as a media grid, project organization view, content assets, or an editor-style management panel. The presenter continues with a practical explanation about building, organizing, or scaling UGC outputs through the tool. Maintain a creator-tutorial pace with clean hard cuts and readable interface detail.

[00:49.00-00:57.79] Close with the presenter more dominant in the frame while HeyGen context remains visible behind him. End with a direct CTA asking viewers to comment “AI” for the link. Make the final frame readable, conversion-oriented, and clearly tied to the value already demonstrated.

NEGATIVE PROMPT

Avoid warped phone screens, unreadable dashboard text, messy cutout edges around the presenter, drifting face identity, fake-looking UGC footage, over-animated transitions, robotic narration, slurred speech, lip-sync mismatch, clipping, room echo, low-contrast CTA text, random wardrobe changes, muddy UI panels, flicker, frame jitter, and generic ad visuals that do not feel native to social feeds.

SHOT PROMPTS

- Hook delta: mobile-native UGC proof clip with presenter lower center
- Proof delta: more creator-style examples and finger-point emphasis
- Dashboard delta: dark HeyGen AI Agent setup interface
- Management delta: media grid / project organization view
- CTA delta: presenter-forward finish with comment-for-link ask

SPEECH PACK

Timecoded transcript (best-effort observable reconstruction)
- [00:00.00-00:10.00] Speaker A: “Here’s how to create UGC-style content using HeyGen’s new AI Agent feature.” Emotion: upbeat, hook-first.
- [00:10.00-00:22.00] Speaker A: “This lets you generate social-native creator content much faster while keeping the output usable for marketing.” Emotion: confident, proof-oriented.
- [00:22.00-00:38.00] Speaker A: “Let me show you the HeyGen workflow and how the AI Agent part fits in.” Emotion: practical, tutorial-focused.
- [00:38.00-00:49.00] Speaker A: “From here you can manage the content, examples, or project setup inside the dashboard.” Emotion: tactical, steady pace.
- [00:49.00-00:57.79] Speaker A: “Comment ‘AI’ for the link.” Emotion: punchy CTA close.

TAKE_A
- Keep the wording close to the lines above with creator-marketing energy.

TAKE_B
- Same meaning, slightly faster and more ad-operator focused.

TAKE_C
- Same meaning, calmer and more educational.

Closest audible version
- Exact speech was not transcribed verbatim, so the lines above represent closest observable tutorial intent supported by caption, UI context, and pacing.

Safe paraphrase version
- The reel explains how to use HeyGen AI Agent to create UGC-style content and ends by asking viewers to comment “AI” for the link.

AI Face Generator

AI face generator content becomes valuable when it clearly separates synthetic portrait creation from editing a real human photo. The person searching this topic usually wants a face that looks believable but does not belong to an existing person. That makes the page especially useful for design mockups, stock-style placeholders, anonymous profile concepts, and interfaces that need human variety without real-model licensing issues.

The strongest examples on this page should help a creator judge realism, diversity, and practical use. A good synthetic face is not only technically convincing. It also needs to feel usable in product, editorial, or prototype contexts where trust, neutrality, and variation matter.

FAQ

What is an AI face generator best for?

It is best for making believable synthetic faces for mockups, profile placeholders, anonymous personas, and creative projects that need non-real people.

How is this different from a headshot tool?

A headshot tool usually transforms or improves a real person image. A face generator is about creating a face that does not correspond to someone you know.

Who uses synthetic faces most often?

Designers, developers, marketers, and creators use them when they need human imagery without model releases or identity complications.

What should I compare on this page?

Look for realism, range, and whether the face feels usable in actual design or profile contexts rather than only as a novelty output.