AI mockup generator pages are for people who need a design to look real before it exists in the real world. That usually means brand owners, online sellers, or designers trying to show a logo, print, or package on a believable product surface. This page helps you compare mockup directions that feel presentation-ready, faster to iterate, and more useful than flat design files alone.

Video
GLOBAL LOCK: vertical 3:4 Adobe Firefly Boards style promo card, static held frame, red brand treatment over a gloomy downtown city block. Main image shows a tall monolithic concrete tower tinted deep Firefly red, torn open by two vertical cracks, with a masked cyberpunk antihero figure emerging from the fissure. Character design: short white hair, white or silver face mask with dark eye slits, dark tactical armor or jacket, menacing upright posture. Preserve Firefly square 'Fi' logo at top left, bold white headline stacked center reading 'From Idea to Branded Mockup' with a red capsule beneath reading 'in minutes', smaller white subhead explaining how AI-first Firefly Boards help visualize concepts without leaving the flow, lower-left hashtags for Adobe Firefly ambassadors and Firefly Boards, and a small swipe cue at lower right. Rainy traffic, buses, taxis, and pedestrians anchor scale at street level.
[00:00-00:11] Hold on the same branded hero frame throughout with only subtle export shimmer. The red building, cracked facade, cyberpunk figure, overcast clouds, and downtown traffic remain static while the large white headline and red capsule emphasize the message that Firefly Boards turns an idea into a branded mockup in minutes.
Video
GLOBAL LOCK:
The subject is a stylized, anthropomorphic orangutan with a friendly, smiling expression. The orangutan has reddish-brown fur, wears bright orange-tinted sunglasses, and a vibrant blue Hawaiian shirt with tropical leaf patterns. The environment is a sunny tropical beach with white sand, turquoise water, and palm trees in the background. The lighting is bright, warm, and cinematic, mimicking a high-end commercial photoshoot. The color grade is highly saturated with a focus on yellows, blues, and oranges. The product is a bright yellow "BANERGY" energy drink can with black text. Speech is not present, but the visual rhythm follows a tutorial UI flow.

[00:00–00:03]
A clean, dark gray background with centered white bold text: "Tutorial 1: A quick walkthrough of how I created the BANERGY product placement ad using Nano Banana inside Freepik". At the bottom right, a small white icon of a hand swiping left with the text "Swipe for more".

[00:03–00:07]
Screen recording of a web browser showing the Freepik.com homepage. A cursor moves to the left sidebar, hovers over "AI Suite," and clicks on "Image Generator" from the dropdown menu. The UI is clean and modern.

[00:07–00:12]
The browser navigates to the Freepik Image Generator page. The cursor clicks on the "Model" selection box and chooses "Nano Banana." Then, the cursor moves to the "Image References" section and clicks "Add" to upload a high-resolution image of a yellow "BANERGY" energy drink can. The can is shown as a thumbnail in the reference box.

[00:12–00:18]
The cursor clicks into the "Describe your image" text box. A detailed prompt is typed out: "Combine the smiling orangutan wearing orange sunglasses and tropical banana shirt with the bright yellow 'BANERGY' energy drink can. Place the can in the monkey's hand as if he's proudly holding it up for a commercial photo shoot. Keep the tropical beach background with palm trees, vibrant summer colors, and fun advertisement aesthetic. Ultra-realistic photography, high-resolution, playful but professional style."

[00:18–00:23]
The cursor clicks the "Generate" button. A loading animation appears briefly. Then, the final result is revealed: a cinematic, high-quality image of the orangutan in the Hawaiian shirt and sunglasses, holding the yellow BANERGY can on a sun-drenched beach. The camera is at a medium-close-up angle, slightly low to make the subject look heroic. The lighting is vibrant with soft highlights on the fur and sharp reflections on the can.

NEGATIVE PROMPT:
Visual: blurry, distorted hands, extra fingers, low resolution, text artifacts (except for BANERGY), dull colors, messy fur, inconsistent lighting, watermark, logo on shirt, distorted sunglasses.
Speech: N/A (No audio/speech in reference).

SPEECH PACK:
(No speech present in the original video. The video relies on text overlays and UI interaction.)
TRANSCRIPT:
[00:00] "Tutorial 1: A quick walkthrough of how I created the BANERGY product placement ad using Nano Banana inside Freepik"
[00:22] "Swipe for more" (Visual cue)

TAKE_A (Narrator Style - for recreation):
[00:00] "Here is a quick walkthrough of how I made this BANERGY ad using Freepik."
[00:03] "First, head over to Freepik and open the AI Image Generator."
[00:07] "Select the Nano Banana model and upload your product as a reference."
[00:12] "Type in a detailed prompt describing your scene and character."
[00:18] "Hit generate, and you've got a professional ad in seconds."

TAKE_B (Energetic/Hype):
[00:00] "Stop paying for photoshoots! Check out how I made this ad with AI."
[00:03] "Go to Freepik's AI Suite."
[00:07] "Use the Nano Banana model and drop your product photo right here."
[00:12] "Tell the AI exactly what you wantβ€”like this cool monkey on a beach."
[00:18] "Boom! Look at that quality. Insane, right?"
Video
Claye Ai
GLOBAL LOCK:
Subject: A female host, mid-20s, South Asian ethnicity, warm skin tone, long wavy brown hair, wearing a cozy lavender/purple knit sweater. She sits in a home office with a professional black condenser microphone on a boom arm. Background features a dark wooden bookshelf filled with books and small plants, softly blurred.
AI Subject Consistency: A high-fashion female model, European features, sharp jawline, sleek dark hair, wearing a white luxury power suit.
Environment: High-end studio settings for AI ads; warm home office for host.
Lighting: Soft, three-point lighting for the host; dramatic, high-contrast, cinematic lighting for AI outputs.
Color Grade: Warm, saturated tones for the host; cool blues and deep blacks for the "Dior-style" AI ads.
Speech: Clear, energetic female voice, professional cadence, direct-to-camera address.

[00:00–00:05]
Visual: Rapid montage of luxury brand ads. A model with green eyes holds a perfume bottle; a red "HERA" perfume ad; a man wearing a Calvin Klein watch; a woman with flowers on her face holding perfume.
Camera: Extreme close-ups and medium shots, static with slight internal motion.
Lighting: High-fashion studio lighting, dramatic shadows.
Speech: "You don't need to hire models or designers to create brand ads anymore." (Fast-paced, hook delivery).

[00:05–00:12]
Visual: Cut to host in her office. She gestures towards the camera. A screen overlay shows the URL "lovart.ai/home".
Camera: Medium shot, static.
Lighting: Warm, soft key light from the side.
Speech: "AI can do the full photoshoot and video for you. Just go to lovart.ai and start a new project."

[00:12–00:20]
Visual: Screen recording of the Lovart.ai interface. A cursor clicks "Upload Image," then selects "Nano Banana Pro" from a dropdown menu. A prompt is typed: "Luxury studio photoshoot of a model holding my product, cinematic lighting, premium brand look."
Camera: Screen capture, focused on the UI elements.
Speech: "Upload your product image, choose Nano Banana Pro, and describe the ad you want."

[00:20–00:26]
Visual: The AI generates a photo of a model in a white suit holding a Dior bag. The host is shown in a small window, reacting. The screen shows "Edit Text" and "Model Pose" options.
Camera: Split screen: UI on top, Host on bottom.
Speech: "Lovart's design agent will generate a professional ad visual in seconds. You can adjust anything: text, background, model pose, lighting."

[00:26–00:35]
Visual: A "Before" and "After" comparison. The "Before" is warm-toned; the "After" is cool blue with dramatic shadows. The cursor then selects "Kling 3.0" for video generation.
Camera: Side-by-side comparison, then UI focus.
Speech: "And the best part? These edits don't damage or overwrite your original base image. Once your poster looks perfect, you can turn it into a cinematic video using Kling 3.0 inside Lovart."

[00:35–00:40]
Visual: The final video output shows the model in the white suit subtly moving, adjusting her hand on the bag. The host returns to full screen with a "Comment ART" graphic overlay.
Camera: Full-screen AI video, then Medium Shot of host.
Speech: "Just add a motion prompt, generate, and your animated brand ad is ready. Comment ART and I'll send you the tool link."

NEGATIVE PROMPT:
Visual: Blurry faces, extra fingers, distorted product logos, flickering lights in video, unnatural skin texture, messy background in host segments, low resolution, watermarks on AI outputs.
Speech: Robotic tone, background noise, muffled audio, lip-sync mismatch, long pauses between sentences, harsh "S" sounds.

SPEECH PACK:
[00:00-00:05]
Transcript: "You don't need to hire models or designers to create brand ads anymore."
TAKE_A: (Energetic, fast) "You don't need to hire models or designers to create brand ads anymore!"
TAKE_B: (Authoritative, measured) "You don't need to hire models... or designers... to create brand ads anymore."

[00:05-00:12]
Transcript: "AI can do the full photoshoot and video for you. Just go to lovart.ai and start a new project."
TAKE_A: (Helpful, inviting) "AI can do the full photoshoot and video for you. Just go to lovart dot a-i and start a new project."

[00:35-00:40]
Transcript: "Comment ART and I'll send you the tool link."
TAKE_A: (Direct, friendly) "Comment ART and I'll send you the tool link!"
TAKE_B: (Whispered/Secretive) "Comment ART... and I'll send you the tool link."
Video
A) MISE EN PLACE
2) Segment the video into scenes/shots:
- [00:00-00:03] Shot 1: ECU face, talking.
- [00:03-00:05] Shot 2: CU face, holding product.
- [00:06-00:09] Shot 3: MS, head turn, dramatic shadow.
- [00:10-00:12] Shot 4: CU, applying product.
- [00:13-00:15] Shot 5: WS, sitting on floor.
- [00:16-00:18] Shot 6: CU, touching neck.
- [00:19-00:21] Shot 7: MS, sitting on stool, talking.
- [00:22-00:24] Shot 8: MS, holding hair up.
- [00:25-00:27] Shot 9: CU, wind in hair.

3) Extract visual evidence:
- Keyframes: 00:01 (talking face), 00:04 (holding product), 00:07 (shadow face), 00:11 (applying product), 00:14 (full body), 00:17 (touching neck), 00:20 (sitting talking), 00:23 (holding hair), 00:26 (wind in hair).

4) Extract speech evidence:
- Speaker: 1 female voice (Speaker A).
- Transcript:
  [00:00-00:03] "What if I told you I'm not even real."
  [00:03-00:05] "But the product I'm holding is Hailey Bieber's Rhode lip balm."
  [00:06-00:09] "Everything you're seeing was created with AI, no camera, no studio."
  [00:10-00:12] "Just one image and a few prompts."
  [00:13-00:15] "Every reflection, every highlight, every detail was generated in seconds."
  [00:16-00:18] "Real product, unreal possibilities."
  [00:19-00:21] "You don't need a full setup anymore."
  [00:22-00:24] "Just imagination."
  [00:25-00:27] "Comment guide to learn how."
- Lip visibility: Full visibility in shots 1 and 7. Partial/implied in others.
- Sync strictness: High for shots 1 and 7.

5) Invariants list (LOCK THESE):
- Visuals: Asian woman, mid-20s, flawless glowing skin, dark brown hair, fitted white ribbed sleeveless turtleneck tank top, small silver hoop earrings. Cinematic studio lighting, 85mm lens feel, photorealistic texture.
- Speech: Female voice, warm, confident, commercial beauty tone, close-mic studio sound, dry room.

6) Variables list (TWEAK THESE):
- Visuals: Lighting direction (soft beauty vs. hard directional), hair state (tied back vs. loose), background color (black, grey, white), pose, camera framing (ECU to WS).
- Speech: Pacing, emphasis on key words ("real", "AI", "seconds").

B) SHOTLIST
[00:00-00:03]
- framing: ECU, eye level.
- lens: 85mm, shallow DoF.
- camera movement: Static.
- subject: Looking directly at lens, speaking.
- environment: Dark studio background.
- lighting: Soft beauty lighting, high contrast.
- speech: Speaker A, on-camera. "What if I told you I'm not even real." High lip-sync strictness.

[00:03-00:05]
- framing: CU, eye level.
- lens: 85mm, shallow DoF.
- camera movement: Slight drift.
- subject: Holding a pink lip balm tube near her cheek, looking at camera.
- environment: Neutral studio background.
- lighting: Soft diffused lighting.
- speech: Speaker A, VO. "But the product I'm holding is Hailey Bieber's Rhode lip balm."

[00:06-00:09]
- framing: MS, eye level.
- lens: 50mm.
- camera movement: Slow pan following head turn.
- subject: Turns head from profile to face camera.
- environment: Dark studio background.
- lighting: Dramatic hard directional light, sharp diagonal shadow across face.
- speech: Speaker A, VO. "Everything you're seeing was created with AI, no camera, no studio."

[00:10-00:12]
- framing: CU, tight on mouth.
- lens: 100mm macro feel.
- camera movement: Static.
- subject: Applying pink lip balm to lips, eyes looking slightly down.
- environment: Neutral background.
- lighting: Bright, even beauty lighting.
- speech: Speaker A, VO. "Just one image and a few prompts."

[00:13-00:15]
- framing: WS, full body.
- lens: 35mm.
- camera movement: Static.
- subject: Sitting on floor, one leg bent, wearing black trousers with the white tank top.
- environment: Grey studio floor and wall.
- lighting: Soft overhead lighting.
- speech: Speaker A, VO. "Every reflection, every highlight, every detail was generated in seconds."

[00:16-00:18]
- framing: CU.
- lens: 85mm.
- camera movement: Slight push-in.
- subject: Touching neck and jawline with both hands.
- environment: Dark background.
- lighting: Warm rim light, deep shadows.
- speech: Speaker A, VO. "Real product, unreal possibilities."

[00:19-00:21]
- framing: MS.
- lens: 50mm.
- camera movement: Static.
- subject: Sitting on a metal stool, leaning forward, speaking to camera.
- environment: Neutral studio background.
- lighting: Neutral studio lighting, slight vignette.
- speech: Speaker A, on-camera. "You don't need a full setup anymore." High lip-sync strictness.

[00:22-00:24]
- framing: MS, slight low angle.
- lens: 50mm.
- camera movement: Static.
- subject: Arms raised, holding hair up in a high ponytail.
- environment: White studio background.
- lighting: Bright, high-key lighting.
- speech: Speaker A, VO. "Just imagination."

[00:25-00:27]
- framing: CU.
- lens: 85mm.
- camera movement: Static.
- subject: Looking intensely at camera, hair blowing.
- environment: Dark background.
- lighting: Soft dramatic lighting.
- motion cues: Wind blowing hair.
- speech: Speaker A, VO. "Comment guide to learn how."

C) STYLE BIBLE
- visual_style: Photorealistic cinematic commercial beauty portrait.
- camera_signature: 85mm portrait lens dominance, shallow depth of field, mostly static or slow, deliberate movements.
- lighting_signature: Highly variable but always professional studio quality, ranging from soft high-key beauty to dramatic low-key hard shadows.
- grade_signature: High contrast, natural skin tones, deep blacks, clean whites.
- texture_signature: Flawless skin detail, sharp focus on eyes and product.
- pacing_signature: Fast-paced cuts every 2-3 seconds.
- speech_style: Commercial beauty VO, confident, direct-to-camera hybrid.
- speaker_profile: Female, warm, articulate, modern vocal fry.
- mic_mix_profile: Close-mic, dry studio, high clarity, compressed for social media.

D) PROMPT SYNTHESIS

1. MASTER PROMPT
GLOBAL LOCK: Photorealistic cinematic commercial style. Subject: Asian woman, mid-20s, flawless glowing skin, dark brown hair, wearing a fitted white ribbed sleeveless turtleneck tank top, small silver hoop earrings. Environment: Minimalist studio setting with solid neutral backgrounds (white/grey/black). Lighting: High-end beauty lighting, varying from soft diffused to dramatic hard shadows. Camera: 85mm lens, shallow depth of field. Speech: Single female speaker, warm commercial tone, close-mic studio sound.

[00:00-00:03] ECU of the woman's face against a dark background. Soft beauty lighting. She is looking directly at the lens, speaking. Lips are moving in sync with speech.
[00:03-00:05] CU. The woman holds a pink lip balm tube next to her cheek. Soft diffused lighting. She looks at the camera. Slight camera drift.
[00:06-00:09] MS. The woman is turned slightly away in profile, then turns her head towards the camera. Dramatic lighting with a harsh diagonal shadow cutting across her face. Slow pan following the head turn.
[00:10-00:12] CU tight on the mouth. The woman is applying the pink lip balm to her lips. Eyes looking slightly down. Bright, even beauty lighting highlighting skin texture.
[00:13-00:15] WS. The woman is sitting on the floor, wearing black trousers with the white tank top. One leg bent. Grey studio background. Soft overhead lighting. Static camera.
[00:16-00:18] CU. The woman touches her neck and jawline with both hands. Warm, glowing rim light, deep shadows on the opposite side. Slight camera push-in.
[00:19-00:21] MS. The woman is sitting on a metal stool, leaning forward slightly, speaking directly to the camera. Lips moving in sync. Neutral studio lighting, slight vignette. Static camera.
[00:22-00:24] MS, slight low angle. The woman has her arms raised, holding her hair up in a high ponytail. Bright, high-key lighting, white background. Static camera.
[00:25-00:27] CU. The woman's hair is blowing in the wind. She looks intensely at the camera. Soft dramatic lighting, dark background. Static camera.

2. NEGATIVE PROMPT
Visuals: cartoon, illustration, anime, 3d render, deformed anatomy, extra fingers, mutated hands, unnatural skin texture, plastic skin, temporal jitter, flickering lighting, morphing objects, text, watermarks, logos, low resolution, blurry, out of focus.
Audio: robotic voice, unnatural cadence, harsh sibilance, plosives, clipping, background noise, room echo, lip-sync mismatch, slurred words.

4. SPEECH PACK
Speaker: Female, 20s, warm, confident, commercial beauty tone.
[00:00-00:03] "What if I told you... I'm not even real." (Pause for dramatic effect, direct eye contact).
[00:03-00:05] "But the product I'm holding... is Hailey Bieber's Rhode lip balm." (Slight emphasis on 'Rhode').
[00:06-00:09] "Everything you're seeing was created with AI... no camera... no studio." (Paced, emphasizing the negatives).
[00:10-00:12] "Just one image... and a few prompts." (Smooth, instructional tone).
[00:13-00:15] "Every reflection... every highlight... every detail... was generated in seconds." (Staccato emphasis on 'every').
[00:16-00:18] "Real product... unreal possibilities." (Contrast emphasis).
[00:19-00:21] "You don't need a full setup anymore." (Direct, conversational).
[00:22-00:24] "Just imagination." (Soft, aspirational).
[00:25-00:27] "Comment guide... to learn how." (Clear CTA, energetic).
Video

Vertical AI fashion-tech tutorial showing how to generate virtual try-on and outfit variations from a single street-style image. The video opens with a woman standing in a sunlit city street lined with yellow taxis and brick buildings, framed like a fashion campaign shot. Her original look is then transformed into multiple outfit variations, including a structured pale blue blazer dress, a dramatic deep-blue gown, a dark tailored look, and a short blue dress with bright shoes. The visual contrast between these versions immediately establishes the core promise: one subject, many AI-generated fashion outcomes.

The middle of the video moves into a software walkthrough. We see desktop interface screens with product selection, camera angle choices, photography style references, model or garment grids, and form fields for configuration. The creator clicks through options like selecting products, choosing visual style, adjusting photo settings, and generating new outfit versions. The tutorial keeps alternating between the interface and the finished fashion outputs, making it easy to connect each UI step to the visual change in the model image.

Later sections show the generated results in a side-by-side or sequential way, emphasizing how the same woman can be restyled into multiple commercial-ready looks. The workflow expands into broader catalog-style selection views and a final β€œCreate Your Video” screen, suggesting this tool can turn fashion product choices into dynamic visual content. Overall, the clip should feel like a clean AI styling and virtual fashion try-on tutorial, blending product UI, model transformations, street-style fashion imagery, and creator-driven instructional pacing.
Video
GLOBAL LOCK:
Subject is a fit Caucasian woman in her mid-20s, blonde hair tied in a high sleek ponytail, wearing a white zip-up athletic jacket. Skin tone is fair with warm undertones. Environment is a bright, high-key fitness studio with visible softbox lights and industrial windows. Lighting is soft, professional studio lighting. Color grade is clean, bright, high-contrast with vibrant whites. Speech is energetic, direct-to-camera UGC style. Mic signature is clean, close-proximity studio sound.

[00:00–00:04]
The subject is in a medium close-up, facing the camera. She holds a silver "GoodDay" sports drink can in her right hand at chest level. She is speaking directly to the camera with an energetic and friendly expression. Her head tilts slightly as she talks. Background shows a blurred studio setting with soft lighting.
Speech: "After training this hard, I need to recharge."
Lip-sync: High strictness required.

[00:04–00:06]
The subject continues speaking, gesturing slightly with her left hand while holding the can steady. She maintains a bright smile and high energy. The camera remains static in a medium close-up.
Speech: "That's why I grab my sports drink. What about you?"
Lip-sync: High strictness required.

[00:06–00:08]
The subject asks the final question and then pushes the silver can directly toward the camera lens, transitioning from a medium close-up to a close-up of the product. The motion is smooth and deliberate. The video ends with the can dominating the frame.
Speech: "Ready to feel unstoppable?"
Action: Forward push of the can toward the lens.
Lip-sync: High strictness on the final words.

NEGATIVE PROMPT:
Visual: blurry textures, distorted fingers, flickering background, inconsistent hair strands, unnatural eye movement, product label warping, low resolution, muddy colors, robotic limb movement.
Speech: robotic monotone, muffled audio, background hiss, lip-sync delay, unnatural pauses, slurred consonants, clipping audio.

SPEECH PACK:
Transcript:
00:00-00:02: "After training this hard,"
00:02-00:04: "I need to recharge."
00:04-00:06: "That's why I grab my sports drink. What about you?"
00:06-00:08: "Ready to feel unstoppable?"

Delivery Takes:
TAKE_A (Energetic): High pitch, fast pace, emphasis on "hard" and "unstoppable."
TAKE_B (Relatable): Medium pace, warm tone, slight breathiness after "hard."
TAKE_C (Authoritative): Crisp enunciation, steady pace, punchy delivery on "recharge."

Prosody Markup:
"After training THIS hard [pause], I need to RECHARGE. [pause] That's why I grab MY sports drink. What about YOU? [pause] Ready to feel UNSTOPPABLE?"
Video
Tim Koda

MASTER PROMPT
GLOBAL LOCK: Vertical 9:16 creator workflow reel showing how a low-quality phone product photo becomes a premium editorial brand campaign. The piece combines direct-to-camera creator explanation, smartphone screen inserts, dark AI prompt interfaces, polished fragrance and beauty packshots, high-fashion poster layouts, icy blue lighting setups, red background title cards, and glossy close-up beauty shots. Maintain a commercial luxury aesthetic with clean graphic design, sharp product isolation, premium reflections, and fast but readable tutorial pacing. One male creator speaks with confident agency-style cadence, close mic, dry audio, and repeated CTA emphasis on the keyword SNAP.

[00:00-00:05] Open on the creator holding up a poor-quality phone photo of a product while bold on-screen text frames the client problem. Quickly cut to the original image on a smartphone screen and the raw product reference. The creator states the hook: turning one bad iPhone product shot into a full brand campaign.

[00:05-00:10] Show the source product clearly, including dark fragrance bottle imagery and rough input materials. Keep the pace quick and problem-solution oriented. The host explains that the workflow starts with product extraction and building a 2x2 reference grid.

[00:10-00:17] Move into dark AI workflow screens with prompt boxes, image tiles, and reference inputs. The product appears in multiple isolated views while the creator describes feeding references into an LLM to generate several custom prompts. Keep interfaces crisp, black or charcoal, with white type and subtle UI highlights.

[00:17-00:25] Transition into generated campaign outputs. Show premium editorial product renders: dramatic blue-light bottle shots, luxury tabletop scenes, stylized poster frames, and fashion-adjacent compositions. The visual language should alternate between clean packshot precision and moody brand storytelling.

[00:25-00:32] Display a grid or carousel of multiple campaign variants, including print-poster style layouts, branded title cards, and comparative presentation boards. The creator frames this as a scalable shoot process that can create multiple deliverables from one starting photo.

[00:32-00:37] Show high-end beauty close-ups with glossy lips and refined skin detail, suggesting companion campaign imagery beyond the product packshot itself. The grade stays polished, magazine-like, and editorial.

[00:37-00:41] End on a clean CTA beat with a minimal branded frame or title card. The creator closes by telling viewers to comment SNAP to get the full creative shoot process.

NEGATIVE PROMPT
Avoid cheap e-commerce lighting, flat product cutouts, muddy reflections, fake luxury materials, unreadable prompt UI, weak poster typography, inconsistent bottle shape, warped labels, plastic skin on beauty close-ups, noisy shadows, and robotic narration. Keep every asset premium and campaign-ready.

SPEECH PACK
[00:00-00:05]
Closest audible: Comment SNAP to get the full creative shoot process.
Safe paraphrase: Open with a keyword CTA tied to the full workflow.

[00:05-00:17]
Closest audible: Your client sends a trash iPhone photo and expects a full brand campaign, and here is the workflow.
Safe paraphrase: He frames the challenge and explains the early extraction and prompt-building steps.

[00:17-00:32]
Closest audible: Product extraction, 2x2 grid, LLM prompts, then Nano Banana or Flux, then upscale and Lightroom finish.
Safe paraphrase: He walks through the generation and finishing stack that turns one input image into multiple outputs.

[00:32-00:41]
Closest audible: Table top, on figure, lifestyle, print poster, one photo, one workflow; comment SNAP.
Safe paraphrase: Close by emphasizing output variety and repeating the CTA.
Video
GLOBAL LOCK: vertical 9:16 paid-social style SaaS promo video for an AI image and video generation platform aimed at marketers and brand teams. The visual structure alternates between a dark premium product UI and a talking-head founder/creator picture-in-picture window. The presenter is a young adult man with medium-length dark hair, short beard, baseball cap, dark jacket with contrast stitching, and a black microphone visible in front of him. He speaks directly to camera from a home-office setup while the main screen above and behind him demonstrates the product. The interface should feel modern, black-background, neon-accent, high-contrast, and polished, with glowing borders around product cards, upload panels, and preview windows. Branding should prominently feature the name β€œVERV.”

The product story is: marketers can generate ad-ready images and videos from product references. Show a sequence of examples: selecting a product, generating polished model photography, lifestyle placements, beverage shots, packaging concepts, candy/snack ads, and then turning an image into a short video through a guided interface. The emotional tone is practical, high-conviction, performance-marketing oriented, and aspirational. This is not a generic explainer deck. It should feel like a creator selling a real tool with fast visual proof.

[00:00-00:05] Open on a dark app interface with a glowing product-selection panel. A talking-head box with rounded corners sits near the lower portion of frame showing the presenter speaking and gesturing naturally. The UI highlights a simple white 3D mannequin or product placeholder inside a card labeled β€œProduct.” Cursor movement or selection states should suggest the user is choosing a base object to build content from.

[00:05-00:10] Transition to generated campaign imagery. Show fashion-model style results and clean lifestyle scenes built around the product: a white mannequin in a bright editorial room, then a woman standing at a doorway with the same mannequin or product-themed concept. The presenter remains in the lower picture-in-picture, continuing to explain the platform's use for marketing creatives.

[00:10-00:16] Cut to another example set: a colorful lifestyle frame with a subject near a washing machine or rooftop setup, followed by product packaging or bottle mockups. The interface intermittently returns to β€œProduct” cards and clean selection screens so the audience understands this is driven by software, not just a montage of random ads.

[00:16-00:24] Show beauty and beverage examples. A glamorous female model holds a sleek metallic bottle in a polished campaign shot. Another scene shows a bottle alone in an atmospheric setting. Keep these examples bright, premium, and commercially usable. The presenter window continues reacting and talking while the main visual area displays case-study style outputs.

[00:24-00:32] Shift into food and snack ad territory. Show a woman eating or posing with a chocolate or candy bar in bold studio colors. Return briefly to the platform UI where a product image card is highlighted and the software appears to generate matching branded creative variations. Keep the cuts fast and made for ad-world attention spans.

[00:32-00:40] Emphasize the software workflow. A large VERV screen shows a multi-panel dashboard and then a specific feature titled β€œTurn image into video.” The interface includes an original image preview and a prompt box with descriptive text, for example a person holding a bottle at an athletics track. The presenter remains visible below, explaining the workflow like a founder demoing a new release.

[00:40-00:45] End on branded product UI and a strong software payoff: VERV logo over a gradient or dark dashboard background, plus screens showing generated outputs and controls for creative generation. The feeling should be that this tool bridges AI image generation, ad creative production, and image-to-video in one polished marketing stack.

VISUAL DNA:
- Dark premium SaaS interface with black panels, glowing neon edges, gradient accents, and rounded cards.
- Persistent creator/founder talking-head picture-in-picture near the bottom of frame.
- Product cards labeled β€œProduct,” upload areas, prompts, dashboards, and output galleries.
- Generated ad creative spanning fashion, beauty, beverage, lifestyle, and snack categories.
- Strong β€œVERV” branding integrated into multiple scenes.

CAMERA AND EDITING LOCK:
- Fast social-ad pacing with clear section changes every few seconds.
- Mix of screen-recording style UI shots and static talking-head reactions.
- No cinematic handheld realism; this is polished app-demo editing.
- Keep the presenter's face small enough to support the product visuals, not dominate them.

NEGATIVE PROMPT: generic corporate slideshow, boring webinar screen share, plain white software UI, no presenter, no branding, random stock footage with no app context, chaotic meme editing, gaming UI, crypto dashboard, coding IDE, horror tone, political content, travel vlog, subtitles burned in, low-quality screen capture, messy desktop clutter, unrelated ecommerce website.

SHOT PROMPTS:
SHOT 1: dark VERV interface with glowing product card and founder talking-head inset.
SHOT 2: generated fashion and lifestyle ad visuals switching above the presenter's commentary window.
SHOT 3: premium bottle and beauty campaign images created from product references.
SHOT 4: snack and consumer-goods creative examples with quick app returns.
SHOT 5: β€œTurn image into video” workflow screen with prompt field and original image preview.
SHOT 6: final VERV-branded dashboard and outputs montage.

SPEECH PACK:
[00:00-00:45] Conversational founder-style ad read, confident and concise, explaining that VERV helps marketers generate high-quality AI images and videos for brand content, product ads, and creative testing. Natural spoken delivery, no robotic narration.
Video
GLOBAL LOCK: 
Subject is a Caucasian male in his mid-30s with a well-groomed brown beard and medium-length wavy brown hair. He consistently wears a white and olive-green "VANS" trucker hat and a plain, high-quality white crew-neck t-shirt. The environment for the creator's shots is a warm, indoor setting with soft ambient lighting and a neutral, slightly out-of-focus background. The AI-generated content features a cinematic, high-contrast aesthetic with vibrant colors (primarily deep reds and blacks). The speech is energetic, clear, and direct-to-camera, delivered with a "tech-enthusiast" persona.

[00:00–00:05]
Visual: A cinematic, deep red Porsche 911 is shown from multiple angles: top-down, rear view, and 3/4 side profile. The car has a metallic finish and is set against a dark, moody red background with dramatic studio lighting. Text overlay reads "Multiview Perspective Change."
Subject: The creator appears in a small, rounded-square overlay at the bottom center, pointing upwards with both index fingers.
Camera: Smooth transitions between static product shots.
Speech: "This genuinely feels like a cheat code to create high-quality AI visuals for your brand or business."
Sync: Cut to the next shot on the word "business."

[00:05–00:19]
Visual: A rapid-fire montage of the creator's face swapped into various AI-generated scenes: 
1. A close-up of the VANS hat.
2. A model holding a smartphone.
3. A bold fisheye portrait wearing colorful puffer jackets and sunglasses.
4. An "Indie Garden Polaroid" shot with sunflowers and a guitar.
5. A "Halloween Party" shot of the creator in a yellow duck costume holding a red cup.
6. An "Urban Glare Portrait" in a city street.
Subject: Creator remains in the bottom overlay, gesturing with his hands as if explaining the variety.
Motion: Fast cuts (approx. 1-2 seconds each) with slight zoom-ins.
Speech: "This is called Blueprints, and it allows you to create multiple angled shots of any scene. You can upload product reference images and you can even replicate certain styles of images with a simple VFX template they've created for you."

[00:20–00:35]
Visual: Screen recording of the Leonardo.ai interface. The cursor moves to the left sidebar, hovering over and clicking the "Blueprints (Beta)" button highlighted with a red box. It then scrolls through a gallery of templates, selecting "Product Studio Photoshoot."
Subject: Creator in the overlay, looking slightly off-camera as if watching the screen, pointing to the UI elements.
Speech: "All you have to do is upload an image of yourself, and here's how to do it. To get started on Leonardo, you can go to the Blueprints section, and they have all of these different templates."

[00:36–00:45]
Visual: The UI shows the "Upload Person Photo" step. A photo of the creator in his white t-shirt and VANS hat is uploaded. Then, a "Product Photo" of a black smartphone is uploaded. The "Generate" button is clicked. The result shows the creator holding the phone in a professional studio setting.
Subject: Creator in the overlay, nodding and smiling as the result is revealed.
Speech: "You can then select one you want and upload a reference image of your face, for example, and then hit next. Now you can upload a reference image of a product, and then boom! You can actually create images of you holding the product in that environment."

[00:46–00:51]
Visual: The UI shows a "Multiview Perspective Change" generation of the creator sitting on a park bench from different angles (back view, side view, top-down). The video ends with the creator full-screen (or large overlay) against a dark background with the text "TYPE AI COMMENTS."
Subject: The creator winks at the camera and points forward.
Speech: "But it gets crazier because you can use different templates like multiview perspective... if you want to try it out for yourself, type AI in the comments and I'll send you the link."
Sync: Final wink lands exactly on the last word.

NEGATIVE PROMPT:
Visual: blurry face, inconsistent beard length, distorted VANS logo, extra fingers, flickering background, low-resolution UI, robotic body movements, unnatural skin texture, messy hair transitions.
Speech: monotone delivery, background noise, muffled audio, robotic cadence, misaligned lip-sync, harsh "S" sounds, long pauses between sentences.

SPEECH PACK:
[00:00-00:05]
Transcript: "This genuinely feels like a cheat code to create high-quality AI visuals for your brand or business."
TAKE_A: (Energetic, emphasizing "cheat code" and "business")
TAKE_B: (Fast-paced, breathless excitement)
TAKE_C: (Confident, authoritative tone)

[00:46-00:51]
Transcript: "If you want to try it out for yourself, type AI in the comments and I'll send you the link."
TAKE_A: (Friendly, inviting, with a wink at the end)
TAKE_B: (Direct, urgent, pointing at the camera)
TAKE_C: (Casual, "by the way" style delivery)
Video

MASTER PROMPT
GLOBAL LOCK: Vertical 9:16 creator-style AI image generation tutorial reel. Keep the visual structure consistent: dark background, stacked demo windows, rounded-corner presenter overlay near the lower half, and product screenshots or generated outputs occupying the upper area. The presenter is a bearded man in a beige baseball cap and brown hoodie speaking directly to camera with expressive hand gestures. The tutorial should open with a polished luxury ad-style image, then transition into a dark Generate Image interface with prompt and reference controls, and finish with generated lifestyle portraits and result examples. Preserve fast creator-educator pacing, practical workflow clarity, and social-media-friendly text hierarchy.

[00:00-00:10.00] Open with a strong proof-first visual: a luxury perfume bottle ad image against a rich purple satin-like backdrop. Place the presenter in a rounded picture-in-picture window at the bottom, speaking energetically to camera. The hook should feel like, "here is the kind of polished ad-style result you can create," with the upper image doing most of the persuasive work.

[00:10.00-00:28.00] Shift into the process section. Show a dark image-generation interface labeled around concepts like Generate Image, prompt box, reference styles, remix, auto prompt, or similar controls. Keep the presenter visible in the lower area while he explains how the workflow works. Include reference image boards, prompt panels, or app modules that make the system feel practical and reproducible.

[00:28.00-00:48.92] Move into the results and proof section. Show polished generated portraits or fashion-style outputs, app previews, and example result screens, including a casually dressed bearded man in a city street portrait. The presenter continues narrating while the upper content cycles through outputs, reinforcing that the workflow produces believable, commercially useful visuals. End on the strongest lifestyle result.

NEGATIVE PROMPT
Avoid cluttered multi-window chaos, unreadable UI, generic office stock footage, weak hook visuals, random unrelated outputs, corporate webinar styling, tiny text, dark muddy colors, or a tutorial sequence that explains too much before showing a compelling result.

SHOT PROMPTS
[00:00-00:10.00] Luxury perfume ad visual with presenter overlay.
[00:10.00-00:28.00] Dark Generate Image UI, prompt controls, reference boards, presenter explanation.
[00:28.00-00:48.92] Generated lifestyle portraits and result previews with presenter continuing narration.

SPEECH PACK
Timecoded transcript:
[00:00-00:48.92] Single-speaker tutorial explaining an AI image-generation workflow from polished ad example to interface steps to final outputs. Exact wording unclear; preserve concise creator-teacher delivery.

TAKE_A
[00:00-00:48.92] Fast creator-demo explanation with proof-first opening and simple step-by-step UI walkthrough.

TAKE_B
[00:00-00:48.92] Calm but confident tutorial tone emphasizing how to get polished commercial-looking results.

TAKE_C
[00:00-00:48.92] Slightly more enthusiastic creator cadence focused on workflow usefulness and output quality.
Video
GLOBAL LOCK: vertical 9:16 creator-tutorial video about making animated 3D poster designs with AI inside Lovart. The format combines a creator talking-head inset with a large design-canvas interface showing poster layouts, object compositions, typography placement, and brand-style concept iterations. The presenter is a young man in a blue baseball cap with yellow detail, neutral-colored t-shirt, and studio lighting, speaking directly to camera from a creator setup. His inset remains near the lower part of the frame while the main screen above demonstrates the design process.

The content should show multiple high-impact poster examples on a clean white design workspace: a sporty Wilson tennis poster with a player and oversized racket, a luxury-style watch poster inspired by Rolex, tech-product hand-and-watch layouts inspired by Apple and Casio, skate-fashion and shoe posters inspired by Vans β€œOff The Wall,” and bold graphic poster compositions with brand-like typography, textured backgrounds, and floating 3D product placements. The interface should feel like a modern AI design tool with draggable elements, scaling handles, layer boxes, and editable text areas. The emotional tone is practical, design-forward, visually trendy, and aimed at creators, brands, and marketers who want poster-level visuals without complex 3D software.

[00:00-00:07] Open on a clean white design canvas showing a bold AI-generated poster mockup. Examples include a yellow/blue sports-fashion poster and a Wilson-themed tennis composition with oversized typography and a player holding a racket. The creator appears in a small lower inset explaining that Lovart can be used to create animated 3D posters without traditional complex 3D workflows.

[00:07-00:15] Move through several poster variations on the canvas. Show a seated figure in a fashion-style poster, a tennis player composition with giant racket framing, and editable layout elements with bounding boxes or control handles around the artwork. The interface should make it obvious that these are being designed inside one platform rather than assembled manually across multiple apps.

[00:15-00:24] Shift to luxury and product-poster examples. Show close-up hand shots wearing watches against bold green, red, and beige background blocks, evoking Rolex, Apple, and Casio-inspired poster layouts. The creator continues talking while the main design workspace cycles through these premium, brand-like concepts.

[00:24-00:34] Transition into sneaker and skate posters. Feature a black sneaker in dynamic perspective over checkerboard or gradient backgrounds, large β€œOff The Wall” / Vans-like typography, and poster-style compositions that feel slightly three-dimensional or layered. Include multiple variations to emphasize iteration and creative control.

[00:34-00:45] End on the strongest Lovart workflow payoff: several poster versions shown in sequence with editable text fields, layer guides, and template-style composition zones. The message should be clear that AI can accelerate poster ideation, visual styling, and animated design output for brands and creators, all within a single polished design environment.

VISUAL DNA:
- Clean white or light design-canvas UI with editable poster boards.
- Creator talking-head inset at the bottom.
- Brand-inspired poster concepts across sports, watches, tech, and skate culture.
- Bold typography, oversized product framing, layered image cutouts, and 3D-ish design depth.
- Lovart branding and AI design-tool positioning.

STYLE LOCK:
- Creator tutorial and software demo, not generic inspiration slideshow.
- Graphic-design-first visuals with polished poster aesthetics.
- Multiple case-study style poster examples to prove breadth.
- Modern social design energy with premium art-direction sensibility.

NEGATIVE PROMPT: dark coding dashboard, plain webinar slideshow, no creator inset, generic stock ad montage, no editable canvas, no poster composition tools, boring corporate presentation, random unrelated brand assets, political content, horror visuals, sports broadcast footage, no typography, low-quality Canva template feel, subtitles burned in, fully finished commercial with no design-process context.

SHOT PROMPTS:
SHOT 1: Lovart design canvas with sports-fashion and Wilson-style poster examples plus creator inset.
SHOT 2: editable layout handles around poster boards on a clean white workspace.
SHOT 3: luxury watch posters with hand close-ups and bold background color fields.
SHOT 4: Vans-style skate sneaker posters with β€œOff The Wall” graphic composition energy.
SHOT 5: final multi-poster workflow view showing text edits and AI design control.

SPEECH PACK:
[00:00-00:45] Natural creator tutorial commentary explaining how to use Lovart's AI-powered design tools to create animated 3D posters quickly for brands, designers, and content creators.
Video
GLOBAL LOCK: vertical 3:4 static concept-marketing post showing a product mockup visualization in a gloomy downtown city. Main image is a tall brutalist concrete tower on a rainy overcast street corner, with buses, taxis, pedestrians, and wet crosswalks below. A giant jagged rupture tears open the building facade, and from the cavity a cyberpunk female-presenting character emerges mid-climb: short platinum hair, dark respirator mask, black futuristic tactical jacket with metallic armored shoulders and forearms, dark cargo-style pants, hands gripping the broken concrete edges. Preserve large white headline at top reading 'Mockup visual', a multi-line explanatory paragraph about using generated assets to build product mockups, translucent oversized lower text reading 'PHASE 3: VISUALIZATION', and a small 'SWIPE' cue in the lower right. Mood is dystopian, premium, presentation-ready, like a campaign concept board rather than a narrative scene.
[00:00-00:11] Hold on the same hero mockup frame the entire time with only subtle compression shimmer or micro-movement from the social post video export. The broken skyscraper mural-like visual dominates center frame, stormy sky remains dark grey, street traffic and pedestrians sit small at the bottom for scale, and all white overlay text stays fixed and readable throughout, emphasizing the message that AI-generated assets help clients visualize the campaign before production.
Video
Rourke Sefton-Minns

GOAL
Maximize similarity to the reference tutorial-style creator video by preserving the same split layout: a host speaking to camera in the lower portion while a white-background AI design-agent workspace occupies the upper portion and demonstrates how to create animated imagery and poster-style visuals. Prioritize the monochrome floral poster examples, the clean UI with left-side tool icons and right-side chat or prompt panel, the host's black cap and dark sweatshirt, the desk microphone in the foreground, and the later transition into product-style mockups and a CTA asking viewers to comment β€œAI.” The clip should feel like a practical workflow tutorial for creators, not a pure product announcement.

MISE EN PLACE
- Format: 9:16 vertical tutorial explainer, approximately 29.74 seconds.
- Presenter lock:
  - Young adult white male creator with medium-length dark hair, short beard, wearing a black baseball cap and dark navy or black sweatshirt.
  - He speaks directly to camera from a minimal creator setup with a large black desk microphone visible at lower right or center foreground.
- Interface / demo lock:
  - Upper portion shows a mostly white design workspace with a left vertical toolbar, floating canvas area, and a right-side prompt/chat panel.
  - Early examples focus on high-contrast black-and-white floral poster design with large typography such as β€œDESIGN” and β€œAI.”
  - Later examples show prompt text, generated poster refinements, and animated or product-style compositions such as orange headphones or a phone emerging from clouds.
  - Closing CTA visually includes β€œComment β€˜AI’”.
- Scene segmentation:
  - Shot 1: 00:00-00:07.00, hook with monochrome floral poster visuals and the host introducing animated imagery with AI design agents.
  - Shot 2: 00:07.00-00:15.00, clearer UI walkthrough showing prompt panel and generated flower poster iterations.
  - Shot 3: 00:15.00-00:23.00, expanded design workflow with poster composition and product-mockup examples.
  - Shot 4: 00:23.00-00:29.74, wrap-up and CTA with final animated visual and β€œComment β€˜AI’” prompt.
- Key visual evidence:
  - Bold black typography on white poster layouts.
  - Metallic or reflective flower forms and grayscale floral framing.
  - Right-side text prompt area with generated-result thumbnails.
  - Left-side vertical UI icons suggesting a design canvas workflow.
  - Orange product visuals later in the clip, especially headphones floating in clouds and phone imagery.
- Speech evidence inferred from caption and visuals:
  - Main topic: how to create animated imagery with AI design agents.
  - Delivery: instructional, fast, creator-friendly, focused on practical workflow.
- LOCK THESE invariants:
  - host-on-bottom + UI-on-top format, floral poster tutorial, white interface, black cap and dark sweatshirt host, microphone visible, CTA comment mechanic.
- Variables that may drift slightly:
  - exact host gestures, exact arrangement of the generated poster variations, specific cursor or panel microstates.

SHOTLIST
Shot 1
- shot_id: 1
- timecode_start: 00:00.00
- timecode_end: 00:07.00
- duration: 7.0s
- framing: split-screen with host medium close at bottom and large poster examples above.
- lens: 50mm equivalent creator lens on host.
- camera movement: host static, upper visuals cut or animate through poster variants.
- subject: host points upward while black-and-white floral poster layouts with β€œDESIGN” and β€œAI” cycle above him.
- environment: minimal creator room or neutral background, microphone in foreground.
- lighting: soft frontal key on host, bright flat white UI background above.
- speech/audio: direct instructional hook about using AI design agents.
- must match: editorial black-and-white flower aesthetic.

Shot 2
- shot_id: 2
- timecode_start: 00:07.00
- timecode_end: 00:15.00
- duration: 8.0s
- framing: same split layout, more interface-visible.
- lens: same host framing.
- subject: upper panel shows a prompt/chat panel and canvas with glowing flower or poster iterations while the host explains the workflow steps.
- environment: white workspace with left toolbar icons and right-side conversation panel.
- lighting: consistent soft host light, bright UI above.
- motion cues: UI transitions, generated image swaps, subtle host hand motions.
- speech/audio: concise explanation of generating and iterating poster concepts.
- must match: actual design-agent workflow feel, not just static images.

Shot 3
- shot_id: 3
- timecode_start: 00:15.00
- timecode_end: 00:23.00
- duration: 8.0s
- framing: upper panel rotates into more applied examples.
- lens: same host lens.
- subject: product-style poster compositions appear, including orange headphones and a smartphone-like device in cloud-themed layouts, while the host explains expanding the same workflow into animated imagery or campaign-ready assets.
- environment: same interface structure persists with generated thumbnails and poster artboards.
- lighting: bright clean digital design surface above, natural creator lighting below.
- speech/audio: host discusses moving from static visuals into more polished animated outputs.
- must match: product-marketing style examples and clean white workspace.

Shot 4
- shot_id: 4
- timecode_start: 00:23.00
- timecode_end: 00:29.74
- duration: 6.74s
- framing: final split-screen CTA.
- lens: same host framing.
- subject: upper panel lands on an orange headphone-in-cloud visual with text prompt β€œComment β€˜AI’” while the host closes with a direct call to action.
- environment: same clean white interface or finished hero visual.
- lighting: consistent, simple, social-native.
- speech/audio: closing CTA and final takeaway.
- must match: comment-driven CTA ending.

STYLE BIBLE
- visual_style: creator workflow tutorial, clean design-tech explainer, editorial AI poster demo.
- camera_signature: mostly static host frame with UI-driven motion above.
- lighting_signature: soft creator key light below, bright white interface above.
- grade_signature: minimal white-gray-black palette early, then selective warm orange product accents later.
- texture_signature: crisp UI, sharp typography, reflective grayscale flowers, polished mockup visuals.
- pacing_signature: hook with pretty output, reveal the workflow, broaden use cases, close with CTA.
- speech_style: practical, creator-to-creator teaching tone.
- mic_mix_profile: clean close-mic voice, low ambient room noise, light tutorial pacing.

MASTER PROMPT
GLOBAL LOCK: Keep a vertical split-screen tutorial video. The host stays visible in the lower section for most of the runtime, wearing a black cap and dark sweatshirt, speaking directly to camera with a large black microphone visible beside him. The upper section shows a bright white AI design-agent interface and generated poster outputs. The overall feel should be modern creative-software education for social media, with sharp typography, polished poster visuals, and a practical workflow tone. No subtitles are necessary in the prompt body, but the showcased design outputs may contain text that is visibly part of the examples.

[00:00-00:07.00] Start with large monochrome poster examples in the upper section: grayscale flowers, floral wreaths, metallic petals, and bold text like β€œDESIGN” and β€œAI.” Below, the host points and explains that he is about to show how to create animated imagery with AI design agents. Keep his expression engaged and instructive.

[00:07.00-00:15.00] Make the upper section show more of the interface itself: a left toolbar, a canvas with poster artwork, and a right-side prompt or chat panel generating and refining flower-based poster designs. The host below explains the workflow, likely how he prompts and iterates the output.

[00:15.00-00:23.00] Expand the example set in the upper section from floral editorial posters into product-style designs, including orange headphones and a phone-like object emerging from clouds. The host explains how the same design-agent workflow can be used to make more applied animated imagery and campaign-ready assets.

[00:23.00-00:29.74] Finish with a polished hero-style visual in the upper section, such as orange headphones floating in soft white clouds, plus a clear CTA reading β€œComment β€˜AI’.” The host below closes with a direct final recommendation and invitation to engage.

NEGATIVE PROMPT
Avoid dark moody coding interfaces, messy cluttered rooms, enterprise webinar style, shaky handheld footage, neon gamer lighting, generic stock AI art unrelated to the workflow, overcomplicated UI clutter, overly corporate voice, or cinematic narrative scenes that break the tutorial format.

SHOT PROMPT DELTAS
- Shot 1 delta: emphasize bold monochrome floral poster designs with big typography.
- Shot 2 delta: emphasize the actual design-agent UI with prompt panel and iteration flow.
- Shot 3 delta: emphasize product-style outputs such as orange headphones and phone/cloud visuals.
- Shot 4 delta: emphasize the β€œComment β€˜AI’” CTA and clean final hero image.

SPEECH PACK
- Timecoded speech intent:
  - [00:00-00:07.00] introduce the goal of making animated imagery with AI design agents.
  - [00:07.00-00:15.00] explain the core workflow and prompt-based iteration.
  - [00:15.00-00:23.00] explain how to turn the workflow into more commercial product-style outputs.
  - [00:23.00-00:29.74] deliver the CTA and recap.
- Delivery direction: energetic but instructional, medium-fast pace, clear creator language, lightly persuasive close.
Video
Rourke Sefton-Minns
Core format and topic lock: a vertical creator tutorial about the Lists feature inside Freepik Spaces. The interface is a dark node-based workflow canvas showing a structured AI pipeline that generates multiple outputs from one product concept. The featured example is a Dyson-style cordless vacuum workflow that includes rendered product components, variation grids, modern interior imagery, target demographic portraits, and final product-use advertising scenes. A male presenter with shoulder-length brown hair, beard, cream shirt, and blue cap appears in a webcam box at the bottom, explaining how the workflow scales.

Shot-by-shot reconstruction

0.0s-10.0s
Open on a dramatic product-pipeline example in Freepik Spaces. A Dyson-style vacuum is exploded into separate rendered components and connected through list-style workflow outputs. The presenter appears below, gesturing and reacting to the scale of the setup.

10.0s-22.0s
Show additional outputs created from the structured workflow, including interior lifestyle scenes and product variation boards. Emphasize that the lists feature is producing multiple connected results instead of a single image.

22.0s-40.0s
Zoom out to reveal the wider Freepik Spaces canvas with many modules and linked sections. Then zoom back into important blocks such as β€œ3D renders of product parts” and image generation for target demographic personas. Keep the presenter visible in the lower webcam frame.

40.0s-55.5s
Display demographic portrait outputs and finished product-use visuals, such as the vacuum being used inside a bright kitchen. End with the broader claim that the lists feature enables scalable prompt sequences for design, product, and marketing workflows, plus a CTA inviting viewers to comment β€œAI” for the free workflow.

Visual style
Dark UI creator-tech tutorial, node-based workflow overview, clean screen-recorded interface, talking-head explainer overlay, product-design and marketing example outputs, no cinematic cuts beyond interface navigation.

Motion notes
Motion should come from interface scrolling, zooming across workflow sections, output swaps, and the presenter’s hand gestures. Keep the same Dyson-style product case study and same presenter placement throughout the clip.

Negative prompt
messy interface, unreadable workflow blocks, extra webcam windows, unrelated software, random artwork changes, watermark, subtitles unrelated to tutorial, gaming UI, shaky handheld camera, non-product examples replacing the vacuum case study

Speech pack
English creator narration explaining that Lists in Freepik Spaces can connect prompt sequences to an LLM and generate scalable structured outputs for design systems, product variations, and marketing assets.
Video
Adriana Bubori
GLOBAL LOCK: 
The video features two distinct subjects. Subject 1 is a Caucasian female in her mid-30s with long, straight light-brown hair, wearing a black sleeveless turtleneck top, sitting at a light wood table in a minimalist room with abstract art. Subject 2 is a Black male model in his mid-20s with short, neat braids, an athletic build, wearing a structured brown zip-up jacket and amber-tinted sunglasses with "MEC" branding on the temples. The environment shifts between a minimalist home office and a high-end, sun-drenched luxury studio/penthouse. The lighting is consistently warm, cinematic, and editorial, with soft-focus backgrounds. The color grade is a warm, high-contrast filmic look with rich browns and creams. The camera language uses a mix of static medium shots and dynamic, handheld-style close-ups with shallow depth of field. Speech is a warm, professional female voice with clear articulation.

[00:00–00:04]
Subject 1 (female creator) is centered in a medium shot, speaking directly to the camera with expressive hand gestures. She is in a bright, minimalist room. The lighting is soft natural light from the side. Her expression is welcoming and authoritative.
Speech: "What if I told you that you could turn a simple image of your product..."
Lip-sync: High strictness.

[00:04–00:07]
A fast-paced 4-grid montage. Top-left: Subject 2 (male model) sitting on a tan leather sofa. Top-right: Close-up of the amber sunglasses. Bottom-left: Subject 2 standing in a low-angle shot. Bottom-right: Close-up of Subject 2's face. The lighting is high-end editorial with strong highlights.
Speech: "...into a full set of campaign images, into studio shots..."

[00:07–00:09]
Subject 2 is in a medium close-up, reaching both hands towards the camera lens, creating a sense of depth. The background is a clean, off-white studio wall. He wears the brown jacket and sunglasses.
Speech: "...and even ads with AI?"

[00:09–00:10]
A sharp close-up of Subject 2's face, looking slightly off-camera. The focus is on the sunglasses. The lighting is dramatic, coming from a window.
Speech: "Now let's say you want to..."

[00:10–00:11]
A match-cut transition to a different model: a Caucasian male with wavy blonde hair, wearing the same sunglasses and brown jacket. Same framing and lighting as the previous shot.
Speech: "...change the model, do it."

[00:11–00:12]
Subject 2 is shown in a full-body low-angle shot, wearing the brown jacket and matching trousers, standing against a bright white background.
Speech: "You don't like the clothes..."

[00:12–00:13]
A quick cut where Subject 2's outfit swaps from the brown jacket to a clean white oversized t-shirt. The pose and background remain identical.
Speech: "...swap them."

[00:13–00:14]
Subject 2 is sitting on a tan leather sofa in a medium shot, looking directly at the camera with a neutral, cool expression.
Speech: "You need one more angle..."

[00:14–00:15]
An extreme close-up of Subject 2's hand adjusting the sunglasses on his face. The "MEC" logo on the temple is clearly visible.
Speech: "...easy."

[00:15–00:18]
Close-up of Subject 2 speaking. His lips move in perfect sync with the voiceover. He looks confident.
Speech: "You want to change the voice? Change it."
Lip-sync: High strictness.

[00:18–00:20]
Subject 2 is sitting in a leather armchair next to a large floor-to-ceiling window overlooking a park. The lighting is golden hour, creating long shadows.
Speech: "Creating content for your brand..."

[00:20–00:23]
Final close-up of Subject 2 looking into the camera. A white button with the text "AI" and a cursor clicking it appears over his chest.
Speech: "...has never been easier. Comment AI to learn how."

NEGATIVE PROMPT: 
Visual: blurry faces, inconsistent sunglasses shape, flickering lighting, distorted hands or fingers, floating objects, low resolution, grainy texture, unnatural skin smoothing, robotic movement, text artifacts on clothing.
Speech: robotic monotone, mismatched lip-sync, background noise, muffled audio, unnatural pauses, harsh 's' sounds, clipping audio, inconsistent volume.

SPEECH PACK:
[00:00-00:08]
Transcript: "What if I told you that you could turn a simple image of your product into a full set of campaign images, into studio shots, and even ads with AI?"
TAKE_A: (Curious, rising intonation on "What if I told you", energetic pace)
TAKE_B: (Professional, steady cadence, emphasis on "simple image" and "full set")
TAKE_C: (Slow, dramatic, pausing after "ads" for impact)

[00:08-00:18]
Transcript: "Now let's say you want to change the model, do it. You don't like the clothes, swap them. You need one more angle, easy. You want to change the voice? Change it."
TAKE_A: (Punchy, fast-paced, authoritative "do it" and "swap them")
TAKE_B: (Conversational, light-hearted, shrug-like cadence on "easy")
TAKE_C: (Instructional, clear pauses between each command)

[00:18-00:23]
Transcript: "Creating content for your brand has never been easier. Comment AI to learn how."
TAKE_A: (Warm, smiling tone, direct and clear CTA)
TAKE_B: (Confident, slightly slower pace on "never been easier")
TAKE_C: (Encouraging, upbeat, emphasis on "Comment AI")
Video
Rourke Sefton-Minns
Create a vertical 9:16 minimal premium design-poster visual for an AI creative workflow, featuring a bright yellow tennis ball floating just above an outstretched human hand against a clean blue sky. The hand should rise from the lower portion of the frame wearing a white wristband, with the ball suspended in crisp sunlight so it feels like a polished 3D object hovering in space. Bold yellow Lovart text repeats in the upper left, while repeated Design text appears in the lower right like confident editorial poster typography. The overall result should feel like a high-end animated 3D poster concept for designers: simple, modern, vector-friendly, and easy to manipulate as a motion design asset. No clutter, no subtitles, no extra objects, no cartoon style.

AI Mockup Generator

AI mockup generator content becomes useful when a creator already has a design asset and needs to show how it will look in context. The goal is not abstract image generation. It is realistic presentation. A logo on packaging, artwork on a shirt, or branding on a phone case becomes easier to judge when it appears on something that looks close to a finished product.

That is why the best examples on this page should be evaluated like presentation tools, not just image effects. A strong mockup helps a client, customer, or internal team understand the design faster. When you compare ideas here, focus on whether the preview looks believable, whether the product context supports the brand, and whether the result makes iteration easier before anyone spends money on samples or production.

FAQ

What is an AI mockup generator best for?

It is best for turning flat designs into realistic product previews for apparel, packaging, merchandise, and branded presentation work.

Who usually needs this kind of page?

Designers, e-commerce sellers, brand teams, and freelancers often use mockups to present ideas before producing physical items.

Why not just show the original design file?

A flat file does not show scale, material context, or how the design feels on a real product, which is often what clients and customers need to see.

What should I compare on this page?

Compare realism, product context, and whether each mockup style helps the design feel more convincing in a client or customer setting.

AI Mockup Generator: Product Preview Ideas for Design Work | Alici.AI