AI Interior Design Generator

AI interior design generator pages work best when they stay close to real room decisions. People here usually want to upload a current space, try a style direction, and see whether a redesign feels worth pursuing before they buy anything. This page helps you compare interior ideas that feel practical for rooms, layouts, and decor choices instead of only making pretty mood images.

Video
GLOBAL LOCK: 
Subject is a Caucasian male in his mid-30s with a well-groomed brown beard and medium-length wavy brown hair. He consistently wears a white and olive-green "VANS" trucker hat and a plain, high-quality white crew-neck t-shirt. The environment for the creator's shots is a warm, indoor setting with soft ambient lighting and a neutral, slightly out-of-focus background. The AI-generated content features a cinematic, high-contrast aesthetic with vibrant colors (primarily deep reds and blacks). The speech is energetic, clear, and direct-to-camera, delivered with a "tech-enthusiast" persona.

[00:00–00:05]
Visual: A cinematic, deep red Porsche 911 is shown from multiple angles: top-down, rear view, and 3/4 side profile. The car has a metallic finish and is set against a dark, moody red background with dramatic studio lighting. Text overlay reads "Multiview Perspective Change."
Subject: The creator appears in a small, rounded-square overlay at the bottom center, pointing upwards with both index fingers.
Camera: Smooth transitions between static product shots.
Speech: "This genuinely feels like a cheat code to create high-quality AI visuals for your brand or business."
Sync: Cut to the next shot on the word "business."

[00:05–00:19]
Visual: A rapid-fire montage of the creator's face swapped into various AI-generated scenes: 
1. A close-up of the VANS hat.
2. A model holding a smartphone.
3. A bold fisheye portrait wearing colorful puffer jackets and sunglasses.
4. An "Indie Garden Polaroid" shot with sunflowers and a guitar.
5. A "Halloween Party" shot of the creator in a yellow duck costume holding a red cup.
6. An "Urban Glare Portrait" in a city street.
Subject: Creator remains in the bottom overlay, gesturing with his hands as if explaining the variety.
Motion: Fast cuts (approx. 1-2 seconds each) with slight zoom-ins.
Speech: "This is called Blueprints, and it allows you to create multiple angled shots of any scene. You can upload product reference images and you can even replicate certain styles of images with a simple VFX template they've created for you."

[00:20–00:35]
Visual: Screen recording of the Leonardo.ai interface. The cursor moves to the left sidebar, hovering over and clicking the "Blueprints (Beta)" button highlighted with a red box. It then scrolls through a gallery of templates, selecting "Product Studio Photoshoot."
Subject: Creator in the overlay, looking slightly off-camera as if watching the screen, pointing to the UI elements.
Speech: "All you have to do is upload an image of yourself, and here's how to do it. To get started on Leonardo, you can go to the Blueprints section, and they have all of these different templates."

[00:36–00:45]
Visual: The UI shows the "Upload Person Photo" step. A photo of the creator in his white t-shirt and VANS hat is uploaded. Then, a "Product Photo" of a black smartphone is uploaded. The "Generate" button is clicked. The result shows the creator holding the phone in a professional studio setting.
Subject: Creator in the overlay, nodding and smiling as the result is revealed.
Speech: "You can then select one you want and upload a reference image of your face, for example, and then hit next. Now you can upload a reference image of a product, and then boom! You can actually create images of you holding the product in that environment."

[00:46–00:51]
Visual: The UI shows a "Multiview Perspective Change" generation of the creator sitting on a park bench from different angles (back view, side view, top-down). The video ends with the creator full-screen (or large overlay) against a dark background with the text "TYPE AI COMMENTS."
Subject: The creator winks at the camera and points forward.
Speech: "But it gets crazier because you can use different templates like multiview perspective... if you want to try it out for yourself, type AI in the comments and I'll send you the link."
Sync: Final wink lands exactly on the last word.

NEGATIVE PROMPT:
Visual: blurry face, inconsistent beard length, distorted VANS logo, extra fingers, flickering background, low-resolution UI, robotic body movements, unnatural skin texture, messy hair transitions.
Speech: monotone delivery, background noise, muffled audio, robotic cadence, misaligned lip-sync, harsh "S" sounds, long pauses between sentences.

SPEECH PACK:
[00:00-00:05]
Transcript: "This genuinely feels like a cheat code to create high-quality AI visuals for your brand or business."
TAKE_A: (Energetic, emphasizing "cheat code" and "business")
TAKE_B: (Fast-paced, breathless excitement)
TAKE_C: (Confident, authoritative tone)

[00:46-00:51]
Transcript: "If you want to try it out for yourself, type AI in the comments and I'll send you the link."
TAKE_A: (Friendly, inviting, with a wink at the end)
TAKE_B: (Direct, urgent, pointing at the camera)
TAKE_C: (Casual, "by the way" style delivery)
Video

GLOBAL LOCK: Vertical architecture-process explainer video presented as a fast-moving design workflow montage. The visual language alternates between bold white text on dark backgrounds, screenshots of AI chat or prompt interfaces, CAD-like plan views, and polished renders of modern residential architecture and interiors. The core narrative is that a house was designed without traditional CAD, using AI-driven prompts and iterative visual direction. Featured outputs include bright modern facades, colorful interiors, a green sculptural building, a compact concrete-and-brick house with stairs, vivid blue and red dining spaces, tiled bathrooms, and small furniture details. The overall tone is confident, design-forward, and instructional, with strong emphasis on iteration, composition, material consistency, and speed.

[00:00-00:06] Open with high-contrast title cards and quick flashes of sketch-like architecture imagery, material references, and CAD-style plan screenshots. The message establishes that the creator designed a house without CAD and is about to explain how. Keep typography large, centered, and punchy over dark backgrounds between the visuals.

[00:00:06-00:14] Transition into screenshots of an AI chat or prompt interface showing architecture references, prompts, and iterative instructions. The workflow should imply feeding images, style cues, and spatial direction into a conversational design process rather than drafting conventionally. The screen captures should feel like a real tool-assisted design pipeline.

[00:14-00:22] Reveal the first polished outputs: a storefront-like facade, vivid red interior corridor, blue entry, and a green sculptural building form. Use clean cuts between exterior and interior renders to show that the process is controlling composition and architectural language, not just generating random images. Text overlays should stress things like solving composition, prompting, and consistency.

[00:22-00:30] Move into additional examples such as a concrete-and-brick house with stairs, a green cylindrical or perforated interior, and other modern architectural studies. Screenshots of prompt refinements and AI tool panels continue to appear between renders, reinforcing the iterative nature of the method.

[00:30-00:42] End with more refined interior and furniture-like details: a colorful dining setup with red pendant lights, tiled bathrooms with a blue stool, a red side table, and a final exterior volume in dark red. Closing text emphasizes that consistency and speed come from the right workflow, and that AI can handle the visual iteration usually associated with CAD-heavy concept development.
Video
Gizem Akdag
GLOBAL LOCK:
Subject is a single woman, slender build, long dark hair. The environment is a surreal, immersive forest made of dense, cascading moss-like foliage that hangs in thick, vertical sculptural volumes. The color palette is dominated by deep muted greens and earthy browns. The lighting is cinematic, late afternoon sun. High-quality editorial photography style, ultra-detailed textures, 4k resolution.

[00:00–00:07]
A wide shot of a woman standing in a surreal landscape of dense, hanging moss. She is wearing a simple tan/beige outfit. The lighting is soft and diffused, coming from the top. The camera is static. The mood is calm and introspective.

[00:08–00:11]
The woman's outfit transforms into a vivid, saturated rich red flowing dress. The dress is massive and billows dramatically to the right as if caught by a strong wind. The fabric has dynamic movement and high volume. The background remains the same muted green moss forest. The contrast between the red dress and green background is sharp and striking.

[00:12–00:14]
The scene's lighting shifts dramatically to high-contrast chiaroscuro. Brilliant golden god rays and volumetric light shafts pierce through the foliage from the top right. Deep, crushed shadows in the background. The red dress pops intensely. Floating particles and leaves are visible in the light beams. The camera zooms in slightly on the subject.

NEGATIVE PROMPT:
Visual: distorted anatomy, extra limbs, blurry face, low resolution, plastic skin texture, flat lighting, messy foliage, text, watermark, logo, flickering shadows, inconsistent dress movement.
Audio/Speech: N/A (No speech in reference).

SPEECH PACK:
(No speech present in the original video. The video relies on visual text overlays and background music.)
Video
Kallaway
GLOBAL LOCK:
Subject is a Caucasian male in his mid-30s, short dark hair, wearing a black baseball cap and a black KITH brand t-shirt. Environment is a moody studio with warm practical lighting on shelves and a window with blinds. The video style is a mix of high-end talking head and clean, minimalist 3D architectural software UI. Color grade is warm and cinematic for the host, and bright, high-key for the software. Speech is energetic, clear, and direct-to-camera.

[00:00–00:03]
Subject: 3D render of a modern, two-story minimalist house with a flat roof and large glass windows.
Environment: A desert-like landscape with soft blue sky.
Action: A blue semi-transparent plane moves across the top of the house model.
Camera: Wide shot, slight zoom in.
Lighting: Bright, simulated daylight.
Speech: "This is the future of home design."

[00:03–00:07]
Subject: Host talking to camera, making an 'OK' gesture.
Environment: Studio background.
Action: Host speaks with hand gestures; a red ZURU logo appears on screen.
Camera: Medium close-up, static.
Lighting: Warm key light, blue/orange accent lights in background.
Speech: "It's called Zuru. And their AI software makes building your dream house feel like..."

[00:07–00:18]
Subject: Screen recording of Zuru software.
Environment: White digital workspace.
Action: A cursor drags a bed into a room; a 2D floor plan transitions into a 3D view.
Camera: Top-down view transitioning to perspective.
Lighting: Even, digital white light.
Speech: "...playing a video game. It's pretty wild. Now on the platform, you can literally drag and drop any component you'd want in your home."

[00:18–00:24]
Subject: Split screen: Zuru software on top, Fortnite gameplay on bottom.
Environment: Digital workspace vs. colorful game world.
Action: Host points to the screen; Fortnite character builds a wooden structure.
Camera: Split screen.
Lighting: High contrast.
Speech: "And it feels like a video game because they literally built the entire platform on top of the same game engine as Fortnite."

[00:24–00:43]
Subject: Technical 3D models of a house.
Environment: Wireframe and structural analysis views.
Action: Red and yellow heat maps appear on the house frame; purple lines show electrical and plumbing layouts.
Camera: Close-up on 3D details.
Lighting: Technical, high-contrast colors.
Speech: "But here's why this is so powerful. Because the software is built with AI, all of your design choices are automatically pressure tested for real building codes. Structural support, energy efficiency, it's all checked."

[00:43–00:52]
Subject: Cinematic renders of modern interiors and exteriors.
Environment: A luxury living room with a view; a house with a pool at sunset.
Action: Smooth camera pans across the rendered rooms.
Camera: Slow cinematic pan.
Lighting: Soft, golden hour lighting.
Speech: "This means literally anyone, even a five-year-old, could design a real home that could actually be built safely anywhere in the world."

[00:52–01:05]
Subject: Industrial factory setting.
Environment: A clean, automated manufacturing plant.
Action: Large red and white robotic arms move and assemble large white panels.
Camera: Medium shots of the robots.
Lighting: Bright industrial lighting.
Speech: "Once you design the home you want, you just press print and the robot factory will build and assemble the entire thing for you."

[01:05–01:16]
Subject: 3D house model with a cost breakdown UI.
Environment: Digital workspace.
Action: A list of costs (Walls, Windows, Doors) appears, totaling $92,598. A large "75% CHEAPER" text overlay appears.
Camera: Static UI shot.
Lighting: High contrast.
Speech: "They're bringing the cost of homebuilding down significantly. People are going to be able to build homes for 75% cheaper than normal."

[01:16–01:26]
Subject: News article and drone footage.
Environment: Malibu coastline.
Action: Headline about Malibu rebuilding appears; drone shot shows a road along the ocean at sunset.
Camera: Aerial drone shot.
Lighting: Natural sunset light.
Speech: "And get this, Zuru bought up a bunch of the Malibu beachfront property from the wildfires. Their next project is to rebuild the California coast."

NEGATIVE PROMPT:
Avoid shaky handheld camera, low-resolution screen captures, robotic or flat vocal delivery, inconsistent lighting on the host's face, cluttered background, blurry 3D renders, and poor lip-sync. No harsh shadows or overexposed highlights in the studio shots.

SPEECH PACK:
[00:00–00:03] "This is the future of home design." (TAKE_A: Enthusiastic, TAKE_B: Serious, TAKE_C: Whispered/Intense)
[00:03–00:07] "It's called Zuru. And their AI software makes building your dream house feel like..." (TAKE_A: Fast-paced, TAKE_B: Emphasizing 'AI', TAKE_C: Casual)
[00:07–00:18] "playing a video game. It's pretty wild. Now on the platform, you can literally drag and drop any component you'd want in your home." (TAKE_A: Explanatory, TAKE_B: Amazed, TAKE_C: Direct)
[00:60–01:16] "People are going to be able to build homes for 75% cheaper than normal." (TAKE_A: Punchy, TAKE_B: Slow for emphasis, TAKE_C: Shocked)
Video
Core format and topic lock: a vertical creator tutorial about replacing clothing in videos using AI, likely with Kling AI or a similar try-on workflow. The layout combines a main sample video or interface demo above with a talking-head presenter below. The featured subject is the creator himself in a plain indoor room, first wearing a simple light t-shirt and neutral pants, then providing front and back references, then using an interface with masking and region controls such as subject, face, costume, and manual adjustments, and finally showing a new outfit generated onto the same motion clip.

Shot-by-shot reconstruction

0.0s-12.0s
Open with the raw driving-input video of the creator standing in a room and reaching toward the camera. The presenter in the lower talking-head frame explains that this source clip will be used to replace the clothing while preserving the motion.

12.0s-24.0s
Show front and back stills or frames of the subject so the workflow can understand how the clothing wraps around the body from multiple angles. Keep the emphasis on gathering better reference coverage for more accurate outfit replacement.

24.0s-40.0s
Display the interface where the creator selects or confirms the relevant regions of the frame. Show mask and control options such as subject, face, costume, and manual refinement. This section should read as the setup stage for the virtual try-on transformation.

40.0s-53.3
Reveal the output clip in which the creator’s clothing has been changed while the same room, pose, and camera angle remain intact. End on the transformed outfit result and creator commentary encouraging viewers to comment for the workflow link.

Visual style
Vertical AI fashion-tech tutorial, clean screen-recorded UI, talking-head explainer overlay, indoor webcam-like source footage, practical virtual try-on workflow, no cinematic scene changes.

Motion notes
Motion comes from the source video demonstration, interface selection steps, and the presenter’s gestures. Preserve the same performer identity, room, and body movement between input and output so the value of the clothing replacement is obvious.

Negative prompt
messy interface, unrelated clothing examples, unreadable UI, extra presenters, watermark, subtitles unrelated to tutorial, broken body anatomy, changing room layout, random fashion runway shots, non-human subjects, unrelated software screens

Speech pack
English creator narration explaining how to capture a driving input, provide front and back references, mask the right regions, and generate a believable outfit replacement video.
Video
GLOBAL LOCK: Vertical 9:16 UGC tutorial reel with a persistent two-layer presentation style: the upper 60 to 70 percent of the frame shows demonstrations, screenshots, typed prompts, and generated image results; the lower portion shows the same male creator speaking directly to camera in a rounded-corner selfie window for most of the video. The creator is a white male in his late 20s to mid 30s, medium-length wavy dark brown hair, short beard and mustache, expressive eyebrows, average build, casual creator aesthetic. Keep his delivery energetic, friendly, and persuasive. Wardrobe changes are intentional by section: white tee and cream Vans cap at the opening studio desk, blue polo and backward cap for the main explainer section, yellow suit jacket and black top hat for the final gag CTA. Upper-frame design alternates between a white studio opening, black presentation slides branded "Google Nano Banana" with a banana emoji, product-demo image canvases, and dark Freepik interface screens on a soft orange-blue gradient background. The reel should feel like an AI creator tutorial ad: quick but readable, clean text overlays, obvious prompt boxes, high contrast UI, fast social pacing, light jump cuts, and consistent bottom talking-head commentary. Speech style is single-speaker direct-to-camera tutorial English with crisp articulation, upbeat cadence, short persuasive sentences, and creator-economy CTA energy. Audio should sound like a close phone or lav mic in a quiet room, lightly compressed, dry, intelligible, and synced to the speaker window.

[00:00-00:04.50] Open on a bright white studio setup. The upper frame shows the colorful Google wordmark above the title "Nano Banana" with a banana emoji. Centered below it, the creator sits behind a white table in a cream Vans cap and light shirt, leaning toward a turquoise striped cup-shaped microphone or tumbler. Softbox lights are visible on both sides, making the setup feel like a casual creator studio. In the lower portion of frame, a separate rounded-corner selfie video of the same man begins speaking directly to camera. He introduces the tool with immediate enthusiasm. Lips are fully visible in the lower video; lip-sync strictness high for the first spoken hook.

[00:04.50-00:10.00] Cut to a black presentation layout branded "Google Nano Banana" at the top. The upper demo area shows a bright outdoor image of the creator on a Grand Canyon style cliff-edge walkway, arms stretched, backpack on, huge sky and canyon behind him. A prompt box appears under the image and begins typing "Make it into a youtube thumbnail". The lower selfie speaker remains on screen in the blue polo and backward cap, gesturing with one hand while explaining the edit. The tone is excited, helpful, and a little amazed. Keep the typed prompt animation readable and central.

[00:10.00-00:14.50] The same canyon image updates into a louder thumbnail treatment with giant curved yellow "GRAND CANYON" text behind the creator’s head. Emphasize the before-and-after value clearly: same base photo, more clickable YouTube-style packaging. The lower speaker continues talking in sync with hand gestures. Audio remains a crisp tutorial voice, no music overpowering the speech.

[00:14.50-00:20.50] Transition to a luxury product-edit example. In the upper frame, a prompt card reads "Replace the bottle" with a small reference thumbnail, then the output becomes a glossy Dior Sauvage-style perfume bottle on swirling golden light trails over a dark brown-black studio background. Maintain premium ad aesthetics, reflective glass, centered bottle, and luminous streaks. The lower talking-head explains the edit use case, likely referencing product replacement or image transformation. Speech stays fast, punchy, and creator-friendly.

[00:20.50-00:24.00] Briefly show another generated image example in the upper area, including a polished portrait-style output that demonstrates broader image editing capability beyond product swaps. Keep the cut quick and social-first, serving as visual proof rather than a full tutorial pause. The bottom speaker window continues uninterrupted, preserving continuity.

[00:24.00-00:31.50] Move into the software walkthrough. The upper frame now shows the Freepik dark UI over a soft gradient backdrop, starting with an AI Suite menu containing categories like image tools, video tools, audio tools, and design tools. Then zoom into the model panel where "Google Nano Banana" is selected, with image reference slots, style/composition/effects/character/object controls, and a beta disclaimer about aspect ratio. The creator in the lower window counts features with his fingers while describing how to access the workflow. Keep the UI readable enough for social tutorial viewing, but still fast-paced.

[00:31.50-00:36.50] Continue the interface demo with more dark UI panels, prompt fields, thumbnails, and settings sections scrolling or cutting through the workflow. The creator keeps speaking in direct, practical language, as if walking viewers through where to click and how to upload references. Camera on the lower speaker remains static, head-and-shoulders, neutral indoor room with door and wall behind him.

[00:36.50-00:43.00] End with a comedic CTA transformation. The upper frame shows a prompt reading "Give him a sign to hold" while the creator appears dressed like a theatrical ringmaster or showman in a yellow jacket and tall black top hat on a sunlit balcony. He holds a handmade cardboard sign that reads "Comment AI and I'll send you the link!" The lower talking-head still speaks beneath, landing the call to action. The final beat should feel playful, persuasive, and optimized for comments. Lip-sync remains visible in the lower window; key sync accents should land on the CTA words "comment AI" and "send you the link".

NEGATIVE PROMPT: extra fingers, warped hands during gesturing, drifting facial hair, inconsistent eye color, duplicated selfie windows, unreadable UI, misspelled "Google Nano Banana", broken prompt boxes, random logos, muddy text, incorrect YouTube thumbnail lettering, deformed perfume bottle glass, floating product shadows, overexposed softboxes, messy background clutter, cinematic bokeh that hides the tutorial content, abrupt framing jumps, desynced speech, robotic cadence, slurred consonants, harsh sibilance, echoey room tone, loud background music, clipping, pumping compression, lip-sync mismatch, subtitle blocks covering the demo.

SHOT PROMPTS:
SHOT_1 [00:00-00:04.50]: White studio opener, Google Nano Banana title, creator at desk with Vans cap and turquoise cup, bottom selfie explainer starts.
SHOT_2 [00:04.50-00:10.00]: Black branded demo screen, Grand Canyon reference photo, typed prompt box for YouTube thumbnail conversion, bottom speaker explains.
SHOT_3 [00:10.00-00:14.50]: Thumbnail result reveal with giant GRAND CANYON text, same split-screen layout, energetic creator commentary.
SHOT_4 [00:14.50-00:20.50]: Product-edit demo, perfume bottle replacement prompt, luxury golden-light result, bottom speaker continues.
SHOT_5 [00:20.50-00:24.00]: Quick alternate polished image result proving editing range.
SHOT_6 [00:24.00-00:31.50]: Freepik AI Suite walkthrough, dark UI menus, Google Nano Banana model selected, image reference slots and controls visible.
SHOT_7 [00:31.50-00:36.50]: More UI steps, prompt/settings panels, creator explains workflow and uploads.
SHOT_8 [00:36.50-00:43.00]: Final joke CTA, top hat outfit, cardboard sign asking viewers to comment AI for the link, bottom talking-head closes the pitch.

SPEECH PACK:
Timecoded transcript (best-effort, inferred from visible overlays and tutorial cadence):

[00:00-00:04.50]
TAKE_A: "Please use this if you have not already. It is a game changer."
TAKE_B: "If you are not using this yet, you need to. It is a total game changer."
TAKE_C: "This tool is a game changer, and you should absolutely be using it already."
Prosody: fast hook, confident, slightly urgent, friendly creator tone.

[00:04.50-00:10.00]
TAKE_A: "You can take an image like this and ask Nano Banana to turn it into something more clickable."
TAKE_B: "Watch this. I can upload a photo and prompt Nano Banana to make it into a YouTube thumbnail."
TAKE_C: "Here is a simple example. Drop in an image and tell it to make a YouTube-ready thumbnail."
Prosody: explanatory, upbeat, demonstration-first.

[00:10.00-00:14.50]
TAKE_A: "It keeps the subject but gives you a much stronger thumbnail treatment."
TAKE_B: "Same image, better packaging. That is why this is so useful for creators."
TAKE_C: "This is the kind of upgrade that makes basic content feel publish-ready."
Prosody: impressed, selling practical value.

[00:14.50-00:20.50]
TAKE_A: "You can also do product swaps, like replacing the bottle and turning it into a premium ad."
TAKE_B: "It is not just thumbnails. You can replace products and restyle the entire scene."
TAKE_C: "This works for product creatives too. Swap the object and it rebuilds the shot around it."
Prosody: persuasive, slightly faster, feature-stack delivery.

[00:20.50-00:24.00]
TAKE_A: "And it is not limited to one type of image either."
TAKE_B: "You can use the same workflow across different visual styles."
TAKE_C: "That flexibility is what makes the tool stand out."
Prosody: transitional, concise.

[00:24.00-00:31.50]
TAKE_A: "Inside Freepik, open the AI Suite, choose Google Nano Banana, and upload your image references."
TAKE_B: "If you want to try it, go into AI Suite, pick the Nano Banana model, then add your reference image here."
TAKE_C: "This is where it lives in Freepik. Select the model, drop your images in, and start prompting."
Prosody: instructional, practical, clear enunciation.

[00:31.50-00:36.50]
TAKE_A: "Then you can use the style, composition, effects, character, and object controls to shape the result."
TAKE_B: "From here you fine-tune the edit with the controls and prompt box."
TAKE_C: "Once the image is in, the rest is just directing the model with these tools."
Prosody: matter-of-fact, tutorial rhythm.

[00:36.50-00:43.00]
TAKE_A: "Want to try it? Comment AI and I will send you the link with unlimited generations on Freepik."
TAKE_B: "If you want access, comment AI and I will send you the link."
TAKE_C: "Comment AI for the link and I will send it over."
Prosody: bright CTA, direct ask, strong emphasis on "comment AI".
Video
GLOBAL LOCK: A consistent female model with vibrant red hair, fair skin with visible natural freckles across her nose and cheeks, and a slender build. She wears a simple white ribbed t-shirt and white cotton shorts. The environment features premium pale yellow bedding (duvet and pillows) with a soft, matte linen texture. Lighting is consistently soft, warm, and cinematic, mimicking high-end interior photography. The color grade is warm with creamy highlights and soft shadows. All speech is delivered by a warm, professional female voiceover with a clear, encouraging cadence.

[00:00–00:03]
Subject: Extreme close-up of the red-haired model's face. She is tightly wrapped in a pale yellow duvet so only her face is visible.
Action: She has her eyes closed, then opens them and smiles warmly and authentically at the camera.
Camera: Static ECU, shallow depth of field, soft bokeh on the fabric.
Lighting: Soft, diffused light from the side, highlighting skin texture and freckles.
Speech: "What if you could create any image you've ever imagined for your own product?" (Warm, curious tone)
Sync: High lip-sync strictness as she smiles and speaks.

[00:03–00:04]
Subject: Close-up of a fair-skinned hand with natural nails gently pressing into the pale yellow duvet.
Action: The fingers sink slightly into the soft, plush fabric to show texture.
Camera: Macro shot, slight handheld shake for realism.
Lighting: Bright, natural daylight.
Speech: "...for your own product?" (Continuing phrase)

[00:04–00:05]
Subject: A studio shot of a pale yellow duvet and two matching pillows neatly folded on a light wood minimalist stool.
Action: Static product shot.
Camera: Medium shot, eye-level, clean off-white background.
Lighting: Even studio lighting with soft shadows.
Speech: "Studio shots," (Punchy delivery)

[00:05–00:06]
Subject: The red-haired model in the white t-shirt, hugging a large pale yellow pillow to her chest.
Action: She squeezes the pillow and laughs joyfully, looking slightly off-camera.
Camera: Medium Close-Up, slight zoom-in.
Lighting: Warm, golden-hour style light.
Speech: "model shots," (Energetic delivery)

[00:06–00:08]
Subject: A surreal wide shot from a high bird's-eye view. A full bed with pale yellow bedding is floating on the surface of a calm, deep blue lake.
Action: The model is lying flat on her back on the bed, arms out, holding a coffee cup. Subtle ripples in the water.
Camera: High-angle Wide Shot, static.
Lighting: Natural outdoor light, bright sun.
Speech: "and even complex campaign shots..." (Awe-filled tone)

[00:08–00:10]
Subject: The model sitting up on the floating bed in the middle of the lake.
Action: She sits up, looking out over the water, her red hair catching the light.
Camera: Low-angle Medium Shot, tracking slightly with the movement of the bed on water.
Lighting: Backlit by the sun, creating a rim light on her hair.
Speech: "...like these." (Confident delivery)

[00:10–00:14]
Subject: A split screen or overlay showing the Invideo AI interface. A text prompt is being typed: "Ultra close-up beauty portrait of the red-haired model..."
Action: The UI shows the generation process, then transitions back to the model hugging the pillow in a bedroom.
Camera: Screen recording transition to Medium Shot.
Lighting: Soft bedroom light.
Speech: "You can do all of this in Invideo with the brand new Nano Banana 2." (Informative, tech-focused tone)

[00:14–00:16]
Subject: The same wooden stool from [00:04], but the bedding quickly swaps from pale yellow to soft pink.
Action: A quick cut transition showing product variety.
Camera: Static Medium Shot.
Lighting: Consistent studio light.
Speech: "As a business owner, that means no more costly reshoots..." (Problem-solving tone)

[00:16–00:18]
Subject: High-angle shot of the model lying on the bed in a bright bedroom.
Action: She stretches her arms above her head, looking relaxed and happy.
Camera: Wide Shot, high angle.
Lighting: Bright, airy, morning light through a window.
Speech: "...whenever you launch a new product..." (Relieved tone)

[00:18–00:20]
Subject: Close-up of the model's face as she lies on the pillow, eyes closed, then smiling.
Action: She turns her head slightly on the pillow, looking peaceful.
Camera: Close-up, side profile.
Lighting: Soft, warm glow on her skin.
Speech: "...or need one more angle from the same set." (Practical tone)

[00:20–00:23]
Subject: The model sitting up, holding two pillows, smiling directly into the lens.
Action: She gives a big, friendly smile and a small nod.
Camera: Medium Shot, eye-level.
Lighting: Bright, flattering studio light.
Speech: "This is how smart brands scale today. Comment 'invideo' to try it yourself." (Call to action, authoritative yet friendly)
Sync: High lip-sync strictness for the final CTA.

NEGATIVE PROMPT: blurry faces, inconsistent hair color, distorted fingers, extra limbs, flickering light, text logos on bedding, plastic skin texture, robotic mouth movements, mismatched lip-sync, harsh shadows, low resolution, jittery motion, floating artifacts in bedroom scenes.

SPEECH PACK:
[00:00–00:03] "What if you could create any image you've ever imagined for your own product?"
TAKE_A: (Slow, dreamy, emphasizing "any image")
TAKE_B: (Fast, excited, emphasizing "your own product")
TAKE_C: (Natural, conversational, balanced emphasis)

[00:04–00:10] "Studio shots, model shots, and even complex campaign shots like these."
TAKE_A: (Rhythmic, pausing after each category)
TAKE_B: (Building excitement towards "like these")

[00:20–00:23] "This is how smart brands scale today. Comment 'invideo' to try it yourself."
TAKE_A: (Direct, professional, clear CTA)
TAKE_B: (Friendly, warm, inviting)
PROSODY: [00:00] What if... [pause] you could create ANY image... [00:20] This is how SMART brands scale today. [pause] Comment 'invideo' [emphasis] to try it yourself.
Video
GLOBAL LOCK:
Subject is a Caucasian male, mid-20s, with short brown hair and a light beard, wearing a tan "VANS" trucker hat and a plain white t-shirt. He is positioned in the bottom third of the frame in a talking-head format. The top two-thirds of the frame is a digital workspace. The environment for the subject is a cozy room with warm, out-of-focus background lighting. The digital workspace is a clean, modern software UI with a white background. The video has a high-energy, fast-paced UGC tutorial style. Speech is enthusiastic, clear, and direct-to-camera.

[00:00–00:03]
The top 2/3 shows a rapid succession of Taylor Swift posters. First, a red and black vintage-style poster with "TAYLOR" in large block letters. Then, a collage-style poster with denim textures and "TAYLOR SWIFT" in a stylized font. The subject at the bottom is talking excitedly, gesturing with his hands.

[00:04–00:06]
The top 2/3 switches to Post Malone posters. One is a gritty, black-and-white screen-print with a red star over his eye and "POST" in red spray-paint font. The next is a profile shot with "F-1 Trillion" text in pink. The subject continues his energetic narration.

[00:07–00:14]
The top 2/3 shows a breakdown of a Leonardo DiCaprio poster. A portrait of DiCaprio appears on the left, a text prompt on the right. A progress bar fills, and a "Wolf of Wall Street" poster is revealed, featuring a screen-print texture and yellow/black color scheme. The subject points upwards toward the visuals.

[00:15–00:25]
The top 2/3 shows the "Lovart" website interface. A cursor clicks "New Project." The subject explains the tool. The cursor types "Create me a poster for Ed Sheeran" into a chat box. A model selection menu pops up, and "Nano Banana Pro" is selected.

[00:26–00:37]
The top 2/3 shows an Ed Sheeran poster being generated. It features him with a guitar against a sunset background. The subject demonstrates iterations: the text at the bottom changes to "NEW YEAR'S EVE" and "LAS VEGAS SPHERE." The style then shifts to a high-contrast green and black screen-print.

[00:38–00:42]
The entire frame transitions to a real-world scene. A man in a tan jumpsuit, seen from behind, is taping a large white poster onto a red brick wall. The poster features a black circular logo and the text "COMMENT AI." The subject appears in a small bubble at the bottom, saying "type AI in the comments."

NEGATIVE PROMPT:
Visual: blurry face, distorted hands, flickering UI elements, inconsistent hat logo, low resolution, messy background, unnatural eye movements.
Speech: robotic tone, monotone delivery, background noise, muffled audio, lip-sync mismatch, stuttering, long pauses.

SPEECH PACK:
[00:00–00:06]
TAKE_A: "Google Nano Banana Pro is mind-blowing when it comes to creating graphic design work. You can take any character and create any poster design."
TAKE_B: "Nano Banana Pro is a total game-changer for design. Take any celeb, any style, and boom—instant professional posters."
TAKE_C: "This new AI model is insane for graphics. One reference photo is all you need to make these incredible celebrity posters."

[00:07–00:14]
TAKE_A: "With one reference image of their face and a basic prompt. So I'm going to show you exactly how you can get the best results."
TAKE_B: "Just one photo and a simple sentence. I'll show you the secret to getting these high-end results every single time."
TAKE_C: "Reference photo plus a basic prompt equals this. Let me walk you through the process for the best output."

[00:15–00:25]
TAKE_A: "To get started you want to go to Lovart, which is a dedicated AI design tool. You can now write in a basic prompt, then select Google Nano Banana Pro."
TAKE_B: "Head over to Lovart—it's built for designers. Type your idea, pick the Nano Banana Pro model, and you're ready."
TAKE_C: "Step one: open Lovart. It’s an AI design powerhouse. Enter your prompt, choose the Google model, and watch the magic."

[00:26–00:42]
TAKE_A: "Once you hit generate, it will use its own prompt enhancer. Now you can iterate, change text or backgrounds. Type AI in the comments for the link!"
TAKE_B: "Hit generate and let the AI enhance your prompt. Tweak the text, swap the background, it's that easy. Comment AI for access!"
TAKE_C: "Generate, iterate, and perfect. Change anything you want in seconds. If you want to try this, just type AI below!"
Video
Create a vertical 9:16 premium AI model promo visual featuring an ultra-realistic close-up portrait of a young woman facing directly into camera against a dark teal background. She has fair skin, dark hair pulled back, subtle natural makeup, and translucent amber-orange eyeglasses catching a precise highlight across the frame. The lighting should be soft but dramatic, sculpting the face with studio precision and emphasizing realistic skin texture, calm eyes, and balanced symmetry. In the composition, glowing yellow ImagineArt 1.0 text appears in the upper right, while Most Realistic AI Model is set large at the bottom like bold creator-marketing typography. The overall feeling should be a polished product ad announcing a highly realistic character-generation model for creators and brands. No clutter, no subtitles, no cartoon styling.
Video

GLOBAL LOCK: A vertical 9:16 split-screen social proof video featuring the same white European-looking man in his late 20s to early 30s with fair neutral skin, brown side-swept hair, athletic build, clean-shaven face, fitted dark t-shirt, thin silver necklace, and dark smartwatch, seated at a round table using a space gray laptop. Keep his identity, face shape, hair, posture, laptop position, hand placement, watch, necklace, and down-looking focused expression consistent across the full sequence. The lower half of the frame is always the original source clip: a clean but ordinary bright apartment interior with white walls, hallway opening, wall-mounted TV on the left, soft daylight, and neutral consumer-camera realism. The upper half is always the AI-transformed version of the same moment, preserving pose and laptop interaction while swapping only wardrobe details slightly and dramatically changing the environment. Camera remains static, eye-level to slightly high, medium shot, portrait framing. Motion is minimal and realistic: typing, brief thinking gesture to chin, subtle head angle changes. Text overlays read “AI:” at top left, “Original:” above the lower section, and “Comment ‘AI’ for the prompts” centered between the halves. Style is crisp creator-demo proof, optimized for instant comparison and save/share behavior.

[00:00-00:01] Show the first split-screen comparison. In the upper half, place the creator in a warm wooden cabin interior with large windows, mountain view, practical lamp glow, and cozy brown timber walls while he types on the laptop. In the lower half, show the original bright apartment scene with the same seated pose and laptop placement. Keep the comparison clean and immediately readable.

[00:01-00:02] Swap only the upper half environment to a Santorini-style terrace at golden hour with blue railing, sea cliffs, and warm sunset light. The creator remains seated with matching body angle and laptop orientation. Lower half stays unchanged as the original apartment plate.

[00:02-00:03] Change the AI upper half to a Mediterranean villa interior with arched windows, cream stucco walls, sunlit floor, and olive trees visible outside. The creator briefly raises a hand toward his face in a thinking pose; mirror that motion in the original bottom half.

[00:03-00:04] Move the upper half into a high-rise luxury apartment with floor-to-ceiling windows and orange city sunset. Keep the creator’s pose, laptop, and chin-touch gesture aligned to the original. Preserve the centered comparison layout and CTA text.

[00:04-00:05] Transform the upper half into a dark wood library office with desk lamp, warm pools of light, bookshelves, and a more formal mood. The creator’s hands return to the keyboard. The original lower clip remains a plain daylight apartment with no background change.

[00:05-00:06] Hold on the same library-office transformation for an extra beat to let the comparison land. Maintain fixed camera, no zoom, and the same overlay text.

[00:06-00:07] Replace the upper half with a moody rainy-window lounge scene in teal and amber tones, soft reflections on glass, and a dim modern sofa in the back. The creator continues typing with serious concentration. Bottom half remains the bright apartment.

[00:07-00:08] Switch the upper half to a tropical outdoor workspace with wood structure, large tropical leaves, bright sun patches, and warm travel-lifestyle energy. The creator stays locked in the same seated laptop pose.

[00:08-00:09] Change the upper half to a glass house surrounded by green forest, soft daylight filtered through large panes, and minimalist modern architecture. Preserve the same shirt silhouette, watch, necklace, laptop size, and head tilt.

[00:09-00:10] Move the upper half to a luxury hotel suite at night with warm lamps, city lights outside, beige furnishings, and premium travel ambience. Keep the original lower half unchanged and clearly labeled.

[00:10-00:11] End on the final split-screen comparison with the same city-hotel AI background held long enough for viewers to read the CTA: Comment “AI” for the prompts. No extra camera motion, just a clean proof-driven finish.

NEGATIVE PROMPT: do not alter identity, face proportions, hairstyle, skin tone, build, laptop scale, or seated posture between scenes; avoid warped hands on keyboard, broken wrists, floating elbows, inconsistent necklace, or missing watch; avoid morphing furniture, flicker, unstable split line, typography corruption, or mismatched perspective between AI and original; do not change the lower original frame at all except natural motion from the source clip; no surreal lighting, extra people, extra laptops, bent table edges, or melting architecture; avoid jittery transitions, logo clutter, artifacting, blurred facial features, or unnatural eye direction.
Video
MASTER PROMPT

Create a vertical 9:16 creator reel that rounds up useful AI tools for image and creative-media generation. A male host appears in a lower-frame talking-head window and rapidly walks through different examples above him: dreamy cloud-and-cliff fantasy artwork, a lifestyle portrait sitting above the clouds, beauty-product ad imagery, fashion mockups, tool brand cards such as Hautech.ai and Hugging Face, and large thumbnail grids that suggest broader tool libraries. The tone should be energetic, opinionated, and built for creators looking for new AI resources.

GLOBAL LOCK

- Format: 9:16 AI-tools roundup reel with persistent host commentary.
- Host anchor: bearded male creator in a cap, speaking directly to camera from a lower cutout.
- Topic anchor: curated list of AI tools for image generation, stylized concepts, ad mockups, and creative workflows.
- Visual anchor: each tool or example gets a clean showcase card or full-screen sample image above the host.
- Pace: fast but readable, with each new tool feeling like a fresh recommendation or proof point.

TIMELINE

0.0s - 8.0s
Open with the broad theme of AI tools and a strong visual example such as a giant floating cliff in the clouds. Let the host introduce the roundup while a cinematic fantasy image above him sets the aspirational tone.

8.0s - 18.0s
Move into more polished generative examples: a seated man above the clouds, a beauty or beverage ad image, and clean commercial-style renders. This section should establish that the tools are useful for both artful concepts and marketing visuals.

18.0s - 30.0s
Show specific tool references and interface-adjacent cards, including names like Hautech.ai. Use fashion imagery, lifestyle product scenes, and creative thumbnails to suggest what each tool is good for without becoming a full software walkthrough.

30.0s - 43.0s
End with broader ecosystem references such as Hugging Face or large grids of options, implying deeper exploration beyond the first few tools. The host should close with the sense that this is a curated stack for creators who want practical AI image resources and inspiration.

NEGATIVE PROMPT

No coding-terminal deep dive, no dry enterprise software demo, no overly technical machine-learning jargon on screen, no horror imagery, no unrelated gaming footage, no chaotic meme editing. Keep it creator-focused, visual, and recommendation-driven.

SHOT PROMPTS

- Vertical creator-roundup shot with a male host in a lower commentary box and a floating-cliff fantasy image labeled AI Tools above him.
- Lifestyle concept art example of a man sitting on white steps above the clouds, used as proof of generative image quality.
- Beauty or beverage ad mockup with polished commercial lighting and product-in-hand framing, shown as an AI creative use case.
- Tool recommendation card featuring Hautech.ai with fashion-style imagery and clean presentation.
- Broader ecosystem reference frame featuring Hugging Face and other tool or thumbnail grids to suggest a larger creative AI stack.

SPEECH PACK

- Spoken delivery should sound like a concise creator recommendation reel highlighting which AI tools are worth trying and what kinds of visuals they help produce.
- Audio should prioritize the host with a light, modern background track.
Video
Create a vertical 9:16 minimal premium design-poster visual for an AI creative workflow, featuring a bright yellow tennis ball floating just above an outstretched human hand against a clean blue sky. The hand should rise from the lower portion of the frame wearing a white wristband, with the ball suspended in crisp sunlight so it feels like a polished 3D object hovering in space. Bold yellow Lovart text repeats in the upper left, while repeated Design text appears in the lower right like confident editorial poster typography. The overall result should feel like a high-end animated 3D poster concept for designers: simple, modern, vector-friendly, and easy to manipulate as a motion design asset. No clutter, no subtitles, no extra objects, no cartoon style.
Video
GOAL
Maximize visual + motion + speech similarity to the reference video, prioritizing the composite nature of the shot (talking head overlaid on screen recording), the specific UI elements shown, and the energetic, fast-paced tutorial delivery.

WORKFLOW
A) MISE EN PLACE (prep)
- Invariants: The composite layout (split-screen style with the subject keyed out in the bottom center). The subject's appearance (Caucasian male, 30s, short beard, beige "Vans Off The Wall" cap, plain white t-shirt). The background environment (a crisp screen recording of a web browser showing the Lovart.ai interface). The lighting on the subject (soft, even, frontal). The audio signature (close-mic, dry, energetic podcast style).
- Variables: The specific content shown on the background screen recording (which changes rapidly), the subject's hand gestures and facial expressions, the text overlays.

B) SHOTLIST (blueprint)
- shot_id: 1, timecode_start: 00:00, timecode_end: 00:54, duration: 54s
- framing: Static MCU (Medium Close-Up) for the subject in the bottom center. The background is a full-screen capture.
- lens: 35mm equivalent for the subject, sharp focus. Infinite depth of field for the screen recording.
- camera movement: Static camera for the subject. The background screen recording features digital zooms and pans to highlight UI elements.
- subject: Caucasian male, 30s, short beard, beige "Vans" cap, white t-shirt. Energetic, using open-palm gestures, pointing upwards.
- environment: Background is a digital screen recording of a UI.
- lighting: Subject has soft, even ring-light style illumination.
- color grade: Natural, warm skin tones for the subject. High contrast, vibrant colors for the UI elements (especially the red handbag and blue Dior bottle).
- motion cues: Subject's hands moving rapidly, screen recording UI elements changing instantly without loading times.

C) STYLE BIBLE (global)
- visual_style: UGC tech tutorial composite.
- camera_signature: Locked-off camera for the subject, dynamic digital screen capture for the background.
- lighting_signature: High-key, flat lighting on the subject to ensure a clean chroma key look.
- grade_signature: Clean, sharp, high-definition digital look.
- pacing_signature: Extremely fast-paced, cutting out all dead air and loading times.
- speech_style: Energetic, authoritative, direct-to-camera tech tutorial voiceover.
- speaker_profile: Male, mid-30s, enthusiastic, fast talker, clear enunciation.
- mic_mix_profile: Dry, close-mic, heavily compressed for maximum clarity and presence, no room reverb.

D) PROMPT SYNTHESIS

MASTER PROMPT
GLOBAL LOCK: A continuous composite video. The background is a crisp, high-resolution screen recording of a web browser navigating a design tool interface. In the bottom center of the frame, overlaid on top of the screen recording, is a talking-head subject: a Caucasian male in his 30s, medium-length brown hair, a short beard, wearing a beige "Vans Off The Wall" baseball cap and a plain white crew-neck t-shirt. The subject is well-lit with soft, even frontal lighting, casting no harsh shadows, and has a clean cutout (green screen effect). The camera angle for the subject is a static medium close-up (chest up). The overall visual style is a polished UGC tech tutorial. The speech is energetic, fast-paced, and authoritative, recorded with a close-mic, dry podcast-style audio signature.

[00:00–00:10] The background screen shows a collage of high-end design posters, including a glossy red patent leather handbag and a dark blue Dior Sauvage perfume bottle. The subject in the bottom center gestures enthusiastically with both hands, palms open, explaining the concept. A text overlay "Nano Banana x Design" appears at the top.
[00:10–00:20] The background screen transitions to show the process of generating a Dior Sauvage ad. The screen zooms slightly on the generated images featuring a model holding the bottle. The subject continues to talk rapidly, using small hand chops to emphasize points. A red circle graphic appears around the text "AI Design Agent" on the screen.
[00:20–00:35] The background screen shows the Lovart.ai interface. A red handbag image is uploaded. The screen shows text being typed: "create me 2 posters of this handbag, one elegant, and one bold". The subject points up at the screen with his index finger.
[00:35–00:45] The background screen displays the generated handbag posters. The UI shows the user selecting one, and then editing it to add a female model in a black suit holding the bag. The subject brings his hands together in a clarifying gesture.
[00:45–00:54] The background screen shows a new prompt being entered to place the poster in Times Square. The screen instantly updates to show the red handbag poster displayed on a massive glowing billboard in a neon-lit Times Square at night. The subject points directly at the camera. A large text overlay "Comment 'AI'" appears at the top of the screen.

NEGATIVE PROMPT
Visual artifacts to avoid: green spill on the subject, messy chroma key edges, blurry screen recording text, UI elements that look like AI hallucinations (keep the UI looking like a real website), inconsistent clothing on the subject, lighting changes on the subject.
Speech negatives: robotic cadence, unnatural emphasis, slurred words, harsh sibilance, plosives, clipping, pumping compression, over-denoise artifacts, lip-sync mismatch, long pauses, breathing sounds.

SPEECH PACK
Speaker: Male, 30s, energetic, fast-paced tech tutorial voice.
Transcript:
[00:00-00:10] "Google's Nano Banana is insane when it comes to design work. You can take a product, clothes, and location and put it together. But when you use this for design work, it's wild."
[00:10-00:20] "You can upload a photo of a product into Lovart, for example, and it will create all these custom posters using Google Nano Banana. I'm going to show you how you can do this because this is the efficiency revolution for design."
[00:20-00:35] "You can create a new project on Lovart, upload a photo of the product you want in the poster. You can write in a prompt like 'create me 2 different poster designs'. Then select the image model, select Nano Banana, and the agent will create different styles."
[00:35-00:45] "Pick the ones you want. Now we've got these two images, you can actually prompt and edit anything you want about that poster. So I now have this model holding the product, we can remove things and change the size."
[00:45-00:54] "Then we can even reprompt this to put this on Times Square for example. Boom. If you want to try this out for yourself, type AI in the comments and I'll send you a link."
Video
Claye Ai
GLOBAL LOCK:
Subject: A female host, mid-20s, South Asian ethnicity, warm skin tone, long wavy brown hair, wearing a cozy lavender/purple knit sweater. She sits in a home office with a professional black condenser microphone on a boom arm. Background features a dark wooden bookshelf filled with books and small plants, softly blurred.
AI Subject Consistency: A high-fashion female model, European features, sharp jawline, sleek dark hair, wearing a white luxury power suit.
Environment: High-end studio settings for AI ads; warm home office for host.
Lighting: Soft, three-point lighting for the host; dramatic, high-contrast, cinematic lighting for AI outputs.
Color Grade: Warm, saturated tones for the host; cool blues and deep blacks for the "Dior-style" AI ads.
Speech: Clear, energetic female voice, professional cadence, direct-to-camera address.

[00:00–00:05]
Visual: Rapid montage of luxury brand ads. A model with green eyes holds a perfume bottle; a red "HERA" perfume ad; a man wearing a Calvin Klein watch; a woman with flowers on her face holding perfume.
Camera: Extreme close-ups and medium shots, static with slight internal motion.
Lighting: High-fashion studio lighting, dramatic shadows.
Speech: "You don't need to hire models or designers to create brand ads anymore." (Fast-paced, hook delivery).

[00:05–00:12]
Visual: Cut to host in her office. She gestures towards the camera. A screen overlay shows the URL "lovart.ai/home".
Camera: Medium shot, static.
Lighting: Warm, soft key light from the side.
Speech: "AI can do the full photoshoot and video for you. Just go to lovart.ai and start a new project."

[00:12–00:20]
Visual: Screen recording of the Lovart.ai interface. A cursor clicks "Upload Image," then selects "Nano Banana Pro" from a dropdown menu. A prompt is typed: "Luxury studio photoshoot of a model holding my product, cinematic lighting, premium brand look."
Camera: Screen capture, focused on the UI elements.
Speech: "Upload your product image, choose Nano Banana Pro, and describe the ad you want."

[00:20–00:26]
Visual: The AI generates a photo of a model in a white suit holding a Dior bag. The host is shown in a small window, reacting. The screen shows "Edit Text" and "Model Pose" options.
Camera: Split screen: UI on top, Host on bottom.
Speech: "Lovart's design agent will generate a professional ad visual in seconds. You can adjust anything: text, background, model pose, lighting."

[00:26–00:35]
Visual: A "Before" and "After" comparison. The "Before" is warm-toned; the "After" is cool blue with dramatic shadows. The cursor then selects "Kling 3.0" for video generation.
Camera: Side-by-side comparison, then UI focus.
Speech: "And the best part? These edits don't damage or overwrite your original base image. Once your poster looks perfect, you can turn it into a cinematic video using Kling 3.0 inside Lovart."

[00:35–00:40]
Visual: The final video output shows the model in the white suit subtly moving, adjusting her hand on the bag. The host returns to full screen with a "Comment ART" graphic overlay.
Camera: Full-screen AI video, then Medium Shot of host.
Speech: "Just add a motion prompt, generate, and your animated brand ad is ready. Comment ART and I'll send you the tool link."

NEGATIVE PROMPT:
Visual: Blurry faces, extra fingers, distorted product logos, flickering lights in video, unnatural skin texture, messy background in host segments, low resolution, watermarks on AI outputs.
Speech: Robotic tone, background noise, muffled audio, lip-sync mismatch, long pauses between sentences, harsh "S" sounds.

SPEECH PACK:
[00:00-00:05]
Transcript: "You don't need to hire models or designers to create brand ads anymore."
TAKE_A: (Energetic, fast) "You don't need to hire models or designers to create brand ads anymore!"
TAKE_B: (Authoritative, measured) "You don't need to hire models... or designers... to create brand ads anymore."

[00:05-00:12]
Transcript: "AI can do the full photoshoot and video for you. Just go to lovart.ai and start a new project."
TAKE_A: (Helpful, inviting) "AI can do the full photoshoot and video for you. Just go to lovart dot a-i and start a new project."

[00:35-00:40]
Transcript: "Comment ART and I'll send you the tool link."
TAKE_A: (Direct, friendly) "Comment ART and I'll send you the tool link!"
TAKE_B: (Whispered/Secretive) "Comment ART... and I'll send you the tool link."
Video
Tim Koda

MASTER PROMPT
GLOBAL LOCK: Vertical 9:16 creator workflow reel showing how a low-quality phone product photo becomes a premium editorial brand campaign. The piece combines direct-to-camera creator explanation, smartphone screen inserts, dark AI prompt interfaces, polished fragrance and beauty packshots, high-fashion poster layouts, icy blue lighting setups, red background title cards, and glossy close-up beauty shots. Maintain a commercial luxury aesthetic with clean graphic design, sharp product isolation, premium reflections, and fast but readable tutorial pacing. One male creator speaks with confident agency-style cadence, close mic, dry audio, and repeated CTA emphasis on the keyword SNAP.

[00:00-00:05] Open on the creator holding up a poor-quality phone photo of a product while bold on-screen text frames the client problem. Quickly cut to the original image on a smartphone screen and the raw product reference. The creator states the hook: turning one bad iPhone product shot into a full brand campaign.

[00:05-00:10] Show the source product clearly, including dark fragrance bottle imagery and rough input materials. Keep the pace quick and problem-solution oriented. The host explains that the workflow starts with product extraction and building a 2x2 reference grid.

[00:10-00:17] Move into dark AI workflow screens with prompt boxes, image tiles, and reference inputs. The product appears in multiple isolated views while the creator describes feeding references into an LLM to generate several custom prompts. Keep interfaces crisp, black or charcoal, with white type and subtle UI highlights.

[00:17-00:25] Transition into generated campaign outputs. Show premium editorial product renders: dramatic blue-light bottle shots, luxury tabletop scenes, stylized poster frames, and fashion-adjacent compositions. The visual language should alternate between clean packshot precision and moody brand storytelling.

[00:25-00:32] Display a grid or carousel of multiple campaign variants, including print-poster style layouts, branded title cards, and comparative presentation boards. The creator frames this as a scalable shoot process that can create multiple deliverables from one starting photo.

[00:32-00:37] Show high-end beauty close-ups with glossy lips and refined skin detail, suggesting companion campaign imagery beyond the product packshot itself. The grade stays polished, magazine-like, and editorial.

[00:37-00:41] End on a clean CTA beat with a minimal branded frame or title card. The creator closes by telling viewers to comment SNAP to get the full creative shoot process.

NEGATIVE PROMPT
Avoid cheap e-commerce lighting, flat product cutouts, muddy reflections, fake luxury materials, unreadable prompt UI, weak poster typography, inconsistent bottle shape, warped labels, plastic skin on beauty close-ups, noisy shadows, and robotic narration. Keep every asset premium and campaign-ready.

SPEECH PACK
[00:00-00:05]
Closest audible: Comment SNAP to get the full creative shoot process.
Safe paraphrase: Open with a keyword CTA tied to the full workflow.

[00:05-00:17]
Closest audible: Your client sends a trash iPhone photo and expects a full brand campaign, and here is the workflow.
Safe paraphrase: He frames the challenge and explains the early extraction and prompt-building steps.

[00:17-00:32]
Closest audible: Product extraction, 2x2 grid, LLM prompts, then Nano Banana or Flux, then upscale and Lightroom finish.
Safe paraphrase: He walks through the generation and finishing stack that turns one input image into multiple outputs.

[00:32-00:41]
Closest audible: Table top, on figure, lifestyle, print poster, one photo, one workflow; comment SNAP.
Safe paraphrase: Close by emphasizing output variety and repeating the CTA.
Video
GLOBAL LOCK:
Subject is a young blonde woman, light skin with warm undertones, approximately 20-25 years old. She has long, wavy blonde hair with curtain bangs. Her facial features are reminiscent of a pop star aesthetic (soft features, full lips). Wardrobe includes a pink camisole with strawberry prints and a white floral sundress. The environment for the subject is a soft-focus indoor bedroom or studio. The creator (narrator) is a blonde woman in her late 20s, wearing a black graphic t-shirt, holding a black Rode microphone. Creator's background is a dark studio with bookshelves and warm orange/red practical globe lights. Lighting for the subject transitions from flat/amateur to cinematic backlighting. Color grade is warm, saturated, and editorial.

[00:00–00:01]
Subject: Young blonde woman in pink strawberry-print camisole, sitting on a black stool.
Environment: Plain white wall background.
Action: Subject looks at the camera with a neutral, slightly pouty expression.
Framing: Medium shot, eye level.
Lighting: Flat, direct flash-style lighting, creating a slight shadow on the wall.
Motion: Static shot with a slight digital zoom-in.

[00:01–00:02]
Subject: Same woman, now in a white sundress with small red floral patterns.
Environment: Soft-focus indoor wall.
Action: Subject poses with a slight head tilt, looking into the camera.
Framing: Medium close-up.
Lighting: Soft, diffused side lighting.
Motion: Quick cut transition.

[00:02–00:03]
Subject: Same woman, close-up on face.
Action: She winks with her right eye and holds up a peace sign with her hand near her face.
Framing: Close-up (CU).
Lighting: Natural, low-contrast indoor light.
Motion: Quick cut transition.

[00:03–00:10]
Visual: iPhone screen recording showing the Google Gemini app interface. A circular overlay in the bottom center shows the creator speaking into a Rode microphone.
Action: On screen, the user selects "Create Image," uploads the winking photo, and pastes a long text prompt. The creator in the overlay is speaking and gesturing with her free hand.
Speech (Creator): "First head to Gemini and hit create image. Upload the photo, paste the prompt, hit generate, and just wait for the new image to be generated."
Cadence: Fast, instructional, energetic.
Mic Signature: Close-up, crisp, studio quality.

[00:10–00:14]
Subject: The creator (narrator) full screen.
Environment: Studio with bookshelves and warm orange/red glowing bulbs.
Action: Creator speaks directly to the camera, holding the Rode mic close to her mouth. She gestures towards the bottom of the screen.
Framing: Medium close-up (MCU).
Lighting: Dramatic studio lighting with warm rim lights.
Speech (Creator): "If you want the prompt I used, just comment 'prompt' and I'll send it over."
Cadence: Warm, inviting, clear call to action.
Lip Sync: High strictness; mouth movements must match the words "prompt" and "send it over."

NEGATIVE PROMPT:
Visual: distorted facial features, inconsistent hair color, blurry hands, extra fingers, flickering lights in background, low-resolution screen recording, robotic body movements, harsh shadows on face in the final "relit" version.
Speech: muffled audio, background hiss, robotic voice, misaligned lip-sync, long pauses between sentences, monotone delivery.

SPEECH PACK:
[00:03–00:10]
Transcript: "First head to Gemini and hit create image. Upload the photo, paste the prompt, hit generate, and just wait for the new image to be generated."
TAKE_A: (Fast, upbeat) "First head to Gemini and hit create image! Upload the photo, paste the prompt, hit generate... and just wait for the new image to be generated."
TAKE_B: (Instructional, steady) "First, head to Gemini and hit 'create image'. Upload your photo, paste the prompt, hit generate, and wait for the new image."
TAKE_C: (Casual, conversational) "So, first go to Gemini and click create image. You're gonna upload the photo, paste that prompt, hit generate, and then just wait for the magic."

[00:10–00:14]
Transcript: "If you want the prompt I used, just comment 'prompt' and I'll send it over."
TAKE_A: (Direct, friendly) "If you want the prompt I used, just comment 'prompt' and I'll send it over!"
TAKE_B: (Helpful, soft) "If you'd like the prompt I used here, just comment 'prompt' below and I'll send it right over to you."
TAKE_C: (Punchy, CTA-focused) "Want the prompt? Comment 'PROMPT' and I'll send it over now."

AI Interior Design Generator

AI interior design generator content becomes useful when it treats the room as a decision problem, not just a fantasy scene. The person searching this topic often has a real living room, bedroom, or office they want to rethink. That means the strongest examples on this page should help them compare layout moods, decor directions, and style shifts in a way that still feels connected to practical redesign choices.

This matters because room imagery is only valuable if it can guide action. A creator or homeowner needs more than a beautiful render. They need a sense of what a Scandinavian, modern, warm, or dramatic version of their space might actually feel like. When you compare examples here, focus on style clarity and whether the redesign direction feels believable enough to act on.

FAQ

What is an AI interior design generator best for?

It is best for testing room redesign directions, decor styles, and before-and-after concepts before buying furniture or committing to a makeover.

Why do people upload their real room photos?

Because they want to see a new style applied to an actual space they own or use, not just browse generic inspiration rooms.

What makes a strong interior example?

A strong example feels style-specific and believable enough that you can imagine using it as a real decor or layout decision reference.

What should I compare on this page?

Look for style clarity, room realism, and whether the redesign direction feels actionable rather than only decorative.