A) MISE EN PLACE
Reference summary
- Duration: 00:58.26
- Format: vertical 9:16, 720x1280, 24 fps
- Structure: tutorial reel combining talking-head presentation, interface demo, output examples, and CTA
- Audio: spoken direct-to-camera narration over tutorial visuals; exact transcript partially inferred from observable text, pacing, and caption
Scene / shot segmentation
1. 00:00.00-00:04.50
Hook section. AI-generated snowy valley / blocky stylized environment fills the background while large centered white text reads “How to do this with FREEPIK.” Presenter appears as a cutout talking head seated in a chair at the bottom of frame.
2. 00:04.50-00:10.50
Fast examples and UI preview. The background alternates between a Freepik/Kling workflow poster and a glossy “Real Estate” example card on purple abstract waves. Presenter keeps gesturing directly to camera.
3. 00:10.50-00:26.00
Workflow explanation. Interface screens, prompt windows, and tutorial cards dominate the frame while presenter remains pinned lower center. Visual emphasis moves between chat-like instruction blocks and editing software panels.
4. 00:26.00-00:40.50
Technical implementation section. Close views of the prompt box, Freepik generation settings, and editing timeline appear, including readable concepts like a living-room camera glide prompt and After Effects layer names for day/night states.
5. 00:40.50-00:50.50
Result showcase. Bright luxury living-room renders and a moving circular ring/portal effect appear while presenter continues the explanation.
6. 00:50.50-00:58.26
CTA finish. Large text “Comment ‘AI’” fills the upper half while workflow graphics and the tutorial poster stack below it. Presenter lands the call to action.
Visual evidence keyframes
- 00:00.00: stylized snowy scene, bold white hook text, presenter bottom center
- 00:06.00: Freepik workflow card on screen with presenter gesturing
- 00:09.00: “Real Estate” result frame over glossy purple abstract background
- 00:15.00: dark UI and chat box explaining workflow steps
- 00:30.00: prompt box visible with cinematic living-room camera-glide prompt and blue Generate button
- 00:39.00: After Effects timeline with project label “Freepik Kling O1 Style Transfer” and DAYTIME / NIGHTTIME layers
- 00:48.00: interior render showcase with presenter still lower frame
- 00:55.00: bold “Comment ‘AI’” CTA card and Freepik branding
Speech evidence (best-effort)
- speaker_count: 1
- speaker A: male-presenting presenter, on-camera for most of the reel
- speech style: energetic tutorial narration, direct address, short explanatory bursts, occasional emphasis gestures matching the cuts
- likely content themes in order:
1) hook about how to create the shown transition with Freepik
2) quick proof that the style-transfer result works for practical use cases such as real estate
3) walkthrough of workflow steps and prompt usage
4) implementation notes around Kling / Freepik / After Effects
5) closing CTA to comment “AI” for prompts, images, or resources
- lip visibility: full, presenter visible and speaking throughout many segments
- lip_sync_strictness: medium for recreation, because mouth motion is visible but precise wording is not the main retention driver
Invariants list (LOCK THESE)
- presenter identity: white-presenting man in his 20s-30s with medium brown hair, short beard and moustache, blue baseball cap with yellow front logo, muted teal/gray athletic t-shirt with cream shoulder stripes, seated in a black chair
- layout: presenter cutout anchored near the bottom center while backgrounds switch between AI outputs, UI screenshots, tutorial cards, and editing timelines
- design language: dark backdrop or interface-heavy background, bold white headline typography for hooks and CTA, high-contrast tutorial-card overlays
- product context: Freepik and Kling style-transfer workflow, prompt box, generation settings, result previews, After Effects implementation
- motion grammar: rapid jump cuts every few seconds, presenter hand gestures synced to emphasis points, no cinematic camera move inside the talking-head layer
- lighting / grade: evenly lit presenter with soft frontal light, slightly warm skin tones, clean creator-video look
- audio style: concise teaching narration, upbeat but clear, no cinematic acting, creator-education cadence
Variables list (TWEAK THESE)
- exact scenic examples used behind the presenter
- exact software screens and UI crop choices
- precise phrasing of the narration
- title copy variations, as long as the first frame still clearly states the tutorial promise
- CTA wording around “Comment AI,” while preserving the comment-driving mechanic
B) SHOTLIST
Shot 1
- shot_id: 1
- timecode_start: 00:00.00
- timecode_end: 00:04.50
- duration: 4.50s
- framing: presenter lower-third cutout over a full-screen AI landscape, text centered above him
- lens: presenter feels like webcam / phone close-medium crop
- camera movement: static presenter crop; background video may subtly move
- subject: presenter talks directly to camera with open-hand gestures
- environment: snowy stylized AI environment in background
- lighting: soft, even, creator-studio frontal light on presenter
- color grade: bright scenic background contrasted with darker presenter shadow edges
- speech/audio: Speaker A introduces the tutorial promise, roughly “how to do this with Freepik”
- must match: instant value proposition and brand/tool clarity in frame one
Shot 2
- shot_id: 2
- timecode_start: 00:04.50
- timecode_end: 00:10.50
- duration: 6.00s
- framing: stacked workflow poster and result cards with presenter pinned at bottom
- lens: presenter crop unchanged
- camera movement: brisk editorial cuts between background examples
- subject: presenter continues gesturing while visual proof of output quality appears
- environment: Freepik workflow graphic, glossy purple abstract background, “Real Estate” sample card
- lighting: presenter remains constant; backgrounds are saturated and polished
- speech/audio: Speaker A explains what kind of transition or use case is being shown
- must match: quick proof section before deep tutorial
Shot 3
- shot_id: 3
- timecode_start: 00:10.50
- timecode_end: 00:26.00
- duration: 15.50s
- framing: interface screens and text panels dominate; presenter cutout remains lower center
- lens: medium crop on presenter
- camera movement: fast cuts, no slow camera motion
- subject: presenter emphasizes workflow steps with hand motions
- environment: dark UI panels, text blocks, buttons, workflow poster
- lighting: consistent creator lighting
- speech/audio: Speaker A explains the process step by step in short sentences
- must match: tutorial credibility through actual software views
Shot 4
- shot_id: 4
- timecode_start: 00:26.00
- timecode_end: 00:40.50
- duration: 14.50s
- framing: close interface crops, prompt box, settings, editing timeline
- lens: presenter crop unchanged, background takes priority
- camera movement: screen swaps and hard cuts
- subject: presenter points and speaks; UI shows prompt engineering and compositing logic
- environment: prompt panel with generation controls, After Effects timeline, day/night layer naming
- lighting: neutral creator lighting
- speech/audio: Speaker A gets more tactical, likely naming tools and steps
- must match: explicit practical detail, not vague inspiration talk
Shot 5
- shot_id: 5
- timecode_start: 00:40.50
- timecode_end: 00:50.50
- duration: 10.00s
- framing: output preview takes over, presenter still present
- lens: medium crop on presenter, wide interior examples in background
- camera movement: brisk output showcase cuts
- subject: presenter reinforces the use case and payoff
- environment: luxury living room renders, daylight and nighttime mood variants, circular portal/ring effect
- lighting: bright interiors contrast with dark tutorial backdrop
- speech/audio: Speaker A summarizes why the workflow feels premium / dynamic
- must match: result proof after the technical section
Shot 6
- shot_id: 6
- timecode_start: 00:50.50
- timecode_end: 00:58.26
- duration: 7.76s
- framing: large CTA text in upper frame, workflow graphics below, presenter lower center
- lens: presenter crop unchanged
- camera movement: mostly static CTA hold with minor cut refreshes
- subject: presenter lands the final ask
- environment: “Comment ‘AI’” headline, Freepik poster, dark background
- lighting: consistent creator lighting
- speech/audio: Speaker A invites comments to receive prompts / images / resources
- must match: strong comment-driving CTA at the end
C) STYLE BIBLE (GLOBAL)
- visual_style: creator tutorial reel, clean UGC educator format, software-demo montage
- camera_signature: static cutout presenter layer plus rapidly changing background plates
- lighting_signature: soft frontal light on presenter with minimal drama, practical “studio desk creator” feel
- grade_signature: presenter stays warm-neutral while the backgrounds alternate between vibrant AI outputs and dark UI panels
- texture_signature: crisp app screenshots, bold text overlays, clean edges around the presenter cutout
- pacing_signature: immediate hook, proof fast, tutorial core in the middle, results near the end, CTA close
- speech_style: direct-to-camera educational narration
- speaker_profile: energetic male creator voice, conversational, confident, tutorial-first
- pronunciation_profile: relaxed but clear English, medium pace, emphasis on tool names and steps
- mic_mix_profile: dry creator audio, intelligible, lightly compressed, optimized for phone playback
D) PROMPT SYNTHESIS
MASTER PROMPT
GLOBAL LOCK: Create a vertical 9:16 creator tutorial reel. Keep one white-presenting male presenter in his late 20s to early 30s visible as a cutout near the bottom center for most of the video. He has medium brown hair, short beard, blue baseball cap with a yellow logo patch, muted teal-gray athletic t-shirt with cream stripes on the shoulders, and sits in a black office chair. He speaks directly to camera with energetic but clear tutorial cadence, frequent hand gestures, and a creator-education tone. The background changes rapidly between AI-generated example footage, Freepik / Kling workflow cards, software UI close-ups, prompt boxes, editing timelines, luxury interior outputs, and large white CTA text. Lighting on the presenter remains soft and even, like a YouTube short-form setup. The reel should feel premium, practical, and scroll-stopping, not chaotic. Keep typography bold and readable, especially in the opening hook and final CTA.
[00:00-00:04.50] Open with a dreamy stylized snowy valley or blocky cinematic environment filling the frame. Place the presenter as a bottom-center cutout, talking directly to camera with open-hand gestures. Large bold white text appears centered above him: a clear promise equivalent to “How to do this with FREEPIK.” Keep the frame immediately readable in under one second. Speaker A introduces the tutorial in a punchy sentence, upbeat, direct, and creator-friendly, lips fully visible, medium lip-sync strictness.
[00:04.50-00:10.50] Cut through a fast sequence of proof visuals while the presenter continues talking in the same lower-center position. Show a Freepik/Kling workflow poster, then a polished result card such as a real-estate style transformation over glossy purple abstract graphics. Keep the presenter gesturing to emphasize that this is a real usable workflow, not just a concept. Speaker A explains the type of transition and why it feels premium or dynamic. Maintain crisp readable branding and high contrast.
[00:10.50-00:26.00] Shift into the tutorial core. Background becomes darker UI panels, instruction cards, and software screens. The presenter keeps speaking with concise direct-teaching cadence and emphatic hand motions. Alternate between chat-like explanation boxes, workflow graphics, and screen recordings of the process. Keep every cut purposeful and easy to parse. Speaker A explains the steps in plain English, likely calling out the tool stack and the logic of layering AI visuals over video. Lips remain visible; sync important sentence accents to cut points.
[00:26.00-00:40.50] Push into the tactical detail section. Show a prompt interface with a large text box, generation controls, aspect ratio settings, and a blue generate button. Include a prompt concept like a smooth forward camera glide through a high-end living room with a floating ring and natural daylight. Then cut to an editing timeline such as After Effects with project naming around Freepik Kling style transfer and layers labeled DAYTIME and NIGHTTIME. The presenter continues speaking, now more instructional and specific, with slightly sharper emphasis on key terms. Keep the background sharp enough that viewers can read it as real software.
[00:40.50-00:50.50] Move into result showcase mode. Show bright luxury interior renders, window-heavy living rooms, daytime and nighttime examples, and a circular portal/ring motif suggesting the style-transfer effect. The presenter remains lower center, speaking with a satisfied “here is the result” energy. Cuts are brisk but less dense than the middle tutorial section so the viewer can appreciate the output quality.
[00:50.50-00:58.26] Finish with a comment CTA. Large bold white text equivalent to Comment “AI” fills the upper half of the frame while workflow graphics and the Freepik poster stack beneath it. The presenter looks into camera and lands a direct ask for viewers to comment in exchange for prompts, images, or workflow help. Keep the final frame highly screenshot-able and optimized for engagement comments. Lips visible, clear final emphasis on the call to action.
NEGATIVE PROMPT
Avoid messy cutout edges around the presenter, unreadable UI text, distorted hands, warped face identity, random wardrobe changes, off-brand tool names, muddy screen captures, cluttered overlapping graphics, weak hook typography, low-contrast captions, overdone motion graphics, cinematic shallow-depth glamour shots, robotic narration, slurred speech, lip-sync mismatch, clipped audio, heavy reverb, harsh de-essing, background noise pumping, strobing transitions, flicker, frame jitter, generic stock-office imagery, and CTA text that is too small to read on mobile.
SHOT PROMPTS
- Hook shot delta: snowy cinematic AI background, bold white tutorial text, presenter lower center
- Proof shot delta: workflow poster plus flashy real-estate sample, presenter gesturing
- Tutorial shot delta: dark UI screens, explanation boxes, practical workflow overlays
- Prompt shot delta: close prompt interface with readable cinematic living-room prompt and generate button
- Editing shot delta: After Effects timeline with DAYTIME and NIGHTTIME layer logic
- Result shot delta: high-end interior showcase and moving ring motif
- CTA shot delta: giant Comment “AI” text with branded workflow poster below
SPEECH PACK
Timecoded transcript (best-effort observable reconstruction)
- [00:00.00-00:04.50] Speaker A: “Here’s how to do this with Freepik.” Emotion: upbeat, hook-first, medium-fast pace.
- [00:04.50-00:10.50] Speaker A: “This workflow gives you a clean cinematic style-transfer transition, and it works for polished use cases.” Emotion: confident, explanatory.
- [00:10.50-00:26.00] Speaker A: “I’m showing the process step by step so you can layer AI visuals over your video inside Freepik and Kling.” Emotion: practical, tutorial-focused.
- [00:26.00-00:40.50] Speaker A: “Use a clear motion prompt, generate the shot, then bring it into your edit and organize the effect layers.” Emotion: precise, more technical, medium pace.
- [00:40.50-00:50.50] Speaker A: “This is where it starts to feel premium because the transition adds movement and visual depth.” Emotion: reinforcing payoff.
- [00:50.50-00:58.26] Speaker A: “Comment ‘AI’ if you want the prompts, images, or Freepik workflow.” Emotion: direct CTA, slightly punchier emphasis.
TAKE_A
- Keep the wording close to the lines above with confident creator-teacher cadence.
TAKE_B
- Same meaning but slightly faster and more sales-forward, with stronger emphasis on tool names and “Comment AI.”
TAKE_C
- Same meaning but slightly calmer, more instructional, and less hype-heavy.
Closest audible version
- Because the exact waveform was not transcribed word-for-word, treat the lines above as closest-observable tutorial intent anchored to on-screen text, pacing, and the caption.
Safe paraphrase version
- The reel teaches how to recreate a cinematic style-transfer transition in Freepik/Kling, shows the workflow, and ends by asking viewers to comment “AI” for the assets.