GLOBAL LOCK: A short-form tutorial reel hosted by a young light-skinned male creator in his early 20s with a slim build, short dark hair mostly hidden under a backwards black baseball cap, dark eyebrows, clean-shaven face, and a direct confident delivery style. He wears a black hoodie in a dark studio with magenta and blue edge lighting on his face and shoulders. Across the whole video, keep the creator visually consistent whenever he appears on camera. Alternate between direct-to-camera talking-head shots and desktop/screen-recording style inserts that show app interfaces, prompt builders, editing panels, and generated example outputs. The overall structure must feel like a practical creator education reel teaching how to make viral AI videos with ChatGPT, GPTs, Kling, and an editor workflow. Use social-video pacing, clear cut points, large readable interface elements, bold keyword captions, and crisp screen captures. Speech style is one energetic male speaker only, close-mic, dry room, high intelligibility, punchy cadence, creator-educator tone, with cuts landing on emphasized words.
[00:00–00:03] A hyper-stylized example montage opens the reel before the tutorial explanation begins. Show quick AI-generated insert shots: a yellow/orange plush-like character or pastry-like creature in a tiny kitchen set, exaggerated close framing, warm domestic lighting, toy-scale props, and a glossy social-media-ready finish. Add motion that feels like a viral AI clip rather than a static still: tiny hand gestures, object movement, short action beats, and a polished ad-like grade. Include large social-post overlays such as view counts or bold engagement graphics to imply virality. No host visible yet. No spoken words clearly visible on lips here if needed, or let the first line begin under the montage as voice-over. Audio should already feel like a tutorial hook.
[00:03–00:07] Cut hard to a centered talking-head medium close-up of the creator in the dark studio. The host looks straight into the lens and says the equivalent of “How to make viral videos,” with lips fully visible and sync strictness high. Frame him chest-up, camera at eye level, 35mm-to-50mm lens feel, shallow background, magenta-blue neon edge lights behind him. His expression is serious and helpful, with fast, clear articulation. The cut should feel like a strong tutorial promise after the flashy hook.
[00:07–00:12] Intercut between the host and the first example set. Show a vertical phone-style AI video example of a red cartoonish squishy character in a fleshy or surreal macro environment, then cut to generated household-object characters in a kitchen or interior setting, each with visible view-count overlays. Keep the host narration continuing over these inserts, explaining that viewers are asking how these kinds of videos are made. The examples should feel deliberately absurd, highly clickable, and visually varied. Maintain a social-app UI vibe on the inserts.
[00:12–00:16] Return to the host in the same neon studio framing. He explains that the process is easy or straightforward. Use a steady locked-off shot, close mic, no visible background clutter, and keep the delivery conversational but authoritative. Cut precisely on his emphasized keywords.
[00:16–00:21] Switch to screen-recording style visuals that show a desktop or browser workflow. Display recognizable AI tooling logos and interface tiles associated with ChatGPT, GPTs, custom tools, or image/video generation platforms. Cursor movement should be deliberate and readable. Then cut back briefly to the host as he explains the first step: going to GPTs or opening a custom GPT workflow. The speech remains one speaker, with no ambient distractions.
[00:21–00:27] Show actual interface navigation in a clean dark-themed desktop UI: menus, lists of GPTs, and prompt or tool panels. Include cursor clicks on fields and dropdowns. Briefly show a text or voice-input area and then a more advanced editing or story-generation screen. The host explains the setup step by step, describing where to go and what to choose. Keep the visuals aligned to the speech so every mention lands on the corresponding interface action.
[00:27–00:33] Continue in the software workflow with a tighter focus on prompt construction and asset preparation. Show text fields being filled, aspect-ratio settings such as 9:16, character/object references, and a “create story” or similar composition interface. Then reveal generated outputs: a stern milk-carton-like object character, a toast or bread-like character, and a colorful gadget character in a neon environment. The host explains that he is generating characters or story assets that can later be animated.
[00:33–00:39] Stay in screen-recording mode and move into the video-generation stage. Show the generated stills or character renders inside a platform interface, then a workflow where files are exported, selected, or prepared for upload into Kling AI or a comparable video generator. Interface panels should show thumbnails, upload areas, and generation controls. The host explicitly mentions Kling AI and a version number or model family, with cut-sync on the product name for emphasis.
[00:39–00:45] Demonstrate the final generation pipeline. Show the cursor uploading still images, selecting outputs, and previewing the finished short clips. Then display finished AI video shots of the angry milk-carton character and the colorful electronic character moving on their own in polished short scenes. The creator’s voice makes the pitch clear: upload the assets, run the generation, and turn them into videos like these. Keep the examples vivid and cute rather than realistic.
[00:45–00:48] End on the host back in the neon studio, now holding up a phone or printed visual reference while delivering the call to action. He tells viewers to comment for the prompt or follow for more. The shot is front-facing, centered, and slightly more animated than earlier, with confident hand motion and a creator CTA tone. Keep lips fully visible, close-mic audio dry and crisp, and land the final words right before the cut ends.
NEGATIVE PROMPT: inconsistent host identity, changing facial structure, different hats or wardrobe across talking-head shots, muddy UI text, unreadable screen captures, fake software logos replacing interface clarity, random extra speakers, robotic voice cadence, monotone narration, slurred words, lip-sync mismatch, soft unfocused screen recordings, flickering cursor, temporal jitter, duplicate objects in generated examples, malformed household characters, broken anatomy on host hands, blown-out neon highlights, crushed shadows hiding the face, excessive motion blur, abrupt camera zooms not present in the reference, noisy room echo, harsh sibilance, clipping, over-compressed dialogue, floating captions unrelated to speech, unrelated cutaway footage, low-resolution app panels, and generic “AI tutorial” visuals that ignore the specific ChatGPT-to-Kling workflow.
SHOT PROMPTS:
SHOT_01_HOOK: Viral AI example montage, tiny surreal kitchen set, pastry-like mascot, glossy toy-scale realism, warm light, social overlay metrics, ultra-clickable short-form hook.
SHOT_02_HOST_INTRO: Young male creator in backward black cap and black hoodie, neon magenta-blue studio, medium close-up, direct eye contact, says how to make viral videos, crisp close-mic tutorial delivery.
SHOT_03_EXAMPLES: Vertical examples of bizarre AI characters with high view overlays, red squishy mascot, household-object characters, meme-ready absurdity.
SHOT_04_GPTS_SETUP: Desktop UI with ChatGPT and GPT listings, cursor selecting custom GPT workflow, host explaining first setup step.
SHOT_05_PROMPT_BUILD: Dark-mode interface, text prompts, asset setup, aspect-ratio controls, create-story panel, generated character images appearing.
SHOT_06_KLING_STAGE: Exported character stills uploaded into Kling AI style interface, generation controls, preview windows, finished animated clips.
SHOT_07_CTA: Host returns to studio, holds visual reference, asks viewers to comment and follow, assertive creator-education ending.
SPEECH PACK
[00:00–00:03]
Closest audible transcript: "People keep asking how I make these viral AI videos."
Safe paraphrase: "A lot of people keep asking how these viral AI videos are made."
TAKE_A: [confident hook] People keep asking... how I make these viral AI videos.
TAKE_B: [fast, punchy] People keep asking how I make these viral AI videos.
TAKE_C: [teacherly emphasis] A lot of people keep asking how these viral AI videos are made.
Speaker: A
Lips visible: none or partial under montage
Lip-sync strictness: low
Mic-room signature: close mic, dry, clean, present
[00:03–00:07]
Closest audible transcript: "How to make viral videos."
Safe paraphrase: "Here is how to make viral AI videos."
TAKE_A: [direct] How to make viral videos.
TAKE_B: [slightly slower] Here's how to make viral AI videos.
TAKE_C: [emphasis on viral] How to make VIRAL videos.
Speaker: A
Lips visible: full
Lip-sync strictness: high
Cut sync: strong cut lands on "How"
[00:07–00:12]
Closest audible transcript: "A lot of you were asking me how these videos are made."
Safe paraphrase: "A lot of you asked how these kinds of videos get made."
TAKE_A: [friendly] A lot of you were asking me how these videos are made.
TAKE_B: [faster] A lot of you asked how these kinds of videos get made.
TAKE_C: [storytelling] So a lot of you have been asking... how these videos are actually made.
Speaker: A
Lips visible: mixed
Lip-sync strictness: medium
[00:12–00:16]
Closest audible transcript: "It's actually really easy."
Safe paraphrase: "It's way easier than people think."
TAKE_A: [reassuring] It's actually really easy.
TAKE_B: [casual] It's way easier than people think.
TAKE_C: [emphasis] This is actually super easy.
Speaker: A
Lips visible: full
Lip-sync strictness: high
[00:16–00:21]
Closest audible transcript: "Go to GPTs..."
Safe paraphrase: "First, open GPTs and start there."
TAKE_A: [instructional] Go to GPTs.
TAKE_B: [calm tutorial] First, open GPTs and start there.
TAKE_C: [step-by-step] Step one: go into GPTs.
Speaker: A
Lips visible: mixed
Lip-sync strictness: medium
[00:21–00:27]
Closest audible transcript: "Use any example..."
Safe paraphrase: "Use any example or template that fits the kind of video you want."
TAKE_A: [guide tone] Use any example that fits what you want to make.
TAKE_B: [clear] Use a template or example that matches the type of video you want.
TAKE_C: [slightly faster] Pick any example that lines up with the kind of video you're trying to make.
Speaker: A
Lips visible: partial
Lip-sync strictness: medium
[00:27–00:33]
Closest audible transcript: "Create... paste the... into..."
Safe paraphrase: "Create the assets, paste the prompt in, and set the format you want."
TAKE_A: [procedural] Create the assets, paste the prompt in, and set the format you want.
TAKE_B: [step-by-step] Build the assets, paste everything in, then choose your format.
TAKE_C: [faster tutorial cadence] Create it, paste the prompt, and set it up the way you need.
Speaker: A
Lips visible: mixed
Lip-sync strictness: medium
[00:33–00:39]
Closest audible transcript: "Go like Kling AI 2.6..."
Safe paraphrase: "Then take it into Kling AI and generate the motion from there."
TAKE_A: [brand emphasis] Then take it into Kling AI and generate the motion from there.
TAKE_B: [short] Next, use Kling AI for the video part.
TAKE_C: [tutorial tone] After that, bring the assets into Kling AI and run the generation.
Speaker: A
Lips visible: mixed
Lip-sync strictness: medium
Cut sync: emphasize "Kling AI"
[00:39–00:45]
Closest audible transcript: "Upload... and make videos like this."
Safe paraphrase: "Upload your images and turn them into videos like these."
TAKE_A: [instructional] Upload your images and turn them into videos like these.
TAKE_B: [punchy] Upload them... and make videos like this.
TAKE_C: [encouraging] Just upload the assets and you'll get videos like these.
Speaker: A
Lips visible: mixed
Lip-sync strictness: medium
[00:45–00:48]
Closest audible transcript: "Comment... follow..."
Safe paraphrase: "Comment if you want the prompt, and follow for more."
TAKE_A: [creator CTA] Comment if you want the prompt, and follow for more.
TAKE_B: [fast CTA] Comment for the prompt and follow for more.
TAKE_C: [friendly close] Drop a comment if you want it, and follow for more videos.
Speaker: A
Lips visible: full
Lip-sync strictness: high