GLOBAL LOCK: A vertical 9:16 social video led by a female-presenting virtual influencer with a straight neon-pink bob, blunt bangs, pale cool-neutral skin, glossy pink lips, subtle cat-eye liner, and a calm young-adult digital face. She sits upright in a black gaming chair in a warm bedroom-studio with blurred shelves, red decor, and a pink toy figure in the background. She wears an oversized black sweatshirt with giant warped white lettering across the chest. Keep the same identity, outfit, room layout, shallow depth of field, warm practical lighting, clean beauty-grade skin rendering, and direct-to-camera talking-head composition throughout. The camera language is mostly locked-off medium close-up framing on a 50mm-equivalent phone lens with occasional full-screen inserts and over-picture overlays. Speech style is upbeat UGC commentary with one on-camera speaker, dry close-mic sound, clear diction, lightly smiling cadence, and subtitle-driven editing. English captions are burned in with bold white text and selected yellow emphasis words, plus a second line of smaller Japanese subtitles. Lip sync should stay believable whenever the avatar face fills frame.
[00:00-00:06] A static medium close-up of the pink-haired virtual host speaking straight to camera in the chair. She introduces a brand called Nervous. Background remains softly blurred and warm, with shelf objects visible as color accents. Expression is friendly and informative, slight head movement, natural blink cadence, gentle mouth articulation. Subtitles appear centered in the lower third, with the key phrase emphasized in yellow. Audio is a single dry voice close to the mic, conversational and lightly energetic, with lips fully visible and high lip-sync strictness.
[00:06-00:11] Keep the same framing and room. The host explains that the brand is made by a Japanese designer who also works with 3D. Maintain steady locked camera, same light logic, and crisp avatar face render. The edit remains simple and speech-led, with emphasis words in yellow and Japanese translation below. Delivery is even, bright, and informative.
[00:11-00:16] Still on the host in the same setup. She says many of the designs are inspired by anime. The visual rhythm stays minimal so the viewer focuses on the spoken explanation and subtitles. Preserve the warm room tone and clean vertical social aesthetic.
[00:16-00:21] Keep the host as the anchor, but overlay or hard-cut to reference imagery tied to anime inspiration. Show a visual insert referencing the Longinus spear from Evangelion and Kurapika from Hunter x Hunter while the host continues speaking. The insert should feel like a mobile-first explainer collage, not a cinematic cutaway. Subtitle timing lands on the anime names for emphasis.
[00:21-00:27] Return to the host or keep her visible while an insert demonstrates that one of the glasses designs is modeled after the creator's own brand logo. Use a centered product or logo image overlay with bold subtitle emphasis on “my brand's logo.” Motion stays minimal, like a clean editorial explainer. Voice remains the same speaker, clear and close, with medium lip-sync priority if the insert covers the mouth.
[00:27-00:34] Show the host continuing the explanation that every piece is handmade, made from stainless steel. Insert a clean product-detail shot or isolated eyewear frame on a pale background to support the craftsmanship claim. Maintain the warm-to-neutral palette contrast between host footage and simple reference graphics. Subtitles keep the same bold English plus smaller Japanese translation format.
[00:34-00:42] The host explains that some other designs use deadstock frames, basically frames that were not used anymore and were recycled. Stay in the same talking-head setup with perhaps one or two quick supporting image overlays. Camera stays static, expression earnest and slightly excited, with small nods and precise mouth shapes. The edit cadence is still speech-first and education-first, not flashy.
[00:42-00:49] Show the host describing how much she loved working with these real materials. Insert a social-proof image or behind-the-scenes group photo in a vivid red-lit space featuring several people wearing or presenting the eyewear. This insert should create proof, texture, and community, while the host remains the narrative guide. Audio retains the same dry voice and smooth pacing.
[00:49-00:57] Return fully to the host for the closing CTA. She says everybody looked great in the pieces and asks who she should work with next, specifically which anime collaboration viewers want to see. Preserve the same locked framing, warm shelf background, pink hair, black sweatshirt, and subtitle treatment. End on a direct engagement prompt with a slight smile and clean stop, as if inviting comments.
NEGATIVE PROMPT: inconsistent avatar identity, changing hair color or haircut, realistic human skin pores replacing the clean digital beauty render, warped eyes, asymmetrical bangs, broken teeth, muddy lip-sync, rubbery mouth motion, extra fingers, deformed shoulders, floating chair edges, unstable shelf background, caption glitches, unreadable subtitles, wrong language subtitles, missing yellow emphasis words, flicker between host shots, noisy compression, camera shake, dramatic cinematic bokeh changes, random outfit changes, harsh blue lighting, incorrect inserts unrelated to anime or eyewear, logo hallucinations, melted metal glasses, jittery overlay compositing, robotic cadence, clipped consonants, harsh sibilance, plosives, over-compressed voice, distant room echo, and out-of-sync phrase timing.
SHOT PROMPTS:
Shot 1 anchor host: pink-bob virtual influencer in black oversized sweatshirt, seated in black gaming chair, warm bedroom shelf background, medium close-up, static camera, direct eye contact, clean beauty CGI render, lower-third English subtitles with yellow emphasis and smaller Japanese line.
Shot 2 anime reference insert: mobile-style explainer overlay showing anime inspiration references while host voice continues, crisp graphic insert, no cinematic transition flourishes.
Shot 3 logo and product insert: isolated logo-inspired eyewear visual or brand mark overlay, pale background, centered composition, minimal motion.
Shot 4 craftsmanship support: simple product detail image emphasizing handmade stainless steel construction, clean neutral backdrop.
Shot 5 social proof insert: group photo in strong red lighting with people gathered around branded eyewear, energetic but still framed for a vertical social reel.
Shot 6 closing host CTA: same talking-head setup, inviting question to audience about future anime collaborations.
SPEECH PACK:
[00:00-00:06] closest audible: “This is a brand called Nervous.” safe paraphrase: “I want to show you a brand called Nervous.” TAKE_A: bright and immediate, slight smile on “Nervous.” TAKE_B: slower intro with a short pause before the brand name. TAKE_C: punchier emphasis on “brand” and “Nervous.”
[00:06-00:11] closest audible: “Made by a Japanese designer who also works with 3D.” safe paraphrase: “The brand comes from a Japanese designer with a 3D background.” TAKE_A: informative and smooth. TAKE_B: mild emphasis on “Japanese designer.” TAKE_C: stronger emphasis on “works with 3D.”
[00:11-00:16] closest audible: “A lot of his designs are inspired from anime.” safe paraphrase: “Many of the pieces pull from anime references.” TAKE_A: neutral explanation. TAKE_B: slightly excited on “anime.” TAKE_C: brief pause before “inspired.”
[00:16-00:21] closest audible: “Longinus from Evangelion and Kurapika from Hunter x Hunter.” safe paraphrase: “He references anime icons like Evangelion's Longinus spear and Kurapika.” TAKE_A: crisp name reading. TAKE_B: more playful delivery on the franchise names. TAKE_C: deliberate pause between the two references.
[00:21-00:27] closest audible: “This design is based on my brand's logo.” safe paraphrase: “One of the shapes is modeled after the creator's logo.” TAKE_A: clean explainer tone. TAKE_B: slightly punch “brand's logo.” TAKE_C: short pause after “design.”
[00:27-00:34] closest audible: “Every piece is handmade and made from stainless steel.” safe paraphrase: “Each pair is handmade with stainless steel construction.” TAKE_A: admiration in tone. TAKE_B: slower and more premium. TAKE_C: stronger emphasis on “handmade.”
[00:34-00:42] closest audible: “Some of his other designs use deadstock frames, basically frames that aren't used anymore and he recycled.” safe paraphrase: “He also reworks unused deadstock frames into new designs.” TAKE_A: explanatory and steady. TAKE_B: clearer pause before “basically.” TAKE_C: emphasis on “recycled.”
[00:42-00:49] closest audible: “I loved working with these real materials.” safe paraphrase: “Working with the real materials was one of my favorite parts.” TAKE_A: warm and appreciative. TAKE_B: softer voice with reflective tone. TAKE_C: upbeat emphasis on “loved.”
[00:49-00:57] closest audible: “Everybody looked really good in them. Who do you think I should work with next? What anime should I collab with?” safe paraphrase: “Everyone looked great in the pieces, so tell me what anime collaboration I should do next.” TAKE_A: community-building and inviting. TAKE_B: playful question cadence at the end. TAKE_C: stronger emphasis on “next” and “anime.”
Delivery direction: one female-presenting young-adult digital speaker, warm and internet-native, medium pace around 130-145 WPM, clean articulation, light smile, expressive emphasis on nouns, short pauses before brand or anime names. Mic-room signature should feel close, dry, low-noise, lightly compressed, with no music overpowering the voice. Lip-sync strictness is high in host shots and medium during full-screen inserts.