This new worldbuilding technology is going to change marketing. You can now use AI to put yourself in a 3D world. You can control the scene, the motion, the movement, you can even lipsync the audio to match any vocal reference you’ve recorded. If you wanna do this for yourself, these are the tools I used. And shoutout Rourke, he’s the OG for this. First you need spatial reference photos of your world. You can use World Labs, Genie 3, or even a VR headset. We use World Labs cause they have a bunch of different preset environments you can pick from for free. But also, you can upload any photo and it will turn it into a 3D spatial world for you. When you’re in the world, you want to take screenshots from a bunch of different perspectives. I like to think of these as sets, that you can build on top of. Once you have shots of the sets, splice yourself into the scenes using Google’s Nano Banana image model. We use Freepik to do this and it’s super super easy. When you have the combined image, you can take these into an AI video generator like Veo 3 or Sora 2 to bring them to life. And here’s the sauce layer on top… If you want to lipsync so your character talks smoothly you can use Wan 2.2, Infinite Talk, or the new Veo 3.1 video model. And that’s the whole stack. Now to me, this workflow is the canary in the coal mine for where marketing is going. If you’re a clothing brand, you’re gonna be able to build a world that becomes a collection of sets for your fans to make content in. My favorite brands are Kith and Represent, so we just took lifestyle photos from their recent shoots and turned them into playable, immersive environments. We are just in the early innings of what synthetic experiential marketing is gonna look like. And I’m telling you, worldbuilding is the next frontier. Follow @kallaway for more videos like this. #ai #artificialintelligence #tech #technology #marketing #worldbuilding #experientialmarketing #newtech #world

Kallaway

@kallaway · Entrepreneur

INSTAGRAM · 2025-12-05Source

1.7Klikes

32comments

Remix This

Prompt

GLOBAL LOCK: The subject is a male in his mid-30s with light skin, wearing a black baseball cap with a subtle logo and a black long-sleeve shirt with a white "KITH" logo on the chest. He has an energetic, expressive face. The environment transitions between various 3D generated worlds and a studio setting. Lighting is cinematic with high contrast. The color grade is warm and saturated. Speech is direct-to-camera with high-energy delivery and crisp articulation.

[00:00–00:02]
A wide, high-angle drone-style shot of a tropical island. White sand beach, turquoise water with gentle waves, and lush green palm trees. A tiny, indistinguishable human figure stands on the sand. Bright, high-noon tropical lighting.

[00:02–00:05]
The subject appears in a circular frame overlaying the beach, then transitions to a full-screen medium close-up. He is speaking enthusiastically, gesturing with his hands. The background is the same tropical beach but slightly blurred (bokeh).

[00:05–00:08]
A medium shot from the side. The subject is walking along a path lined with tropical plants and palm trees. The lighting is dappled sunlight. He is looking off-camera and smiling. Cinematic handheld camera movement.

[00:08–00:11]
Close-up talking head shot. The background is dark and out of focus with a purple and blue rim light on the subject's shoulders. He is speaking directly to the camera, emphasizing the words "world building."

[00:11–00:14]
Medium shot of the subject sitting in a brown wicker chair inside a modern, sunlit living room with white walls and wooden stairs in the background. He gestures broadly with both hands. High-key, airy lighting.

[00:14–00:17]
A close-up of the living room set, focusing on the wicker chair and a patterned pillow. The camera pans slightly. The lighting is warm and domestic.

[00:17–00:24]
A rapid montage of digital environments: a gothic cathedral with lava flowing through the center, a snowy village under the green Aurora Borealis, and a futuristic sci-fi hallway. High-fidelity textures and dramatic lighting.

[00:24–00:30]
A screen recording of a UI. A photo of a tennis court with mountains in the background is uploaded. The UI shows a "Generate" button being clicked, and the photo transforms into a 3D navigable world.

[00:30–00:36]
The subject is back in a medium shot, gesturing toward a floating window that shows the 3D tennis court world. He explains the "digital sets" concept.

[00:36–00:45]
A grid of 8 reference images showing the subject in different poses and environments. The UI demonstrates "splicing" the subject into the living room set. The subject is seen waving in the final spliced image.

[00:45–00:52]
A screen recording of a video generation tool (Google VEO 3). A prompt is typed: "Animate the reference photo. The subject holds a cup..." The video generates a realistic motion of the subject in the digital set.

[00:52–01:05]
Close-up of the subject speaking. He transitions into a medium shot in a simple white-walled room, wearing the same KITH shirt. He uses his hands to emphasize the "sauce layer" of lip-syncing.

[01:05–01:12]
A cinematic shot of a fashion model in a green tank top walking across a city crosswalk, followed by a shot of a model in a red beret sitting in a futuristic subway car. High-end editorial lighting.

[01:12–01:18]
The subject is superimposed at the bottom of the screen, pointing up at an Instagram profile (KITH). He then shows lifestyle photos of models on a tennis court being turned into 3D worlds.

[01:18–01:26]
Final talking head shot. The subject winks and points at the camera. The video ends with quick cuts of a barn interior at sunset and a woman in a futuristic pink dress in a white, crystalline room.

NEGATIVE PROMPT: visual artifacts, distorted face, inconsistent clothing logos, flickering lighting, robotic lip movement, blurry textures, unnatural hand gestures, floating objects, low resolution, watermarks, text jitter.

SPEECH PACK:
[00:00-00:05] "This is absolutely insane. You can now use AI to put yourself in a 3D world."
TAKE_A: (High energy, fast pace) "This is absolutely insane! You can now use AI to put yourself in a 3D world!"
TAKE_B: (Awe-struck, slower pace) "This... is absolutely insane. You can actually use AI to put yourself... in a 3D world."
TAKE_C: (Direct, informative) "This is insane. AI now lets you put yourself directly into any 3D world."

[00:05-00:11] "I'm talking true world building. You can control the scene, the motion, the movement."
TAKE_A: (Emphasizing 'true') "I'm talking TRUE world building. Control the scene, the motion, the movement."
TAKE_B: (Rhythmic) "True world building. You control the scene. The motion. The movement."

[00:52-01:00] "And here is the sauce layer on top. If you want to lip sync so your character talks smoothly..."
TAKE_A: (Secretive/Excited) "And here’s the sauce layer. Want to lip sync so it looks smooth? Watch this."

PROSODY NOTES: Use punchy emphasis on tool names (World Labs, Sora, Veo). Maintain a "tech-guru" persona—warm but authoritative. High lip-sync strictness required for the "sauce layer" segment.

How kallaway Made This AI Worldbuilding Marketing AI Video

This case study analyzes a high-impact tech-educational video by creator @kallaway, focusing on the emerging frontier of AI-driven worldbuilding. The video demonstrates how creators can use a stack of AI tools to "teleport" themselves into fully realized 3D environments, effectively turning a simple home office into a cinematic digital set. Utilizing a mix of cinematic editorial portraiture, iPhone-style UGC (User Generated Content) authenticity, and high-end 3D renders, the video captures the "future of marketing" aesthetic. Key visual elements include warm, motivated lighting in indoor scenes contrasted with the vibrant, saturated palettes of AI-generated tropical beaches and futuristic interiors. By blending technical tutorial value with high-production "magic" reveals, the content targets indie creators and marketers looking to scale their visual storytelling without a Hollywood budget. The core hook relies on the "synthetic experiential marketing" concept, promising a world where brands and fans can co-create in immersive, playable digital spaces.

What You’re Seeing: A Visual Breakdown

The video is a masterclass in dynamic editing, alternating between "talking head" segments and high-fidelity AI B-roll. The subject, a male creator in a black KITH long-sleeve and baseball cap, maintains a consistent persona while the world around him shifts seamlessly. The lighting transitions from a moody, purple-rimmed studio setup to bright, naturalistic outdoor environments. The editing rhythm is fast-paced, using "punch-ins" (digital zooms) to emphasize key points and text overlays to reduce the cognitive load of technical explanations.

Shot-by-Shot Breakdown

Time Range	Visual Content	Shot Language	Lighting & Tone	Viewer Intent
00:00–00:02	Tropical beach with a tiny figure in the distance.	Wide Shot (WS) / Drone feel	Vibrant, high-saturation turquoise	Hook: Create scale and wonder.
00:02–00:05	Creator appears in a circular overlay, then full screen.	Medium Close-Up (MCU)	Natural day-light feel	Establish authority and subject.
00:05–00:08	Creator walking in a 3D jungle path.	Side Profile / Tracking	Dappled sunlight, cinematic	Demonstrate "immersion" reality.
00:12–00:17	Creator sitting in a wicker chair in a modern home.	Medium Shot (MS)	Warm, airy, high-key	Showcase "Digital Set" application.
00:18–00:24	Rapid montage of AI tool interfaces (World Labs).	Screen Recording / UI	Clean, tech-focused	Tutorial value: "How it's done."
00:38–00:45	Splicing UI showing multiple reference images.	Grid Layout / UI	Neutral, functional	Reinforce consistency mechanism.
01:06–01:12	Models in a subway and crosswalk (AI generated).	Cinematic MS / Handheld feel	Moody, urban, high contrast	Expand use-case to fashion/brands.

Why It Went Viral: The Mechanics of "Magic"

The "Future-Proofing" Hook

This video taps into the psychology of early adoption. By framing AI worldbuilding as the "next frontier" of marketing, it triggers a "Fear Of Missing Out" (FOMO) in professional creators and marketers. The选题 (topic selection) isn't just about a cool tool; it's about a fundamental shift in how content is produced. It addresses the pain point of high production costs by offering a "synthetic" alternative that looks nearly indistinguishable from reality. This "magic trick" element—seeing a person in a room one second and a beach the next—is a biological hook that commands attention through visual novelty.

The "Tool-Stack" Authority

The creator doesn't just show one tool; he demonstrates a workflow. This "Tool Stacking" (World Labs + Google Nano Banana + Freepik + Sora/Veo) creates a high perceived value. Viewers save the video not just because it's cool, but because it serves as a technical roadmap. The mention of specific, high-profile models like "Sora 2" or "Veo 3" leverages the hype cycles of major tech companies, positioning the creator as an insider with access to cutting-edge tech.

Platform Signals & Algorithm Triggers

From a platform perspective, the video excels in Watch Time and Saves. The 0–3 second hook uses a massive scale shift (tiny person on a huge beach) which is visually arresting. The pacing is relentless; there is a visual change or a new piece of information every 1.5 to 2 seconds, preventing "scroll-away" boredom. The use of yellow bold subtitles makes the video consumable even without sound, a critical factor for Instagram's "sound-off" browsing habit. The "Save" rate is likely driven by the tutorial nature—users want to reference the tool names later.

5 Testable Viral Hypotheses

The "Magic Portal" Effect: If you show a seamless transition from a mundane setting to an extraordinary one, watch time increases by 40% due to the "how did they do that?" factor.
The Authority Name-Drop: Mentioning unreleased or "beta" tools (e.g., Sora 2) increases shares by 25% as users want to spread "insider" news.
The Brand Association: Using a well-known brand (KITH) as a case study makes the abstract tech feel "real-world" and applicable.
The "UGC-to-CGI" Contrast: Starting with a low-fi talking head and transitioning to high-fi CGI creates a "Production Value Gap" that keeps viewers engaged.
The "Playable World" Promise: Using keywords like "immersive" and "playable" triggers interest from the gaming and tech-enthusiast demographics.

How to Recreate: From 0 to 1

Step 1: Topic Selection & Positioning

Identify a "Future Tech" angle. This works best for accounts in the Marketing, AI, Filmmaking, or E-commerce niches. Your goal is to show a "shortcut" to high-end results.

Step 2: Capture Your "Base" Persona

Film yourself in a neutral environment (like a home office) with consistent lighting. Wear something with a clear silhouette or a recognizable logo (like the KITH shirt) to help the AI maintain character consistency.

Step 3: Generate the "World" Reference

Use a tool like World Labs or Midjourney to create your 3D environment. Prompt Tip: "Cinematic 3D render of a [Location], 8k, photorealistic, architectural photography style."

Step 4: Create Spatial Reference Photos

Take 5-10 screenshots of your generated world from different angles (Wide, Medium, Close-up). These will act as your "Digital Sets."

Step 5: Splicing the Subject

Use an image-to-image tool (like Freepik or Google Nano Banana) to place a photo of yourself into these screenshots. Use "Reference Image" features to ensure your face and clothes stay consistent.

Step 6: Video Generation (The Motion)

Upload your spliced images into a video model like Luma Dream Machine, Kling, or Runway Gen-3. Prompt: "The man in the black cap waves at the camera, subtle wind in the trees, cinematic camera movement."

Step 7: Lip-Syncing for Realism

To make your AI avatar talk, use LivePortrait or Sync Labs. Upload your generated video and a voice recording of your script. This ensures the mouth movements match your actual speech.

Step 8: Final Edit & Overlay

Assemble the clips in CapCut. Add dynamic captions (Yellow/White), fast transitions, and a tech-forward BGM. Use "Punch-ins" every time you introduce a new tool name.

Growth Playbook: Distribution & Scaling

3 Ready-to-Use Opening Hooks

"Stop building sets. Start building worlds. Here’s how..."
"I just teleported to a 3D beach using 3 AI tools. Marketing is changing forever."
"This is the 'Canary in the Coal Mine' for the future of content creation."

Caption Template: The "Insider Roadmap"

Hook: You can now put yourself in any 3D world using AI. 🤯
Value: I used to spend thousands on location scouts. Now, I use World Labs and Sora to build digital sets in minutes. Here’s the exact stack...
Engagement: Which brand do you think will be the first to launch a 'playable' world? Let me know below! 👇
CTA: Follow for more AI workflows that actually save you time.

Hashtag Strategy

Broad (Reach): #AI #Marketing #FutureTech #ContentCreator
Mid-Tier (Niche): #WorldBuilding #AIVideo #SyntheticMedia #DigitalMarketing
Long-Tail (Specific): #WorldLabsTutorial #SoraAI #KallawayStyle #AIWorkflows

Frequently Asked Questions

What tools make the character look the most consistent?

Using "Image References" in tools like Freepik or Midjourney's --cref (Character Reference) is essential for keeping your face the same.

What are the 3 most important words in the prompt?

"Photorealistic," "Cinematic Lighting," and "Subject Consistency."

Why does the generated face look inconsistent?

Usually due to low-quality reference photos or not using a dedicated "Face Swap" or "Character Lock" feature in the AI model.

How can I avoid making it look like 'uncanny' AI?

Keep the motion subtle; high-intensity movement often breaks the AI's understanding of anatomy.

Is it easier to go viral on Instagram or TikTok with this?

Instagram currently favors high-production "aesthetic" AI content, while TikTok prefers the "raw" tutorial process.