Text to Video

Tokyo Travel Vlogger Collage — AI Video Prompt

A detailed image generation prompt for a 13-frame grid collage capturing various candid moments of a travel vlogger exploring Tokyo. - AIPinMaker

Seedance 2.0Cinematic VideoText to Video
Tokyo Travel Vlogger Collage
Text to Video

Prompt

Generate image of young female travel vlogger exploring Tokyo across multiple candid moments, extremely beautiful with long dark wavy hair, wearing stylish Japanese -inspired oversized streetwear, expressive and spontaneous personality, captured across a 13-frame grid collage (Row 1: 4 frames | Row 2: 5 frames | Row 3: 4 frames), each frame feels like a real casual phone capture with imperfect framing and natural inconsistencies Frame Breakdown Row 1 — Frame 1 (Tokyo Tower) low angle selfie with Tokyo Tower behind her, slightly tilted horizon, sky overexposed to near white, she’s mid-natural smile adjusting into frame, small lens smudge in corner, candid arrival energy Row 1 — Frame 2 (Shibuya street level) walking through Shibuya crossing at night, handheld motion blur, she glances back at camera mid-step, neon reflections on wet pavement, slightly shaky framing, imperfect crop Row 1 — Frame 3 (Convenience store) inside a brightly lit convenience store, she is holding onigiri close to camera, playful raised eyebrows, harsh fluorescent lighting flattening the scene, slight overexposure on whites, shelves softly blurred behind Row 1 — Frame 4 (Yoyogi park) sitting on a bench under trees, looking away from camera, quiet candid moment, soft dappled daylight, slight focus miss on eyes, subtle lens smudge bottom corner Row 2 — Frame 5 (Takoyaki reaction) close-up while eating single takoyaki, reacting mid-bite with laughter, slight motion blur, warm lantern light behind, face slightly out of focus due to movement Row 2 — Frame 6 (Vending machine night) standing in front of glowing vending machine, blue and pink light casting uneven tones on her face, pressing button casually, high ISO grain visible Row 2 — Frame 7 (Train reflection) her face reflected in train window at night, city lights streaking behind, layered reflection effect, slight ghosting on glass, she looks away from camera Row 2 — Frame 8 (Torii gate) low angle shot of her walking through a torii gate, captured from slightly behind, motion blur in her movement, bright sky causing mild lens flare Row 2 — Frame 9 (Digital art exhibit) inside immersive light exhibit, colored lights (blue, violet, gold) washing across her face, she looks upward in awe, slight motion blur, low-light noise visible Row 3 — Frame 10 (Gundam statue) low angle wide shot, she stands small in frame while large statue towers above, slight barrel distortion from phone lens, sky slightly blown out Row 3 — Frame 11 (Lantern alley) walking through narrow alley lit by warm lanterns, shot from behind, she glances back briefly, underexposed shadows, grainy night detail Row 3 — Frame 12 (Rooftop skyline) selfie with Tokyo skyline behind, arm extended, wind blowing hair partially across lens, horizon slightly tilted, warm city glow overexposed in distance

(Row 3 — Frame 13)

crouching down in a park in Nara, she holds out a deer cracker toward a deer, the deer bows its head toward her, she bursts into genuine laughter mid-reaction, surprised and slightly leaning back, another deer nudges her from behind causing her to lose balance slightly, camera captures the moment imperfectly with slight shake and off-center framing, soft natural daylight, open park background with trees and space, unscripted chaotic energy, candid expression

young female travel vlogger exploring Tokyo across 13 candid moments in a grid collage, extremely beautiful with long dark wavy hair, wearing stylish Japanese streetwear, expressive and spontaneous, scenes include Tokyo Tower selfie, Shibuya crossing at night, convenience store snack moment, park bench quiet scene, takoyaki reaction, vending machine glow, train window reflection, torii gate walk, immersive light exhibit, Gundam statue low angle, lantern alley walk, rooftop skyline selfie, Harajuku street ending, each frame slightly imperfect with motion blur

Video prompt 👇39:Tf51,Generate image of young

Prompt breakdown

Subject
Young female travel vlogger with long dark wavy hair in stylish Japanese-inspired oversized streetwear across 13 specific moments including Tokyo Tower low-angle selfie, Shibuya wet-pavement crossing, convenience-store onigiri, Yoyogi bench, takoyaki mid-bite laugh, vending-machine button press, train-window reflection, torii-gate walk, colored-light exhibit, Gundam statue, lantern alley, rooftop skyline, and Nara deer-cracker chaos
Style
Casual phone-capture aesthetic with imperfect framing, motion blur, lens smudges, barrel distortion, ghosting, and natural inconsistencies across every frame
Lighting
Scene-specific: overexposed near-white skies at Tokyo Tower, harsh fluorescent flattening in the konbini, neon reflections on wet Shibuya pavement, warm lantern glow on takoyaki, blue-pink vending-machine spill, soft dappled park daylight, and low-light exhibit color washes
Composition
13-frame grid collage (Row 1: 4 frames, Row 2: 5 frames, Row 3: 4 frames) with varied phone angles—low-angle selfies, close-ups, reflections, rear three-quarter views, and off-center handheld crops
Mood
Spontaneous and unscripted energy—candid arrival smiles, genuine mid-bite laughter, upward awe in the light exhibit, and surprised imbalance when deer nudge her from behind

Remix ideas

  • Swap the Nara deer frame for a Harajuku Takeshita-dori snap while keeping the same shaky off-center framing
  • Add stronger motion blur and wet-pavement reflections only to the Shibuya crossing panel
  • Change the train-reflection frame to a daytime Sumida River view with cherry-blossom petals on the glass

Reference images

Tokyo Travel Vlogger Collage reference
Text to Video

How to use this AI Video prompt template

  1. AiVideo Maker stepOne1
    Copy the prompt — grab this template’s prompt and negative prompt.
  2. iVideo Maker stepTwo2
    Pick a model — choose a recommended AI model for the best match.
  3. AiVideo Maker stepThree3
    Generate — open the studio with one click and create your result.

Related templates

NBA Courtside Influencer POV
Seedance 2.0

NBA Courtside Influencer POV

Young American Gen-Z female AI content creator, around 22 years old, maintaining a naturally attractive and trendy facial aesthetic: long voluminous dark-brown hair with soft curls, glowing warm fair skin tone, expressive hazel-brown eyes, glossy lips, subtle clean-girl makeup, soft defined jawline, and effortlessly charismatic influencer-style features. Confident yet approachable vibe, natural smile, relaxed courtside energy, authentic celebrity guest aesthetic. Wearing premium Knicks-inspired Gen-Z streetwear — oversized beige varsity-style jacket layered over a fitted blue-and-orange cropped fan top, loose baggy jeans, fashionable sneakers, minimal gold jewelry, and trendy accessories. Realistic live NBA broadcast shot during a Knicks vs 76ers Eastern Conference Semifinals game in Philadelphia. ESPN-style TV cutaway showing her seated courtside the entire time. One continuous shot, no cuts or angle changes. She naturally switches attention between the court and the camera like a real viral fan moment captured live on national television. Action flow: 0–4s: smiling casually while watching the game, fixing her hair slightly, relaxed Gen-Z influencer energy. 4–7s: notices herself on the Jumbotron and gives a playful confident wave toward the camera with a bright smile. 7–11s: cheers briefly, leans toward her friend beside her while laughing naturally, authentic courtside interaction. 11–15s: claps while smiling, subtle realistic movements only, playful facial reactions as crowd energy rises. Style: ultra-realistic sports broadcast aesthetic, viral TikTok/Instagram Gen-Z energy, telephoto camera feel, cinematic arena lighting, slight ESPN-style TV grain and compression artifacts, authentic crowd movement, shallow depth of field, realistic skin texture, cinematic sports framing, persistent unchanged playoff scorebug and lower-third graphic identifying her as an AI content creator, natural candid celebrity fan atmosphere, 16:9 aspect ratio.3b:T7c4,Yo

womanphotorealisticinstagram
Text to Video
1965 Retro Living Room Scene
Grok Imagine

1965 Retro Living Room Scene

Vintage 1990s theatrical action-adventure film scene, 16:9 widescreen, practical effects only, shot on 35mm anamorphic film, Kodak-style grain, dusty desert heat, sun-baked train yard, handheld camera energy, real stunt blocking, no CGI.\n\nA fiery red-haired woman in a tan work shirt, fitted blue jeans, leather belt, holster, and boots runs beside a rusted freight train as sparks and smoke burst from the train wall behind her. She grips a realistic prop revolver low in her right hand with correct finger placement, clean wrist alignment, and believable running posture. Her left arm pumps naturally as she sprints, focused and urgent, hair whipping in the hot wind.\n\n0: 00–0:03\nStart exactly from the image. Medium-wide tracking shot beside the train. She runs toward camera-left, breathing hard, looking over her shoulder as sparks shower from the train. Smoke rolls across the frame. The revolver stays pointed safely downward while she runs, her grip firm and anatomically correct.\n\n0: 03–0:06\nHandheld camera pushes closer. She ducks as a practical squib blast punches through the train car behind her, sending orange sparks and black smoke outward. Dust kicks up around her boots. Her face is determined, scared but controlled, like a grounded 1990s action heroine.\n\n0: 06–0:10\nShe slides behind a steel rail post for cover, pivots sharply, raises the revolver with both hands for one clean defensive aim. Keep the gun realistic: correct barrel shape, cylinder, trigger guard, natural two-handed grip, no warped fingers, no extra fingers. She does not fire yet — she listens, eyes scanning through drifting smoke.\n\n0: 10–0:14\nA second explosion erupts farther down the train. She flinches, then commits, sprinting across the tracks toward a gap between freight cars. Camera follows with intense handheld motion blur, sparks falling behind her like a shower of fire. End on her disappearing into smoke, hair and shirt snapping in the wind, the burning train looming behind her.\n\nStyle / camera: late-1980s / early-1990s theatrical film, 35mm anamorphic Panavision look, 50mm lens, shallow depth of field, practical pyrotechnics, real smoke, real dust, imperfect focus, film grain, slight gate weave, warm desert sunlight, rusty reds and dusty tans, gritty but colorful adventure-thriller tone.53:T917,Vi

womanvehiclewide-angle
Text to Image
Warrior Woman Riding a Dragon Cinematic Video Prompt
Seedance 2.0

Warrior Woman Riding a Dragon Cinematic Video Prompt

The camera rockets up from below, tearing past jagged cliff walls, then whips into a downward tilt to reveal the ocean below in full violent eruption, waves detonating against black rock. On the cliff edge stands a warrior woman, utterly fearless, gazing into the maelstrom as if daring it to rise higher. She steps forward without hesitation and leaps off the edge, arms spread wide, plummeting through howling wind and salt spray. The camera dives alongside her in freefall, her hair and cloak thrashing upward as the black rocks and churning sea rush closer. Then a massive shadow tears through the mist below — a dragon surges upward with wings unfurling like storm sails, scales gleaming wet with ocean spray. It swoops beneath her mid-fall, and she lands on its back with practiced grace, fingers gripping the ridges along its neck. The dragon banks hard, climbing in a spiraling arc back up past the cliff face. The camera keeps pace, circling tight, then pulls ahead into a final close, frontal reveal: she rides the beast skyward, wind-whipped, calm, and burning with determination, the raging ocean now small and powerless far below.40

womananimalwide-angle
Text to Video
Cinematic Grocery Store Sequence
Seedance 2.0

Cinematic Grocery Store Sequence

Create a 15-second ultra-realistic vertical (9:16) cinematic video of a young woman shopping in a modern grocery store. Scene Style: Bright, clean, and aesthetically pleasing supermarket with soft natural lighting, slightly warm tones, shallow depth of field, and smooth cinematic camera movement. Sequence: 0–3s: Wide establishing shot of a modern grocery store aisle. Shelves neatly stocked with fresh fruits, vegetables, and packaged goods. Soft ambient store sounds. 3–7s: Medium tracking shot of a young woman wearing casual stylish outfit (white shirt, light denim jeans, minimal makeup). She pushes a shopping cart slowly while scanning shelves thoughtfully. 7–11s: Close-up shots: • Her hand picking fresh apples and checking quality • Slow-motion of fruits being placed into cart • Subtle smile as she compares items on a list 11–15s: Cinematic side profile shot as she walks down the aisle. Soft sunlight beams through store windows, creating a dreamy glow. Camera slowly pulls back as she continues shopping calmly. Mood: Peaceful, everyday lifestyle elegance, slightly cinematic commercial feel. Visual Quality: Ultra-realistic, 4K detail, smooth motion, natural skin tones, shallow focus, soft bokeh background.3d:T4de,Create

womanwide-anglephotorealistic
Text to Video
Jungle Explorer One-Shot
Seedance 2.0

Jungle Explorer One-Shot

One fearless female explorer in beige crop top, jungle shorts, rope belt, muddy skin, long dark hair loose. Swinging from giant vines above ancient forest. Main frame from rear. Small rectangular inset top-right shows front face and torso. SCENE: Camera locked behind her as she is already mid-swing at high speed over massive jungle canyon. No cuts. Maintain one fluid take. Rear frame shows legs kicking, hair whipping, vine tension stretching realistically. Trees thousands of feet tall, waterfalls pouring through branches. Inset shows laughter, shock, intense focus, wind slamming cheeks. Route progression: giant swing over canyon → releases to next vine midair → swings through temple ruins → ducks under broken bridge → near miss with flock of giant birds → slides through waterfall curtain → lands onto moving tree branch rail → branch bends and launches her to another vine → final swing returns toward first canyon. Same momentum and shakes in both feeds. MOOD: Freedom, primal thrill, discovery. STYLE: Epic jungle realism.39:T41b,One fear

womanwide-anglephotorealistic
Text to Video
Cinematic Fashion Pageant Storyboard
Seedance 2.0

Cinematic Fashion Pageant Storyboard

Professional fashion pageant storyboard image, 9-panel cinematic layout, luxury runway stage packed with audience and judges, dramatic spotlights, LED screens, fashion week atmosphere. Panel 1: Wide establishing shot of all contestants lined up backstage and on the runway. Panels 2–8: Individual contestant spotlight frames connected by a glowing red camera path. Each contestant wears a completely different outfit, color, and style: 1. Isabella – Champagne gold couture gown 2. Sooyeon – Midnight blue satin slit dress 3. Lilith – Black sequined evening gown 4. Natalie – White asymmetric designer dress 5. Mei – Ruby red luxury gown 6. Araya – Gold metallic couture dress 7. Valentina – Emerald green silk gown 8. Jessica – Blush pink crystal-embellished dress Each panel shows the contestant performing a runway walk, stopping center stage, posing confidently, and introducing herself. Floating name tags beside each contestant show name and country. A glowing red FPV camera path connects every contestant in sequence with arrows showing movement direction. Motion pattern box in the corner reads: Walk In → Ramp Walk → Close-Up → Say Name → Pull Back → Next Contestant Final panel: All contestants together on stage during the grand finale while the camera pulls back to reveal the audience, judges, spotlights, and full runway. Luxury fashion storyboard aesthetic, production-planning infographic style, cinematic camera annotations, glowing route lines, professional previsualization board, ultra-realistic photography, Vogue editorial quality, 8K, highly detailed.3d:T64a,Professional fashion

womanstudiowide-angle
Text to Video

Explore more prompts

Browse more AI image and video prompts by category.

FAQ

Why does the prompt specify lens smudges and tilted horizons?
These details replicate authentic phone-vlog footage so the entire collage feels like real casual captures rather than staged shots.
What makes the final Nara frame stand out from the Tokyo scenes?
It introduces unscripted chaotic energy with two deer interacting at once, genuine laughter, and slight loss of balance captured through imperfect shake and off-center framing.