Text to Video
Tokyo Vlogger Collage — AI Video Prompt
A compilation of candid moments of a Japanese travel vlogger in Tokyo, ranging from Shibuya crossings to ramen shops. - AIPinMaker

Prompt
Generate image of her Japanese travel vlogger exploring Tokyo across exactly 13 candid moments, beautiful with long wavy sunlit brown hair, soft glam makeup, wearing stylish bold Tokyo streetwear outfits — cropped jackets, oversized shirts, mini skirts, fitted tops, loose cargo pants, trendy sneakers — carefree adventurous personality, authentic handheld iPhone collage aesthetic with natural imperfections. Frame breakdown includes: — neon Shibuya crossing selfie at night, slightly messy hair from wind, blurry passing crowds — convenience store candid holding colorful Japanese snacks and canned coffee — rainy Tokyo alley shelter moment under transparent umbrella, reflections everywhere — karaoke booth selfie laughing uncontrollably with microphone in hand — Harajuku fashion street candid with shopping bags and chaotic background energy — sushi bar close-up with messy table, half-eaten sushi and drinks — train window reflection shot on Tokyo metro during golden evening light — arcade gaming reaction mid-laugh surrounded by glowing machines in Akihabara — temple stairs candid in Asakusa with humid lens fog and tourists passing by — rooftop Tokyo skyline selfie with wind blur and glowing city lights — blurry dancing shot inside tiny underground Japanese club — late-night ramen shop candid with steam fogging the camera lens Final frame: feeding stray cats near quiet Tokyo alley at sunset while laughing after one climbs onto her lap unexpectedly, shaky candid capture, warm cinematic glow, messy authentic energy. Style: realistic Tokyo vlog collage, imperfect smartphone camera feel, Japanese neon atmosphere, motion blur, authentic travel storytelling, casual handheld framing, realistic skin texture, natural ambient lighting, subtle grain, humidity haze, candid Gen Z social media realism, no studio polish, chaotic beautiful travel memories, cinematic urban energy.39:T77e,Generate image of he
Prompt breakdown
- Subject
- Japanese travel vlogger with long wavy sunlit brown hair, soft glam makeup, and bold Tokyo streetwear across 13 specific candid moments from Shibuya to Asakusa cat alley
- Style
- realistic Tokyo vlog collage with imperfect smartphone camera feel, motion blur, subtle grain, and casual handheld framing
- Lighting
- natural ambient lighting including neon night glow, golden evening train light, warm cinematic sunset, and humidity haze
- Composition
- authentic iPhone collage of exactly 13 frames with natural imperfections, shaky capture, and chaotic urban energy
- Mood
- carefree adventurous personality expressed through laughing karaoke, mid-laugh arcade reactions, and messy authentic travel memories
Remix ideas
- swap the final cat-feeding frame for a dawn Tsukiji market shot with fish crates
- increase lens fog and reflections only on the rainy alley and ramen frames
- add specific snack brands like KitKat and Boss coffee to the convenience store candid
Reference images
How to use this AI Video prompt template
1
Copy the prompt — grab this template’s prompt and negative prompt. 2
Pick a model — choose a recommended AI model for the best match. 3
Generate — open the studio with one click and create your result.
Related templates

Harajuku Street Fashion GRWM Video
Stylized Japanese Harajuku street fashion “Get Ready With Me” vertical video featuring a trendy Japanese fashion creator preparing for a day out in Shibuya. Bright colorful bedroom filled with posters, plushies, neon signs, accessories, and stacked fashion magazines. She energetically talks in Japanese while applying glitter makeup, colored eyeliner, glossy lips, and styling layered Harajuku outfits. Include fast-paced cuts of oversized jackets, fishnet sleeves, platform sneakers, rings, dyed hair streaks, kawaii handbags, and mirror selfies. Dynamic camera angles, quick zoom transitions, spinning outfit reveals, flashing photo booth effects, VHS overlays, animated Japanese text graphics, energetic J-pop inspired pacing. Neon pink and cyan lighting mixed with daylight from the window. Scenes of her checking outfits in front of a full-length mirror, taking selfies, spraying perfume, grabbing headphones, then leaving the apartment into busy Tokyo streets. Highly detailed fashion textures, youthful trendy atmosphere, anime-inspired realism, social media reel aesthetic Japan3c:T
1980s Retro Lifestyle Montage
Produce a 15-second cinematic video capturing a nostalgic 1980s-inspired day in the life of a young woman. Embrace a dreamy retro atmosphere using soft film grain, warm golden sunlight, pastel tones, and hints of subtle neon lighting. Style her in authentic 80s fashion—oversized denim jackets, high-waisted jeans, loose shirts, vintage sneakers, scrunchies, and bold accessories. Begin with a serene morning moment: she wakes in a softly sunlit bedroom, sheer curtains gently swaying in the breeze. Cut to her standing at a mirror, casually getting ready—applying light makeup, adjusting her outfit with natural, unposed expressions. Transition to a vibrant street scene lined with vintage cars and retro storefronts, where she walks with calm confidence. Shift to an intimate moment of her listening to music on a cassette player, headphones on, eyes closed, fully absorbed in the sound. Move into a cozy diner setting where she sits quietly with a drink, lost in thought. Follow with a carefree bike ride, wind flowing through her hair, bathed in golden sunlight. As evening sets in, show her laughing with friends under a warm sunset, capturing genuine, candid joy. End with a peaceful night scene—she sits by a window, city lights glowing softly with a gentle neon ambiance, reflecting a calm and introspective mood. Use fluid camera movements, occasional slow motion for emotional highlights, and a mix of wide shots, close-ups, and POV angles. Focus on authenticity, soft lighting, and a cohesive nostalgic tone throughout. Avoid any text, logos, or branding.3a:T62

Espionage-Thriller Fight Scene Prompt with Image References
Mei Tactical suit.png = Mei: East Asian woman, long straight black hair, black leather tactical suit with gold accent piping along collar, shoulders, and wrist cuffs, gold buckle utility belt, dual thigh holsters, black flat-heeled boots with gold chevron trim. Dex Rei Tactical suit.png = Dex Rei: East Asian woman, dark green twin ponytails with blunt-cut bangs, dark green leather tactical suit with black reinforced shoulder pads and knee panels, black utility belt with hip pouch, single thigh holster, black lace-up heeled combat boots, black tactical gloves. Hallway 2.PNG = Hallway environment reference. Style & Mood: Aggressive espionage-thriller intensity dialed higher. Fluorescent tubes flicker under impact vibrations, casting staccato light across polished epoxy floors. Desaturated teal-gray palette, motion blur on extremities during fast rotation, sharp focus snapping to point of contact on every strike. Sweat droplets flung from hair on spinning moves catch the overhead light. Dynamic Description: Handheld medium shot already in motion — Mei drives forward with a rapid three-punch combination, fists cutting the air. Dex Rei weaves back, parries the third strike wide, and fires a roundhouse kick that whips her green ponytails in a wide arc, boot connecting flush with Mei's raised guard. Smash cut to low-angle wide, static — Mei absorbs the impact, slides back half a step on the slick floor, then explodes forward with a spinning back kick, her black hair fanning outward as her heel drives toward Dex Rei's sternum. Cut to high-angle crane shot descending — Dex Rei catches the kick against crossed forearms, the force pushing her boots backward on polished floor, fluorescent reflections streaking beneath her. She redirects Mei's momentum, pivots, and launches a spinning jump kick — her entire body rotating airborne, green suit blurring, boot arcing toward Mei's head. Cut to ECU, static — Mei's hand snaps up, palm catching the incoming boot inches from her temple, fingers gripping leather. Whip-pan to medium handheld tracking — Mei shoves the caught leg away, immediately chains into a leaping roundhouse, her body torquing horizontal mid-air, gold wrist cuffs flashing under fluorescent light. The kick grazes Dex Rei's shoulder as she ducks. Cut to wide stabilized tracking from floor level — Dex Rei drops into a spinning leg sweep across the polished surface, one palm planted. Mei vaults over it, lands, and Dex Rei is already rising with a vertical spinning crescent kick that forces Mei to arch backward, the boot passing centimeters from her chin. Hard cut to medium close-up, handheld — both women snap back to fighting stance simultaneously, chests heaving, green ponytails and black hair still settling from the rotational force. A flicker of mutual acknowledgment passes between them — Dex Rei's smirk, Mei's narrowed eyes — before they surge forward again. Static Descriptio41:Tb8a,Mei Tactical

NBA Courtside Influencer POV
Young American Gen-Z female AI content creator, around 22 years old, maintaining a naturally attractive and trendy facial aesthetic: long voluminous dark-brown hair with soft curls, glowing warm fair skin tone, expressive hazel-brown eyes, glossy lips, subtle clean-girl makeup, soft defined jawline, and effortlessly charismatic influencer-style features. Confident yet approachable vibe, natural smile, relaxed courtside energy, authentic celebrity guest aesthetic. Wearing premium Knicks-inspired Gen-Z streetwear — oversized beige varsity-style jacket layered over a fitted blue-and-orange cropped fan top, loose baggy jeans, fashionable sneakers, minimal gold jewelry, and trendy accessories. Realistic live NBA broadcast shot during a Knicks vs 76ers Eastern Conference Semifinals game in Philadelphia. ESPN-style TV cutaway showing her seated courtside the entire time. One continuous shot, no cuts or angle changes. She naturally switches attention between the court and the camera like a real viral fan moment captured live on national television. Action flow: 0–4s: smiling casually while watching the game, fixing her hair slightly, relaxed Gen-Z influencer energy. 4–7s: notices herself on the Jumbotron and gives a playful confident wave toward the camera with a bright smile. 7–11s: cheers briefly, leans toward her friend beside her while laughing naturally, authentic courtside interaction. 11–15s: claps while smiling, subtle realistic movements only, playful facial reactions as crowd energy rises. Style: ultra-realistic sports broadcast aesthetic, viral TikTok/Instagram Gen-Z energy, telephoto camera feel, cinematic arena lighting, slight ESPN-style TV grain and compression artifacts, authentic crowd movement, shallow depth of field, realistic skin texture, cinematic sports framing, persistent unchanged playoff scorebug and lower-third graphic identifying her as an AI content creator, natural candid celebrity fan atmosphere, 16:9 aspect ratio.3b:T7c4,Yo

Convenience Store K-Pop Dance Performance
Create a creative video of Convenience Store K Pop Dance Performance. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: cinematic-video. Reference: convenience-store-k-pop-dance-performance-5699.

AI Tools Content Creator Growth Journey
Create a cinematic 15-second futuristic video showing a content creator using AI tools to grow on Twitter/X. Scene 1: Person sitting at laptop, struggling to write tweets and analyze trends. Scene 2: AI assistant appears on screen generating viral tweet ideas, hashtags, audience insights, and scheduling posts automatically. Scene 3: Twitter/X dashboard shows increasing followers, engagement, likes, reposts, and trending posts. Scene 4: Close-up of happy creator smiling while notifications explode on screen. Futuristic blue interface, holographic AI visuals, fast transitions, modern social media aesthetic, realistic lighting, motivational background music.
Explore more prompts
Browse more AI image and video prompts by category.
FAQ
- Which scene shows steam fogging the lens?
- The late-night ramen shop candid features steam fogging the camera lens alongside the karaoke and club moments.
- What creates the warm cinematic glow in the final frame?
- The sunset cat-feeding alley shot uses warm cinematic glow with the unexpected cat on her lap and shaky handheld capture.