Text to Video
Cinematic Cafe Barista Story — AI Video Prompt
A short-form video prompt detailing a barista making latte art, optimized for social media with high visual focus on texture and lighting. - AIPinMaker

Prompt
1Medium Shot / Eye LevelSlow Push-InA young male barista stands behind the counter of a modern industrial-style café. Morning sunlight streams through large glass windows, creating soft volumetric light rays. He wears a black turtleneck sweater with a gray apron, giving the scene a premium and calm atmosphere.Lo-fi electronic beat begins; subtle café ambience2s 2Extreme Macro / Low AngleStaticBottomless portafilter extraction begins. Rich espresso flows down like molten lava, layered with thick golden crema.Amplified espresso extraction sound1s 3Close-Up / Side ViewSlight TrackingThe steam wand froths milk inside a stainless steel pitcher. White steam surrounds the pitcher while the milk spins into a silky microfoam texture.Crisp high-pressure steam “hiss” sound1s 4Close-Up / Top-Down AnglePush-InThe barista begins pouring latte art. Smooth white milk flows precisely onto the espresso surface, gradually forming a galloping horse pattern. The coffee reflects warm amber highlights with ultra-detailed liquid. intensifies slightly; delicate pouring sound3s 5Close-Up / Eye LevelRack FocusThe finished latte is gently placed on a wooden counter. The camera first focuses on the perfect horse latte art, then slowly shifts focus to the barista’s satisfied smile in the background.Soft ceramic cup “clack” sound; music fades gently3s Pacing Simplified the workflow into the essential visual sequence: Extraction → Milk Frothing → Latte Art → Final Reveal Faster rhythm optimized for: TikTok Instagram Reels Xiaohongshu AI-generated short-form videos Visual Focus Key visual highlights: Golden espresso crema Silky milk texture Galloping horse latte art reveal Emotional rack-focus ending AI Video Generation Keywords cinematic coffee shop macro latte art espresso extraction shallow depth of field warm sunlight volumetric lighting creamy milk texture ASMR coffee sound cinematic rack focus ultra realistic cozy industrial café 85mm lens soft amber tones high-detail liquid simulation3a:T7f8,1Medium Sho
Prompt breakdown
- Subject
- Young male barista in black turtleneck sweater and gray apron preparing espresso and pouring a galloping horse latte art in a modern industrial café
- Style
- Cinematic short-form video with macro details, push-ins, tracking shots, and rack focus
- Lighting
- Morning sunlight through large glass windows creating soft volumetric light rays and warm amber highlights on the coffee
- Composition
- Sequence from medium shot eye-level push-in to extreme macro low angle static, close-up side tracking, top-down push-in, and eye-level rack focus
- Mood
- Premium calm atmosphere building to satisfaction with the finished latte reveal
Remix ideas
- Replace the galloping horse with a different latte art pattern such as a heart or leaf
- Extend the rack focus duration to linger longer on the barista's smile
- Add more steam effects during milk frothing for enhanced visual texture
Reference images
How to use this AI Video prompt template
1
Copy the prompt — grab this template’s prompt and negative prompt. 2
Pick a model — choose a recommended AI model for the best match. 3
Generate — open the studio with one click and create your result.
Related templates

Pixar Style Father Son Football Story
Pixar-style 3D animated cinematic video, heartwarming father-and-son football story inside a cozy sunlit living room. A loving father wearing a red Portugal football jersey gently shows a tiny Portugal jersey with number 7 to his adorable blonde baby. The baby has huge expressive blue eyes, chubby cheeks, and realistic skin details. Warm golden morning sunlight streams through large windows, soft depth of field, ultra-detailed character animation, emotional storytelling. Scene 1: Father proudly presents a miniature Portugal jersey to the baby. Scene 2: Baby smiles and laughs as the father helps him wear the jersey. Scene 3: Close-up of the father carefully painting football fan markings on the baby's cheeks. Scene 4: The baby transforms into a tiny football player wearing a complete Portugal kit with number 7, football boots, and socks. Scene 5: Hero shot of the baby standing confidently beside a soccer ball, determined expression, Portugal flag visible in the background. Camera movements: smooth cinematic dolly shots, close-ups, gentle push-ins, shallow depth of field. Lighting: warm golden-hour sunlight, soft shadows, cozy family atmosphere. Style: Disney Pixar, ultra-realistic 3D animation, vibrant colors, emotional storytelling, highly detailed facial expressions, professional cinematic rendering, 4K, masterpiece quality, smooth motion, cute and wholesome family moment.

Luxury Skincare Cinematic Ad
Create a 15-second ultra-realistic luxury skincare cinematic video titled “Luxury Captured in Motion: Glow Skin & Light in Perfect Harmony.” Style: high-end beauty commercial, warm champagne lighting, soft golden highlights, shallow depth of field, macro cinematography, smooth slow-motion camera movement, premium fashion advertisement aesthetic. Ultra-realistic live-action only. Scene Breakdown: 0–3s: Extreme macro shot of glowing serum droplets falling in slow motion into reflective water. Golden light refracts beautifully, creating elegant ripples and a luxury skincare mood. 3–6s: Medium close-up of a flawless woman with glowing glass skin and soft natural makeup. She slowly turns her face toward the light, calm expression, cinematic rim lighting defining facial contours. 6–9s: Hero product shot of a premium serum bottle floating and slowly rotating in mid-air. Suspended particles, soft mist, and liquid light trails surround it with luxury commercial energy. 9–12s: Close-up of skincare application on cheek. Fingertips gently spread serum, revealing hydrated luminous skin. Camera slowly pushes in to emphasize texture and glow. 12–15s: Final hero frame. The woman stands beside the serum bottle under warm golden lighting. Subtle wind motion in hair, soft bokeh background, confident gaze at camera. Elegant luxury typography fades in: “Glow in Perfect Harmony.” Constraints: ultra-realistic only, no animation look, no CGI style, no cartoon, no illustration. Premium cinematic skincare advertisement quality.3a:T609,Create a 1

The Silent Retreat Uproar (Disaster Scene)
Create a cinematic video of The Silent Retreat Uproar Disaster Scene. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: cinematic-video. Reference: the-silent-retreat-uproar-disaster-scene-2214.

Skincare Commercial Macro Foam Detail
Create a 15-second ultra-realistic cinematic vertical (9:16) commercial video. Scene Style: Modern skincare advertisement, clean minimal bathroom setting, soft morning natural light, premium commercial look, macro detailing of water, foam, and skin texture. Sequence: 0–3s: Extreme close-up shot of a young man’s hands. He squeezes face wash into his palm—thick gel drops in slow motion, highly detailed texture. Soft light reflects on the product. 3–6s: He rubs the face wash between his palms, forming rich creamy foam. Camera focuses on lather buildup with cinematic macro shots. 6–10s: He applies the foam to his face and gently massages in circular motion. Voiceover begins: “This face wash removes dirt, oil, and impurities…” 10–13s: Slow-motion rinse shot—water flows across his face, washing away foam. Skin appears fresh, clean, and glowing. Subtle cinematic zoom-in. 13–15s: He looks into the mirror with a refreshed, confident expression and says: …for clear, smooth, and energized skin every day. Final product pack shot appears with soft glow and clean white background.3d:T45a,Create a 15-seco

Cinematic Beach Sunset Scene
Create a cinematic video of Cinematic Beach Sunset Scene. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: cinematic-video. Reference: cinematic-beach-sunset-scene-3081.

Japanese School Martial Arts Sequence
Use the uploaded reference image as the only character reference. The reference image contains the school girl, the three male troublemakers, and the male hero. Preserve the exact facial features, hairstyles, identities, clothing, proportions, and appearance of every character shown in the reference image. Do not replace, redesign, or alter any character. Create an ultra-realistic Japanese high school classroom scene during warm afternoon golden hour. The classroom contains wooden desks, chairs, school bags, books, notebooks, posters, a chalkboard, fluorescent ceiling lights, curtains, sliding windows, and realistic school details. Warm sunlight streams through the windows, creating natural shadows and visible dust particles. 0s–3s The school girl sits alone at her desk studying. The three male students surround her desk, teasing her and making her uncomfortable. The camera slowly pushes toward her worried expression. Warm sunlight fills the classroom. 3s–5s The male hero notices the situation and walks toward them. He calmly tells the boys to stop bothering her. The boys laugh at him. One suddenly kicks him. The camera whips dramatically with the impact. 5s–12s An intense one-take martial arts sequence begins. The hero fights all three boys at once using realistic choreography. He dodges attacks between desks, vaults over tables, slides across tabletops, blocks strikes with school bags, redirects attackers into desks, and lands powerful combinations. Chairs move, papers fly, desks slide, and curtains react naturally. The camera constantly follows, circles, ducks, and moves through the classroom while staying close to the action. 12s–15s Epic cinematic slow-motion finale. The hero lands a decisive final strike. The three attackers are knocked backward in dramatic slow motion. Papers float through golden sunlight. The camera circles around the hero standing protectively in front of the girl while the defeated boys fall behind him. Powerful cinematic ending. Style: ultra-realistic live action, Japanese school drama, high-budget action film, realistic body physics, authentic martial arts choreography, natural lighting, grounded textures, immersive handheld cinematography, movie-quality visuals, one-take camera feel, 8K detail. Negative prompts: different faces, changed identities, face swap, extra characters, cartoon, anime style, CGI look, Unreal Engine look, glossy AI finish, distorted anatomy, duplicated limbs, weak choreography, poor continuity, plastic skin, unrealistic physics, oversaturated colors, excessive VFX, blurry faces, low detail.3d:Ta2f,
Explore more prompts
Browse more AI image and video prompts by category.
FAQ
- What is the total runtime optimized for social media?
- The five scenes are paced for a fast 10-second video suitable for TikTok, Instagram Reels, and Xiaohongshu.
- How does the prompt handle sound design?
- It layers a lo-fi electronic beat with amplified espresso extraction, high-pressure steam hiss, delicate pouring sounds, and a soft ceramic clack before fading.