Text to Image

Minecraft Rooftop Aerial Portrait — AI Image Prompt

A creative prompt blending a photorealistic human subject with a blocky, authentic Minecraft environment from a high-angle drone perspective. - AIPinMaker

GPT Image 2PortraitText to Image
Minecraft Rooftop Aerial Portrait
Text to Image

Prompt

Use the provided face as the primary identity. Preserve exact facial structure, proportions, eyes, nose, lips, and natural skin texture. Do not alter identity.

A highly realistic photo of a {argument name="subject" default="young woman"} sitting on top of a {argument name="structure" default="Minecraft-style house roof"}, viewed from a high aerial perspective. The environment is a vast Minecraft world with blocky forests, rivers, and villages far below, rendered in authentic Minecraft style (pixelated textures, cubic geometry).

Subject:
- Female sitting casually on the edge of the roof
- Natural relaxed pose, slightly leaning back with one hand supporting body
- Looking up toward the camera (top-down drone angle)
- Expression slightly open mouth / candid / playful

Outfit:
- White sleeveless top
- Long sleeve flannel shirt tied around the waist
- Black oversized pants
- Casual streetwear vibe

Style & Rendering:
- Face and body must be photorealistic (real human skin, natural lighting)
- Environment fully Minecraft style (blocky, pixel textures)
- Seamless blend between realistic subject and game world
- Soft daylight, slightly hazy distance, cinematic depth
- High detail, sharp focus on subject, background slightly blurred (depth of field)

Composition:
- Subject centered on rooftop
- Roof clearly Minecraft block style
- Extreme height feeling (ground very far below)
- Slight wide-angle lens effect

Quality:
ultra realistic, 4k, high detail, cinematic lighting, photorealistic skin texture, no cartoon face, no distortion, no extra limbs

Prompt breakdown

Subject
Young woman in white sleeveless top, flannel tied at waist, black oversized pants, sitting casually on roof edge with one hand supporting, looking up playfully with slightly open mouth
Style
Photorealistic skin and body details blended seamlessly with authentic Minecraft blocky geometry and pixelated textures throughout the environment
Lighting
Soft daylight with slight haze in distance and cinematic depth of field keeping sharp focus on the subject
Composition
Extreme high aerial top-down drone angle centered on rooftop with wide-angle lens effect and visible vast drop to ground
Mood
Candid playful expression against the expansive blocky Minecraft landscape

Remix ideas

  • Change the roof to a village blacksmith building while keeping the same top-down pose and outfit
  • Shift to golden hour lighting so the realistic skin picks up warmer tones against cooler pixel blocks
  • Add faint Minecraft chickens and trees rendered in the mid-ground to reinforce scale without distracting from the subject

Reference images

Minecraft Rooftop Aerial Portrait reference
Text to Image

How to use this AI Image prompt template

  1. AiVideo Maker stepOne1
    Copy the prompt — grab this template’s prompt and negative prompt.
  2. iVideo Maker stepTwo2
    Pick a model — choose a recommended AI model for the best match.
  3. AiVideo Maker stepThree3
    Generate — open the studio with one click and create your result.

Related templates

Woman Riding Giant Drone Above Mountains
GPT Image 2

Woman Riding Giant Drone Above Mountains

Create a photorealistic cinematic portrait of a Pakistani woman (character description) sitting cross-legged on top of an enormous professional flying camera drone high above a dramatic mountain range and a sea of clouds at golden-hour sunrise. Keep the subject centered and full-body from the knees up, with a strict face-reference portrait look if a reference is provided, natural realistic proportions, and confident calm posture. She wears a fully black modest outfit: long flowing tunic, loose trousers with subtle lace-trim cuffs, draped black scarf/shawl, black sandals, and a black wide-brim hat; long brown hair falls over her shoulders. She holds a drone remote controller with a screen in both hands, looking forward. The drone is oversized, matte black, rugged and futuristic, with a central octagonal body, visible front gimbal camera, landing struts, and exactly four visible arms with four propeller assemblies extending around her. Background: epic alpine peaks, valleys, thick cloud layers below, warm sunlight from the left, orange and peach sky, atmospheric haze, strong depth of field, cinematic contrast. Style: ultra-realistic photography, high detail fabric texture, realistic hands and feet, sharp subject, softly blurred distant mountains, premium adventure-fashion editorial mood. Add a small elegant cursive watermark in the bottom-right corner reading Made by Mr. Tariq (watermark text). No extra people, no city, no text besides the watermark, no cartoon style, no distorted limbs, no extra propellers, no duplicate controllers.

womandronegolden-hour
Text to Image
Cinematic Meadow Portrait
GPT Image 2

Cinematic Meadow Portrait

Ultra-realistic cinematic photo of a hijabi woman. Do not change her face. She has a tall and slim body posture. She is sitting calmly in the middle of a vast wild poppy flower field during the afternoon. The entire area is filled with natural dark green grass and hundreds of red-orange poppy flowers scattered randomly and organically across almost the entire endless frame. The flower distribution must look natural and realistic like a real flower meadow. The woman wears a soft cream-colored hijab with realistic fabric folds naturally draping around her head and shoulders. Outfit consists of a plain white t-shirt layered with a light yellow knitted outer cardigan, and loose light blue jeans. Relaxed and emotional pose: she sits casually facing directly toward the camera, one hand holding a single poppy flower extended toward the lens. She smiles cheerfully. Camera angle is taken slightly from the side at eye level using a cinematic perspective. Composition is a medium shot — close enough for the face to appear clear while still showing many poppy flowers surrounding her. In the foreground near the lens, several blurry poppy flowers create natural depth and cinematic framing. No shoes and no bag around the subject. Background shows a wide bright blue daytime sky with calm atmosphere. Afternoon sunlight naturally illuminates the side of her face and hijab with soft subtle shadows. Rich dark green tones, vivid yet realistic red-orange flowers, extremely sharp grass texture details, cinematic color grading, subtle film grain, realistic depth of field, dreamy and immersive atmosphere, ultra realism like a National Geographic cinematic editorial photo, HD. Ultra-realistic cinematic photo of a hijabi woman. Do not change her face. She has a tall and slim body posture. She is sitting casually in the middle of a vast wild poppy flower field, captured from a perfectly top-down aerial drone perspective. Her body is positioned exactly in the center of the frame and remains clearly visible while still appearing smaller compared to the massive field surrounding her, creating a cinematic sense of scale and aesthetic solitude. The composition is slightly closer than before so her face remains visible, but not too close-up, allowing the flower field to still dominate the frame. The entire area is filled with natural dark green grass and hundreds of red-orange poppy flowers scattered randomly and organically across almost the entire frame. No pathways or large empty areas. Flower distribution must appear realistic and natural like a real poppy meadow viewed from high drone altitude. The woman wears a soft cream-colored hijab with realistic fabric folds naturally draping around her head and shoulders. Outfit consists of a plain white t-shirt layered with a light yellow knitted outer cardigan, loose light blue jeans, and clean white sneakers. Relaxed and natural pose: both hands resting beside her body supporting herself on the grass, one leg bent casually while the other leg stretches naturally. Her head is tilted slightly sideways. She looks directly toward the camera with a soft gentle smile and calm expression. Near her shoulder lies a small white mini backpack placed on the grass. Camera perspective must feel very high like a professional drone with cinematic aerial lens. The camera faces perfectly straight downward, allowing the flower and grass patterns to create immersive visual textures. The woman appears blended naturally into the vast landscape surrounding her. Natural golden hour lighting with soft shadows, earthy and slightly moody tones, rich dark green colors, vivid yet natural red-orange flowers, extremely sharp grass textures, cinematic color grading, subtle film grain, dreamy and peaceful atmosphere, ultra-realistic like a professional National Geographic drone photo, HD.38

womandronephotorealistic
Text to Image
Crowd Mosaic Best Friends Selfie
GPT Image 2

Crowd Mosaic Best Friends Selfie

Create a surreal aerial-view photomosaic portrait of two (number of friends) young women posing for a close selfie, where the entire image is constructed from thousands of tiny pedestrians seen from above on a vast pale concrete plaza. From far away, the crowd forms a realistic selfie of two female friends with long dark brown hair; from close up, every strand of hair, clothing shadow, skin tone, handbag, and background texture is made of individual walking people casting small shadows. Count exactly two main figures: the left woman wears a dark short-sleeve top, carries a glossy black handbag on her shoulder, tilts inward, and makes a peace sign with one hand; the right woman is larger in the foreground, wears an off-shoulder light gray knit sweater, has long wavy hair over one shoulder, and carries a brown shoulder strap. Cover each face with a plain soft-edged square privacy block in muted taupe-gray, exactly two face-covering squares total, one smaller on the left figure and one larger on the right figure. Use a high-angle drone perspective, clean off-white plaza background, scattered isolated pedestrians around the portrait edges, realistic long shadows, extreme crowd-density detail, loneliness mixed with collective scale, muted neutral color palette, and a hyperrealistic miniature-people mosaic illusion. The image should read as a casual best-friends selfie at first glance and as thousands of unrelated passersby when zoomed in. Use dark brown top and light gray off-shoulder sweater (clothing colors), glossy black handbag and brown shoulder strap (bag styles), pale off-white plaza (background color), and lonely yet intimate crowd-mosaic feeling (mood).

dronewomanphotorealistic
Text to Image
Early 2000s High Fashion Studio Shoot - Nano Banana Pro AI Prompt for Product Marketing
Nano Banana 2

Early 2000s High Fashion Studio Shoot - Nano Banana Pro AI Prompt for Product Marketing

{ "shot": { "composition": "full body dual shot, 50mm lens, static angle, high fashion composition", "camera_motion": "static studio setup", "frame_rate": "still photography", "film_grain": "light vintage fashion grain, mimicking early 2000s campaigns" }, "subject": { "description": "Two female fashion models in bold editorial poses. One kneeling with long platinum pink hair, the other behind her with short black hair. Both wear mini Dior-style outfits and newspaper-printed thigh-high boots.", "wardrobe": "designer print bikinis, newspaper-print thigh-high boots, large statement accessories, small shoulder bags", "makeup": "bold eyeliner, overlined lips, high-gloss highlight on cheeks", "hair": "model 1: platinum pink straight, long; model 2: short black wavy bob" }, "scene": { "location": "studio with white seamless backdrop", "time_of_day": "artificial lighting, studio session", "environment": "clean background, shadow below subjects from soft directional lighting" }, "visual_details": { "action": "model 1 kneeling and arching her back with attitude, model 2 seated close behind with hand on hip", "props": "designer handbags, large earrings, over-the-knee boots with printed patterns" }, "cinematography": { "lighting": "flat studio light, front-lit with soft diffused fill, subtle shadows for contour", "tone": "high-gloss, bold, provocative fashion magazine look" }, "audio": { "ambient": "none (photo)", "music": "implied editorial fashion shoot silence" }, "color_palette": "desaturated whites, chrome, denim blue, muted skin tones, with pops of print", "dialogue": { "character": "N/A", "line": "", "subtitles": false } }

womanstudiophotorealistic
Text to Image
Rainy Train Schoolgirl Portrait
GPT Image 2

Rainy Train Schoolgirl Portrait

Create a moody cinematic square portrait of a young Japanese schoolgirl (character description) sitting alone on a train or bus on a rainy day, leaning her head against a fogged window covered in raindrops and condensation. She has long wet-looking black hair falling over her shoulders and wears a white sailor-style school uniform with a dark navy collar and ribbon tie. The composition is close and intimate, with the window occupying the left half of the frame and the girl on the right, her posture tired and melancholic as she looks downward toward the glass. Add exactly one large opaque square censor block over the center of her face, colored muted dark brown-gray, hiding all facial features. Outside the window, show a soft blurred green countryside landscape under overcast gray weather. Use natural low-light photography, shallow depth of field, muted green and gray tones, realistic rain droplets on glass, soft film grain, emotional rainy-day atmosphere, and a 1:1 aspect ratio. No text, no watermark, no extra characters.

womanmoodybokeh
Text to Image
High Fashion Editorial Portrait Prompt
Nano Banana 2

High Fashion Editorial Portrait Prompt

Create a portrait image of High Fashion Editorial Portrait Prompt. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: portrait. Reference: high-fashion-editorial-portrait-prompt-10473.

womanstudiophotorealistic
Text to Image

Explore more prompts

Browse more AI image and video prompts by category.

FAQ

Why does the face remain unchanged across generations?
The prompt opens by directing the model to treat the supplied face as primary identity and explicitly preserve exact proportions, eyes, nose, lips, and skin texture without alteration.
How is the photorealistic subject kept from looking out of place?
Only the human figure receives real skin, natural lighting, and fabric details while the entire environment stays locked to cubic Minecraft geometry and pixel textures for deliberate contrast.