Text to Image

Three-Image Fusion and Integration — AI Image Prompt

A multi-reference prompt that merges a background image, a character design, and an outfit reference into a single cohesive scene with realistic lighting and grounding. - AIPinMaker

GPT Image 2PhotographyText to Image
Three-Image Fusion and Integration
Text to Image

Prompt

Use {argument name="background image" default="reference image 1"} as the main background environment.
Use {argument name="character image" default="reference image 2"} as the character design reference, including face, body balance, hairstyle, and overall impression.
Use {argument name="outfit image" default="reference image 3"} as the outfit design reference.
Create one full-body character standing upright in the scene.
Keep the character naturally integrated into the background, with matching perspective, lighting direction, shadow, color temperature, and atmosphere.
Do not change the character’s identity, body proportions, facial impression, or outfit structure.
The character should stand calmly and clearly, with a natural straight posture, both feet on the ground, and a stable body balance.
Make the result look like one unified finished image, not a collage.
High-quality, clean, detailed, polished image generation.
Avoid extra limbs, broken fingers, distorted hands, missing body parts, ghosting, duplicated faces, unnatural anatomy, messy clothing, and visual noise.39

Prompt breakdown

Subject
full-body character standing upright drawn from reference image 2 with outfit details from reference image 3 placed inside background from reference image 1
Lighting
direction shadows and color temperature matched exactly to the chosen background environment
Composition
natural straight posture both feet on the ground stable body balance and perspective alignment with the scene
Mood
calm unified atmosphere with no visual seams or collage artifacts

Remix ideas

  • Replace only the background reference image to relocate the identical character into a new setting while keeping lighting rules active
  • Swap the outfit reference image to test alternate clothing on the same preserved face and body proportions
  • Add a short follow-up instruction such as "soft breeze moving hair" to enhance environmental interaction after the initial fusion

Reference images

Three-Image Fusion and Integration reference
Text to Image

How to use this AI Image prompt template

  1. AiVideo Maker stepOne1
    Copy the prompt — grab this template’s prompt and negative prompt.
  2. iVideo Maker stepTwo2
    Pick a model — choose a recommended AI model for the best match.
  3. AiVideo Maker stepThree3
    Generate — open the studio with one click and create your result.

Related templates

Video Generation Workflow Prompt (Text/LLM)
Nano Banana 2

Video Generation Workflow Prompt (Text/LLM)

Create a creative image of Video Generation Workflow Prompt Textllm. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: photography. Reference: video-generation-workflow-prompt-textllm-6914.

photorealisticwomancontrolnet
Text to Image
Photorealistic Candid Lifestyle Photo with ControlNet - Nano Banana Pro AI Prompt for Social Media Post
Nano Banana 2

Photorealistic Candid Lifestyle Photo with ControlNet - Nano Banana Pro AI Prompt for Social Media Post

"subject": "Young woman, long brown hair, wearing a yellow string bikini with blue trim and logos. Tattoos on right arm and fingers. Wearing a delicate necklace.", "pose": "Sitting in a wicker chair, leaning slightly forward. Hands resting on outer hips, touching bikini bottoms. Direct gaze.", "environment": "Semi-outdoor bar area. Wicker chair. White counter and slush machine to the right. Man sitting facing away in background left. Blue sky.", "camera": "Mid-shot, slightly elevated angle. Shallow depth of field with blurred background.", "lighting": "Bright, natural daylight. Even illumination with soft shadows under chin and chest.", "mood_and_expression": "Neutral expression, direct eye contact.", "style_and_realism": "Photorealistic candid lifestyle photograph.", "colors_and_tone": "Vibrant palette. Dominant yellow and blue, warm skin tones, bright blue sky.", "quality_and_technical_details": "Sharp focus on subject, natural skin texture, standard digital camera resolution.", "aspect_ratio_and_output": "3:4", "controlnet": { "pose_control": { "model_type": "OpenPose", "purpose": "Match sitting posture and hand placement on hips.", "constraints": "Maintain torso angle and arm positioning exactly.", "recommended_weight": 0.8 }, "depth_control": { "model_type": "MiDaS", "purpose": "Maintain spatial separation between subject, bar counter, and background.", "constraints": "Preserve 3D volume of body and chair.", "recommended_weight": 0.75 } }, "negative_prompt": "stylization, distortion, proportion changes, depth flattening, beautification filters, cartoon, painting, unnatural anatomy, weird hands"

photorealisticwomancontrolnet
Text to Image
Photorealistic Nighttime Flash Photography Prompt with ControlNet (JSON format) - Nano Banana Pro AI Prompt for Social Media Post
Nano Banana 2

Photorealistic Nighttime Flash Photography Prompt with ControlNet (JSON format) - Nano Banana Pro AI Prompt for Social Media Post

{ "subject": "Young female, light skin, shoulder-length brown hair. Wearing a light pink string bikini (bikini color). Glossy skin texture on chest and shoulders.", "pose": "Standing, torso slightly angled, facing forward. Left hand holding the side tie of the bikini bottom. Right arm resting straight down.", "environment": "Nighttime resort setting. Dark wooden railing immediately behind subject. Large swimming pool illuminated with bright blue underwater lights. Distant silhouetted palm trees and pitched-roof buildings.", "camera": "Medium shot, eye level, framed from the hips up. Slight background blur.", "lighting": "Harsh frontal flash lighting on the subject creating high contrast and specular highlights. Background lit by vibrant blue pool lights and dim ambient resort lighting.", "mood_and_expression": "Neutral expression, direct gaze. Candid nighttime snapshot.", "style_and_realism": "Photorealistic, raw smartphone flash photography.", "colors_and_tone": "Dominant light pink, warm flash-lit skin tones, vibrant deep blue water, and heavy black shadows.", "quality_and_technical_details": "Sharp foreground focus, natural skin imperfections and oil specularities visible, slight noise in dark background areas.", "aspect_ratio_and_output": "3:4", "controlnet": { "pose_control": { "model_type": "openpose", "purpose": "maintain torso angle and specific arm placement", "constraints": "anchor left hand exactly at the hip tie", "recommended_weight": 1.0 }, "depth_control": { "model_type": "depth", "purpose": "preserve spatial separation between subject, railing, and pool", "constraints": "maintain stark foreground-to-background depth transition", "recommended_weight": 0.8 } }, "negative_prompt": "stylization, distortion, proportion changes, depth flattening, beautification filters, illustration, painting, unnatural anatomy, altered body structure" }

womancontrolnetphotorealistic
Text to Image
Photorealistic Beach Pose with ControlNet Constraints - Nano Banana Pro AI Prompt for Social Media Post
Nano Banana 2

Photorealistic Beach Pose with ControlNet Constraints - Nano Banana Pro AI Prompt for Social Media Post

{   "subject": "Young adult female, tanned skin with water droplets and sand. Wet, wavy blonde hair gathered up. Wearing a black string bikini with thong bottom and thin straps. Gold watch on left wrist, small earring.",   "pose": "Standing in waist-deep water, body facing away. Torso twisted right, head turned left looking down over shoulder in profile. Left arm raised, hand touching forehead. Right hand behind neck. Hips angled toward camera.",   "environment": "Shallow turquoise ocean water over sandy bottom with sparse seagrass. Clear bright blue sky. Distinct horizon line. Distant parasail speck on right.",   "camera": "Medium shot from mid-thigh up. Slightly above water level. Clear focus on subject, slight water refraction at bottom.",   "lighting": "Bright, direct daylight. Strong specular highlights on wet skin, shoulders, and buttocks. Natural, high-contrast shadows defining physical form.",   "mood_and_expression": "Relaxed, candid beach setting. Neutral, downward gaze.",   "style_and_realism": "High-fidelity photograph, raw snapshot realism.",   "colors_and_tone": "Vibrant natural colors. Deep blue sky, cyan water, bronze skin tone, black swimwear.",   "quality_and_technical_details": "Sharp focus on subject. Visible texture of sand and water droplets on skin. Natural lighting and shadows.",   "aspect_ratio_and_output": "3:4",   "controlnet": {     "pose_control": {       "model_type": "openpose_full",       "purpose": "Strictly lock skeletal alignment, anatomical proportions, torso orientation, and prevent structural or volumetric alteration",       "constraints": "No limb length adjustment, no torso compression, no chest volume reduction, no posture change, no symmetry correction, no ribcage narrowing",       "recommended_weight": 1.2     },     "depth_control": {       "model_type": "depth_midas",       "purpose": "Enforce exact volumetric projection, silhouette area equivalence, curvature depth integrity, and spatial layering",       "constraints": "No depth flattening, no volumetric shrinkage, no perspective reinterpretation, no shadow softening, no foreground-background collapse",       "recommended_weight": 1.25     }   },   "negative_prompt": "stylization, aesthetic correction, body normalization, shape refinement, geometric reinterpretation, anatomical distortion, breast reduction, volume compression, torso slimming, ribcage narrowing, symmetry correction, proportion changes, depth flattening, shadow softening, beautification filters, perspective alteration" }3f:Ta0a,{   "subject": "Young adult female, t

womanphotorealisticcontrolnet
Text to Image
Surreal Cinematic Subway Studio
GPT Image 2

Surreal Cinematic Subway Studio

Cinematic 35mm photography of a young woman sitting backwards on a modern chrome chair inside an enormous underground subway station transformed into a surreal creative studio, uploaded face (reference image) used 100% as reference. She is wearing a dark graphite oversized shirt (top), loose white trousers (bottom), and white slip-on sandals. She has long, silky dark hair, flawless skin, elegant feminine features, and a confident, creative expression. Around her, the station walls are covered with layered poster collages, ripped magazine spreads, handwritten typography experiments, floating transparent UI windows, and glowing directional signs displaying phrases like “Design Is Everywhere,” “Create the Unexpected,” and “Stay Original.” Abandoned train tracks are filled with scattered sketchbooks, camera lenses, open laptops showing editing software, rolls of film, and oversized blueprint papers blowing dramatically through the station from incoming wind. Cinematic fluorescent lights flicker overhead while distant train headlights create atmospheric haze, volumetric lighting, and reflective highlights across the polished floor. Muted monochrome tones dominate the image, enhanced by faded cyan highlights and subtle warm orange accents from glowing neon signs. The composition feels like a luxury streetwear campaign fused with a futuristic graphic design exhibition, blending modern creative culture with urban industrial aesthetics. Highly detailed textures, cinematic depth, realistic skin texture, soft analog film grain, slightly desaturated editorial color grading, premium fashion photography, immersive environmental storytelling, ultra-sharp focus, masterpiece-quality detail, luxury Instagram campaign aesthetic, visually rich and emotionally cinematic composition.3a:T75d,Cine

womanmoodyphotorealistic
Text to Image
Multi-Angle, Face-Obscured Shot Generation Prompt for Nano Banana Pro
Nano Banana 2

Multi-Angle, Face-Obscured Shot Generation Prompt for Nano Banana Pro

Create a creative image of Multi Angle Face Obscured Shot Generation Prompt For Nano Banana Pro. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: photography. Reference: multi-angle-face-obscured-shot-generation-prompt-for-nano-banana-pro-11386.

photorealisticmoodycontrolnet
Text to Image

Explore more prompts

Browse more AI image and video prompts by category.

FAQ

How does the prompt stop the character from appearing pasted in?
By requiring identical perspective lighting direction shadows and color temperature between the character and the background reference so everything renders as one cohesive image.
What happens if my three reference images have very different art styles?
The fusion still works but you may need to emphasize the "high-quality clean detailed polished" instruction and add "in the style of the background" to reduce style clashes.