Text to Image
Grok Imagine R2V Multi-Image Reference Tutorial — AI Image Prompt
A detailed tutorial and explanation in Japanese about the new Grok Imagine R2V (Reference-to-Video) feature, which allows referencing up to 7 images for video generation. The tweet provides examples of how to use image references for character details, scene interpretation, cut transitions, and scene changes, including the prompt structure for each example. - AIPinMaker

Prompt
▼動画①↖︎:7枚参照してキャラの詳細設定を記述 プロンプト:キャラの詳細設定記述(※約1000文字のYAML) ・特にカット切り替えがない、7枚の画像を総合的に解釈したような1シーン ――――― ▼動画②↗︎:7枚参照してプロンプトなし プロンプト:「.」1文字(※画像2枚以上の時は完全ノープロンプトはできないようです) ・①とあまりかわらない ――――― ▼動画③↙︎:7枚参照して各画像参照記述でカット切り替えを指示 プロンプト:「 @ image1→ @ image2→(省略)→ @ image7の順番でカット切り替えする」 ・カット切り替えはできている ・全部は使われてない(6秒でギリギリ5カット) ・参照画像そのままではなく、似たような構図で再描画されている感じ ――――― ▼動画④↘︎:キャラクターシート・開始フレーム画像・背景画像の3つ参照で場面切り替え プロンプト:「 @ image1 のキャラが @ image2 のようにうたた寝していると @ image3 の世界に迷い込む。ドリーミーなトランジション。ダイナミックなカメラワーク」
Prompt breakdown
- Subject
- seven reference images interpreted as one cohesive character scene or sequenced cuts, including a character sheet, sleeping pose start frame and background world
- Composition
- @ image1→ @ image2→ @ image7 cut order or three-image blend of character sheet plus sleeping frame entering dream world
- Mood
- dreamy transition with dynamic camera work
Remix ideas
- shorten the YAML to 400 characters and retest whether more of the seven images appear in the cuts
- insert timing notes like 1s per @ tag to push closer to six distinct cuts
- swap the sleeping pose reference for a standing frame to change how the character enters the background world
Reference images
How to use this AI Image prompt template
1
Copy the prompt — grab this template’s prompt and negative prompt. 2
Pick a model — choose a recommended AI model for the best match. 3
Generate — open the studio with one click and create your result.
Related templates

Storyboard Keyframe Video Reference
Create a game video of Storyboard Keyframe Video Reference. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: illustration. Reference: storyboard-keyframe-video-reference-4939.

Storyboarding and Multi-Perspective Video Generation
Create a game video of Storyboarding And Multi Perspective Video Generation. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: illustration. Reference: storyboarding-and-multi-perspective-video-generation-46.

AI Discourse Meme Summary Prompt (Chinese & English)
Create a creative video of Ai Discourse Meme Summary Prompt Chinese English. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: cinematic-video. Reference: ai-discourse-meme-summary-prompt-chinese-english-423.

Sgt. Pepe Animated Sitcom Prompt
Create a creative video of Sgt Pepe Animated Sitcom Prompt. Style: photorealistic. Composition: balanced and well-framed. Lighting: natural with cinematic mood. Category: cinematic-video. Reference: sgt-pepe-animated-sitcom-prompt-386.

Pencak Silat Martial Arts Sequence
A clean instructional poster in a 4x4 grid (16 panels) showing a male martial artist demonstrating Pencak Silat choreography. The subject is a fit man with medium-length dark hair and a beard, wearing a red button-up shirt, beige pants, and white sneakers. Style is semi-realistic illustration mixed with hand-drawn sketch lines, soft shadows, and a light textured paper background. Each panel includes subtle motion arrows and small captions. Minimalist, balanced layout. शीर्ष title at top: 'Pencak Silat Choreography – 16 Counts – 10 Seconds – Smooth Flowy Chill'. Panel 1: wide ready stance, knees bent, one hand guarding forward, focused expression. Panel 2: step forward with outward block, arm extended defensively. Panel 3: inside block sweeping upward diagonally, torso rotating. Panel 4: strong straight punch forward, aligned hips. Panel 5: side step into low stance, both hands pushing outward. Panel 6: knee lifted high in chamber position, balanced posture. Panel 7: front kick extended forward, opposite arm guarding. Panel 8: step backward into defensive cover, arms crossing. Panel 9: low block, sweeping arm downward in deep stance. Panel 10: hook punch across body with hip rotation. Panel 11: turning body with pivoting feet, circular motion. Panel 12: forward palm strike, stable stance. Panel 13: low sweeping motion near ground in squat stance. Panel 14: rising smoothly from low stance, upward motion. Panel 15: controlled locking pose, one arm raised defensively. Panel 16: closing salute, feet together, hands pressed at chest, calm posture. Include hand-drawn arrows (blue and purple accents), clean infographic layout, evenly spaced panels, consistent character design across all frames. A focused male martial artist performs a smooth, flowing Pencak Silat sequence in a minimalist studio. He is in his late 20s, athletic build, medium height, warm tan skin, with thick wavy dark hair and a short, well-groomed beard. He wears a fitted deep red button-up shirt with sleeves rolled to the forearms, beige slim-fit pants with slight wear at the knees, and clean white sneakers. His expression is calm, controlled, and intent. The setting is a softly lit neutral studio with an off-white textured backdrop and a slightly worn wooden floor. Lighting is diffused and cinematic, creating gentle shadows and emphasizing fluid motion. The choreography is continuous and rhythmic, with no abrupt cuts: 0–2s: يبدأ in a grounded ready stance, knees bent, one palm forward in guard, eyes locked ahead 2–4s: steps forward into an outward block, transitioning into an inside sweeping block upward 4–6s: rotates hips into a straight punch, then shifts weight into a side step with a pushing motion 6–8s: lifts knee smoothly and extends into a controlled front kick, maintaining balance 8–10s: steps back into a guarded cover, drops into a low block, then rises into a hook punch 10–12s: pivots the body with a clean turn, transitioning into a forward palm strike 12–14s: lowers into a sweeping motion, then rises fluidly into a locking control pose 14–15s: finishes upright with a respectful closing salute, hands together at chest level Movement style is soft, flowing, controlled, with precise martial intention rather than aggression. Emphasis on balance, breath, and continuity. Camera is a steady medium-wide shot with subtle slow tracking, no cuts.3c:Td41,A clean instructional poster

Image-to-Video facial preservation prompt
Image-to-Video: Use EXACTLY the same face from the reference image without modifying absolutely anything: same eyes, eyebrows, nose, lips, expression, hair, makeup, and facial proportions 100%. Under no circumstances change any part of the face, hair...
Explore more prompts
Browse more AI image and video prompts by category.
FAQ
- Why does the single '.' prompt still produce output close to the full YAML version?
- With seven images the model leans heavily on visual references so the minimal prompt mainly removes text guidance without changing the core scene much.
- How many cuts actually appear when using the @ image sequencing instruction?
- In a six-second clip the model typically renders about five cuts even when all seven @ tags are listed, and it redraws similar compositions rather than using the exact reference frames.