2026-05-08

Hermes AI Image Generation — Master Template

Reusable master prompt template for generating locked character references and scene images for AI short films using Hermes + gpt-image-2.

Hermes AI Image Generation — Master Template

A reusable production template for generating locked character reference images and scene stills for AI-generated short films using Hermes + gpt-image-2.


=== HERMES AI IMAGE GENERATION TEMPLATE START ===

════════════════════════════════════════════════════ PART 1 — CHARACTER REFERENCE IMAGES ════════════════════════════════════════════════════

TASK: Generate locked character reference images for a short film.

STYLE: Photorealistic cinematic realism. Grounded [COUNTRY/REGION] [GENRE] drama. Natural skin texture, realistic proportions. No beauty filter, no fantasy, no anime, no cartoon. No exaggerated emotion. Plain white background for reference sheets. Full body. Front view + side profile + back view. Consistent face identity across all views. Cinematic soft lighting, high detail, natural motion, no distortion.

ASPECT RATIO: Vertical 9:16. Note: gpt-image-2 defaults to square — embed “vertical portrait composition” in prompt.

GLOBAL NEGATIVE PROMPT: Avoid anime, cartoon, fantasy, glossy beauty skin, perfect model face, luxury house, modern rich apartment, exaggerated crying, open mouth screaming, horror face, distorted hands, extra fingers, extra limbs, deformed face, wrong ethnicity, heavy makeup, colorful fashion outfit, unrealistic lighting, over-smiling, action pose, crowd, children, text, watermark, logo, blur, low resolution, duplicate characters, inconsistent face, distorted eyes.

API CALL FORMAT: POST https://api.openai.com/v1/images/generations Model: gpt-image-2 Params: {“model”: “gpt-image-2”, “prompt”: “”, “n”: 1} Returns: base64 JSON (b64_json) — decode and save as PNG Note: gpt-image-2 does NOT support negative_prompt or aspect_ratio params. Embed negative terms and aspect ratio description directly in the prompt text.

FOR EACH CHARACTER, OUTPUT:

  1. Character name and role
  2. Character lock description (identity traits to preserve)
  3. Reference sheet generation prompt (3-view turnaround)
  4. Expression variation prompts (5 expressions)

════════════════════════════════════════════════════ PART 2 — OBJECT / PROP REFERENCE IMAGES ════════════════════════════════════════════════════

TASK: Generate locked symbolic object reference images.

STYLE: Close-up product-style reference. Plain white background. Photorealistic, high detail. Cinematic soft lighting. Same object identity across all uses.

OUTPUT:

  1. Object name and symbolic meaning
  2. Object lock description
  3. Object reference generation prompt
  4. Variation prompts (different states: screen on/off, held, on table, etc.)

════════════════════════════════════════════════════ PART 3 — LOCATION / BACKGROUND REFERENCE IMAGES ════════════════════════════════════════════════════

TASK: Generate locked location reference images.

STYLE: No people in reference image. Modest [COUNTRY/REGION] setting. Grounded realism, clean but poor living space. Dim warm lighting, soft evening shadows. Photorealistic cinematic realism.

OUTPUT:

  1. Location name and description
  2. Location lock description (props, materials, mood)
  3. Location reference generation prompt
  4. Variation prompts (wide shot, close-up details, doorway angle, wall angle, light variations)

════════════════════════════════════════════════════ PART 4 — SCENE IMAGES ════════════════════════════════════════════════════

TASK: Generate scene stills for each clip using locked character/object/location references.

RULE: Do NOT redesign characters. Use only the existing reference images as identity guides.

REFERENCE FORMAT IN PROMPTS: @daughter_reference = young [COUNTRY] woman character image @father_reference = [COUNTRY] father character image @phone_reference = old smartphone object reference image @home_reference = modest [COUNTRY] village home interior reference image

STYLE: Photorealistic cinematic realism. Grounded [COUNTRY/REGION] [GENRE] drama. Quiet emotional atmosphere. Natural acting. No exaggerated crying, no overacting. No advertisement feeling, no beauty filter. No fantasy, no anime, no cartoon. No luxury home, no modern rich apartment.

COLOR ARC: Scene 1: Dim and slightly cool. Scenes 2-4: Warm but heavy and shadowy. Scene 5: Slightly warmer and more emotional. Scene 6: Warmest and most hopeful.

FOR EACH SCENE IMAGE, OUTPUT:

  1. Scene number and filename
  2. Reference images to use (@daughter_reference, @father_reference, etc.)
  3. Detailed image generation prompt (self-contained, 80-130 words)
    • Full character description (locked, consistent)
    • Environment / location details
    • Lighting style and color palette
    • Mood and atmosphere
    • Composition (framing, angle, lens feel)
    • Ends with: “vertical portrait composition, cinematic lighting, natural motion, no distortion”
  4. Negative prompt (use global negative prompt)

OUTPUT FORMAT PER SCENE: ──────────────────────────────────── SCENE IMAGE XX — filename.png References: @daughter_reference, @phone_reference, @home_reference Prompt: [Full detailed prompt] Negative Prompt: [Global negative prompt] ────────────────────────────────────

════════════════════════════════════════════════════ PART 5 — COPY-PASTE REQUEST TEMPLATE ════════════════════════════════════════════════════

HERMES — IMAGE GENERATION REQUEST

TYPE: [ ] CHARACTER REFERENCES ONLY [ ] SCENE IMAGES ONLY [ ] BOTH

FILM TITLE: </p> <p>GLOBAL STYLE: <e.g., Myanmar village drama / Japanese period piece / etc.></p> <p>LOCATION: <brief description of setting></p> <p>CHARACTER LOCKS:</p> <ul> <li>Daughter: <description></li> <li>Father: <description></li> <li>Phone/Object: <description></li> <li>Home/Background: <description></li> </ul> <p>COLOR ARC: Scene 1: Scenes 2-4: Scene 5: Scene 6:</p> <p>SCENES TO GENERATE:</p> <p>Scene 01A — scene_01a.png Prompt: References: Negative:</p> <p>Scene 01B — scene_01b.png Prompt: References: Negative:</p> <p>[Continue for all scenes…]</p> <p>GENERATION RULES:</p> <ul> <li>gpt-image-2 API only</li> <li>No aspect_ratio param support — embed “vertical portrait composition” in prompt</li> <li>No negative_prompt param — embed negative terms in prompt text</li> <li>Returns b64_json — decode and save as PNG</li> <li>Do NOT redesign characters</li> <li>Do NOT generate video or audio</li> </ul> <p>════════════════════════════════════════════════════ === TEMPLATE END === ════════════════════════════════════════════════════</pre></p> </div> <hr> <h2>How to Use This Template</h2> <ol> <li>Click the <strong>double-square button</strong> in the top-right corner of the prompt block above.</li> <li>Paste it into Hermes (this CLI or Telegram bot).</li> <li>Fill in the copy-paste template with your story details — film title, characters, location, color arc, and scene prompts.</li> <li>Hermes will generate all character references first, then all scene images using gpt-image-2.</li> <li>Character identities are locked and reused consistently across all scene images.</li> </ol> <h2>Key Technical Notes</h2> <ul> <li><strong>gpt-image-2 API</strong> returns base64 JSON — images are decoded and saved as PNG files locally.</li> <li><strong>No <code>aspect_ratio</code> param</strong> — the API doesn’t support it yet. Always embed “vertical portrait composition” in the prompt.</li> <li><strong>No <code>negative_prompt</code> param</strong> — embed all negative terms directly in the prompt text.</li> <li><strong>Character identity</strong> is preserved by repeating the full character lock description in every scene prompt.</li> <li><strong>Color arc</strong> is controlled by changing lighting description keywords (dim/cool → warm/heavy → warm/emotional → warmest/hopeful).</li> </ul> <h2>Related Posts</h2> <ul> <li><a href="/blog/ai-short-film-director-prompt">AI Short Film Director — Master Prompt</a> — Full production pipeline including video prompts, audio scripts, and editing timelines.</li> </ul> </article> </main> </div> <script src="/js/site.js" defer></script> </body> </html>