Hermes AI Image Generation — Master Template

A reusable production template for generating locked character reference images and scene stills for AI-generated short films using Hermes + gpt-image-2.
=== HERMES AI IMAGE GENERATION TEMPLATE START ===
════════════════════════════════════════════════════
PART 1 — CHARACTER REFERENCE IMAGES
════════════════════════════════════════════════════
TASK: Generate locked character reference images for a short film.
STYLE:
Photorealistic cinematic realism.
Grounded [COUNTRY/REGION] [GENRE] drama.
Natural skin texture, realistic proportions.
No beauty filter, no fantasy, no anime, no cartoon.
No exaggerated emotion.
Plain white background for reference sheets.
Full body.
Front view + side profile + back view.
Consistent face identity across all views.
Cinematic soft lighting, high detail, natural motion, no distortion.
ASPECT RATIO:
Vertical 9:16.
Note: gpt-image-2 defaults to square — embed “vertical portrait composition” in prompt.
GLOBAL NEGATIVE PROMPT:
Avoid anime, cartoon, fantasy, glossy beauty skin, perfect model face, luxury house, modern rich apartment, exaggerated crying, open mouth screaming, horror face, distorted hands, extra fingers, extra limbs, deformed face, wrong ethnicity, heavy makeup, colorful fashion outfit, unrealistic lighting, over-smiling, action pose, crowd, children, text, watermark, logo, blur, low resolution, duplicate characters, inconsistent face, distorted eyes.
API CALL FORMAT:
POST https://api.openai.com/v1/images/generations
Model: gpt-image-2
Params: {“model”: “gpt-image-2”, “prompt”: “”, “n”: 1}
Returns: base64 JSON (b64_json) — decode and save as PNG
Note: gpt-image-2 does NOT support negative_prompt or aspect_ratio params.
Embed negative terms and aspect ratio description directly in the prompt text.
FOR EACH CHARACTER, OUTPUT:

Character name and role
Character lock description (identity traits to preserve)
Reference sheet generation prompt (3-view turnaround)
Expression variation prompts (5 expressions)

════════════════════════════════════════════════════
PART 2 — OBJECT / PROP REFERENCE IMAGES
════════════════════════════════════════════════════
TASK: Generate locked symbolic object reference images.
STYLE:
Close-up product-style reference.
Plain white background.
Photorealistic, high detail.
Cinematic soft lighting.
Same object identity across all uses.
OUTPUT:

Object name and symbolic meaning
Object lock description
Object reference generation prompt
Variation prompts (different states: screen on/off, held, on table, etc.)

════════════════════════════════════════════════════
PART 3 — LOCATION / BACKGROUND REFERENCE IMAGES
════════════════════════════════════════════════════
TASK: Generate locked location reference images.
STYLE:
No people in reference image.
Modest [COUNTRY/REGION] setting.
Grounded realism, clean but poor living space.
Dim warm lighting, soft evening shadows.
Photorealistic cinematic realism.
OUTPUT:

Location name and description
Location lock description (props, materials, mood)
Location reference generation prompt
Variation prompts (wide shot, close-up details, doorway angle, wall angle, light variations)

════════════════════════════════════════════════════
PART 4 — SCENE IMAGES
════════════════════════════════════════════════════
TASK: Generate scene stills for each clip using locked character/object/location references.
RULE: Do NOT redesign characters. Use only the existing reference images as identity guides.
REFERENCE FORMAT IN PROMPTS:
@daughter_reference = young [COUNTRY] woman character image
@father_reference = [COUNTRY] father character image
@phone_reference = old smartphone object reference image
@home_reference = modest [COUNTRY] village home interior reference image
STYLE:
Photorealistic cinematic realism.
Grounded [COUNTRY/REGION] [GENRE] drama.
Quiet emotional atmosphere.
Natural acting.
No exaggerated crying, no overacting.
No advertisement feeling, no beauty filter.
No fantasy, no anime, no cartoon.
No luxury home, no modern rich apartment.
COLOR ARC:
Scene 1: Dim and slightly cool.
Scenes 2-4: Warm but heavy and shadowy.
Scene 5: Slightly warmer and more emotional.
Scene 6: Warmest and most hopeful.
FOR EACH SCENE IMAGE, OUTPUT:

Scene number and filename
Reference images to use (@daughter_reference, @father_reference, etc.)
Detailed image generation prompt (self-contained, 80-130 words)

Full character description (locked, consistent)
Environment / location details
Lighting style and color palette
Mood and atmosphere
Composition (framing, angle, lens feel)
Ends with: “vertical portrait composition, cinematic lighting, natural motion, no distortion”


Negative prompt (use global negative prompt)

OUTPUT FORMAT PER SCENE:
────────────────────────────────────
SCENE IMAGE XX — filename.png
References: @daughter_reference, @phone_reference, @home_reference
Prompt:
[Full detailed prompt]
Negative Prompt:
[Global negative prompt]
────────────────────────────────────
════════════════════════════════════════════════════
PART 5 — COPY-PASTE REQUEST TEMPLATE
════════════════════════════════════════════════════
HERMES — IMAGE GENERATION REQUEST
TYPE: [ ] CHARACTER REFERENCES ONLY
[ ] SCENE IMAGES ONLY
[ ] BOTH
FILM TITLE: </p>
<p>GLOBAL STYLE: <e.g., Myanmar village drama / Japanese period piece / etc.></p>
<p>LOCATION: <brief description of setting></p>
<p>CHARACTER LOCKS:</p>
<ul>
<li>Daughter: <description></li>
<li>Father: <description></li>
<li>Phone/Object: <description></li>
<li>Home/Background: <description></li>
</ul>
<p>COLOR ARC:
Scene 1:
Scenes 2-4:
Scene 5:
Scene 6:</p>
<p>SCENES TO GENERATE:</p>
<p>Scene 01A — scene_01a.png
Prompt:
References:
Negative:</p>
<p>Scene 01B — scene_01b.png
Prompt:
References:
Negative:</p>
<p>[Continue for all scenes…]</p>
<p>GENERATION RULES:</p>
<ul>
<li>gpt-image-2 API only</li>
<li>No aspect_ratio param support — embed “vertical portrait composition” in prompt</li>
<li>No negative_prompt param — embed negative terms in prompt text</li>
<li>Returns b64_json — decode and save as PNG</li>
<li>Do NOT redesign characters</li>
<li>Do NOT generate video or audio</li>
</ul>
<p>════════════════════════════════════════════════════
=== TEMPLATE END ===
════════════════════════════════════════════════════</pre></p>
</div>
<hr>
<h2>How to Use This Template</h2>
<ol>
<li>Click the <strong>double-square button</strong> in the top-right corner of the prompt block above.</li>
<li>Paste it into Hermes (this CLI or Telegram bot).</li>
<li>Fill in the copy-paste template with your story details — film title, characters, location, color arc, and scene prompts.</li>
<li>Hermes will generate all character references first, then all scene images using gpt-image-2.</li>
<li>Character identities are locked and reused consistently across all scene images.</li>
</ol>
<h2>Key Technical Notes</h2>
<ul>
<li><strong>gpt-image-2 API</strong> returns base64 JSON — images are decoded and saved as PNG files locally.</li>
<li><strong>No <code>aspect_ratio</code> param</strong> — the API doesn’t support it yet. Always embed “vertical portrait composition” in the prompt.</li>
<li><strong>No <code>negative_prompt</code> param</strong> — embed all negative terms directly in the prompt text.</li>
<li><strong>Character identity</strong> is preserved by repeating the full character lock description in every scene prompt.</li>
<li><strong>Color arc</strong> is controlled by changing lighting description keywords (dim/cool → warm/heavy → warm/emotional → warmest/hopeful).</li>
</ul>
<h2>Related Posts</h2>
<ul>
<li><a href="/blog/ai-short-film-director-prompt">AI Short Film Director — Master Prompt</a> — Full production pipeline including video prompts, audio scripts, and editing timelines.</li>
</ul>
</article>
    </main>
  </div>
  <script src="/js/site.js" defer></script>
</body>
</html>