The 9:16 "Safe Zone" Prompt Kit: Stop Cropping Heads & Captions When You Turn Veo3Gen Clips into Reels/Shorts

TL;DR

If your AI clips look fine in preview but get ruined in Reels/Shorts, it’s often not “bad generation”—it’s bad vertical composition: faces too close to the top edge, no intentional negative space for captions, and critical elements living where platform UI overlays land. Fix it by prompting 9:16 safe-zone framing on purpose (headroom + eyeline + reserved caption space), then doing a fast 60-second safe-zone test on a screenshot before you batch-generate.

Key takeaways

Use a 9:16 safe zone prompt: explicitly request headroom, eyeline placement, and “keep lower third clean.”
Design negative space (lower third and/or a side column) so captions and UI don’t cover your face or product.
Keep the “read” instant: recognizable subject in the first second + one clear motion; don’t rely on subtle movement that disappears on phones (https://www.vidu.com/blog/social-media-video-ai).
Keep camera motion limited and contrast high to help clips survive compression (https://www.vidu.com/blog/social-media-video-ai).
Generate a single draft first, screenshot it, overlay rough UI/caption bands, revise the prompt, then scale variants.

Why vertical AI videos get “ruined” after export (it’s a phone problem)

Vidu’s social-video testing points to a common trap: AI clips can look interesting on a desktop preview, then fall apart on a phone because the subject is too small or the motion is too subtle to register while scrolling (https://www.vidu.com/blog/social-media-video-ai).

On top of that, platforms enforce their own aspect ratios and file constraints; get even one parameter wrong and you invite automatic cropping or quality loss (https://www.vidu.com/blog/social-media-video-ai). You don’t fix that by memorizing specs—you fix it by generating layout-aware vertical masters that tolerate UI overlays and compression.

What “ruined” usually looks like:

Cropped heads (forehead clipped by a top UI band).
Captions covering the mouth/product (lower third collision).
CTA text unreadable (thin type + busy background + compression).
Hook doesn’t read instantly; Vidu notes most AI clips fail the “no second watch required” test (https://www.vidu.com/blog/social-media-video-ai).

The 9:16 safe-zone concept (creator version)

Treat safe zones as priorities, not pixels. Your goal is to generate a layout that can absorb platform overlays.

Safe-zone priorities

Protect the hero: eyes, mouth, hands, product—keep them away from top/bottom edges.
Reserve overlay space: captions (lower third), UI (top/bottom bands), and any edit-layer text you’ll add later.
Make it readable instantly: recognizable subject in the first second, one clear motion/transition, and end before the clip starts to degrade (these were common traits in Vidu’s repeated social-post tests; https://www.vidu.com/blog/social-media-video-ai).

What to protect (non-negotiable)

Eyes/forehead (top crop looks accidental immediately)
Mouth (caption zone loves to sit where mouths end up)
Hands/product (product demos often dip into the lower third)

What to keep empty (on purpose)

Lower third: subtitle + hook text zone
Upper band: UI + breathing room
Optional side column: a clean “caption rail” for CTAs or a logo bug

The 9:16 “Safe Zone” Prompt Kit (copy/paste snippets)

Use these as add-on lines at the end of any scene prompt.

How to combine:

Pick 1 framing snippet + 1 negative-space snippet + (optional) 1 text rule.
Keep them consistent across variations so your batch outputs frame similarly.

A) Framing snippets (choose one)

Centered talking head (headroom + eyeline)

“Vertical 9:16 composition. Head-and-shoulders framing. Subject centered. Eyes on the upper third. Extra headroom above hair. Keep chin fully visible.”

Waist-up demo (caption-safe hands/product)

“Vertical 9:16. Waist-up shot. Subject centered. Hold product at chest height. Keep hands and product above the caption area. Stable framing, minimal camera movement.”

Full-body (safe margins)

“Vertical 9:16. Full-body shot. Keep subject fully in frame with extra space above head and below feet. Locked camera. Simple background.”

Product close-up (center lock)

“Vertical 9:16. Tight product close-up. Product centered with safe margins. Smooth single rotation. Minimal motion besides the product.”

Vidu reports a rotating-product prompt produced centered, smooth rotation in some tests but also showed mid-clip jitter in others—so keep motion simple and inspect early (https://www.vidu.com/blog/social-media-video-ai).

B) Negative-space snippets (choose one)

Lower-third reserved for captions (most common)

“Leave generous clean negative space in the lower third for captions; keep face, mouth, hands, and product out of that area.”

Top-negative-space hook

“Leave large clean negative space at the top for hook text; keep subject lower-middle; avoid top-edge cropping.”

Side ‘caption column’

“Leave a clean vertical column on the right for captions/CTA; place subject on the left third; keep background simple and high contrast.”

All-sides safe margins

“Safe margins on all sides; no important elements near edges.”

C) Text rules (use sparingly)

If you must bake text in

“If any on-screen text appears, keep it large, bold, high-contrast, and away from edges; place it only inside the reserved negative space.”

If you want zero text conflicts

“No embedded text, no subtitles, no logos.”

D) Quick “avoid” line (add when outputs are too tight)

“Avoid extreme close-up; avoid cropping head; avoid subject drifting off-center; avoid important details near edges.”

Worked example (before/after): fixing the Reel that crops the head

This is the exact rewrite pattern most creators need.

Before (tight crop + caption collision)

“A creator talks to camera in a bright kitchen, holding a skincare bottle, energetic, modern, high quality, realistic.”

Why it fails:

No instruction for headroom → face often rides high.
No reserved lower third → hands/product drift into caption space.
Variants frame differently because layout wasn’t specified.

After (a layout-aware 9:16 safe zone prompt)

Copy/paste and customize:

“Vertical 9:16 UGC-style video in a bright kitchen. Waist-up shot of a creator holding a skincare bottle at chest height. Subject centered. Eyes on the upper third. Extra headroom above hair. Leave generous clean negative space in the lower third for captions; keep mouth, hands, and product above the caption area. Stable framing, minimal camera movement. High contrast between subject and background. One clear motion: raise the bottle slightly and point to it.”

What this fixes (specifically):

Headroom protects the forehead from top UI.
Lower-third negative space prevents captions from covering mouth/product.
One clear motion matches what Vidu saw in repeatable social-clip wins (recognizable fast + one motion) (https://www.vidu.com/blog/social-media-video-ai).

Make it faster inside Veo3Gen (when you’re ready to scale)

Veo3Gen supports 9:16 output (and 16:9) and supports text-to-video and image-to-video, plus first-and-last-frame control on Veo 3.1. It also generates native synchronized audio (dialogue, SFX, music) in a single pass—so your Reel draft can come out with audio without a separate step.

When your prompt is stable and you’re testing multiple hooks, backgrounds, or product angles, use Veo3Gen’s developer API to generate variants programmatically. CTA: If you’re doing structured variant testing, start with a single safe-zone master and then automate batches via /api.

Two generation recipes (pick one on purpose)

Recipe A: Vertical-first master (best for Reels/Shorts)

Use this when most views are on phones.

Generate native 9:16 and include safe-zone instructions.
Make the subject readable instantly; Vidu notes phone-scale failures happen when the subject is too small or motion too subtle (https://www.vidu.com/blog/social-media-video-ai).
Favor close framing, limited camera movement, and high contrast for compression survival (https://www.vidu.com/blog/social-media-video-ai).
Keep clips short; Vidu observed consistency breaks starting to appear around second five in many runs (https://www.vidu.com/blog/social-media-video-ai).

Recipe B: One master that repurposes to 16:9 (use with caution)

Use this when you must deliver both vertical and widescreen from the same concept.

Compose for a center-safe column (imagine 9:16 inside 16:9).
Keep the hero centered; avoid edge storytelling.
Avoid embedded text; add text per format in the edit.

Trade-off: the more you future-proof for widescreen, the smaller your subject becomes—which matches Vidu’s phone-scale failure mode (https://www.vidu.com/blog/social-media-video-ai).

The 60-second safe-zone test (do this before batch generation)

This prevents wasting generations on the wrong framing.

Generate one draft (9:16) with your best prompt.
Screenshot a representative frame (usually 0–1s when the hook must read).
Draw three rough bands on the screenshot:
- top UI band
- bottom UI band
- a thicker “caption block” where subtitles typically sit
Check collisions: do the bands overlap eyes, mouth, product, or hands?
Revise prompt using one framing snippet + one negative-space snippet.
Only then batch-generate (new hooks, outfits, backgrounds).

This aligns with Vidu’s guidance that a good social clip should be intentional on a phone screen and not require a second watch to understand (https://www.vidu.com/blog/social-media-video-ai).

Common failure modes (and the exact prompt fix)

Failure: forehead/chin clipped

Fix lines to add:

“Extra headroom above hair. Keep chin fully visible. Safe margins on all sides. Avoid extreme close-up.”

Failure: captions cover mouth/product

Fix lines to add:

“Leave generous clean negative space in the lower third for captions; keep mouth, hands, and product above the caption area. Hold product at chest height.”

Failure: subject drifts off-center mid-clip

Fix lines to add:

“Stable framing, minimal camera movement. Avoid subject drifting off-center.”

If you have a strong reference frame, consider image-to-video: Vidu observed reduced drift when pinning the first frame with an uploaded image (https://www.vidu.com/blog/social-media-video-ai). (Veo3Gen supports image-to-video and first-and-last-frame control on Veo 3.1.)

Failure: motion doesn’t register while scrolling

Fix lines to add:

“One clear motion: (turn / pour / point / reveal). High contrast subject/background.”

Vidu lists close framing, limited camera movement, and high contrast as compression-survival traits (https://www.vidu.com/blog/social-media-video-ai).

Failure: clip degrades near the end

Fix:

Make the action complete earlier. Vidu observed consistency breaks around second five in many runs (https://www.vidu.com/blog/social-media-video-ai).

Checklist

I’m generating native 9:16 (not relying on auto-crop).
The subject is recognizable within the first second. (https://www.vidu.com/blog/social-media-video-ai)
I prompted eyes on the upper third + extra headroom.
I reserved clean negative space (usually lower third) and kept mouth/hands/product out of it.
I kept motion to one clear action/transition. (https://www.vidu.com/blog/social-media-video-ai)
I used limited camera movement and high contrast for compression survival. (https://www.vidu.com/blog/social-media-video-ai)
I ran the 60-second safe-zone test before generating variants.
I avoided embedding critical text unless it lives safely inside reserved negative space.

FAQ

How do I stop Reels from cropping the top of the head in 9:16?

Add “eyes on the upper third” + “extra headroom above hair” + “safe margins on all sides,” then validate with a screenshot overlay before you generate variants.

How do I leave space for captions without making the shot feel empty?

Reserve a clean lower third, but keep the subject large (waist-up or close portrait) and include one clear motion so the frame still feels active while scrolling (https://www.vidu.com/blog/social-media-video-ai).

How do I repurpose a 16:9 AI video to 9:16 without losing faces?

Compose for a centered “safe column” from the start: keep faces and key objects centered, avoid edge storytelling, and don’t embed text near the sides.

How do I keep the subject from drifting off-center during the clip?

Prompt “stable framing, minimal camera movement.” If you can, use image-to-video with a strong first frame—Vidu observed reduced drift when pinning the first frame with an uploaded image (https://www.vidu.com/blog/social-media-video-ai).

Should I bake on-screen text into the generation?

Only when the text is non-critical or scene-native and you’ve reserved clean, high-contrast negative space. Otherwise keep the generation clean and add text per platform in editing.

Use this at the end of any prompt and replace the brackets:

“Vertical 9:16 composition. [SHOT TYPE: head-and-shoulders / waist-up / full-body / product close-up]. Subject clearly recognizable within the first second. Subject [centered / left third / right third]. Eyes on the upper third. Extra headroom above hair. Leave generous clean negative space in the lower third for captions; keep mouth/hands/product out of the caption area. High contrast between subject and background. Stable framing, minimal camera movement. One clear motion: [single action]. Avoid extreme close-up; avoid cropping head; avoid important elements near edges; avoid subject drifting off-center.”

Generate vertical-safe variants faster with Veo3Gen

Once your safe-zone layout is consistent, scaling becomes a variants problem (new hooks, products, backgrounds)—not a framing problem. Veo3Gen provides an affordable way to access Google’s Veo 3.1 video models without Google’s enterprise pricing, with three modes (Veo 3.1 Fast, Quality, Lite), plus 9:16 support and native synchronized audio in one pass.

CTA: If you want to iterate without “use it or lose it” pressure, Veo3Gen offers pay-as-you-go credits plus optional monthly plans, and purchased credits do not expire. Check /pricing and use the free credits for your first safe-zone test run.

Start creating with Veo3Gen

Veo3Gen gives you affordable Veo 3.1 video generation with native audio, up to 4K, and credits that never expire — with free credits to start.

Generate your first video now: Get started
Compare plans and pay-as-you-go pricing: See pricing

The 9:16 "Safe Zone" Prompt Kit: Stop Cropping Heads & Captions When You Turn Veo3Gen Clips into Reels/Shorts

TL;DR

Key takeaways

Why vertical AI videos get “ruined” after export (it’s a phone problem)

The 9:16 safe-zone concept (creator version)

Safe-zone priorities

What to protect (non-negotiable)

What to keep empty (on purpose)

The 9:16 “Safe Zone” Prompt Kit (copy/paste snippets)

A) Framing snippets (choose one)

B) Negative-space snippets (choose one)

C) Text rules (use sparingly)

D) Quick “avoid” line (add when outputs are too tight)

Worked example (before/after): fixing the Reel that crops the head

Before (tight crop + caption collision)

After (a layout-aware 9:16 safe zone prompt)

Make it faster inside Veo3Gen (when you’re ready to scale)

Two generation recipes (pick one on purpose)

Recipe A: Vertical-first master (best for Reels/Shorts)

Recipe B: One master that repurposes to 16:9 (use with caution)

The 60-second safe-zone test (do this before batch generation)

Common failure modes (and the exact prompt fix)

Failure: forehead/chin clipped

Failure: captions cover mouth/product

Failure: subject drifts off-center mid-clip

Failure: motion doesn’t register while scrolling

Failure: clip degrades near the end

Checklist

FAQ

How do I stop Reels from cropping the top of the head in 9:16?

How do I leave space for captions without making the shot feel empty?

How do I repurpose a 16:9 AI video to 9:16 without losing faces?

How do I keep the subject from drifting off-center during the clip?

Should I bake on-screen text into the generation?

Copy/paste: the Veo3Gen 9:16 safe zone prompt footer

Generate vertical-safe variants faster with Veo3Gen

Start creating with Veo3Gen

Sources

Try Veo 3 & Veo 3 API for Free