Producing 9:16 Video Ads: Framing, Pacing, Mistakes
A practical guide to vertical 9:16 video ad production—safe zones, motion pacing, and the framing mistakes that quietly kill performance on TikTok, Reels, and Shorts.
You shot a widescreen ad, cropped it to a tall rectangle, and shipped it to TikTok. The headline now sits behind the caption bar. The product is half out of frame. The first second is your logo. The ad spends $40 before you notice.
Vertical video is not landscape rotated ninety degrees. The aspect ratio changes how people hold the phone, how the platform stacks its own interface on top of your creative, and how much time you get before a thumb flicks the whole thing away. The production rules are different, and most of them are about geometry and pacing rather than taste.
The frame is smaller than you think
A 9:16 video is 1080 wide by 1920 tall. But the platform does not give you all of it. TikTok, Reels, and Shorts each overlay a column of buttons on the right (like, comment, share, the spinning audio disc) and stack text at the bottom: the caption, the username, the music ticker, the "Sponsored" label, and the call-to-action button.
The usable area—where you can put anything that must be seen—is roughly the center column with generous margins. Treat the outer edges as decoration only.
The safe zones, in numbers
Work from the 1080x1920 canvas and reserve these margins for nothing-important content:
- Top: ~250px. The status bar and, on some placements, the account header live here.
- Bottom: ~450-700px. This is the big one. Captions, CTA buttons, and the music ticker eat the lower third to lower 40%. Meta's Reels CTA sits especially high.
- Right: ~120-200px. The action rail (the icons) covers the right edge.
- Left: ~60px. A smaller margin, but text flush-left still gets clipped on some devices.
That leaves a vertical band, slightly left of center, as your reliable real estate. Put your subject, your product, and any burned-in text inside it. Anything outside is a gamble that pays off only on the placements that happen not to cover it.
Framing the subject for a tall format
Landscape framing wants headroom and lead room—space around a subject. Vertical framing wants the opposite. The screen is tall and narrow, so the subject should fill the width and you should fight the urge to leave air on the sides.
Three rules that carry most of the weight:
- Frame for the width, not the height. A talking-head should fill the horizontal frame from roughly chin height up. If you can see the speaker's shoulders and a foot of wall on either side, you shot it for the wrong ratio.
- Keep the action above the fold. The "fold" is the top of the caption bar—around the 60% mark from the top. Hands demonstrating a product, the moment a label appears, the before/after reveal: all of it belongs in the top two-thirds.
- Stack, don't spread. Vertical rewards stacking elements top-to-bottom (text above, product below) rather than placing them side by side. Side-by-side comparisons that work in 16:9 turn into two tiny columns in 9:16.
Pacing: the first second and the cut rhythm
Paid social feeds are a thumb-driven medium. The viewer is not committed; they are deciding. Most of that decision happens in the first second, before the platform even logs a "view."
Two pacing rules separate vertical ads that hold attention from ones that bleed it:
Open on motion or a face, never a logo
The first frame is the thumbnail and the first impression at once. A static logo card is the most expensive half-second in the ad—it teaches the algorithm and the viewer that nothing is happening. Open on a moving subject, a face mid-sentence, or the product already in use. Brand reveals belong at the end, after you have earned the attention.
Cut faster than feels comfortable
Vertical ad editing runs faster than narrative video. A useful starting rhythm:
- 0.0-1.0s: the hook—one clear visual idea, one line of text on screen.
- 1-3s: the problem or context, a fresh cut roughly every 1-1.5 seconds.
- 3-12s: the demonstration or proof, cutting every 1.5-2.5 seconds, with a visual change (angle, zoom, text) on every beat.
- Final 2-3s: the offer and CTA, held slightly longer so it can be read.
The point is not speed for its own sake. It is that every cut is a small re-hook. A held shot with no change invites the thumb. Even a subtle push-in or a new caption line resets the viewer's "should I stay" timer.
Captions and on-screen text
Sound-off viewing is the default, not the exception. A large share of feed watching happens muted, so the ad has to work as a silent film with subtitles and still make sense if the sound comes on.
Practical text rules for 9:16:
- Burn captions in. Platform auto-captions are unreliable and sit in the danger zone at the bottom. Render your own and place them in the safe band, usually the upper-middle, not pinned to the bottom edge.
- One idea per card. Three to six words on screen at a time. Full sentences in small type do not get read on a phone held at arm's length.
- High contrast, heavy weight. Bold sans-serif with a solid or semi-opaque backing. Thin text over busy footage is invisible on a sunny-day screen.
- Sync text to speech. Word-by-word or phrase-by-phrase captions that match the voiceover hold attention better than a static block.
A pre-export checklist
Run this before you push any 9:16 ad live. It catches the mistakes that survive the edit and only show up in the live placement.
- Open the preview at phone size, not desktop. Shrink it to a phone's footprint. Half the framing problems are only visible small.
- Overlay a safe-zone template. Confirm no critical text or product sits in the bottom 700px or behind the right rail.
- Watch the first second muted. Does it communicate without sound and without the title? If not, the hook is too slow.
- Check the last frame. The final held frame is what lingers. It should carry the offer and a legible CTA.
- Confirm the export is true 1080x1920. Not a 720x1280 upscale, not a letterboxed landscape with bars.
- Verify captions don't collide with the platform UI. Test against the actual placement—Reels stacks differently than TikTok.
- Hold the CTA long enough to read. A two-frame flash of a URL is not a call to action.
Common mistakes, and what to do instead
The recurring failures in vertical ad production fall into a short list:
- Cropping landscape footage to fill. You lose the sides and the composition. Reframe or reshoot; if you must crop, crop to the subject's face, not the geometric center.
- Letterboxing instead of filling. Black bars top and bottom waste the format's only advantage—the tall frame—and read as lazy. Fill the canvas or blur-extend, but do not ship bars.
- Text pinned to the bottom edge. It sits behind the caption and CTA. Move it up into the safe band.
- A slow, branded intro. Three seconds of logo animation is three seconds of viewers leaving. Lead with the hook.
- One ad, many platforms, no adjustment. The safe zones differ. A TikTok-safe layout can clip on Reels. At minimum, nudge text up for Meta placements.
- Testing one variant. Vertical creative fatigues fast. The hook that wins this week loses next week. Plan for many variants of the same offer, not one polished hero ad.
FAQ
What aspect ratio should I use for TikTok, Reels, and Shorts?
9:16 at 1080x1920 for all three. It is the native full-screen ratio across TikTok, Instagram Reels, and YouTube Shorts. You can run 1:1 (square) as a fallback for in-feed Meta placements, but vertical is the primary format for paid social and gets the full screen.
Where do I put text so it isn't covered by the platform UI?
Keep important text out of the bottom ~700px and away from the right edge ~150px, since that is where captions, the CTA button, and the action rail sit. The reliable zone is the upper-middle of the frame, slightly left of center. Always preview against the real placement, because Reels and TikTok stack their interfaces differently.
How long should a 9:16 video ad be?
For paid social, most direct-response vertical ads land in the 9-21 second range, with the hook in the first second and the CTA held at the end. Longer can work for a strong demonstration, but the burden of holding attention rises with every second. Cut hard and test shorter variants alongside it.
Producing many of these by hand is slow—reframing, captioning, and exporting per platform for every variant. Aitachyon takes a website URL and returns a finished, captioned vertical ad in about two minutes, exported in 9:16, 16:9, or 1:1 with the text already inside the safe zone—so you can ship the volume of variants paid social actually demands. Plans start at $29/mo with a 14-day money-back guarantee.
Related articles
AI UGC Ads: How to Make Them Look Like Real Content
A practical guide to scripting, avatar, and post-processing choices that make AI-generated UGC ads pass as organic creator content on paid social.
TutorialsFrom URL to finished video ad: the two-minute pipeline
Paste a URL, get a finished video ad in about two minutes. A step-by-step look at the pipeline: brand scrape, three scripts, voice, avatar, captions.
TutorialsTestimonial Video Ads When You Have No Testimonials Yet
How to produce testimonial-style video ads at launch using beta quotes, founder narration, and honest social proof framing—without faking customers.