Tips & Ideas

How to Film a Talking Head Video (That Edits Well)

The complete guide to filming talking head videos for Instagram Reels, TikTok and YouTube Shorts. Camera settings, lighting, framing and footage delivery tips from the editing team at VX Workflow.

Jakob Quinn
10 min read
Table of Contents
  1. What Is a Talking Head Video?
  2. Equipment You Actually Need (and What You Don't)
  3. Lighting Setup for Talking Head Videos
  4. Camera Settings for Talking Head Video
  5. How to Frame Yourself for Vertical Video (9:16)
  6. How to Deliver Footage That Edits Well
  7. Talking Head Video Examples: What Good Looks Like in Practice

A talking head video is a direct-to-camera shot where one person speaks to the audience with no cutaways. It's the main format for short-form business content, and the reason most of it underperforms is that the footage was bad before the editor ever saw it.

This guide covers what the format actually requires, what equipment matters and what doesn't, and how to hand off footage that gives your editor something to work with. Everything here comes from the team that cuts these videos every day.

What Is a Talking Head Video?

A talking head video is any piece of content where the subject speaks directly to camera as the primary visual element. No cutaways to other footage, no screen recordings, no slides. Just a person, a frame, and a point to make.

The name comes from broadcast television, where news segments and interview pieces were called "talking heads" because the shot was tight on the speaker's face and shoulders. In short-form video, the format has taken over personal brand and business content because it needs almost no production setup and you can film 10 of them in an afternoon.

Why it works for business content

When it's filmed well, talking head content communicates expertise and builds familiarity faster than most other formats. The viewer is looking at a face. Faces hold attention. When the delivery is confident and the frame is clean, the format does its job.

The problem most business owners hit isn't the format. It's inconsistency. One reel looks polished, the next looks like a car park. That back-and-forth destroys the trust the format is supposed to build.

Talking head video examples: what good looks like

The stuff that actually performs shares a few things. The speaker is at eye level or slightly above, never looking up into the lens. The background is chosen, not accidental. The audio is clean enough that a viewer can follow without captions doing all the heavy lifting.

What separates the top 10% from the rest is almost always footage quality, not performance. A confident speaker filmed badly loses. An average speaker filmed well, clean exposure, sharp focus, decent audio, works.

Equipment You Actually Need (and What You Don't)

Most business owners overcomplicate this. Here's what actually matters.

Camera: iPhone, Samsung, DJI, or mirrorless?

Your iPhone is enough. iPhone 14 and later shoots 4K with optical image stabilization that outperforms cameras costing ten times more. The same goes for Samsung Galaxy S22 and newer. Any current Android flagship rear camera will produce strong results.

A DJI Osmo Pocket 3 is worth it if you move while filming. The gimbal stabilization removes the kind of shake a phone tripod can't. For a static desk setup, it offers nothing meaningful over a phone.

A mirrorless camera is not necessary. If you own one, use it. If you don't, don't buy one for this.

The camera matters far less than the lighting and audio. We deal with this constantly: phone footage shot in good light with a decent mic is easier to work with than mirrorless footage lit badly with built-in audio. Don't let gear be the reason you don't start.

Microphone: what actually moves the needle

Phone audio is the most common reason business content feels cheap even when the visuals are fine. Viewers tolerate imperfect video. They don't tolerate audio that makes them work to follow what's being said.

You don't need an expensive mic to fix it. A Rode Wireless GO II runs around $300 and clips to your collar. A DJI Mic 2 is similar. Either one produces broadcast-quality audio from a standard talking head setup.

Not ready to buy a wireless mic? Your AirPods or wired EarPods in your pocket, with the mic close to your mouth, will beat your phone's built-in microphone in any untreated room.

Record a 30-second test clip and listen back through headphones before your full take. If you wouldn't want to listen to it for 60 seconds, your audience won't either.

Tripod and stability

A stable frame is the baseline for talking head content. Handheld is not the same as intentional movement. It reads as amateurish and gives the editor a harder job than they should have.

A basic desk tripod with a phone mount costs under $30 and solves the problem entirely. A full-height tripod if you film standing. A Gorillapod for improvised setups where you need to clamp or wrap around something.

The other benefit of a tripod is consistency. When your frame is locked, every reel you film at that setup looks the same. That compounds over time into something recognizable.

Lighting Setup for Talking Head Videos

Lighting is where most talking head content falls apart. The gap between a $500 camera in good light and a $3,000 camera in bad light is not close. Get the light right first.

Natural light window setup

A large window with indirect natural light to your side, roughly 45 degrees to your face, is the best light source available for talking head content. It gives you soft, directional light with natural depth.

Position yourself so the window is to your left or right. Not directly in front of you, not behind you. The light falls on one side of your face, the other side sits in slight shadow. This is Rembrandt-adjacent lighting, and it's what makes professional interview content look the way it does.

What to avoid: a window behind you. The camera exposes for the bright background and your face goes dark. Turning to face the window directly flattens the image. Side light is the target.

Natural light changes throughout the day. If your window setup worked at 10am, check it again before filming at 2pm.

Ring light setup

A ring light at eye level, two to three feet from your face, is the standard fix for controlled indoor filming. Size matters here. Smaller ring lights produce harsh light that shows in the eyes as a tight, hard reflection. Go 18 inches minimum.

Position it at eye level or very slightly above. Tilting it down from above your head creates shadows under the eyes and nose. If you're getting dark patches on your face, move it closer to eye level before adjusting anything else.

Ring lights give you even, flat illumination that reads as clean and professional in vertical video. The trade-off is that it lacks the directional depth of a window setup. For short-form content, the consistency is usually worth it.

What to avoid

Overhead room lighting as your primary source creates what editors call "raccoon eyes": deep shadows under the brows and eyes that make the speaker look tired or washed out. It's the most common lighting mistake in talking head content filmed at home or in offices.

Colored lights behind you can work for gaming or entertainment content. For business content, they read as unprofessional and pull focus away from what's being said.

Mixed color temperatures, a warm room light and a cold window at the same time, create an uneven color cast that's hard to correct in post. Pick one light source and commit to it.

Camera Settings for Talking Head Video

iPhone settings

Open the Camera app and go to Settings before you start filming.

Set format to 4K at 30fps. That gives you enough resolution to crop in post without losing quality, at a frame rate that feels natural for direct-to-camera content. 24fps is more cinematic but can look juddery in a fast-moving social feed. 60fps looks too smooth for someone talking straight to camera.

Turn Cinematic Mode off. It hunts for focus continuously and will pull to your background mid-sentence. Tap and hold on your face to lock focus and exposure before recording. The yellow square with a lock icon confirms both are locked.

Turn Action Mode off. It crops the image significantly and adds processing delay.

For most setups, the front camera is fine. The rear camera has a better sensor but requires a mirror or monitor to frame yourself. Most people don't want the added setup. If you're on an iPhone 15 Pro or later and delivering to an editor, ProRes is worth using if your storage allows it. Standard HEVC is perfectly workable.

Samsung settings

Open the Camera app and switch to Pro Video mode, or use Expert RAW if it's installed.

Set to 4K at 30fps. Turn the Scene Optimizer off. It applies AI adjustments in real time that alter exposure and color in ways that are difficult to undo in editing.

Samsung's front cameras have improved significantly on the S22 and newer. For a desk setup, the front camera is convenient and produces strong results. If image quality is the priority, rear camera with a tripod and a mirror to check framing is the better option.

Lock focus by tapping and holding your face. Turn Video Stabilization to Basic. The higher stabilization modes crop the image and introduce noticeable warping when you move laterally.

Frame rate and resolution

4K at 30fps is the standard for short-form content going to an editor, across all devices. It gives enough resolution to reframe if needed.

Don't film in slow motion unless the shot specifically calls for it. Don't film in landscape and crop to vertical. Film natively in 9:16 from the start. Horizontal footage cropped to vertical loses resolution and almost always cuts off parts of the frame that were intentional.

How to Frame Yourself for Vertical Video (9:16)

Vertical framing needs different instincts than landscape. The rules aren't complicated.

Your eyes should sit at the upper third of the frame, not the center. Center-framing in vertical video looks static and leaves dead space below you. Upper-third framing feels natural and leaves room at the bottom for captions and text overlays that don't cover your face.

Frame from mid-chest up. Cutting at the shoulders feels cramped. Lower than the waist wastes the frame. Leave three to four inches of headroom between the top of your head and the top of the frame. No headroom looks like a mug shot.

Background: it should be chosen, not accidental. A plain wall works. A bookshelf works. An open-plan office with people walking behind you doesn't. If you can't control the background, move closer to a wall and let it fall out of focus.

How to Deliver Footage That Edits Well

What follows is the stuff that makes the difference between a brief that produces a great cut on the first pass and one that needs two rounds of revisions to fix problems that were already in the footage.

File naming and handoff

Name your files before you upload. Date, topic, take number. Something like 2026-05-19_pricing-reel_take3.mp4. It means the editor can find the right clip without watching all of them first.

Upload via Dropbox or Google Drive. Not WhatsApp, iMessage, or email. All three compress the video. AirDrop to a Mac then upload is fine. USB transfer is fine. Anything that adds compression is not.

If you filmed multiple takes, upload all of them. Don't pre-select. The editor will find the best performance. What looks like a mistake to you is often recoverable. What looks fine to you might have a focus pull you didn't notice.

The mistakes that slow things down most

Stopping mid-take. When you mess up, keep the camera rolling and restart the sentence. Stopping and starting creates dozens of short clips instead of a few long takes, and the editor has to stitch the best version together from fragments. It adds time and usually reduces the quality of the cut.

Filming in a room with echo. Hard surfaces, bare walls, uncarpeted floors, no soft furnishings, all create reverb that's noticeable in the mix and difficult to clean up in post. If your room echoes, hang a blanket behind you, sit in a closet, or get the mic closer to your mouth. Any of those helps more than a plugin.

Moving out of your locked focus zone. Lock focus on your face and then lean forward or back significantly, and the focus will follow to wherever you are now. Stay within about 30cm of your starting position once it's locked.

Filming against a window in the afternoon when the light has shifted. Natural light is inconsistent across the day. Your 10am setup and your 2pm setup will look different. Check before you film.

What a good brief looks like

When you submit footage via the VX Workflow portal, the brief is where you tell the editor what you're trying to achieve, not just what you filmed.

A good brief covers: the main point you want the viewer to take away, the platform and format, any specific moments in the footage you liked, and any music or caption preferences that aren't already in your brand profile. A brief that says "make it good" produces a generic cut. A brief that says "the moment at 1:23 where I talk about pricing is the hook, build it around that" gives the editor something specific to work toward.

The brand profile handles the defaults: colors, fonts, tone, music style. The brief handles what's different about this particular reel.

> Film it. We'll cut it. Get your first 2 reels edited by a senior editor, QC'd by Hayden Brinkley before you see it. No card needed. Start free →

Talking Head Video Examples: What Good Looks Like in Practice

The reels that perform best for business follow a pretty consistent structure. A hook in the first two seconds, a single core point in the 30 to 45 seconds after that, and an end that gives the viewer somewhere to go.

The hook doesn't have to be dramatic. "Here's the one thing I changed that doubled my inquiry rate" works because it's specific and promises a payoff. "I want to talk about something important" doesn't, because it asks the viewer to trust that the next 45 seconds are worth their time without giving them any reason to.

One point per reel. Not a list, not multiple takeaways. One thing. Reels that try to cover too much almost always lose the viewer before the payoff lands.

The ending is more important than most people think. A reel that just stops leaves the viewer's energy nowhere to go. A question ("what would you add to this?") or a direction ("link in bio") gives them something to do with it.

Frequently asked questions

What is the difference between a talking head video and a vlog?
A vlog follows a person through an experience or day using a mix of footage types: walking shots, location changes, cutaways. A talking head video is a single, static direct-to-camera setup. For business content, talking head is more effective because it's faster to produce, easier to edit, and creates a more direct connection with the viewer.
How long should a talking head video be for Instagram Reels?
For business content, 30 to 60 seconds is the range that works. Under 30 seconds usually isn't enough time to build credibility and land a complete point. Over 60 seconds loses a significant portion of the audience before the payoff. The Reels algorithm favors completions, so a 45-second video most people finish outperforms a 90-second one half the audience leaves early.
Do I need a professional camera for talking head videos?
No. An iPhone 14 or any current Samsung Galaxy flagship produces footage that meets professional standards for short-form content. Lighting, audio, and a stable frame matter more than camera quality. A well-lit phone video with clean audio beats poorly lit mirrorless footage every time.
What background should I use for a talking head video?
A plain wall, a bookshelf, or a clean branded environment all work. The rule is that it should be chosen, not accidental. What doesn't work: cluttered spaces, people moving behind you, or a bright window directly behind you that blows out the exposure. If in doubt, move closer to a wall and let the background go soft.
Can I film multiple reels in one session?
Yes, and it's one of the smarter ways to stay consistent. Film five to eight takes across two or three topics in one setup. Submit all the footage as separate briefs. The brand profile autofills the defaults for each one. A 90-minute filming session can produce a full month of content if you batch it properly.
What is the best microphone for talking head video on iPhone?
The Rode Wireless GO II and DJI Mic 2 are both solid at around $300. If you're not ready to buy dedicated audio gear, wired EarPods with the mic near your collar will outperform the iPhone's built-in mic in most rooms. Test it: record 30 seconds in the space you plan to film, listen back through headphones, and decide if you'd sit through 60 seconds of it.
Share this article