How to Film Talking Head Videos on iPhone
Practical setup for filming talking head videos on iPhone. Framing, lighting, background, audio, and delivery — from a team that edits talking head content for clients every day.
Table of Contents
- What Is a Talking Head Video and Why Is It the Most Powerful Format for Most Creators?
- How Do You Frame a Talking Head Shot on iPhone Without It Looking Off?
- Where Should You Position Your iPhone for a Talking Head Video?
- What's the Best Lighting Setup for a Talking Head Video Filmed at Home or in an Office?
- How Do You Handle Audio for a Talking Head Video on iPhone?
- What Background Should You Use for a Talking Head Video?
- How Do You Actually Perform Naturally on Camera When You're Filming Yourself?
- What Does Well-Filmed Talking Head Footage Let an Editor Do That Bad Footage Does Not?
Film a talking head on iPhone by positioning the lens at eye level two to three feet from your face, placing your eyes in the upper third of the frame, facing a window or light source, mounting the phone on a tripod, and clipping a mic to your collar. That is the whole setup.
---
What Is a Talking Head Video and Why Is It the Most Powerful Format for Most Creators?
A talking head video is a single-camera shot of one person speaking directly to the lens. No b-roll. No cutaways. Just a person, a frame, and a point of view.
It is the dominant format in short-form video right now because it is the most direct. When someone watches a talking head, they are in a one-on-one conversation. The creator is looking at them. The creator is speaking to them specifically. No format closes the distance between a brand and an audience faster.
I have worked with clients across fitness, real estate, ecommerce, and consulting. The ones who commit to talking head content consistently are almost always the ones who see the most compounding results. Narellan Pools Ipswich grew their followers 133 percent and hit an 8.54 percent engagement rate. They doubled their revenue from social. The content driving that was not fancy production. It was consistent talking head video, posted on schedule, from someone their audience recognised.
Talking head video also scales well. Once you have a working setup, you can batch-record. Once you have a reliable format, your editor can work faster. The output becomes predictable in the best sense.
The format also ages well in the algorithm. Platforms reward watch time and completion, and a well-delivered talking head that hooks the viewer in the first two seconds and delivers real information holds attention better than most over-produced alternatives.
This is the format most people should start with and most established creators should return to more than they do.
---
How Do You Frame a Talking Head Shot on iPhone Without It Looking Off?
The most common framing mistake is centering the face in the frame. It looks passive and leaves wasted space above the head. The fix is simple: place your eyes in the upper third of the frame, not the middle.
From there, you want a head-and-shoulders composition with a small gap between the top of your head and the top of the frame. Not a lot of headroom, just enough that the frame does not clip your hair or feel cramped. Do not cut off the chin. Ever. The chin is structural. Cut it and the composition immediately looks accidental.
Camera height matters as much as framing. The lens should sit at eye level or very slightly above. Slightly above is forgiving and flattering. Below eye level is almost never the right call for this format. When the camera is below you, it changes the visual dynamic in a way that reads as unflattering rather than authoritative.
Distance from lens to face should be two to three feet for a standard head-and-shoulders frame. Step further back and the shot widens, which can feel less personal and more like a presentation than a conversation. Step closer and you create intimacy, but you also risk distortion from the iPhone's wide-angle lens, which is not a flattering distortion.
One useful test: frame yourself, then take a screenshot and look at it as an image rather than as a live view. Problems that are invisible when you are concentrating on delivery become obvious the moment you look at a still frame.
For more on composing for vertical formats, how to film yourself on iPhone for reels covers the mechanics in more depth.
---
Where Should You Position Your iPhone for a Talking Head Video?
A tripod. Not a stack of books, not a friend holding the camera, not your laptop propped up at an angle. A tripod.
Handheld talking head video has a place, and that place is very specific content where movement is part of the energy. For most talking head content, especially educational or authority-building video, a locked-off frame is the right call. It is professional. It is easy to edit. It does not create motion sickness in the viewer.
A proper phone mount costs between $15 and $30. A Gorilla Pod is useful if you need to grip around furniture or get a precise angle without a full tripod. Either works. There is no version of this where handholding a phone and trying to speak at the same time produces better footage than a stable mount.
Position the tripod so the phone is genuinely at eye level. Most people set it too low because they are standing and the tripod is not fully extended, or they are sitting and the tripod is at standing height. Check the camera position in the live view before you start, not after you finish recording.
Where in the room you position yourself relative to the iPhone is a separate question from where you place the tripod, and it is connected to your lighting setup, which is the next section.
---
What's the Best Lighting Setup for a Talking Head Video Filmed at Home or in an Office?
Window light is the best option you are not paying for.
Sit facing the window. Not with the window behind you, not beside you at a 90-degree angle. Facing it. The light should land on your face from the front, wrapping slightly to one side. That is what produces an even, natural-looking image without expensive equipment.
If the window creates too much contrast between the lit side of your face and the shadow side, a white piece of card or a reflector opposite the window will bounce enough fill light to soften it. This is a $0 to $15 solution that solves a problem people spend hundreds on gear to fix.
If natural light is not reliable or you are filming in the evening, a ring light or a small LED panel is the practical alternative. The ring light should be at eye level, slightly to one side rather than dead centre in front of you. Dead centre ring lights produce a flat look that is fine but forgettable. Slightly off-axis light has more dimension.
The thing to actively avoid is overhead lighting. Standard ceiling lights create shadows under the eyes, under the nose, under the chin. They make people look tired or unwell on camera even when they are not. If overhead lights are the only source in the room, turn them off and use a ring light or a desk lamp pointed at your face instead.
Whatever your lighting source, the camera needs to maintain a consistent exposure throughout the take. On iPhone, tap and hold on your face in the live view to lock focus and exposure before you start recording. If you do not do this, the camera will automatically adjust exposure as you move, and the brightness of the image will drift mid-take, which creates a headache in post.
For more control over the image, best iPhone camera settings for video walks through what to change and why.
---
How Do You Handle Audio for a Talking Head Video on iPhone?
The built-in iPhone microphone is not good enough for most talking head content. It picks up room noise, air conditioning units, the hum of appliances, traffic. It also picks up a lot of room reverb, which makes your voice sound hollow and echoey. You can get away with it in a well-treated room if you are close to the phone and the room is quiet, but you are working against the tool.
AirPods Pro are a genuine step up in certain conditions. The microphone is closer to your mouth, which helps with clarity, and the noise cancellation on the input side reduces ambient noise. They are not a professional solution, but they are meaningfully better than the phone mic and most people already own them.
For production-quality audio, the Rode Wireless GO II or the DJI Mic 2 are the two I recommend most often. Both are clip mic systems. You clip the transmitter to your collar or lapel, close to your mouth, and it transmits wirelessly to a receiver that plugs into the iPhone. Hide the transmitter under a collar or behind a lapel. The audio these systems produce is noticeably cleaner than anything you can get from a phone mic, and that difference is immediately audible in the final edit.
Good audio is not optional. Viewers will tolerate average video quality with excellent audio. They will not tolerate the reverse. The audio is what makes the viewing experience bearable or unbearable.
One practical thing to do before every take: record five seconds of silence in the room before you start speaking. This gives your editor a clean sample of the room noise, which they can use to clean the audio in post. It costs you five seconds. It saves the editor real work.
---
What Background Should You Use for a Talking Head Video?
The background should be slightly out of focus. Not blurred into oblivion like portrait mode, but soft enough that the viewer's attention stays on you rather than reading the spines of every book on your shelf.
The way to achieve this without using portrait mode is distance. Put two to three metres between yourself and the wall or the shelf behind you. The iPhone's camera will naturally throw the background slightly out of focus when you are far enough from it. Portrait mode blurs aggressively and creates an artificial-looking separation that reads as cheap on a laptop screen even if it looks fine on a phone. Physical distance is the better solution.
Plain white or grey walls are not a good choice. They compress badly on Instagram and TikTok, and they look like you are filming in a storage room. A bookshelf, a plant, a slightly out-of-focus room with some depth, these work well because they have visual interest without demanding attention.
The most important thing the background should not do is distract. A cluttered desk behind you, laundry visible in a door frame, a window with blown-out light directly behind you, these pull the viewer's eye away from you. Check the background before you film, not after. Turn around and look at what the camera is going to see. Remove anything that reads as clutter.
If you are filming in an open-plan office, close the door or find a spot with a wall behind you. Fewer moving parts in the background is always better.
---
How Do You Actually Perform Naturally on Camera When You're Filming Yourself?
Pick one point in the camera lens and talk to it. One point. Not the screen, not the little red recording indicator, not your own reflection in the glass if you are using a selfie setup. The lens. Treat it like a person you are talking to, not like a device you are recording into.
This sounds obvious, but most people look at their own face on the screen while filming, which means they are looking slightly off to the side or down, depending on where the screen sits relative to the lens. It is a small difference on camera, but it reads as disconnected. The viewer can feel when someone is not quite looking at them.
Write bullet points of what you want to cover on a piece of paper and position it right next to the lens. Not on your phone, not on a monitor across the room, not in your head as a memorised script. A physical piece of paper next to the lens that you can glance at without visibly shifting your gaze. This is not teleprompter territory. It is permission to not remember everything word-for-word, which is what makes delivery feel natural instead of rehearsed.
Film in short takes by topic rather than trying to nail one long continuous recording. Decide you are going to cover three points and record each point as a separate clip. Take one. Review it. Take two if needed. Then move to the next point. Editors can assemble multiple short takes into a tight argument. You do not need to deliver a perfect seven-minute monologue in one go.
The best on-camera delivery I have seen from business owners and consultants who were not natural on-camera people came from giving them permission to stop and start. The pressure of the long continuous take is what creates the stiffness. When you know you can stop after thirty seconds, you relax. And relaxed is what you want to look.
---
What Does Well-Filmed Talking Head Footage Let an Editor Do That Bad Footage Does Not?
When you deliver ten clean, short takes, each covering one point with a clear start and a clear end, the editor can find the best version of every idea and assemble the tightest possible argument. They are selecting from options. That is a creative and editorial job.
When you deliver one eight-minute rambling take where you circle back to the same point three times and trail off twice and recover, the editor is doing archaeology. They are excavating your best moments from material that was not designed for assembly. Both are workable. One consistently produces a better final cut.
The specific things that make footage easy to edit: clean in-points where you start talking without a false start, clean out-points where you finish a sentence and stop rather than trailing into "so, yeah, um," a pause between takes rather than talking through the mistake, and no sudden changes in position, distance, or lighting mid-take that create jarring jump cuts.
Audio consistency is the other thing Hayden flags most often when reviewing footage before we cut it. If you are close to the mic for two takes and then step back for the third, the audio level changes and the cut between those takes sounds wrong. Stay at the same distance from the mic for every take in a session.
The difference between footage that takes three hours to edit and footage that takes ninety minutes is almost entirely in how it was recorded. The more disciplined you are about clean takes and consistent setup, the better the edit, and the faster it comes back to you.
If you want to understand how to set up your footage for the edit before it goes to an editor, how to brief a video editor covers what context helps most.