What Is MP4 Format? The Complete Guide (2026)

MP4 ISO 14496-14 · MPEG-4 Part 14
H.264 + AAC Most common codec pair inside MP4
4B+devices record in MP4
91%video sites use H.264
ISO 14496-14official standard
40%smaller with H.265

Open the camera app on any smartphone and start recording. On an iPhone, you're creating an MP4. On a Samsung Galaxy, you're creating an MP4. On a Pixel, you're creating an MP4. Pull up YouTube, Netflix, TikTok, or Instagram — the video streams you receive are packaged in MP4. Record your screen with OBS or Loom: MP4. Export from DaVinci Resolve or Adobe Premiere for web delivery: MP4.

MP4 is the most pervasive video container format in existence, and yet most people who use it daily have no idea what it actually is, how it works internally, or why it matters which codec is packed inside it. The difference between an MP4 with H.264 video and an MP4 with H.265 video is enormous — roughly 40% of the file size — even though both files have the exact same extension. This guide explains the internals, the history, the codec choices, and everything practical you need to know to work with MP4 files effectively.

1. What Is MP4?

MP4 — formally known as MPEG-4 Part 14 and standardized as ISO/IEC 14496-14 — is a digital container format for storing video, audio, subtitles, chapter markers, and metadata in a single file. Files use the .mp4 extension and the MIME type video/mp4. The format was first formally standardized in 2003 and is now maintained by the ISO/IEC Joint Technical Committee 1.

The critical concept: MP4 is a container, not a codec. A container is a file format that wraps one or more streams of encoded media together with timing information, metadata, and an index of where everything is. The codec is the algorithm that actually compresses the video or audio data stored inside that container. An MP4 file can contain many different video codecs — H.264, H.265/HEVC, AV1, VP9, MPEG-4 Part 2 — and many different audio codecs — AAC, AC-3/Dolby, MP3, FLAC, Opus. The container is the box; the codec is the language the content is written in.

This distinction matters enormously in practice. When someone says "MP4 is higher quality than AVI," they are almost certainly talking about the codecs typically used inside each container, not the containers themselves. An MP4 with H.264 and an AVI with H.264 carry the same compressed video — the container differences are in metadata structure, seeking behavior, and software support, not the video quality.

Inside every MP4 file, data is organized into hierarchical boxes (also called atoms), a structure inherited from Apple's QuickTime format. The three most important boxes at the top level of any MP4 are:

  • ftyp — File Type Box. The very first box in any MP4 file. Contains the brand identifier (usually isom or mp42), the version, and a list of compatible brands. It allows media players to quickly identify what kind of MP4 this is and whether they can play it.
  • moov — Movie Box. Contains the complete structural description of all streams: codec parameters, frame timing, stream dimensions, track metadata, and crucially, the location (byte offset) of every chunk of media data in the file. The moov atom is the index — a player cannot seek to a point in the video without it.
  • mdat — Media Data Box. The actual raw encoded video frames and audio samples. This is where the bulk of the file size lives.

The position of the moov atom relative to the mdat atom is what determines whether an MP4 can stream progressively — a topic covered in detail in the container section below.

ISO 14496-14
Official MP4 standard (MPEG-4 Part 14)
91%
Video websites using H.264/AVC in MP4
1998
MPEG-4 Part 2 video codec first released
40%
Smaller files with H.265 vs H.264

MP4 derives from Apple's QuickTime File Format (.mov), which Apple shared with the ISO working group in the late 1990s as the basis for the standardized ISO Base Media File Format (ISOBMFF, ISO 14496-12). MP4 is one specific profile of ISOBMFF — alongside other formats like HEIF (used for iPhone photos), AVIF (next-generation images), and CMAF (Common Media Application Format for streaming). Understanding this shared heritage explains why MP4 and MOV files are structurally very similar and often interchangeable with a simple container remux.

2. A Brief History of MP4

The MP4 format did not emerge fully formed — it is the result of three decades of compression research, standardization battles, and the relentless pressure of the internet on file sizes. Understanding the timeline explains why certain codecs dominate and where the format is heading.

1993
MPEG group begins MPEG-4 development — The ISO/IEC Moving Picture Experts Group begins work on MPEG-4, an ambitious standard intended to cover visual and audio coding for interactive multimedia applications. The scope went far beyond simple video compression, encompassing scene description, synthetic media, and object-based video.
1998
MPEG-4 Part 2 video codec released — ISO/IEC 14496-2, the first MPEG-4 video codec (now called MPEG-4 Visual or MPEG-4 Part 2), is published. DivX and Xvid are based on this codec. It offers significantly better compression than MPEG-2 (DVD standard) and becomes popular for internet video distribution on early CD-ROM collections and peer-to-peer networks.
2001
ISO 14496-12: Base Media File Format derived from QuickTime — Apple provides the QuickTime File Format specification to the ISO working group. ISO 14496-12 is published as the ISO Base Media File Format — the foundation that MP4, HEIF, AVIF, and other modern formats build on. The atom/box structure of QuickTime becomes the structural basis for the entire family of formats.
2003
ISO 14496-14 (MP4 file format) formally standardized — The MP4 file format specification is published as an ISO standard, defining it as a specific profile of the ISO Base Media File Format optimized for MPEG-4 content. The .mp4 extension and video/mp4 MIME type are established. This is the birth of MP4 as we know it today.
2006
H.264/AVC becomes the dominant codec — YouTube adopts it — H.264 (MPEG-4 Part 10 / AVC), finalized in 2003 and refined in subsequent profiles, achieves dominant adoption. YouTube begins H.264 streaming. The combination of H.264 video and AAC audio inside an MP4 container becomes the de facto standard for web video, remaining so for the next decade and beyond. Internet Explorer 9, released in 2011, drops Flash and adds native H.264/MP4 support, completing the browser transition.
2009
iPhone 3GS records in MP4 — smartphone standard established — The iPhone 3GS becomes the first iPhone with video recording capability, outputting H.264/AAC in an MP4 container. This single product ships video recording to tens of millions of consumers and establishes MP4 as the default format for user-generated video content worldwide.
2013
H.265/HEVC approved — same quality at half the bitrate — ISO/IEC 23008-2 (H.265/HEVC, High Efficiency Video Coding) is approved. It achieves approximately 40–50% better compression than H.264 at equivalent perceptual quality — or equivalently, the same quality at half the bitrate. 4K streaming on Netflix and Apple TV+ relies on H.265 in MP4 containers to keep bitrates manageable on consumer internet connections.
2018
AV1 codec released — royalty-free compression — The Alliance for Open Media (founded by Google, Mozilla, Netflix, Microsoft, Amazon, Apple, and others) releases AV1. It achieves approximately 30% better compression than H.265 and is entirely royalty-free, removing the licensing barrier that slowed H.265 adoption. AV1 streams can be delivered inside MP4 containers as well as WebM.
2023
Netflix begins AV1 streaming in MP4 containers — Netflix rolls out AV1 streaming on smart TVs and connected devices with hardware AV1 decoding, delivering the same video quality at lower bitrates. YouTube and Vimeo have been streaming AV1 since 2018–2019. AV1 in MP4 becomes a production reality for major streaming platforms.
2026
H.266/VVC emerging — 50% smaller than H.265 — H.266 (Versatile Video Coding), finalized in 2020 and now seeing hardware support begin shipping in consumer devices, promises approximately 50% compression improvement over H.265 — meaning a 4K H.266 stream can match the quality of a 4K H.265 stream at half the bitrate. Hardware encoding and decoding support is gradually rolling out. MP4 containers will carry H.266 as adoption grows.

3. MP4 as a Container: What That Actually Means

Understanding MP4 as a container — not a codec — is the most important conceptual shift for anyone working seriously with video. It changes how you diagnose playback problems, understand file sizes, and decide on export settings.

The Box/Atom Structure

Every MP4 file is built from a hierarchy of boxes (the ISO term) or atoms (the QuickTime term — both words describe the same thing). Each box has a 4-byte size field followed by a 4-byte type identifier (like ftyp, moov, mdat). Boxes can contain other boxes, creating a tree structure. The most important boxes in a typical MP4 are:

  • ftypFile Type Box. Always the first box. Identifies the file as an MP4 and lists compatible specification versions. A player checks this first to know how to parse the rest of the file.
  • moovMovie Box. Contains all the structural metadata: stream definitions, codec parameters, frame timing tables (every frame's size and byte offset within mdat), edit lists, and track-level metadata. The moov box is the table of contents for the entire file. A player cannot seek or begin playback without it.
  • trakTrack Box (inside moov). One trak box per stream — typically one for video, one for audio, and optionally one for subtitles. Contains the track's specific codec parameters, sample tables, and timing data.
  • mdatMedia Data Box. The raw encoded video frames and audio samples. The actual bulk of the file. The moov box tells the player exactly which byte offset in mdat corresponds to which frame at which timestamp.
  • udtaUser Data Box. Optional metadata: title, artist, copyright, encoder software, creation date, GPS location. Not required for playback.
  • free / skip — Optional padding boxes that can be filled with zeros. Sometimes used to reserve space in the file for later editing without requiring the entire file to be rewritten.

Faststart: Why Moov Position Matters for Streaming

The single most practically important structural property of an MP4 file for web use is the position of the moov atom. Here's the problem: when a video encoder processes a file, it doesn't know the final structure of the moov atom until encoding is complete — because moov contains the byte offset of every frame in mdat, and those offsets can only be known after all the frame data has been written. So most encoders write mdat first (as frames are encoded) and then write moov at the end of the file.

This creates a problem for web streaming. When a browser requests an MP4, it starts downloading the file from the beginning. If moov is at the end — which is the default for most encoders — the browser must download the entire file before it can parse the structure and begin decoding. For a 500MB movie, this means waiting for all 500MB to download before a single frame plays. The video will not buffer progressively: it will just sit there loading.

The solution is faststart — moving the moov atom to the front of the file, before mdat. With moov at the start, the browser can parse the structure immediately, begin decoding, and display the first frames while the rest of the file continues downloading in the background. Progressive streaming works correctly.

In FFmpeg, faststart is applied with the -movflags +faststart flag:

# Apply faststart to an existing MP4 (remux only, no re-encoding)
ffmpeg -i input.mp4 -c copy -movflags +faststart output.mp4

# Encode and apply faststart in one step
ffmpeg -i input.mp4 -c:v libx264 -c:a aac -movflags +faststart output.mp4

The -c copy flag in the first command tells FFmpeg to copy the streams without re-encoding — it's just reorganizing the atom structure, which is fast and does not reduce quality. This is something you should do to every MP4 file you serve on the web.

How to check if an MP4 has faststart: Run ffprobe -v quiet -print_format json -show_format input.mp4 and look at the output. Alternatively, open the file in a hex editor and check whether the moov atom appears before the mdat atom. In ExifTool, the "MPEG4 Item" or "Movie Atom" position is shown in the output.

Fragmented MP4 (fMP4) for Adaptive Streaming

Standard MP4 uses a single mdat box containing all media data, with a single moov box indexing it all. Fragmented MP4 (fMP4) breaks the file into small segments, each with its own moof (Movie Fragment) and mdat pair. This structure is used by HLS (HTTP Live Streaming) and DASH (Dynamic Adaptive Streaming over HTTP) — the adaptive streaming protocols used by Netflix, YouTube, Hulu, and every major streaming service. Each fragment can be independently decoded without the rest of the file, enabling adaptive bitrate switching and live streaming.

4. Video Codecs Inside MP4

The codec inside an MP4 determines the actual video quality, compression ratio, encoding speed, and device compatibility. The .mp4 extension tells you almost nothing about the quality — a 4K H.265 MP4 and a 480p H.264 MP4 are both .mp4 files.

H.264 / AVC (Most Common)

H.264 (also known as AVC, Advanced Video Coding, or MPEG-4 Part 10) is the overwhelmingly dominant video codec in 2026, used by approximately 91% of video websites according to usage data. It was finalized by the ITU-T and ISO/IEC in 2003 and has been refined through numerous subsequent amendments. H.264 achieves excellent quality at manageable file sizes and, critically, has hardware decoding support on every device sold in the last decade — from iPhones to Android phones to smart TVs to game consoles to laptops.

H.264's success is the result of the right combination of compression efficiency, hardware support, and institutional adoption arriving at the same time. YouTube adopted it in 2010. Apple made it the default for iPhone recording in 2009. The HTML5 video standard supports it natively. If you need a video file to play on any device without any software installation, H.264 in an MP4 container is your choice.

H.264 profiles control the tradeoff between compression efficiency and decoder complexity. The most important for web use are Baseline (lowest complexity, older devices), Main (general use), and High (best compression, modern devices). For web delivery, High Profile at Level 4.1 or 5.1 is the standard target.

H.265 / HEVC (40% Smaller, Growing Support)

H.265 (HEVC, High Efficiency Video Coding) was approved in 2013 and achieves approximately 40–50% better compression than H.264 at equivalent perceptual quality — meaning a 4K H.265 stream at 10 Mbps looks roughly equivalent to a 4K H.264 stream at 18–20 Mbps. This reduction is why Netflix and Apple TV+ use H.265 for 4K HDR content: it makes 4K streaming practical on consumer internet connections.

The obstacle to universal H.265 adoption is licensing. H.265 involves multiple patent pools administered by different organizations (MPEG LA, HEVC Advance, and Velos Media), and the licensing costs and complexity have caused some major platforms and browser vendors to hesitate. Safari supports H.265 playback in MP4 containers. Chrome and Firefox have limited H.265 support, depending on platform hardware decoding capabilities. For controlled environments (Apple devices, smart TVs with hardware HEVC, Windows with HEVC codec installed), H.265 in MP4 is an excellent choice. For maximum web browser compatibility, H.264 remains safer.

AV1 (Royalty-Free, Best Compression)

AV1 was released in 2018 by the Alliance for Open Media — a consortium including Google, Mozilla, Netflix, Microsoft, Amazon, Apple, and Intel. It achieves approximately 30% better compression than H.265 (and ~50% better than H.264) and is completely royalty-free, eliminating the patent licensing concerns that complicated H.265 adoption.

AV1 can be stored in both WebM containers (Google's open container format) and MP4 containers. For streaming platforms, AV1 in MP4 is the preferred delivery format. Netflix, YouTube, Vimeo, and Facebook all stream AV1. Chrome, Firefox, and Edge support AV1 decoding in software and via hardware decoders on modern GPUs. Safari added AV1 support in macOS 13 and iOS 16 for devices with hardware AV1 decoding. The main limitation is encoding speed — AV1 encoding is computationally intensive and can be 10–50× slower than H.264 encoding at comparable quality settings, making it impractical for real-time encoding and slow for large batch conversions.

VP9 in MP4 (Uncommon)

VP9 is Google's predecessor to AV1, achieving better compression than H.264 and released in 2013. It is most commonly used in WebM containers (YouTube served VP9 in WebM extensively before transitioning to AV1), but VP9 can technically be stored in MP4 containers. In practice, VP9 in MP4 is uncommon — if you're using VP9, you're probably using WebM. For MP4, H.264, H.265, or AV1 are the practical choices.

H.266 / VVC (Emerging in 2026)

H.266 (VVC, Versatile Video Coding) was finalized by the ITU-T in 2020 and represents the next generation after H.265. It achieves approximately 50% better compression than H.265 — meaning a 4K H.266 stream at 5 Mbps would match the visual quality of a 4K H.265 stream at 10 Mbps. Hardware encoder and decoder chips with H.266 support began appearing in 2024–2025. MP4 containers support H.266 via the vvc1 codec tag. Consumer adoption is in its earliest stages as of 2026.

Codec Year Compression vs H.264 Browser Support Royalty-Free Best For
H.264 / AVC 2003 Baseline Universal No (MPEG LA) Maximum compatibility
H.265 / HEVC 2013 ~40% smaller Partial No (multiple pools) 4K, Apple devices, storage
AV1 2018 ~50% smaller Good, growing Yes Streaming platforms, web
VP9 2013 ~20% smaller Good Yes WebM (rarely used in MP4)
H.266 / VVC 2020 ~70% smaller Emerging No Future streaming, 8K

5. Audio Codecs Inside MP4

The audio codec inside an MP4 is just as important as the video codec — bad audio quality is often more immediately noticeable to viewers than mild video compression artifacts. MP4 supports a wide range of audio codecs, each with different compression efficiency, quality characteristics, and device support.

AAC (Default — Successor to MP3)

AAC (Advanced Audio Coding, MPEG-4 Part 3) is the default audio codec for MP4 and the most widely supported. It was designed as the successor to MP3 — achieving the same perceived audio quality at roughly half the bitrate. At 128 kbps, AAC sounds noticeably better than MP3 at 128 kbps. At 192 kbps AAC, most listeners cannot distinguish the result from CD-quality audio in blind tests.

AAC in MP4 is universal: supported on every iPhone, Android device, Mac, Windows PC, browser, smart TV, and streaming platform. When in doubt, use AAC. The two most common profiles are AAC-LC (Low Complexity — the most compatible, used by iTunes, iPhone recordings, and most streaming) and HE-AAC (High Efficiency — uses spectral band replication for better quality at very low bitrates, used in radio streaming and podcasts below 64 kbps).

AC-3 / Dolby Digital (Broadcast and Blu-ray)

AC-3 (Dolby Digital) and its successor E-AC-3 (Dolby Digital Plus) are professional audio codecs supporting up to 7.1 surround sound channels. They are the standard audio codecs for Blu-ray, broadcast television (ATSC, DVB), and streaming services delivering surround sound. Netflix, Amazon Prime, and Apple TV+ use Dolby Digital Plus (E-AC-3) in MP4 containers for surround sound tracks. AC-3 is supported by most Blu-ray players, streaming devices, and smart TVs, though some older Android devices require an app like VLC for AC-3 playback.

MP3 in MP4 (Legacy)

MP3 (MPEG-1 Audio Layer III) can technically be stored in MP4 containers, though this is increasingly rare. Earlier versions of certain encoders and some legacy workflows produced MP4 files with MP3 audio rather than AAC. MP3 in MP4 is playable by most modern players including VLC and FFmpeg, but some devices and platforms have limited support. If you encounter an MP4 with MP3 audio and need broad compatibility, converting the audio stream to AAC (without re-encoding the video) with FFmpeg is straightforward and lossless for the video stream.

FLAC (Lossless Audio in MP4)

FLAC (Free Lossless Audio Codec) can be stored in MP4 containers, providing completely lossless audio alongside video. This is used in archival and audiophile contexts where bit-perfect audio reproduction is required alongside the video. An MP4 file with H.264 video and FLAC audio is larger than the same file with AAC audio, but the audio is indistinguishable from the original uncompressed PCM source in a blind test. VLC, FFmpeg, and most media players support FLAC in MP4, though consumer streaming platforms transcode FLAC to AAC for delivery. To learn more about lossless audio formats, see our lossless audio formats guide.

Opus (Modern Streaming Audio)

Opus is an open, royalty-free audio codec standardized by the IETF (RFC 6716) and developed by the Xiph.Org Foundation and Mozilla. It achieves better audio quality than AAC at the same bitrate, particularly at low bitrates (below 64 kbps). Opus in MP4 is supported by Chrome, Firefox, and Edge, and is increasingly used for web-based video communication (WebRTC calls) and web audio. For broad consumer device compatibility however, AAC remains the safer choice for MP4 audio.

6. MP4 vs MKV vs MOV vs AVI vs WebM

All five of these are container formats, and each has a distinct position in the ecosystem based on its codec support, adoption, and intended use cases. The choice of container rarely affects video quality — what matters most is the codec inside — but it significantly affects compatibility and workflow.

Format Container Type Best Codec Browser Support Streaming Editing Support Use Case
MP4 ISO BMFF (ISO 14496-14) H.264, H.265, AV1 Universal Excellent (faststart, fMP4) Universal Web, mobile, sharing, streaming
MKV Matroska (open) H.264, H.265, AV1, VP9 Partial (Firefox, Chrome) Moderate Good (most NLEs) Archiving, Blu-ray rips, home theater
MOV QuickTime (Apple) H.264, H.265, ProRes Good (Safari, Chrome) Moderate Excellent (Apple ecosystem) Apple editing workflows, Final Cut Pro
AVI RIFF (Microsoft, 1992) H.264 (via Xvid, DivX) Poor Poor Good (legacy) Legacy Windows, old camcorders
WebM Matroska subset (Google) VP8, VP9, AV1 Good (Chrome, Firefox) Excellent Moderate Web-native video (Chrome/Firefox)

MP4 vs MKV

This is the most common comparison because both formats support the same major codecs (H.264, H.265, AV1) and both are used for high-quality video. The key differences are:

  • Compatibility: MP4 plays natively on virtually every device. MKV requires VLC or a codec pack on many Windows systems, and is not natively supported in Safari or many smart TVs without a media player app.
  • Codec support: MKV supports a wider range of audio and subtitle formats, including PGS (Blu-ray image-based subtitles), TrueHD/Atmos audio, and dozens of exotic codecs. MP4's codec support is narrower but covers 99% of use cases.
  • Archiving: For ripping Blu-rays or archiving media with multiple audio tracks, multiple subtitle tracks, and chapter markers, MKV is often preferred due to its flexibility.
  • Web use: MP4 is unambiguously better for web streaming. MKV has no faststart equivalent and weaker browser support.

MP4 vs MOV

MP4 and MOV share the same underlying ISO Base Media File Format — structurally they are very similar. In most cases, an MOV file can be remuxed to MP4 (or vice versa) without re-encoding the video or audio, simply by rewriting the container wrapper. The key differences are practical rather than technical:

  • MOV supports Apple ProRes, a high-quality intermediate codec used in professional editing workflows. ProRes in MP4 is technically possible but uncommon.
  • MOV files are the native output of DJI drones, iPhones recording ProRes or HEVC, and Final Cut Pro timelines. They open natively in Final Cut Pro and other Apple software.
  • For web delivery, MOV files should be remuxed to MP4 — browser support for MOV (particularly in non-Safari browsers) is less reliable.

MP4 vs AVI

AVI (Audio Video Interleave) was the dominant Windows video format from 1992 through the early 2000s. It uses the RIFF container structure (the same structure WebP images use) and is fundamentally limited in its ability to handle modern codecs and streaming. AVI does not support variable frame rate video natively, has poor seeking performance, and lacks any streaming optimization equivalent to faststart. The only reason to use AVI today is if you are working with legacy camcorder footage or very old software that does not accept anything else. Converting AVI to MP4 is one of the most common video conversion tasks — and one of the most straightforward.

MP4 vs WebM

WebM is Google's open container format, a subset of Matroska, designed for web video. It supports VP8, VP9, and AV1 video codecs with Vorbis and Opus audio. WebM is natively supported in Chrome and Firefox but has limited support in Safari and many consumer devices. MP4 has far broader native support. For a converter page comparing these formats, see our MP4 to WebM converter. The practical rule: serve MP4 with H.264 as your primary web video, optionally serving WebM (VP9 or AV1) to Chrome/Firefox users who would benefit from the better compression. Our video format guide covers this in more detail.

7. Where MP4 Is Used

MP4's ubiquity is no accident — it genuinely fits a wider range of use cases than any alternative format. Here is where it dominates in 2026:

📱

Smartphones

iPhone, Samsung Galaxy, Google Pixel — every major smartphone records video as H.264 or H.265 inside an MP4 container. iOS uses HEVC (H.265) for storage efficiency on devices with hardware HEVC encoding. Android manufacturers vary by device but MP4 is universal.

🎬

Streaming Services

Netflix, YouTube, TikTok, Instagram, Vimeo, and Disney+ all deliver video in MP4 (or fragmented MP4) containers. YouTube uses H.264 MP4 as the baseline with AV1 MP4 for supported clients. Netflix uses H.265 MP4 for 4K HDR content.

✂️

Video Editing

Adobe Premiere Pro, DaVinci Resolve, Final Cut Pro, and CapCut all accept MP4 as a primary input format. H.264 MP4 is the most common format for bringing footage into an edit, and H.264 or H.265 MP4 is the standard export format for web delivery from all major NLEs.

🎮

Game Cutscenes

Unity and Unreal Engine both use H.264 MP4 for in-game video and cinematics. The VP8/VP9 in WebM is also used in some engines, but MP4/H.264 is the primary format for prerendered cutscenes on console and PC due to hardware decoder availability.

🖥️

Screencasting

OBS Studio, Loom, Camtasia, and Screenflow all default to H.264 MP4 output. It is the standard format for tutorial videos, product demos, and software walkthroughs because of its universal compatibility and reasonable file sizes.

📡

Broadcast

ATSC 3.0 (the NextGen TV broadcast standard in the United States) uses MPEG-4 transport streams with H.265 video. Digital terrestrial TV in much of the world (DVB-T2) also uses H.264 and H.265. The MP4 container is a cornerstone of modern broadcast infrastructure.

MP4's position is self-reinforcing: because every device supports it, software defaults to it; because software defaults to it, every user produces MP4 files; because every user produces MP4 files, every platform supports it. Breaking out of this feedback loop requires a compelling technical advantage — as AV1 is demonstrating — plus royalty-free status to avoid the patent barriers that slowed H.265.

8. How to Extract Audio From MP4

One of the most common MP4 file operations is extracting the audio track — pulling out the AAC audio from an MP4 and saving it as an MP3 file. This is done for podcasts, backing tracks, YouTube music, recorded lectures, and countless other use cases. The good news: it's fast, free, and no quality is lost in the video stream (since you're discarding it).

Option 1: Convertlo Free Browser-Based Converter

Our free MP4 to MP3 converter uses FFmpeg.wasm to extract the audio directly in your browser. No files are uploaded to any server. No account is required. No waiting for a queue.

1
Open the converter

Go to convertlo.pro/mp4-to-mp3. The page loads FFmpeg.wasm — a WebAssembly build of FFmpeg that runs entirely in your browser.

2
Drop your MP4 file

Drag and drop your MP4 file onto the upload zone, or click to browse. Files are processed locally — they never leave your device.

3
Convert and download

The audio track is extracted and converted to MP3. Download the result. For large files, processing may take 30–60 seconds depending on your device speed.

For a detailed walkthrough of the process including quality settings, see our step-by-step MP4 to MP3 conversion guide. If you want to understand the MP3 format itself, our MP3 format guide covers the history and compression internals.

Option 2: FFmpeg (Command Line)

For batch processing or when you need precise control over output quality, FFmpeg is the standard tool. The fastest approach is a direct stream copy — copying the AAC audio stream from the MP4 and remuxing it into an MP3 container without any re-encoding. However, because MP3 containers cannot hold AAC, you typically re-encode to MP3 (libmp3lame):

# Extract audio and convert to MP3 (re-encodes to MP3)
ffmpeg -i input.mp4 -vn -c:a libmp3lame -q:a 2 output.mp3

# -vn = no video (drop the video stream)
# -c:a libmp3lame = encode audio as MP3
# -q:a 2 = VBR quality 2 (~190 kbps average — high quality)

# Extract as M4A (AAC in MP4 container — preserves full quality)
ffmpeg -i input.mp4 -vn -c:a copy output.m4a

# Batch convert all MP4 files in a directory to MP3
for f in *.mp4; do
  ffmpeg -i "$f" -vn -c:a libmp3lame -q:a 2 "${f%.mp4}.mp3"
done

The -c:a copy command for .m4a output is a true lossless extraction — no re-encoding occurs, so no quality is lost. For MP3 output, some re-encoding is necessary because MP3 and AAC use different compression algorithms.

Extract Audio From Your MP4 Files Free

Drag in any MP4 — our browser-based converter uses FFmpeg.wasm to extract the audio track and save it as an MP3. No upload, no sign-up, no waiting.

9. How to Open MP4 Files

A standard MP4 file with H.264 video and AAC audio will open with no additional software on virtually every modern operating system and browser. Here's what to use on each platform:

Windows

Windows Media Player and the Movies & TV app (also called Films & TV in some regions) on Windows 10 and 11 both support H.264 MP4 natively. For H.265/HEVC MP4, you need the HEVC Video Extensions codec from the Microsoft Store (free from some OEM installs, or $0.99 for direct install). VLC Media Player (free, open-source) supports virtually every MP4 variant without any additional codecs.

macOS

QuickTime Player on macOS supports H.264 and H.265 MP4 natively. For AV1 MP4, the IINA media player is a popular free alternative with full AV1 support. iMovie and Final Cut Pro accept MP4 directly.

iOS / iPadOS

The built-in Photos app and Files app both open MP4 files. The Files app can preview MP4 inline. The VLC app is available on iOS for any codec that QuickTime does not support natively.

Android

The default Gallery or Photos app on Android devices opens MP4 files. For H.265 MP4, hardware support varies by device — most flagships from 2016 onward have hardware HEVC decoding. VLC for Android handles software decoding for any codec.

Web Browsers

All major browsers (Chrome, Firefox, Safari, Edge, Opera) support H.264 MP4 playback in the HTML5 <video> element. For H.265/HEVC, Safari on macOS and iOS has native support; Chrome and Edge support it when hardware decoding is available; Firefox has limited support. AV1 is supported in Chrome, Firefox, and Edge; Safari supports it on hardware with AV1 decoders.

When an MP4 Won't Open

If an MP4 file won't open or plays only audio (no video), the most common causes are:

  • The video codec is not supported by the player (e.g., H.265 without the HEVC codec on Windows)
  • The audio codec is unusual (e.g., DTS or TrueHD in an MP4)
  • The file is corrupted or incomplete (common with incomplete downloads)
  • The file is actually in a different format with an .mp4 extension (some cameras mislabel .mov files)

Installing VLC Media Player resolves the first two issues on any desktop platform. For corrupted files, FFmpeg's -err_detect ignore_err flag can sometimes recover playable content from a damaged MP4.

Checking what's inside an MP4: To see exactly which video and audio codecs an MP4 contains, right-click the file in VLC and choose "Media Information", or run ffprobe -v quiet -print_format json -show_streams input.mp4 in a terminal. The codec_name field in the output shows the exact codec for each stream.

10. Frequently Asked Questions

What is an MP4 file?
An MP4 file is a digital container format standardized as ISO 14496-14 (MPEG-4 Part 14). It stores video, audio, subtitles, chapter markers, and metadata in a single file with the .mp4 extension and the MIME type video/mp4. MP4 is not itself a codec — it is a wrapper that can contain many different video codecs (H.264, H.265, AV1, VP9) and audio codecs (AAC, AC-3, FLAC, Opus). The internal structure is based on hierarchical boxes including ftyp, moov, and mdat.
Is MP4 a video or audio format?
MP4 is primarily known as a video container format, but it can contain audio-only streams as well. When an MP4 contains only an audio track — typically AAC audio — it is often renamed with the .m4a extension. iPhone voice memos, Apple Music downloads, and many podcast exports use .m4a files, which are technically MP4 containers with audio-only content. The full .mp4 extension is used when at least one video stream is present.
What is the difference between MP4 and H.264?
MP4 is a container format and H.264 is a video codec — they are fundamentally different things that work together. MP4 is the box that holds the data; H.264 (also called AVC, Advanced Video Coding) is the algorithm that compresses the video stream stored inside that box. Most MP4 files use H.264 video, but an MP4 can also contain H.265/HEVC, AV1, VP9, or other video codecs. Think of it this way: H.264 is the language the video is written in; MP4 is the book it is printed in.
Why is MP4 the most popular video format?
MP4 became the dominant video format because of a combination of factors: it is backed by the ISO standard (ISO 14496-14), the H.264 codec it usually contains achieves excellent quality at manageable file sizes, and it is supported natively on virtually every device and platform — Windows, macOS, iOS, Android, browsers, smart TVs, game consoles, and streaming services. The iPhone 3GS recording video in MP4/H.264 in 2009 was a key inflection point that drove consumer adoption. By 2010, YouTube had adopted H.264/MP4 as its primary format, cementing its dominance. The format's success is self-reinforcing: because every device supports it, software defaults to it; because software defaults to it, every platform must support it.
Can MP4 contain lossless audio?
Yes. MP4 containers support FLAC (Free Lossless Audio Codec) as an audio stream, which is completely lossless. FLAC in MP4 is supported by VLC, FFmpeg, and most modern players. However, mainstream streaming services and consumer devices predominantly use AAC, which is lossy. For archival or audiophile use cases requiring bit-perfect audio alongside video, Matroska (MKV) is often preferred over MP4 because it has broader codec support and is more commonly used in that context. See our lossless audio formats guide for more on FLAC vs WAV vs MP3.
What is faststart in MP4?
Faststart (also called "web optimization" or "moov atom relocation") is an MP4 file structure where the moov atom — which contains the index of all streams, their locations, and their timing — is placed at the beginning of the file rather than the end. By default, most video encoders write the moov atom at the end of the file because it can only be finalized once encoding is complete. For web streaming, a file with moov at the end cannot begin playback until the entire file is downloaded. Moving moov to the front allows browsers and players to begin buffering and playing immediately. In FFmpeg, this is achieved with the -movflags +faststart flag.
Is MP4 the same as MPEG-4?
No, though they are related. MPEG-4 is a broad ISO standard (ISO/IEC 14496) with 33 parts covering video compression, audio compression, scene description, and more. MP4 specifically refers to MPEG-4 Part 14, which defines the file format (the container). So MP4 is a specific part of the MPEG-4 standard, not the entire standard. The H.264 video codec is MPEG-4 Part 10, and AAC audio is MPEG-4 Part 3 — both commonly found inside MP4 containers, but separate specifications from the container format itself.
Can I play MP4 files on any device?
An MP4 file with H.264 video and AAC audio will play on virtually any modern device without installing additional software: iPhone, Android phones and tablets, Windows 10/11 (built-in), macOS, all major browsers (Chrome, Firefox, Safari, Edge), smart TVs, PlayStation, Xbox, and most media players. Compatibility can break down if the MP4 contains a less common codec — for example, an MP4 with H.265/HEVC video requires hardware HEVC decoding support, which is absent on some older Android devices and some older Windows PCs without the HEVC Video Extensions codec.
How do I convert an MP4 to MP3?
The easiest way is to use our free MP4 to MP3 converter at convertlo.pro/mp4-to-mp3.html. It uses FFmpeg.wasm to extract the audio track directly in your browser — no upload required, no sign-up, and it works on any device. Drop the MP4 file onto the converter and download the resulting MP3. For a detailed walkthrough, see our step-by-step conversion guide.
What is the best codec to use inside MP4?
For maximum compatibility in 2026, H.264 (AVC) with AAC audio is the best choice — it plays on 99%+ of devices without issues. For smaller file sizes with good compatibility (modern devices and browsers), H.265/HEVC with AAC is excellent — files are 35–40% smaller than H.264 at equivalent quality. For streaming platforms and progressive use cases, AV1 in MP4 is increasingly popular because it is royalty-free and achieves 30% better compression than H.265, though encoding is much slower and older devices lack hardware decoding support.
Is H.265 MP4 better than H.264 MP4?
H.265 (HEVC) produces files that are approximately 35–50% smaller than H.264 at equivalent visual quality, making it significantly more efficient for storage and streaming. However, H.265 has licensing costs for encoders and decoders, which has slowed adoption on some platforms. H.265 MP4 also requires hardware decoding support that is absent on many older Android devices and PCs without updated codec support. For end-to-end compatibility across all devices, H.264 MP4 remains the safer choice. For storage efficiency when you control the full playback environment — such as Apple devices or modern smart TVs — H.265 is considerably better.
What is the difference between MP4 and MKV?
Both MP4 and MKV (Matroska) are container formats that can hold H.264, H.265, AV1, and other codecs. The main differences: MKV supports a wider range of codecs and subtitle formats (including PGS/Blu-ray subtitles, multiple audio tracks, chapters); MP4 has far broader native device and browser support. MKV is the preferred container for media archiving, Blu-ray rips, and anime distribution because of its flexibility. MP4 is preferred for streaming, sharing, and compatibility with consumer devices and platforms. If you have an MKV file you want to play on a device or upload to a platform that does not support MKV, converting it to MP4 (usually just remuxing without re-encoding) solves the compatibility problem.
✍️
Convertlo Editorial Team
We write practical guides on video formats, file conversion, and media workflows. All tools mentioned are browser-based and free to use.
About Convertlo →