How to Convert MP3 to MP4 (Add Album Art for YouTube, TikTok & Instagram)

MP3 MP4 YouTube FFmpeg TikTok Album Art
MP3 to MP4 — combining audio with a static image
Free Toolbrowser-based, no install
5 methodsFFmpeg, Kapwing, iMovie, CapCut + browser
8–15 MBtypical 4-minute output
6 platformswith correct image dimensions

You've finished a track, a podcast episode, or an ambient soundscape, and you want to upload it to YouTube. The problem: YouTube doesn't accept MP3 files. Neither does TikTok or Instagram. All of them require video. The workaround — used by millions of musicians, podcasters, and content creators — is to combine your audio with a single static image to produce an MP4 video file. Every frame of that "video" is the same image. The conversion takes a few minutes and the resulting file is surprisingly small.

1. TL;DR — What This Actually Does

1
image + audio = MP4 video
0%
audio quality loss (if copied)
8–15 MB
typical 4-min output with FFmpeg
free
FFmpeg, iMovie, CapCut all free

The short answer: "Converting MP3 to MP4" means combining your audio file with a static image to produce a video file that platforms like YouTube and TikTok can accept. The audio is unchanged; the video track is simply your image displayed for the full duration of the audio. The best free method is FFmpeg: ffmpeg -loop 1 -i cover.jpg -i audio.mp3 -c:v libx264 -tune stillimage -c:a aac -b:a 192k -pix_fmt yuv420p -shortest output.mp4

New: Convertlo now has a free browser-based MP3 to MP4 converter — drop your audio + cover image and get an MP4 instantly, no install. This guide also covers FFmpeg, Kapwing, iMovie, and CapCut for more control. To go the other direction, use Convertlo's MP4 to MP3 converter.

Quick Reference — MP3 to MP4 Conversion

Fastest (no install): Convertlo's browser-based converter — drop audio + image, done.

Best quality (offline): FFmpeg: ffmpeg -loop 1 -i cover.jpg -i audio.mp3 -c:v libx264 -tune stillimage -c:a aac -b:a 192k -pix_fmt yuv420p -shortest output.mp4

Best for mobile: CapCut (iOS/Android) — free, no watermark, TikTok-ready vertical output.

Best for Mac: iMovie — free, pre-installed, no command line needed.

What it does: Wraps your audio in a video container with a static image as the visual track. The image loops for the full audio duration. Output is a standard H.264/AAC MP4 file accepted by YouTube, TikTok, Instagram, and all video platforms.

2. Why Video Platforms Require a Video File

YouTube, TikTok, Instagram, Facebook, and most social platforms are built around video playback infrastructure. Their upload pipelines transcode everything to video formats optimized for streaming. They don't have audio-only playback paths on the main feed — audio is always attached to a video track, even if that track never changes.

The practical consequence: a 320kbps MP3 can't be uploaded to YouTube directly. But an MP4 where every frame is your album cover art — even though that "video" never moves — is accepted immediately, transcoded, and available to stream globally within minutes. Artists like Radiohead, The Beatles (remastered), and countless independent musicians have millions of YouTube plays on exactly this format: a static image with audio.

3. Image Dimensions by Platform

The image you use determines the video resolution. Use the wrong dimensions and you get letterboxing, pillarboxing, or cropped content depending on how the platform handles aspect ratio mismatches.

Platform Dimensions Aspect Ratio Notes
YouTube Standard 1920×1080 16:9 Best for most music/podcast uploads
YouTube Shorts 1080×1920 9:16 Under 60 seconds; vertical format
TikTok 1080×1920 9:16 Vertical only; square crops weirdly
Instagram Feed (Square) 1080×1080 1:1 Album art is usually square — ideal
Instagram Reels 1080×1920 9:16 Same as TikTok vertical
Facebook Video 1280×720 min 16:9 1920×1080 recommended
Twitter / X 1280×720 min 16:9 or 1:1 Max 512 MB, 2 min 20 sec

The blurred background trick: If your album art is square (1:1) and you need 16:9 for YouTube, place the square art centred on a 1920×1080 canvas with a blurred, darkened version of the same image filling the letterbox areas. This looks intentional rather than like a technical compromise. Canva and Photoshop both have this as a simple template.

4. How to Prepare Your Image

The image quality determines the output video quality. A few things to get right before conversion:

  • Resolution: Export your image at 1920×1080 (or whatever the target dimensions are) at minimum. Higher is better — up to 3840×2160 for 4K. FFmpeg scales down to your target dimensions if needed, but it can't add detail that wasn't there.
  • Format: Use PNG for artwork with text, logos, or sharp edges. JPG for photographic images. Both work fine with FFmpeg.
  • Text in the image: Add song title, artist name, and album name as text in the image before converting — you can't easily edit the "video" after conversion.
  • Even dimensions: Make sure width and height are even numbers (divisible by 2). H.264 requires this. Odd dimensions will throw an error unless you add FFmpeg's scale filter.

5. Method 1: FFmpeg (Best Quality, Free)

FFmpeg is a free, open-source tool that runs on Windows, macOS, and Linux. For MP3-to-MP4 conversion it produces the highest quality output at the smallest file size, and it runs completely offline — nothing is uploaded anywhere.

Installation

  • macOS: brew install ffmpeg
  • Linux: sudo apt install ffmpeg or sudo dnf install ffmpeg
  • Windows: Download from ffmpeg.org, or winget install ffmpeg in PowerShell

The Standard Command

ffmpeg -loop 1 -i cover.jpg -i audio.mp3 -c:v libx264 -tune stillimage -c:a aac -b:a 192k -pix_fmt yuv420p -shortest output.mp4

Each flag explained:

  • -loop 1 -i cover.jpg — loops the image file infinitely (it's a single frame, so "looping" just holds it)
  • -i audio.mp3 — the audio input
  • -c:v libx264 — encode video with H.264 (universally supported)
  • -tune stillimage — tells H.264 to optimise for static frames; dramatically reduces file size by not encoding changes that don't exist
  • -c:a aac -b:a 192k — re-encode audio to AAC at 192kbps (required for broad compatibility with MP4 players)
  • -pix_fmt yuv420p — required for compatibility with iOS, QuickTime, and older players
  • -shortest — stop when the audio ends (without this, the image loop would continue indefinitely)

With Odd-Dimension Images

If your image has odd-numbered dimensions, add the scale filter:

ffmpeg -loop 1 -i cover.jpg -i audio.mp3 -c:v libx264 -tune stillimage -c:a aac -b:a 192k -pix_fmt yuv420p -vf "scale=trunc(iw/2)*2:trunc(ih/2)*2" -shortest output.mp4

Expected Output Size

A 4-minute song with the -tune stillimage flag is typically 8–15 MB for a 1920×1080 output. Without that flag, the same file might be 50–100 MB. The stillimage tune tells H.264 that nearly all information is in the first keyframe and subsequent frames have zero changes — the encoder can then represent those frames with almost no data.

Free Browser-Based Converters — No Install Required

Both directions available on Convertlo — combine audio + image into MP4, or extract audio from any video. 100% browser-based, nothing uploaded.

6. Method 2: Kapwing (Browser-Based, No Install)

Kapwing is a browser-based video editor with a free tier. It requires no software install and works on any device with a browser, making it the best option if you're not comfortable with command-line tools.

1
Create a new project

Go to Kapwing.com → click Start Editing → Create New Project. Set canvas size to 1920×1080 for YouTube, or 1080×1080 for Instagram.

2
Add your background image

Click Add Media → upload your image. Drag it to fill the canvas. This becomes your background video track.

3
Add your MP3

Click Add Media again → upload your MP3. It appears as an audio layer. Trim or adjust timing if needed.

4
Extend the image clip

Drag the right edge of the image clip to match the audio length. Both tracks should end at the same time.

5
Export as MP4

Click Export Project → choose MP4 at 1080p. Free tier adds a small Kapwing watermark in the corner. Kapwing Pro removes it (~$16/month).

Privacy note: Kapwing uploads your files to their servers for processing. For unreleased music or private content, use FFmpeg locally instead — your files never leave your computer.

7. Method 3: iMovie (Mac and iPhone, Free)

iMovie is Apple's free video editor, pre-installed on macOS and available for free on iOS. It's slower than FFmpeg for this task but requires no command-line knowledge.

On Mac

  1. Open iMovie → File → New Movie
  2. Import your background image (File → Import Media)
  3. Drag the image to the timeline. Right-click it → adjust the clip duration to match your audio length (e.g. 3:47 for a 3 minute 47 second song)
  4. Import your MP3 and drag it to the audio track below the timeline
  5. File → Share → File → set format to MP4, quality to Best (ProRes exports to a different format)

On iPhone / iPad

  1. Open iMovie → Create Project → Movie
  2. Select your image from Photos
  3. Tap the image in the timeline → use the Ken Burns tool to disable movement (set start and end frames the same)
  4. Tap the plus icon → Audio → My Music to add your MP3
  5. Trim if needed → tap Share → Save Video

iMovie outputs at a fixed quality that is good enough for most YouTube uploads. File sizes are larger than FFmpeg output because iMovie doesn't use the stillimage optimization.

8. Method 4: CapCut (Mobile, Best for TikTok & Reels)

CapCut is a free mobile video editor made by ByteDance (same company as TikTok). It's designed specifically for short-form vertical video and is the easiest option for creating TikTok and Instagram Reels audio posts.

  1. Open CapCut → New Project → select your image
  2. Set canvas ratio to 9:16 (tap the ratio icon in the toolbar)
  3. Tap Audio → Sounds → add your MP3 from your music library or import it
  4. Trim the video clip to match the audio length
  5. Export → 1080p. CapCut's free export includes no watermark for personal use.

CapCut's free tier is genuinely useful — no watermark on standard exports, unlike Kapwing. It also has built-in text templates, lyric caption tools, and equalizer visualizer effects if you want more than a static image.

9. Audio Quality: Copy vs Re-encode

This is a technical detail that matters if you care about audio fidelity:

Approach FFmpeg flag Quality Compatibility Best for
Copy MP3 stream -c:a copy Identical to source Some players don't support MP3-in-MP4 Personal archiving
Re-encode to AAC 192k -c:a aac -b:a 192k Perceptually identical Universal YouTube, social media uploads
Re-encode to AAC 320k -c:a aac -b:a 320k Slightly larger, marginal benefit Universal Archiving high-quality audio

For YouTube and social media uploads, re-encoding to AAC 192kbps is the right choice. AAC is the standard audio codec for MP4 containers, and every platform's player handles it natively. YouTube's own encoding pipeline will re-encode your audio again on upload anyway — at that point the bitrate of your source file matters very little as long as you're not going below 128kbps.

10. Making a YouTube-Ready Music Upload

These are the details that separate a professional-looking YouTube music upload from an amateur one — none of them require additional software.

The Image

Export your album art at 3000×3000px (streaming services require this resolution for cover art). Scale it to 1920×1080 for YouTube by placing it centred on a matching background. Adding the song title, artist name, and release year as text in the image is optional but improves discoverability in thumbnails.

The Custom Thumbnail

Upload your 1280×720 album art as a custom YouTube thumbnail. This image shows in search results and suggested video previews — it's seen far more than the video itself. A well-designed thumbnail significantly affects click-through rate.

Metadata

YouTube's music upload workflow (available via YouTube Studio) lets you designate videos as music content, link them to a song record, and add artist attribution. This connects your upload to YouTube Music and makes it discoverable through the music tab.

Description Template

A minimal but effective description: song title → artist → album → release year → streaming links → lyrics (if you have them). YouTube's automatic chapter detection reads timestamps in descriptions — useful for longer recordings like albums or mixes.

11. Adding a Waveform or Audio Visualizer

Static image + audio is the simplest approach. A waveform or animated equalizer visualizer is more engaging but requires more work. Here are the realistic free options:

VEED.io

Browser-based. Has a free audio visualizer template — upload MP3, choose a visualizer style, export. Free tier adds a watermark.

Kdenlive

Free, open-source desktop editor. Has an Audio Spectrum visualizer effect. Complex to set up but fully free and offline.

Adobe Express

Has audio visualizer templates. Adobe account required; free tier available with limited exports.

CapCut (mobile)

Free visualizer effects in the audio panel. Works for short clips. No watermark on personal exports.

FFmpeg + Python

MoviePy library can generate waveform overlays programmatically. Full control, no watermarks, steep learning curve.

Canva Video

Canva has animated audio visualizer templates. Free tier limited; Pro unlocks all templates.

12. The Reverse: Extract MP3 from MP4

If you have a video file and need to extract just the audio as an MP3, the process is simpler than creating a video — it's just demuxing the audio stream without re-encoding.

With FFmpeg:

ffmpeg -i input.mp4 -q:a 0 -map a output.mp3

The -q:a 0 flag uses the highest VBR quality setting; -map a selects only the audio stream and ignores the video track. This runs nearly instantly and produces zero quality loss from the source audio.

Or use Convertlo's free browser-based MP4 to MP3 converter — drag in a video, click convert, download the audio. Nothing is uploaded to a server.

13. Frequently Asked Questions

What does converting MP3 to MP4 actually do?
It creates a video file where every frame is a static image, with your MP3 audio playing behind it. The output is an MP4 container holding both a video track (your image, repeated for the audio duration) and an audio track (your MP3, usually re-encoded to AAC for compatibility). YouTube, TikTok, and Instagram accept this because they require video files — they don't support audio-only uploads to the main feed.
What image dimensions should I use for YouTube?
1920×1080 (16:9) for standard YouTube videos. 1080×1920 (9:16) for YouTube Shorts. The image resolution sets the video resolution, so use the highest quality version of your image available. Make sure width and height are even numbers — H.264 requires this.
Does converting MP3 to MP4 reduce audio quality?
If you copy the audio stream without re-encoding (-c:a copy in FFmpeg), quality is bit-for-bit identical. If you re-encode to AAC at 192kbps (-c:a aac -b:a 192k), the difference is perceptually inaudible in blind listening tests. Avoid re-encoding to a lower bitrate than your source MP3 — that causes audible quality loss.
What is the best free tool to convert MP3 to MP4?
FFmpeg (command-line) gives the smallest file sizes and highest quality, is completely free, and runs offline. Kapwing is the best browser-based option if you don't want to use a terminal. iMovie (Mac/iOS) is free and works well for one-off conversions. CapCut (mobile) is best for creating TikTok and Instagram Reels content.
Why is my output MP4 file so large?
You're probably missing the -tune stillimage flag in FFmpeg. Without it, H.264 wastes data encoding inter-frame differences that don't exist in a static image. With -tune stillimage, a 4-minute video is typically 8–15 MB. Without it, the same video can be 50–100 MB. Also verify you're using libx264 and not a lossless codec.
Can I convert MP3 to MP4 on iPhone for free?
Yes. iMovie (free, pre-installed on newer iPhones): create a Movie project, add your image, add your MP3 as audio, match durations, export. CapCut (free): even simpler, and designed specifically for vertical content. Both apps export MP4 with no watermark for personal use.
Does Convertlo convert MP3 to MP4?
Yes — Convertlo now has a free browser-based MP3 to MP4 converter. Drop your audio file and an optional cover image and it creates the MP4 in your browser using FFmpeg.wasm — no install, no upload. For more advanced workflows (custom visualizers, chapter markers, batch encoding), FFmpeg, Kapwing, iMovie, and CapCut are covered in this guide.
Can I add a waveform animation instead of a static image?
Yes, but it requires more complex tools. VEED.io (browser-based) has free audio visualizer templates. CapCut on mobile has built-in visualizer effects. For full control, Kdenlive (free desktop editor) has an Audio Spectrum effect, and Python's MoviePy library can generate waveform overlays programmatically.
How long does FFmpeg take to convert MP3 to MP4?
With -tune stillimage, FFmpeg processes static image video extremely fast — a 4-minute song typically converts in under 30 seconds on any modern computer. The stillimage tune reduces the encoding workload dramatically because there's almost nothing to encode between frames. The audio re-encoding is the only significant CPU work.
How do I upload a song to YouTube without a music video?
The standard approach: combine your audio file with a static image (album art, a logo, or any background) to create an MP4 video. YouTube requires a video file — it doesn't accept MP3 uploads directly. Use FFmpeg's ffmpeg -loop 1 -i cover.jpg -i audio.mp3 -c:v libx264 -tune stillimage -c:a aac -b:a 192k -pix_fmt yuv420p -shortest output.mp4 for the best result. Or use Convertlo's free browser-based MP3 to MP4 converter if you want to skip the command line entirely.
What FFmpeg command converts MP3 to MP4 with a static image?
The standard FFmpeg command to create an MP4 from an MP3 and a cover image is: ffmpeg -loop 1 -i cover.jpg -i audio.mp3 -c:v libx264 -tune stillimage -c:a aac -b:a 192k -pix_fmt yuv420p -shortest output.mp4. Key flags: -loop 1 holds the image for the video duration; -tune stillimage dramatically reduces file size by optimizing for static frames; -shortest stops the video when the audio ends. A 4-minute song outputs to roughly 8–15 MB.

YouTube's requirement for video files has been a constant since the platform launched. The static-image MP4 workaround is two decades old and still the most straightforward way to put audio on the platform. If you want to skip the command line entirely, Convertlo's MP3 to MP4 converter does it directly in your browser — drop audio + image, click convert, done. To go the other direction and extract audio from a video, Convertlo's MP4 to MP3 converter handles that too.

Convertlo Editorial Team
We test file conversion tools, formats, and workflows so you don't have to. All guides are written from hands-on testing with real files.
More articles →