Step 1:Split the video into 3 segments

Intro (0–30 s)

Bash
ffmpeg -i input.mp4 -t 30 -c copy intro.mp4

Speech (30 s–end-30 s ≈ 5340 s)

Bash
ffmpeg -i input.mp4 -ss 30 -t 5340 -c copy speech.mp4

Outro (last 30 s)

Bash
ffmpeg -i input.mp4 -ss 5370 -c copy outro.mp4

Step 2: Encode each segment with proper audio

Intro (stereo music, 360p)

Bash
ffmpeg -i intro.mp4 -c:v libx265 -preset medium -crf 28 -c:a aac -b:a 96k -ac 2 -vf "scale=640:360" intro_enc.mp4

Speech (mono, 360p)

Bash
ffmpeg -i speech.mp4 -c:v libx265 -preset medium -crf 28 -c:a aac -b:a 96k -ac 1 -vf "scale=640:360" speech_enc.mp4

Outro (stereo music, 360p)

Bash
ffmpeg -i outro.mp4 -c:v libx265 -preset medium -crf 28 -c:a aac -b:a 96k -ac 2 -vf "scale=640:360" outro_enc.mp4

Step 3: Concatenate all 3 segments into final video

Create a text file concat_list.txt with:

file 'intro_enc.mp4'
file 'speech_enc.mp4'
file 'outro_enc.mp4'

Run FFmpeg to merge:

Bash
ffmpeg -f concat -safe 0 -i concat_list.txt -c copy final_podcast.mp4

Set a title in metadata

Bash
ffmpeg -i final_podcast.mp4 -metadata title="My Podcast Episode 1" -c copy final_podcast_meta.mp4

Copyright info in FFmpeg