What sort of visuals?
I overlay videos on my podcast instead of ever bothering just with ffmpeg.
Clipping is more annoying as the audio- point is never as clean as you want, looking at a few different solutions to this but will probably home brew something