You say you demux the audio. But what do you do with the video? Do you remux the audio to some other container? Along with the video? Or do you play the video inside of the original MPEG container while playing the audio externally? I'm asking because this could be a problem related to the source filters you're using for playback...
|