Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

Domains: forum.doom9.org / forum.doom9.net / forum.doom9.se

 

Go Back   Doom9's Forum > General > Subtitles

Reply
 
Thread Tools Display Modes
Old 10th January 2026, 03:50   #2141  |  Link
TR-9970X
Registered User
 
TR-9970X's Avatar
 
Join Date: Jan 2025
Posts: 233
Quote:
Originally Posted by VoodooFX View Post
I don't need srt, just write the times where something is missing.
No need to touch audio.
Hello again,

OK, done more testing, now it's over to you to see if anything else can be done.

I have added the audio, a couple of .srt's (they're small for comparison), and the error msg that Waveform pops up...

And a read me.txt, so please read thru that, saves me adding it here.

https://www.mediafire.com/file/yahfa...folder.7z/file
__________________
Main Systems:-
9970X on Gigabyte TRX50 AERO D
7970X on Asus Pro WS TRX50-Sage WiFi
9950X3D on MSI Carbon X670E
7950X on Gigabyte Aorus Elite B650
i9-13900KF on MSI Tomahawk B660
TR-9970X is offline   Reply With Quote
Old 10th January 2026, 12:41   #2142  |  Link
VoodooFX
Video damager
 
VoodooFX's Avatar
 
Join Date: Sep 2008
Posts: 1,270
Quote:
Originally Posted by TR-9970X View Post
Those missing phrases/words were not identified as voice by vad, so there was no attempt to transcribe those areas.

Here is srt produced with Pro version: https://pastecode.io/s/nds33xaw

Code:
faster-whisper-xxl.exe boot.flac -l en -m large-v2 --vad_method pyannote_v3 -o source --standard --max_gap 1 -hst 2 -ct float16 --ff_vocal_extract mb-roformer --realign


BTW, that CPP srt is weird, 00:00:00-00:00:30 is not in audio, timestamps are off by up to dozens seconds.

Last edited by VoodooFX; 10th January 2026 at 12:47.
VoodooFX is offline   Reply With Quote
Old 10th January 2026, 13:04   #2143  |  Link
TR-9970X
Registered User
 
TR-9970X's Avatar
 
Join Date: Jan 2025
Posts: 233
Quote:
Originally Posted by VoodooFX View Post
Those missing phrases/words were not identified as voice by vad, so there was no attempt to transcribe those areas.

That's interesting, I'm sure that other models did pickup some of that "missing" audio/text.

I will compare them tomorrow.


Here is srt produced with Pro version: https://pastecode.io/s/nds33xaw

Code:
faster-whisper-xxl.exe boot.flac -l en -m large-v2 --vad_method pyannote_v3 -o source --standard --max_gap 1 -hst 2 -ct float16 --ff_vocal_extract mb-roformer --realign
I should be able to use some of this command line in non Pro, there's only a little bit more than the previous.

BTW, that CPP srt is weird, 00:00:00-00:00:30 is not in audio, timestamps are off by up to dozens seconds.

Well, I'm glad that I sent them too
From the quick look I've had of the subs you sent, it looks like it grabbed a LOT of extra dialogue

BTW, did you bother to check if my files opened Waveform ??
__________________
Main Systems:-
9970X on Gigabyte TRX50 AERO D
7970X on Asus Pro WS TRX50-Sage WiFi
9950X3D on MSI Carbon X670E
7950X on Gigabyte Aorus Elite B650
i9-13900KF on MSI Tomahawk B660
TR-9970X is offline   Reply With Quote
Old 10th January 2026, 13:55   #2144  |  Link
VoodooFX
Video damager
 
VoodooFX's Avatar
 
Join Date: Sep 2008
Posts: 1,270
Quote:
Originally Posted by TR-9970X View Post
That's interesting, I'm sure that other models did pickup some of that "missing" audio/text.
Not "did pickup", as they don't try to pickup anything. They never checked that it's actually a speech what they are trying to transcribe. That's why you get there those hallucinations when there is no speech at all.


Quote:
Originally Posted by TR-9970X View Post
I should be able to use some of this command line in non Pro
You wont be able to use that as those are only Pro features.

Quote:
Originally Posted by TR-9970X View Post
From the quick look I've had of the subs you sent, it looks like it grabbed a LOT of extra dialogue
Yes, because cpp is weird, instead of transcribing stuff, cpp outputs "(speaking in foreign language)" bs.

Quote:
Originally Posted by TR-9970X View Post
BTW, did you bother to check if my files opened Waveform ??
Yes, opened after I named your files properly.
VoodooFX is offline   Reply With Quote
Old 10th January 2026, 14:29   #2145  |  Link
TR-9970X
Registered User
 
TR-9970X's Avatar
 
Join Date: Jan 2025
Posts: 233
Quote:
Originally Posted by VoodooFX View Post
Not "did pickup", as they don't try to pickup anything. They never checked that it's actually a speech what they are trying to transcribe. That's why you get there those hallucinations when there is no speech at all.




You wont be able to use that as those are only Pro features.

I used this :-
Code:
--vad_method pyannote_v3 -o source --standard --max_gap 1 --ff_sync -hst 2 -ct float16
and all I can see that's different is this :-
Code:
--ff_vocal_extract mb-roformer
Yes, because cpp is weird, instead of transcribing stuff, cpp outputs "(speaking in foreign language)" bs.



Yes, opened after I named your files properly.

OK, so what did you rename them to, so I can check if it's my different version of FFMPEG that is the problem. I'm guessin' just a really simple name, then.
So one question about Pro version, is it a standalone command line, or can it be used within SE ??

Thanks
__________________
Main Systems:-
9970X on Gigabyte TRX50 AERO D
7970X on Asus Pro WS TRX50-Sage WiFi
9950X3D on MSI Carbon X670E
7950X on Gigabyte Aorus Elite B650
i9-13900KF on MSI Tomahawk B660
TR-9970X is offline   Reply With Quote
Old 10th January 2026, 14:40   #2146  |  Link
VoodooFX
Video damager
 
VoodooFX's Avatar
 
Join Date: Sep 2008
Posts: 1,270
Quote:
Originally Posted by TR-9970X View Post
OK, so what did you rename them to, so I can check if it's my different version of FFMPEG that is the problem. I'm guessin' just a really simple name, then.
To the same name as audio. BTW, I use older SE version - 4.0.12.


Quote:
Originally Posted by TR-9970X View Post
So one question about Pro version, is it a standalone command line, or can it be used within SE ??
Yes to both.
VoodooFX is offline   Reply With Quote
Old 15th January 2026, 00:27   #2147  |  Link
markfilipak
Registered User
 
markfilipak's Avatar
 
Join Date: Jul 2016
Location: America
Posts: 467
Audio Sync

Above this message there are a lot of pleas for help regarding audio sync and loss of sync and clipped audio. I'm sorry that I don't have time to respond but I hope what I write here will help.

ISOM containers like MP4 and MOV can contain edit lists. I believe that's also true of MKV but I don't use MKV. If you edit audio without recoding it, that will likely result in an edit list being generated and attached to the AV. If you then audition the AV without recoding the audio, and if there is an edit list, then the player/utility may or may not appear to have synchronized audio depending upon whether it honors edit lists and how well it honors edit lists. You can detect and view edit lists via FFprobe, but it's complicated.

SE's waveform rendering -- specifically, the rendered pictures of the timing -- seems to have some issues with edit lists. What I do is recode the audio in an MP4 so that there's no edit lists. The audio must be cleaned of edit lists. For example, don't just 'FFmpeg -ss...' or 'FFmpeg -to...' or 'FFmpeg --itsoffset...' or 'FFmpeg -itsscale:a...' without recoding the audio -- for the final AV and any temporary AV that you're going to use in SE.

If I'm going to be working on the audio extensively, I recode it to frac so that it's a separate .FRAC stream and then edit to my heart's content. If I need to see it in an AV, I make a temporary AV just to do the viewing. Then, in the final MP4, I recode the frac to whatever audio I want. By keeping the frac audio separate, it stays pristine and the final MP4 has no edit lists. All that takes planning and it is tedious to make temporary AVs just to see some result I need to see, but it's worth it in the end.

All that I've written above is in addition to some SE waveform rendering issues that appear to be unrelated to edit lists. I'm still trying to figure it out.

I've written extensively about edit lists during the past month and my explorations are here in the SE forum. I promised to clean up my documentation and I reiterate that promise. But that has to wait until the time when I've figured it all out.

I hope my experiences help you. In the mean time I'm positioning subtitles based on the audio that MPV plays, not on the positions of the waveforms that SE renders.

Last edited by markfilipak; 15th January 2026 at 00:30.
markfilipak is offline   Reply With Quote
Old 17th January 2026, 18:37   #2148  |  Link
jay123210599
Registered User
 
Join Date: Apr 2024
Posts: 517
Quote:
Originally Posted by Janusz View Post
Let me put it this way... Something hasn't been fully implemented in the keyboard shortcuts in SubtitleEdit and its window for importing *.sub subtitles for editing. A workaround is to use [Video engine]=MPC-MC as the "Video player" and use the video player's shortcuts. For example, CTRL+Right Arrow moves the video forward by one frame, and CTRL+Left Arrow moves the video back by one frame. The spacebar starts video playback. These are the default settings for this video player, and you don't need to change anything. For this to work, in both cases, you need to click on the video window to make it active.

You can delete your settings in SubtitleEdit's "Settings/Shortcuts/Video" menu.
Quote:
Originally Posted by jay123210599 View Post
It worked, but whenever I move a frame forward or backward, this doesn't move along with the video. It always remained the same. How do I fix that?
@Janusz Did you get this? How do I solve this problem?
jay123210599 is online now   Reply With Quote
Old 22nd January 2026, 02:09   #2149  |  Link
TR-9970X
Registered User
 
TR-9970X's Avatar
 
Join Date: Jan 2025
Posts: 233
@ VoodooFX,

Me again, I have a very short clip (35 seconds) that is in German or French, and it's singing (sort of), and I just can't get all the words.

Any suggestions ??, could I maybe send it to you, to have a play ??

Regards
__________________
Main Systems:-
9970X on Gigabyte TRX50 AERO D
7970X on Asus Pro WS TRX50-Sage WiFi
9950X3D on MSI Carbon X670E
7950X on Gigabyte Aorus Elite B650
i9-13900KF on MSI Tomahawk B660
TR-9970X is offline   Reply With Quote
Old 22nd January 2026, 07:29   #2150  |  Link
VoodooFX
Video damager
 
VoodooFX's Avatar
 
Join Date: Sep 2008
Posts: 1,270
Quote:
Originally Posted by TR-9970X View Post
could I maybe send it to you, to have a play ??
Sure.
To stop offtopic, continue in the right thread: https://forum.doom9.org/showthread.php?t=184859
VoodooFX is offline   Reply With Quote
Old 22nd January 2026, 08:42   #2151  |  Link
TR-9970X
Registered User
 
TR-9970X's Avatar
 
Join Date: Jan 2025
Posts: 233
Quote:
Originally Posted by VoodooFX View Post
Sure.
To stop offtopic, continue in the right thread: https://forum.doom9.org/showthread.php?t=184859
Sure, no worries, I was not aware of "your" thread.

Thanks, will send it off shortly.
__________________
Main Systems:-
9970X on Gigabyte TRX50 AERO D
7970X on Asus Pro WS TRX50-Sage WiFi
9950X3D on MSI Carbon X670E
7950X on Gigabyte Aorus Elite B650
i9-13900KF on MSI Tomahawk B660
TR-9970X is offline   Reply With Quote
Old 20th February 2026, 21:08   #2152  |  Link
markfilipak
Registered User
 
markfilipak's Avatar
 
Join Date: Jul 2016
Location: America
Posts: 467
Audio Sync

I have some solid information.

I found a situation that 'shows' speech (waveform) on one side of a scene change but 'says' speech (MPV) on the other side of the scene change.

The hh:mm:ss.mmm of the scene change is 100% correct.

If I put the subtitle where the speech is shown (waveform), it's wrong (external MPV). If I put the subtitle where the speech is said (internal MPV), it's right (external MPV).

Background:
- Waveform was extracted and drawn by internal FFmpeg.
- What waveform shows gets earlier and earlier.
- I had captured the audio via MKVToolNix.
- I had to use -itsscale:a in (external) FFmpeg to sync the audio.
- There is no edit list in the resulting audio stream.

_No_ edit list, yet waveform is being drawn by internal FFmpeg as though there _is_ an edit list that internal FFmpeg is _ignoring_. That would mean that there actually is an edit list which internal MPV, external MPV, and external FFmpeg are all honoring but internal FFmpeg (waveform) is not.

But FFreport shows no edit list. I am totally baffled. The 'error' in waveform is too specific and too accurate and too persistent (video-to-video-to-video) to be accidental. Yet FFreport shows _no_ edit list. ...Is there an edit list or isn't there an edit list? Hell, I don't know anymore.

I need someone to help, to work with me. Nik is busy re-inventing the wheel.

Last edited by markfilipak; 20th February 2026 at 21:12.
markfilipak is offline   Reply With Quote
Old 28th February 2026, 18:29   #2153  |  Link
Nikse555
Registered User
 
Join Date: Feb 2004
Location: Mars
Posts: 438
Subtitle Edit 5 beta-1 is now out
https://github.com/SubtitleEdit/subtitleedit/releases
If you're on mac, be sure to follow the installation instructions: https://github.com/SubtitleEdit/subt...-ov-file#macos

SE 4 was getting increasingly hard to maintain — especially on Linux. SE 5 has rewritten UI in Avalonia UI and works much better on Linux now. Works on mac as well (cross platform with single codebase).
Note that SE 4 will still be maintained for a while, and you can have both SE 4 and 5 installed (use portable versions).

Please give the beta a spin and let me know if you run into trouble.
Note: SE 5 requires Win10 or newer.

Yeah, re-inventing the wheel surely takes a lot of time
Nikse555 is offline   Reply With Quote
Old 5th March 2026, 04:57   #2154  |  Link
jay123210599
Registered User
 
Join Date: Apr 2024
Posts: 517
Quote:
Originally Posted by Nikse555 View Post
Subtitle Edit 5 beta-1 is now out
https://github.com/SubtitleEdit/subtitleedit/releases
If you're on mac, be sure to follow the installation instructions: https://github.com/SubtitleEdit/subt...-ov-file#macos

SE 4 was getting increasingly hard to maintain — especially on Linux. SE 5 has rewritten UI in Avalonia UI and works much better on Linux now. Works on mac as well (cross platform with single codebase).
Note that SE 4 will still be maintained for a while, and you can have both SE 4 and 5 installed (use portable versions).

Please give the beta a spin and let me know if you run into trouble.
Note: SE 5 requires Win10 or newer.

Yeah, re-inventing the wheel surely takes a lot of time
Can it solve this problem?

Quote:
Originally Posted by Janusz View Post
Let me put it this way... Something hasn't been fully implemented in the keyboard shortcuts in SubtitleEdit and its window for importing *.sub subtitles for editing. A workaround is to use [Video engine]=MPC-MC as the "Video player" and use the video player's shortcuts. For example, CTRL+Right Arrow moves the video forward by one frame, and CTRL+Left Arrow moves the video back by one frame. The spacebar starts video playback. These are the default settings for this video player, and you don't need to change anything. For this to work, in both cases, you need to click on the video window to make it active.

You can delete your settings in SubtitleEdit's "Settings/Shortcuts/Video" menu.
Quote:
Originally Posted by jay123210599 View Post
It worked, but whenever I move a frame forward or backward, this doesn't move along with the video. It always remained the same. How do I fix that?
jay123210599 is online now   Reply With Quote
Old 5th March 2026, 20:38   #2155  |  Link
Nikse555
Registered User
 
Join Date: Feb 2004
Location: Mars
Posts: 438
Quote:
Originally Posted by jay123210599 View Post
Can it solve this problem?
I'm not really sure what you mean, but the window with edit of image based subtitles does support the video movement shortcuts of the main window (in SE 5 beta 3+).
Nikse555 is offline   Reply With Quote
Old Yesterday, 22:12   #2156  |  Link
jay123210599
Registered User
 
Join Date: Apr 2024
Posts: 517
Quote:
Originally Posted by Nikse555 View Post
I'm not really sure what you mean, but the window with edit of image based subtitles does support the video movement shortcuts of the main window (in SE 5 beta 3+).
The beta does move the video and time when doing frame-by-frame when editing PGS subtitles, but it doesn't move the subtitles alongside them when I selected an image. It also doesn't show the end times for the subtitles and the PGS subtitle edit menu doesn't show the video in this format: 00:00:00:00
jay123210599 is online now   Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 19:34.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2026, vBulletin Solutions Inc.