Log in

View Full Version : Subtitle Edit


Pages : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 [33] 34 35 36 37 38 39 40 41 42 43 44

Atlantis
27th July 2022, 14:01
For sync via other subtitle, is it possible to do every line automatically in the same language? For example you have an English subtitle rapidly done with lots of errors but perfect timecodes and another English subtitles that is correct but not synchronized. Unfortunately sometimes it is not possible to do it with 20-30 points and you have to do as many points as possible but doing 1600 points by hand is hard.

Atlantis
31st July 2022, 21:27
We need 2 things.

1- First, access to bookmarks. There is no menu, no icon to access bookmarks. Only if you create shortcuts. We need access by mouse and right clicking on a bookmark doesn't count because sometimes you are not on one.

2- In Remove Text for HI, in found results, when right clicking we need an option of either jumping to that line so we can edit it or adding a bookmark to it so we can edit later.

Atlantis
1st August 2022, 23:17
I have major problems with combined words with a -
For example good-looking is OCRed as good. looking

And it's very hard to detect these afterwards. Using nOCR. I don't understand because the line - is long enough not to be confused with . dot.

von Suppé
2nd August 2022, 11:28
For example good-looking is OCRed as good. looking

And it's very hard to detect these afterwards.

I'm surprised that a Words spellcheck doesn't pop-up with these. However, in the "Fix Common Errors" window there is a function called "Start with uppercase letter after period inside paragraph" that should at least detect (not repair, of course) these occasions. You can manually edit the found lines directly in their text-boxes.
It isn't ideal, of course. Preventing during OCR is the way to go.

BTW to run just one of FCE functions, I often find myself wishing for bang-on-ready buttons next to them. Which would instantly run FCE with only that function enabled and without clearing/altering the current selections. IMO FCE-checking in multiple instances often proves valuable.

Maybe when Nikse would be asked kindly? licking

von Suppé
2nd August 2022, 13:08
@Nikse

MPV mediaplayer is able to playback Dolby Vision profile 5 with corrected colors. This specific profile uses a DV proprietary colorspace; on regular players they show purple & green.
To achieve the proper colors, with portable and installed versions it's possible to edit a "mpv.conf" file and alter output driver by adding "vo = gpu-next". Which would trigger libplacebo algorithms kicking in and correct - to an extend - P5 colors.

I've been busy trying to get this to work with latest mpv extracted in SE's roaming folder. I didn't succeed. I don't see any mpv.conf file in SE roaming folder, and adding one didn't help either. Do you know if/where it's possible to edit mpv's output configuration?

At Settings --> Videoplayer, I did notice "--vo=" (greyed out, next to the "Download mpv lib" button). Which is hopeful to see...

Atlantis
2nd August 2022, 19:43
I agree, with Fix Common Errors we need better handling. Saved presets. You have to fix errors in several passes and not all at once.
Having be able to save the selection into different presets helps a lot.

To detect errors, now I do OCR 2 times with 2 different methods. Once Binary Image and once nOCR. Then I compare the 2 results and I catch most of the problems. It gives a 99% accurate OCR at the end but there is still a very small amount that is impossible to detect. They are mostly dots . " :
: goes to ..
". to ...

which bugs me a lot! I have seen already OCRed subtitles that are 100% accurate! Even the dots. I don't know how they do it!

loninapleton
11th August 2022, 21:34
Have others actively used the audio/video to text for projects Subtitle Edit?

Atlantis
12th August 2022, 23:56
I don't even understand your question!

von Suppé
13th August 2022, 10:05
Me too. Maybe loninapleton is referring to something like speech-recognition (something similar to OCR, but then for sound)?

mhe
13th August 2022, 10:12
Is it possible to get text centered on the screen but aligned left, so if there is two lines they are aligned left.
This is a common way to subtitle in sweden i have noticed, and i like it, because that's how you read virtually everything other than subtitles.

Example:
https://imgur.com/SOsdbQr

von Suppé
13th August 2022, 11:29
I know it's possible when exporting as image-based formats like SUP, XML/PNG and other.
In the export window you can set "Align" to "Center, left justify" for all or dialogue.

mhe
14th August 2022, 07:06
I know it's possible when exporting as image-based formats like SUP, XML/PNG and other.
In the export window you can set "Align" to "Center, left justify" for all or dialogue.

I see, that's too bad, i don't think I'm ready to start using image subtitles.

On a completely different note, i try to use "adjust durations" via recalculate with batch convert, but every time it forgets that i ticked it, is it a bug or by design?
Also, there seems to be no way to do that from CLI either, so no great way to really automate a general purpose cleanup and adjust durations, unless I'm missing something.
I basically just want to run a command and have otherwise workable subtitles get slightly better, especially adding some time to subtitles if there is room for it.

Atlantis
14th August 2022, 21:03
Alignment tags are definitely possible in text based srt subtitles.

Top-left: {\an7} Top-center: {\an8} Top-right: {\an9}
Middle-left: {\an4} Middle-center: {\an5} Middle-right: {\an6}
Bottom-left: {\an1} Bottom-right: {\an3}

mhe
15th August 2022, 06:15
Alignment tags are definitely possible in text based srt subtitles.

Top-left: {\an7} Top-center: {\an8} Top-right: {\an9}
Middle-left: {\an4} Middle-center: {\an5} Middle-right: {\an6}
Bottom-left: {\an1} Bottom-right: {\an3}

That's not what i meant, look at the screenshot, the text as a block is centered, but if there are multiple lines each line starts from the same position, left aligned.

von Suppé
15th August 2022, 09:12
For simple textbased subs I doubt if it could be done. Even if, I reckon not all players are compatible.
Have you explored the possibilities of ASS format?

mhe
15th August 2022, 10:03
I know you can put text on certain coordinates with ASS, i think that would work badly with different resolutions.
The more i think of it, this would probably be best to have as a setting in the subtitle filter of the player.
Turns out someone else asked the same question on stackexchange
https://video.stackexchange.com/questions/33795/ass-subtitle-different-alignment-at-block-and-inner-level

Atlantis
15th August 2022, 10:08
That's not what i meant, look at the screenshot, the text as a block is centered, but if there are multiple lines each line starts from the same position, left aligned.
As you can see yourself it says Attachments Pending Approval. We cannot see your screenshot to understand. You have to provide a proper link and screenshot that is outside of the forum.

von Suppé
16th August 2022, 14:53
I know you can put text on certain coordinates with ASS, i think that would work badly with different resolutions.
The more i think of it, this would probably be best to have as a setting in the subtitle filter of the player.
Screensize is metadata that's carried in the ASS file.
If left/right justifying would be possible in the format, it's worth it to give it a try. Maybe your player is "ASS metadata aware" and smart enough to scale size and positioning coordinates according to the set output resolution.

I would sooner doubt if there's a player with the ability to justify plain text. But, never say never.

mhe
18th August 2022, 07:52
I looked through the spec of ASS and fiddled around with subtitle edit, don't think this exist at all. You can place any individual line anywhere on screen, but you can't get a two line subtitle to be both center bottom and within it's own block be left aligned.
Even if you break out a double line entry into singles, then try to align them with ASS position feature, they will be aligned differently if they are of different lengths, because they use the center of that text as the positioning point, so you would have to manually offset that, fun times.
That would be an insane way to solve it anyway, it's not that important to me.

tormento
18th August 2022, 12:52
Subtitle OCR badly needs to split screen at least in two parts to properly recognize contemporary subtitles with different timings on upper and lower part.

It's a must for proper anime subtitles.

von Suppé
18th August 2022, 15:57
Even if you break out a double line entry into singles, then try to align them with ASS position feature, they will be aligned differently if they are of different lengths, because they use the center of that text as the positioning point, so you would have to manually offset that, fun times.
As X coordinates count from the left, imagine the fun increasing for right justifying :D

Atlantis
25th August 2022, 13:51
I'm trying to learn more how the dictionaries work. I'm completely baffled why I can't find one word for example.
plaît in French

I know the soft recognizes this. It proposes this when there is plait. But I can't find it in any dictionaries. Is it not stored in plain text?
I have used 2 software to search in the folder for text and they can't find it! Where is it?!

darksen
17th September 2022, 04:14
Please, add an option to auto set an adjusted encoding when saving any file.
Say you open File A but it's encoded as ANSI, you modify it then save it or you just save it, then SE should set the encoding to what we set in the options.

Nikse555
20th September 2022, 06:06
Please, add an option to auto set an adjusted encoding when saving any file.
Say you open File A but it's encoded as ANSI, you modify it then save it or you just save it, then SE should set the encoding to what we set in the options.

SE has a setting in "Settings.xml"... you can set "AutoConvertToUtf8" to true.

tormento
21st September 2022, 16:16
SE
Welcome back ;)

Please please please, do something to have proper OCR on both upper and lower parts of the screen, having different timings, such as in anime.

darksen
24th September 2022, 11:04
SE has a setting in "Settings.xml"... you can set "AutoConvertToUtf8" to true.

Thank you.

I want to report something, I'm using "Fix common errors" with Spanish, but for some reason words like "Atella" or "Amana" are "fixed" into "A tell a" and "A man a", can you look into this?

Nikse555
24th September 2022, 12:26
@tormento: thx. Detecting position would require ASSA format and a whole lot of coding...


@darksen: You can turn off the auto-word-split of unknown words in Options - Settings - Tools - Fix common errors - Use word split list.
If you're using a whole Spanish subtitle, then the words should probably be added to the user-word or names list (or use a better spell check dictionary).

darksen
24th September 2022, 22:51
@tormento: thx. Detecting position would require ASSA format and a whole lot of coding...


@darksen: You can turn off the auto-word-split of unknown words in Options - Settings - Tools - Fix common errors - Use word split lit.
If you're using a whole Spanish subtitle, then the words should probably be added to the user-word or names list (or use a better spell check dictionary).

Aah, thanks, the reason is that I'm not using the built-in spell checker, I use the Word plugin.

Atlantis
27th September 2022, 10:58
Aah, thanks, the reason is that I'm not using the built-in spell checker, I use the Word plugin.
I didn't know there is a Word plugin. Is it better? How is it better?

tormento
27th September 2022, 16:03
@tormento: thx. Detecting position would require ASSA format and a whole lot of coding...
It would be more than enough an option to split the screen in upper and lower parts and ocr separately then combine sorting by time.

darksen
21st October 2022, 02:06
Ok, following my issues from before I'm having a problem now that SE is applying "fixes" to words that are inside the name list when using "Fix common errors", can this be disabled? I can't find an option in the configuration, also I thought that was a function of the spell check tool, not the OCR fixes. Words like "kung-fu" are getting "fixed" to "Kung-fu" everywhere, to give an example, there are several words that are used also as names in the Spanish language. Please, help.

Zetti
24th October 2022, 10:26
Thanks for new release:
https://github.com/SubtitleEdit/subtitleedit/releases/tag/3.6.8

VoodooFX
7th November 2022, 12:05
Download problem is back again [Win7 / v3.6.8]:

https://i.imgur.com/PHd3bed.png

Zetti
20th November 2022, 21:24
Thanks for new release:
https://github.com/SubtitleEdit/subtitleedit/releases/tag/3.6.9

VoodooFX
20th November 2022, 23:05
Downloads still doesn't work in v3.6.9.

Zetti
20th November 2022, 23:43
Try report it on github.

oniiz86
21st November 2022, 08:39
@Nikse555 Is there any chance of altering the thread's title to simply read Subtitle Edit as the version number is no longer updated it seems?

VoodooFX
21st November 2022, 08:56
Better is not to add the version numbers to the thread tiltles, because ~soon it will "freeze", if it's not frozen already.

Zetti
21st November 2022, 22:04
Thanks for new release:
https://github.com/SubtitleEdit/subtitleedit/releases/tag/3.6.10

Matt Kirby
5th January 2023, 18:04
Why isn't it possible to set linebreaks as shown in video preview of an ass -file "WrapStyle 0"

F.E. I have a long line and the preview in videoplayer shows it with line breaks, but if I want to create really this breaks with auto break the breaks are set on another position. Why does no tool (aegisub also) have a function to set line breaks as seen in the preview? (I want to save the subs as SRT and so I need hard breaks)

Nikse555
6th January 2023, 18:10
Why isn't it possible to set linebreaks as shown in video preview of an ass -file "WrapStyle 0"

IMO this is a poor choice.
Wrap style 0 gives way too long lines that are difficult to read and it also does not split lines at periods or commas which also improves readability.

You can get pretty close by this:
In Options - Settings - General - Rules: Set "Single line max length" to 80 (depending on font/style)
In Options - Settings - Tools - Auto break: keep only "Break by pixel width"

(When "Advanced Sub Station Alpha" was born and monitors/TVs were small, WrapStyle 0 was probably okay... around 1996...)

Matt Kirby
6th January 2023, 18:50
Ah ok, I understand you. My workflow is (when I translate subs) that I stretch the subs in X-Scale to 134 % at a 16:9 film. So an item becomes no more width as 4:3 in a display. After finished work I set the x-factor back to 100%. So I thought I could save the line breaks (I made in 134 % modus).

Nikse555
6th January 2023, 21:08
OK, zooming in temporary with wrap style 0 makes sense. Still, you can make better breaks by taking comma/periods into account.
Sorry, SE does not have a single-line-max-pixel-width which is probably what wrap style 0 uses.

Atlantis
15th January 2023, 19:29
I have OCRed and got an incorrect word LI. The problem is, Spell Check doesn't catch it. It's in French and LI is not in the Names List or User word list.
Where is it? I want to remove it. We need a search function that can find a word in any list.

Nikse555
23rd January 2023, 07:09
@Atlantis: Sorry, the French spell check dictionary allows this... perhaps you can find another French dictionary? (SE requires Hunspell/OpenOffice dictionaries. The .dic/.aff files needs to be placed in the "Dictionaries" folder)

Atlantis
27th January 2023, 00:28
I don't understand. The French spell check dictionary allows what exactly? Do you mean LI is a word in the dictionary?

Amstramgram
29th January 2023, 14:38
Hello everybody !
In 3.6 version, is there a way to permanently change default style for ssa subtitles ?
It was quite easy in 3.5 but I can't find how to do it in the new version.
Thank you in advance for your answers.

Nikse555
29th January 2023, 15:53
I don't understand. The French spell check dictionary allows what exactly? Do you mean LI is a word in the dictionary?

Yes, "LI" is accepted by the French spell check dictionary.

Nikse555
29th January 2023, 18:00
Hello everybody !
In 3.6 version, is there a way to permanently change default style for ssa subtitles ?
It was quite easy in 3.5 but I can't find how to do it in the new version.
Thank you in advance for your answers.

SE has more features for "Advanced Sub Station Alpha" (.ass). For ASSA you can set a default style in the "ASSA styles windows" which is the big red "A" in the toolbar when an ASSA file is open.
I think the default SSA style, just uses the default ASSA style...

Amstramgram
30th January 2023, 17:35
SE has more features for "Advanced Sub Station Alpha" (.ass). For ASSA you can set a default style in the "ASSA styles windows" which is the big red "A" in the toolbar when an ASSA file is open.
I think the default SSA style, just uses the default ASSA style...

Thanks for your answer !
It almost works !
New default settings registered for ass files are passed to ssa but with some random strange differences in margins (left margin passes from 10 to 0, right margin from 10 to 2) or alignment (from center to left) !!!.