View Full Version : Subtitle Edit
Zetti
3rd December 2018, 01:12
Hello.
Both version 3.5.6 and 3.5.7 wont auto translate with Google Translate.
The lastest beta can only translate about 37 lines with Google Translate but without API Key.
And the Microsoft Translator wont working at all without API Key.
I have tried to get a free API Key to both Google Translate and Microsoft Translator, but it seems to hard for me.
Nikse555
4th December 2018, 19:18
@DMD: Where is this font dialog?
@Zetti: I've tried to squeeze a few more lines through (by using larger chunks) in latest beta: https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.7/SubtitleEditBeta.zip
How many lines do you get now?
Zetti
4th December 2018, 21:04
I getting a full subtitle now.
Edit: The subtitle have 2099 lines.
kenzofabio
9th December 2018, 07:20
For everybody in using subtitle edit 3.57 i dont know what happen what this program.i just can use one translate for subtitle.and it said that api quota acess.thx
Zetti
9th December 2018, 15:15
@kenzofabio
Check this github link: https://github.com/SubtitleEdit/subtitleedit/issues/3209
orion44
17th December 2018, 17:41
Thanks to the developer for this excellent program.
A feature request: When exporting subtitles to VobSub (sub/idx) format, could you add the 'character spacing' option,
to set in pixels the space/distance between characters?
Preferably, under the "Line height" option.
It is a really needed option when setting the look of the subtitles, especially when viewing them on TV.
http://oi63.tinypic.com/19ovsw.jpg
Nikse555
20th December 2018, 17:26
@orion44: The default libraries with c#/winforms don't have any 'character spacing' options as far as I know... anybody got any ideas?
Also, SE 3.5.8 is out - Now again includes Tesseract 3.02 per default and an in-program option to download the new Tesseract 4 (Tesseract 4 is pretty slow and do not have italic detection - but it has more languages + works better for small difficult fonts).
Also the translate works somewhat again...
Link to release page on github: https://github.com/SubtitleEdit/subtitleedit/releases
FLX90
7th January 2019, 15:43
Thank you for the great program. I used it for many BluRays and it worked like a charm.
Now I have a strange problem with the german Oldboy (2003) BluRay
I extracted idx&sub and the srt output would be:
1
00:00:50,800 --> 00:00:51,005
Was soll das?
2
00:00:51,009 --> 00:00:51,214
Was soll das?
3
00:00:51,217 --> 00:00:52,378
Was soll das?
4
00:00:52,802 --> 00:00:53,007
Hast du nicht verstanden?
Ich will nur mit dir reden.
5
00:00:53,011 --> 00:00:53,216
Hast du nicht verstanden?
Ich will nur mit dir reden.
6
00:00:53,219 --> 00:00:53,424
Hast du nicht verstanden?
Ich will nur mit dir reden.
7
00:00:53,428 --> 00:00:53,633
Hast du nicht verstanden?
Ich will nur mit dir reden.
8
00:00:53,636 --> 00:00:53,841
Hast du nicht verstanden?
Ich will nur mit dir reden.
9
00:00:53,845 --> 00:00:54,050
Hast du nicht verstanden?
Ich will nur mit dir reden.
10
00:00:54,054 --> 00:00:54,259
Hast du nicht verstanden?
Ich will nur mit dir reden.
11
00:00:54,262 --> 00:00:54,467
Hast du nicht verstanden?
Ich will nur mit dir reden.
12
00:00:54,471 --> 00:00:54,676
Hast du nicht verstanden?
Ich will nur mit dir reden.
13
00:00:54,679 --> 00:00:54,884
Hast du nicht verstanden?
Ich will nur mit dir reden.
14
00:00:54,888 --> 00:00:55,093
Hast du nicht verstanden?
Ich will nur mit dir reden.
15
00:00:55,096 --> 00:00:55,301
Hast du nicht verstanden?
Ich will nur mit dir reden.
16
00:00:55,305 --> 00:00:55,510
Hast du nicht verstanden?
Ich will nur mit dir reden.
17
00:00:55,513 --> 00:00:55,718
Hast du nicht verstanden?
Ich will nur mit dir reden.
18
00:00:55,722 --> 00:00:55,927
Hast du nicht verstanden?
Ich will nur mit dir reden.
19
00:00:55,930 --> 00:00:56,135
Hast du nicht verstanden?
Ich will nur mit dir reden.
20
00:00:56,139 --> 00:00:56,344
Hast du nicht verstanden?
Ich will nur mit dir reden.
21
00:00:56,347 --> 00:00:56,552
Hast du nicht verstanden?
Ich will nur mit dir reden.
22
00:00:56,556 --> 00:00:57,671
Hast du nicht verstanden?
Ich will nur mit dir reden.
Is this a bug on the BluRay?
I think it would be the best to do something like this:
1
00:00:50,800 --> 00:00:52,378
Was soll das?
2
00:00:52,802 --> 00:00:57,671
Hast du nicht verstanden?
Ich will nur mit dir reden.
But the output has 38564 lines.
Would be kind of sisyphean challenge.
I had to write a script.
Is there another workaround?
sneaker_ger
7th January 2019, 15:51
Tools->Merge lines with same text
FLX90
7th January 2019, 21:44
Oh, don’t saw that.
Thank you.
Awesome tool.
FLX90
7th January 2019, 22:12
Currently I'm using Tesseract 4.00 with engine mode Tesseract + LSTM.
Unfortunately I have to do massive post work.
'I' is sometimes a 'L' ,'l' or '!'.
Music symbol isn't recognized right (it's a P or D).
And I don't know what I have overlooked.
Is there anything I could improve in my settings?
EDIT:
I'm using binary image compare now and creating new database for every BluRay.
Works.
johner23
14th January 2019, 02:04
Hello, dear all.
I got the newest version from Subtitle Edit and try to convert a subtitle format to other.
The original subtitle is *.vtt fomat. The output is to be *.srt format.
But after the convertion the subtitle in srt show some strange errors.
Eg:
00:01:04,562 --> 00:01:08,908
{\an3}>> O que mais lhe
interessa na história?
The symbols {\an3} and >> are not correct.
Do you know how to fix that?
I can make manual correction after the convertion to srt, but I would prefer that the program could do it at the first attempt, without any "errors" or strange "symbols" after the task.
Thanks for your time.
Best regards.
Hendi
27th January 2019, 19:12
Hi
If I convert from sub to srt on batch-mode (/convert) the resulting srt file contains all timeframes but only the first timeframe contains a text. If I do this with the GUI, the whole text is converted and all timeframes contains text.
What do I wrong?
Thanks
Hendi
Nikse555
31st January 2019, 07:57
@Hendi: That's a bug, sorry - should be fixed in latest beta: https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.8/SubtitleEditBeta.zip
Next final version should be out soon...
asarian
3rd February 2019, 01:55
Subtitle Edit 3.5 is now out
(...)
but SE can also [B]import and ocr vobsub and blu-ray image based subtitles (even from matroska/mp4 files), and DVB sub from .ts files
Subtitle Edit (the latest) actually immediately crashes for me when I import a .sup file ("Object reference not set to an instance of an object." and many other fatal errors).
Btw, why OCR-ing?! That's so 1985! :)
Nikse555
4th February 2019, 13:34
Subtitle Edit (the latest) actually immediately crashes for me when I import a .sup file ("Object reference not set to an instance of an object." and many other fatal errors).
Could you try latest beta?
https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.8/SubtitleEditBeta.zip
SE 3.5.9 should be out soon...
dngnt
10th February 2019, 08:20
SE 3.5.9 should be out soon...
I'm using SE to convert from bitmap Chinese/Japanese/Korean subs (sup, idx/sub, DVDsub).
There is a possibility to save one sub picture at a time (with right click) while importing , but could you add a function of saving all the sub pictures during OCR?
For example, just a new option in the "export" (during OCR): export all the pictures as *.png? (just the images, no xml because it would be more complicated with "dirty" idx/subs (i.e with errors).)
It's because after finishing the OCR and saving the OCRed srt, I have to correct the (many) mistakes "offline" , but there is no way to compare the OCRed text to the original pictures.
Thanks for your great work!
Nikse555
10th February 2019, 13:40
@dngnt: In the OCR window you can right click in the list view... and choose "Save all images with HTML index..." or "Export -> BDN xml/png"
Nikse555
10th February 2019, 15:37
And SE 3.5.9 is out: https://github.com/SubtitleEdit/subtitleedit/releases
Some of the changes (mostly related to OCR) listed below:
* NEW:
* Bookmarks - thx OmrSi/marb99
* Image export - option to have single lines top justified - thx joedmartin
* IMPROVED:
* Improve Binary OCR of comma / apostrophe - thx Tuukka
* Improve quote/italic detection in binary OCR - thx Miggu
* Add context menu to OCR spell check
* Binary OCR auto detect best DB - thx Mr. Rage
* FIXED:
* Fix missing/bad html tags after "Auto br" - thx iromafia111
* Fix crash in OCR window when closing - thx spetragl
* Fix OCR in batch convert - thx danstraughn
* Fix crash parsing empty word in OCR via Tesseract - thx Barry
varekai
10th February 2019, 18:58
@ Nikse555
Thanks for the update, much appreciated!
Looking in the portable folders I see Tesseract302.
Is that version also included in SubtitleEdit-3.5.9-Setup.exe?
Kind regards
Nikse555
10th February 2019, 19:41
@varekai: Yes, Tesseract302 is included in the installer version as well.
Tesseract 4 is available as an in-program download, but note that T4 does not support italic detection and is a lot slower, but T4 is available in more languages and may work better for small unclear fonts.
varekai
10th February 2019, 20:57
@ Nikse555
Great! I use italics and only swedish and english languages so Tesseract302 works perfectly for my needs! Thanks!
dngnt
11th February 2019, 22:16
@dngnt: In the OCR window you can right click in the list view... and choose "Save all images with HTML index..." or "Export -> BDN xml/png"
Thanks for your indications!
Subtitle Edit is the best for its import/export versatility for srt, and with these 2 export abilities for sup,idx/sub and DVDsub, it's truly the Swiss knife for subtitling!
johner23
25th February 2019, 23:34
Hi, dear all.
@Nikse555
Hi, Nikse.
When I tested some files *.vtt I still get some errors. Different ones, I mean. Please, can you check it?
---> https://github.com/SubtitleEdit/subtitleedit/issues/3290
---> https://github.com/SubtitleEdit/subtitleedit/files/2873430/VTT.-.ERRORS.-.SAMPLES.zip
I can fix it manually using text editor, but I think it will be better if your program could do it automatically for us.
Thanks for your time.
Best regards.
Nikse555
28th February 2019, 22:04
@johner23: thx for the files - latest beta handles this better: https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.9/SubtitleEditBeta.zip
SE adds/keeps the alignment though, so that why you'll see lines starting with e.g. "[\an8}". If you want to remove them, just select all lines in the list view (ctrl+a), right click, choose "Remove formatting -> Remove all formattings / Remove alignment".
locotus
1st March 2019, 00:23
Is there any way of sort lines by beginning capital or smal letters or
by lines beginning with small letters fallowing lines ending
with comma, space or other punctuation mark except period?
Thanks.
sneaker_ger
2nd March 2019, 11:36
Batch converting vobsub in mkv via cli /convert (lots of "Tesseract returned with code 1" messages, timestamps are there but lines are empty) as well as GUI Tools->Batch (lots of "Tesseract returned with code 1" messages, almost empty files) convert isn't working for me. I would also be nice to have a stream selector for mkv/mp4 input on the cli.
Nikse555
2nd March 2019, 16:38
@locotus: sorry, no - what exactly do you want to do?
@sneaker_ger: that should be fixed in last beta I hope - could you verify?
https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.9/SubtitleEditBeta.zip (track-number parameter also added to cmd line)
locotus
2nd March 2019, 17:15
@locotus: sorry, no - what exactly do you want to do?
My main purpuse is to join dialogs lines that are
divided but with duration time and number of characters
below maximun range for both.
Actually I'm trying to do that sorting first for duration,
looking for very short lines with characters number greater
than 15. But can't do that with lines with longer time and
characters.
Thanks anyhow.
sneaker_ger
2nd March 2019, 17:28
Thx. I think there's a small typo: /? says "/trac-number:<track number>" (missing "k")
It still showed "Tesseract returned with code 1" over CLI but I figured out why: in the GUI OCR method was set to "Binary image compare". I changed it to "Tesseract 3.02" and the messages disappeared, the text appeared. I didn't expect the GUI setting to influence the CLI. Also it's not obvious why I would get Tesseract error messages if Tesseract isn't selected.
If you drag&drop mkv files into the GUI batch converter the GUI hangs for a long time, btw.
Nikse555
2nd March 2019, 20:10
@locotus: Could you use Tools -> Merge short lines?
@sneaker_ger: thx for testing :)
Beta updated: https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.9/SubtitleEditBeta.zip
(I've not looked at freezing GUI in batch convert)
sneaker_ger
2nd March 2019, 20:21
Thx. Any changes besides the track/trac typo?
locotus
2nd March 2019, 20:49
@locotus: Could you use Tools -> Merge short lines?
The workflow I normally use is after running that step and after
lines unbreaker to make sure that when sorted for time duration the lines can be easy to visualize and fix after.
Rousseau
4th April 2019, 13:28
Is there any way to set the OCR language/dictionary during batch processing? It appears to use the last used settings instead of detecting language like it does in other program features.
18fps
16th April 2019, 09:36
Thank you for your great program! It has become my main tool for ripping subtitles.
I'd like to know if it's possible to (somewhat) keep the color information when ripping subtitles from blurays or dvds. Because I have many french blurays, and they don't use caps for HoH descriptions of noise or music, they use colors. Red subtitles are the ones that should be deleted for normal subtitles, yellow subtitles are off screen voices and sometimes should be converted to italics,and so on. So i'd like to be able to have a mark that identifies those colors for later processing. Maybe something like *r* for red subtitles or something like that...
nekrovski
5th June 2019, 07:34
Hi
I cannot seem to find a way to remove </span> tags with SubtitleEdit
https://i.imgur.com/kzPUIzp.jpg
I tried right clicking the line and choosing normal (remove formatting) and doesnt seem to work.
sneaker_ger
5th June 2019, 15:57
If there's no simpler option try regex via Edit->Multiple replace:
Replace what: <span.*?>(.*?)</span>
Replace with: $1
Select "Regular expression".
nekrovski
5th June 2019, 16:26
If there's no simpler option try regex via Edit->Multiple replace:
Replace what: <span.*?>(.*?)</span>
Replace with: $1
Select "Regular expression".
Thanks, that worked.
I've looked several times at all the options under "Tools" but cannot find a simpler method. It's weird, for a program like this to have every other little thing in tools, but not this. That's why I think I'm missing it somewhere.
kenzofabio
23rd June 2019, 15:34
hi everyone can u help me,i just translate with subtitle edit beta zip.but the result is some alphabet not a words,is there something wrong with sub edit.thx i appreciate for some information
mikmik68
8th July 2019, 07:49
Hi everyone
i have a problem with subtitle edit , when i translate from english to danish , there is a bunch of numbers and letters in a long string , before the correct translation ...
i have tried to delete everything including settings.
and reinstall the latest version.
nothing seems to work.
same result everytime, with the long string of numbers and letters in a long unbroken line instead of only the translation.
hope someone has a solution.
in advance , thankyou for you help :thanks:
Regards Kim
Nikse555
16th August 2019, 13:37
mikmik68: The issue with random numbers/letters in GT (Google Translate) should be fixed in latest beta: https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.9/SubtitleEditBeta.zip
Also, SE 3.5.10 should be out soon, so please give the above beta a test run :)
sneaker_ger
18th August 2019, 18:01
Could you make ASS/SSA more compatible with existing software/"standards"?
1. some software can't read ASS/SSA when there isn't a Newline at the very end of the file (like Aegisub writes)
2. you use the field "Actor" instead of "Name" for ASS. The "spec" and Aegisub use "Name". I think some user requested "Actor" but using "Name" can serve the same purpose just more compatible.
Nikse555
19th August 2019, 16:44
@sneaker_ger: thx for the info - I've tried to add 1+2 in latest beta: https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.10/SubtitleEditBeta.zip
Let me know how it works.
Also, SE 3.5.10 is out :)
sneaker_ger
19th August 2019, 17:50
Beta is looking great. Thank you for your continued work!
sneaker_ger
5th September 2019, 22:33
(Latest Beta regex:) Why is ".*" replaced by "aa", not "a"?
https://i.imgur.com/Fn5V2or.png
Nikse555
8th September 2019, 08:17
It seems that ".*" also matches an empty string. So first "line1" will be replaced by "a" and then "" will also be replaced by "a" too giving "aa".
See https://stackoverflow.com/questions/7326679/net-regex-matching-matches-empty-strings
So try ".+" instead to avoid empty strings... I don't know why it works like that ;)
sneaker_ger
8th September 2019, 09:18
Ah, I see. It's different from other software I use (e.g. Sublime Text). Thank you.
Matt Kirby
17th September 2019, 19:17
Hi,
I have a problem with OCR and TS-Files.
https://www.dropbox.com/sh/vxsz74el242qw5t/AABYB-56jzODmDPgZlVrxHIaa?dl=0
Here is a part with exact one subtitle-item. When I load it in SE the length is 3.5 seconds.
That's too short. When I convert the part.ts with mkvtoolnix to part.mkv
and I load it to SE the length is correct (about 10.xxx seconds).
Please could you change SE so that the correct length is given with TS-files without conversion?
shag00
18th September 2019, 11:52
I know this request has been going on since 2011 but it really is time this software was able to be easily used by linux users. If an appimage, containing VLC (https://www.videolan.org/legal.html) could be made available it would make life very much easier for linux users. The link on your download page for linux users is 8 years old and out of date. Although someone, somewhere may have cracked it, as far as I am aware you cannot get a windows quality experience with SE using either wine or mono. Either the OCR or the video always has some problem, and stability is also an issue. Of the mastering that I do, sound is now usable decoding, editing and reassembling in linux, video is still outstanding as is subtitles. There is really no native linux alternatives, what is available seems to be no longer supported and certainly is not as fully featured as SE.
Ghitulescu
18th September 2019, 13:58
With so many text-processors, sinking this software into an infinite sea of requests is a bit counterproductive. I mean, when two requirements of treating the semicolon or the hyphen exists, why these people do not use their specially-tuned text-processors, adapted for their language, and do all the text OCR/proc and let SE do only the actual subtitle editing.
I've seen this syndrome eg with satellite receivers whose owners asked the manufacturers or the hackers to fine-tune the AVI-playing section.
Is there any improvement in the bitmap handling of SE in the last 5 years?
vBulletin® v3.8.11, Copyright ©2000-2026, vBulletin Solutions Inc.