Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
|
8th October 2011, 12:19 | #1 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Subtitle Edit
Subtitle Edit 4.0.7 is now out
https://github.com/SubtitleEdit/subtitleedit SE is an open source (C#) subtitle editor with main focus on creating/editing/sync'ing/adjusting/fixing subtitles, but SE can also import and ocr vobsub and blu-ray image based subtitles (even from matroska/mp4 files), and DVB sub + teletext from .ts files. SE supports 300+ subtitle formats - let me know if you need more Can create/edit blu-ray sup and bdn xml files. Available for Windows and Linux Last edited by tebasuna51; 30th August 2024 at 11:10. Reason: Updated version number |
10th October 2011, 06:59 | #3 | Link |
Registered User
Join Date: Mar 2009
Location: Germany
Posts: 5,772
|
It looks promising.
I never used it before, that's why I would ask you how well manages SE32 to work with DVD subtitles (SUP), like retiming, synching (to other/preexistent SUP), bitmap editing etc?
__________________
Born in the USB (not USA) |
10th October 2011, 20:09 | #4 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
SE can read and ocr dvds/vobsub/blu-ray sup + a few more image based formats - but the only image based format SE can write is bdn xml/png. (the blu-ray sup code is converted from 0xdeadbeef's java code for BDSup2Sub) |
|
16th October 2011, 03:36 | #5 | Link |
HeartlessS Usurer
Join Date: Dec 2009
Location: Over the rainbow
Posts: 11,018
|
Crash during "Fix Common Errors".
Crash Report & Mi2.srt here:- EDIT: Link removed.
__________________
I sometimes post sober. StainlessS@MediaFire ::: AND/OR ::: StainlessS@SendSpace "Some infinities are bigger than other infinities", but how many of them are infinitely bigger ??? Last edited by StainlessS; 20th October 2011 at 10:43. |
16th October 2011, 20:33 | #6 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
Thx for reporting this Fixed here: http://www.nikse.dk/SubtitleEdit.zip Last edited by Nikse555; 16th October 2011 at 21:28. |
|
20th October 2011, 10:46 | #7 | Link |
HeartlessS Usurer
Join Date: Dec 2009
Location: Over the rainbow
Posts: 11,018
|
Thanks Nikse555, got something else to keep you busy.
SubTitle Edit 3.2.2, Build 25663 Crash during Spell Check (HunSpell, dont know if same error as previously reported in other thread) Crash Report & DWL.srt here:- http://www.mediafire.com/?6pltpi72a82lz52
__________________
I sometimes post sober. StainlessS@MediaFire ::: AND/OR ::: StainlessS@SendSpace "Some infinities are bigger than other infinities", but how many of them are infinitely bigger ??? |
22nd October 2011, 07:42 | #8 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
I could not re-create this error on my Win7 machine, but I've tried to fix it here (by running suggestions in a separate thread): http://www.nikse.dk/SubtitleEdit.zip Any better? |
|
21st October 2011, 02:19 | #9 | Link |
Registered User
Join Date: Mar 2010
Posts: 52
|
Thanks Nikse555 for new version of SE.
I have some problem with OCR ...plzz help me...when i use 3.2 OCR of VobSub & Blu-ray sup files are working perfectly but when uninstall it and install new version 3.2.2 my OCR not working now..it only shows orange lines no text. Sample of *.sup subtitles...can u plzz check these subtitles. http://www.mediafire.com/?zqml8hbrcy6jqgt Last edited by MajorX; 21st October 2011 at 07:20. |
22nd October 2011, 18:52 | #10 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
New version upped: http://www.nikse.dk/SubtitleEdit.zip Any better? |
|
23rd October 2011, 19:30 | #11 | Link |
HeartlessS Usurer
Join Date: Dec 2009
Location: Over the rainbow
Posts: 11,018
|
Sorry for the delay, Nikse555,
Before I tried your update, I had to download the srt from MediaFire as I did not keep a verbatim copy. I tried it with the original faulting build 25663, and it did not fault. Tried this several times, no fault. Got the version srt that I kept, (probably spell checked via other means) and checked that, same thing, no fault. Have not ripped any other subs since then (I think) and made no changes to the setup. I guess it will have to remain a mystery. EDIT: Also tried with build 13726, no fault (earlier build No ???).
__________________
I sometimes post sober. StainlessS@MediaFire ::: AND/OR ::: StainlessS@SendSpace "Some infinities are bigger than other infinities", but how many of them are infinitely bigger ??? Last edited by StainlessS; 23rd October 2011 at 19:37. |
24th October 2011, 06:51 | #12 | Link |
Registered User
Join Date: Nov 2001
Posts: 1,104
|
What do I do to OCR German subs? I've downloaded a German Tesseract package and unpacked it to the program dir but to no avail. I pretty much have to type every word?
__________________
MultiMakeMKV: MakeMKV batch processing (Win) MultiShrink: DVD Shrink batch processing Offizieller Übersetzer von DVD Shrink deutsch |
24th October 2011, 07:36 | #13 | Link | ||
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
Quote:
Do choose Tesseract as OCR method (not image compare) And if you're lazy just get this version with German dictionaries included: http://subtitleedit.googlecode.com/files/SE322DE.zip |
||
25th October 2011, 07:55 | #14 | Link |
Registered User
Join Date: Nov 2001
Posts: 1,104
|
Mm, I had unpacked it to Subtitle Edit\tesseract\tessdata but ok, your de package is fine, thanks. It also works pretty good, however some events described in parenthesis for the hearing impaired are recognized with mixed case, like
(KEucH†) instead of (KEUCHT) (I_AcH†) instead of (LACHT). Also, the small t is recognized as a small l which messes up a lot of items and can only be fixed manually. These new words don't even exit in the German language so shouldn't spellchecking kick in with "prompt for unkown words" being checked? Then the distance between two words ending with r and starting with j is not recognized. Instead of "aber jetzt" it reads "aberjetzt". What can I do to improve this? Thanks.
__________________
MultiMakeMKV: MakeMKV batch processing (Win) MultiShrink: DVD Shrink batch processing Offizieller Übersetzer von DVD Shrink deutsch Last edited by Chetwood; 25th October 2011 at 08:20. |
25th October 2011, 18:04 | #15 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
(Tesseract should also be a bit faster now) When you press "change all" or "use always" the correction is remembered in OcrFixReplacelist.xml... |
|
26th October 2011, 14:22 | #16 | Link | |
Registered User
Join Date: Nov 2001
Posts: 1,104
|
Quote:
The German Umlaut ü (u with two dots above) is often recognized as two i's: ii. Since it's common in several words, how do I replace it for all of them? Thx.
__________________
MultiMakeMKV: MakeMKV batch processing (Win) MultiShrink: DVD Shrink batch processing Offizieller Übersetzer von DVD Shrink deutsch Last edited by Chetwood; 26th October 2011 at 14:25. |
|
26th October 2011, 14:52 | #17 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
<PartialWords> ... <WordPart from="ii" to="ü" /> </PartialWords> SE will now look for correct spelled words, where "ii" is replaced with "ü". You can also take a look at "eng_OCRFixReplaceList.xml". |
|
25th October 2011, 22:00 | #18 | Link |
Registered User
Join Date: Jul 2011
Posts: 224
|
This actually works really good! almost all of the text is right on, and the GUI guides your through smoothly when it needs a fix.
I have a rather strange auto fix though (some kind of error or bug): http://i1208.photobucket.com/albums/...kon/weirds.png If you want the .SUP that caused this error to occur give me an email address I can send the file to. (Upon further testing this weird error only happens if the "Try MS MODI OCR for unknown words" checkbox is checked, If I un-check it then this strange substitution does not happen.) The only OCR error that I get that does not get automatically corrected is the letter "k" being detected as "l<" and not being auto corrected: http://i1208.photobucket.com/albums/cc361/xekon/k.png I notice similar errors in OCR but they DO get auto corrected like "I\/ly" -> "My" is there a way I can add "l<" to be autocorrected to "k" ? also a setting in the options panel to disable "Try MS MODI OCR for unknown words" by default would be handy, then I wouldn't have to uncheck it every subtitle I load I have also had "d" been detected as "ol" pretty often, then the spell checker dont recognize the word so I edit it manually and change the "ol" to "d" like the word worried, gets detected as worrieol Last edited by xekon; 25th October 2011 at 22:57. |
26th October 2011, 10:03 | #19 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 432
|
Quote:
Tesseract (new 3.01 version) now runs in it's own thread, so it should be a bit faster too. Link to new version: http://www.nikse.dk/SubtitleEdit.zip How is it working? |
|
Thread Tools | Search this Thread |
Display Modes | |
|
|