Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > General > Subtitles

Reply
 
Thread Tools Search this Thread Display Modes
Old 27th April 2005, 21:01   #1  |  Link
zuggy
Registered User
 
Join Date: May 2002
Location: Czech Republic
Posts: 171
ANN: SubRip 1.50 b4

ai4spam continues the developing of subrip, he added support for ripping subs out of avi files that contain them hardsubbed. You can download and test it - http://zuggy.wz.cz

beta 3a: LRN of TeMa=) team added experimental 3GPP Timed Text (*.ttxt) support (http://gpac.sourceforge.net/auth_text.php), without color support though.

beta 4: Bugfix release only (crash when editing a Character Matrix file)

Last edited by zuggy; 7th November 2006 at 08:57. Reason: update
zuggy is offline   Reply With Quote
Old 27th April 2005, 22:44   #2  |  Link
ai4spam
Programmer
 
ai4spam's Avatar
 
Join Date: Sep 2003
Posts: 378
Last version: SubRipAvi 1.50 Beta 3, January 5 2006

Changes in 1.50:
Beta 1: Added whole words formatting. Added support for text formatting in subtitle window. Lots of other GUI changes. Moved to Delphi 2005 (but Delphi 7 is still supported). Speedup in processing videos when the frames are empty. Experimental Spanish language correction.
Beta 2: Fixed the detached chars problem. Fixed UniCode support to preview CodePage changes. Other bugfixes. Some changes in the video GUI. Reverted to Delphi 7, but Delphi 2005 is still supported.
Beta 3: Fixed saving bug for split subtitles. Fixed video no longer working bug. Added a progress bar for loading char matrices. Updated ortography dictionaries to UniCode.

Changes in 1.40:
Beta 1: large GUI overhaul and bugfixes; changed the format of the language and char map files to UniCode; filling a char matrix with UniCode ranges; opening bitmap sequences that DVDSupDecode outputs from converting DVB-T broadcasts saved as .sup by ProjectX; made Modify and Delete work faster for characters in the Char Matrix
Beta 2: fixed a few bugs and added an option to automatically fill the best guess; updated Portuguese language by Vítor Vieira
Beta 3: added support for negative timestamps and using the file offsets from .idx files, and fixed a few bugs

Changes in 1.30:
Beta 1: added experimental UniCode support
Beta 2: fixed a few bugs
Beta 3: fixed AviSynth support (no UniCode file names in .avs scritps)
Beta 4: fixed old character matrix format import (now the correct diacritics are selected if the CodePage is set properly)
Beta 5: fixed "black on black" bug; added experimental support for delays in .idx files (multiple delays are compounded)
Beta 6: fixed a bug when saving subtitles as UniCode instead of ANSI for some languages; GUI improvement when processing videos: show last frame of the subtitle when doing OCR, instead of the current frame
Beta 7: large overhaul of the video GUI (now tolerances are per channel); changed the detection algorithm; added text fattening to help with irregular characters
Beta 8: improved the detection algorithm: now the frame times should be correct in all but the most difficult cases; fixed a bug in the disjoint characters routine; fixed DVD Maestro saving bug
Beta 9: fixed video not showing bug; added UniCode font support (by default, Arial Unicode MS is used, if available); fixed several other bugs
Beta 10: added a Font button in the subtitle window; fixed several bugs in hard-subbed video ripping
Beta 11: fixed some GUI bugs, updated with new French language file by Kurtnoise

Changes in 1.20:
Beta 9: AVI file support; fill char matrix from matched font; best guess so far
Beta 10: speedup in avi files; moved all avi stuff to a new window, changed some things in the GUI; added Ctrl-Enter for using best guess in OCR; added Pause/Abort button in color change dialog; solved the duplicate subs problem; Zuggy solved the P4 crash problem.
Beta 11: speedup and bug fixes for video files; several GUI improvements.
Beta 12: extend selection for disjoint characters; draw blue lines between text lines
Beta 13: fill areas that touch the blue lines between text lines
Beta 14: automatic support for other video files using AviSynth.
Beta 15: support for saving subtitles as bitmaps for later subtitle removal
Beta 16: extend more than 1 character, save bitmaps as .pgm
Beta 17: minor bugfixes

A guide for ripping subtitles from videos is now available:
http://zuggy.wz.cz/redir.php?co=101

I've posted my char matrix: http://sr1.mytempdir.com/132702 or http://foxyshadis.slightlydark.com/r...CharMatrix.rar (thanks to foxyshadis)

It works for some 90% of the DVDs I've seen. Be sure to set the OCR sensitivity high (1000) when using it. See additional instructions in my post: http://forum.doom9.org/showthread.ph...590#post679590

Last edited by ai4spam; 6th January 2006 at 06:09.
ai4spam is offline   Reply With Quote
Old 27th April 2005, 23:33   #3  |  Link
unskinnyboy
Registered User
 
unskinnyboy's Avatar
 
Join Date: Feb 2004
Location: NTSC R1
Posts: 2,047
Hardcoded subs from AVI? Wow. This is great news. Thanks! Installing 1.2b now.
unskinnyboy is offline   Reply With Quote
Old 28th April 2005, 02:22   #4  |  Link
smiller667
Registered User
 
Join Date: Oct 2001
Posts: 1,121
Thank you for the great news ... I'll try the hardsub-ripping soon.

Steve
smiller667 is offline   Reply With Quote
Old 28th April 2005, 07:32   #5  |  Link
niamh
Dismembered
 
niamh's Avatar
 
Join Date: Nov 2003
Location: Craggy Island
Posts: 873
This is great!
__________________
Allen's Axiom: When all else fails, read the instructions.
niamh is offline   Reply With Quote
Old 28th April 2005, 07:48   #6  |  Link
E-Male
mad computer-scientist
 
Join Date: Mar 2002
Posts: 1,375
gotta try this one
E-Male is offline   Reply With Quote
Old 28th April 2005, 19:08   #7  |  Link
ai4spam
Programmer
 
ai4spam's Avatar
 
Join Date: Sep 2003
Posts: 378
Thanks, guys, feedback and bug reports will be appreciated.
I'm currently working on it, so check back to the website often, there may be new betas for download.
I'll try to make a HowTo guide for it also.
ai4spam is offline   Reply With Quote
Old 28th April 2005, 19:25   #8  |  Link
ai4spam
Programmer
 
ai4spam's Avatar
 
Join Date: Sep 2003
Posts: 378
Oh, and ti will work with anything MediaPlayer can play (mpg, ogm, you name it), except that in MediaPlayer mode (as oppposed to simple Avi mode) the area where the movie is displayed has to be on top, with no other windows over it, and there is no way it can tell the frame rate.
ai4spam is offline   Reply With Quote
Old 29th April 2005, 01:18   #9  |  Link
ai4spam
Programmer
 
ai4spam's Avatar
 
Join Date: Sep 2003
Posts: 378
Instructions

Here's a first try at a "manual" for the new features, extracted from the updated documentation:


II - OCR Process

...
If SubRip already has a character that it thinks is close to the
current one, it will display it on top of the window, as the "best
guess". To use it (including the font style), just press the "Use"
button. A new character will be inserted in the character matrix,
just as if you typed it manually.
...


5) If you think you recognize the font that the subtitles use, you can
try to generate all the characters automatically, instead of typing
them one by one. Press the "Fill Matrix From Text" button. In the
window that appears, first type one of the lines of text in the
image, then use the buttons to adjust the text position, size and
angle so that it overlaps exactly with the text in the image. Try
changing the font size and style to improve the match (if you select
italics or bold, the styles will also show up in the character
matrix). When done, press "Done" to continue. If the match is not
perfect, or the OCR tolerance is too high, SubRip may not recognize
the newly-inserted characters, and just show you a best guess. If it
matches, press the "Use" button, and the current character will be
introduced in the character matrix.

...

IX - Ripping SubTitles from video files

Some video files have SubTitles "burned into them". SubRip can be used
to extract subtitles from video files as follows.

1) Open the video file by selecting "Open Hard Subbed Video Files" from
the "File" menu. SubRip will try to open any file that MediaPlayer can
play, but it can only detect the frame rate of .avi files.

2) The movie will start playing in the video window. Wait until you see
a subtitle, or use the track bar in the "Video control panel" or the
edit box to advance to another frame. Press the "Pause" button to stop
the movie from playing when you see a subtitle.

3) In the movie window, click INSIDE a character (the white area) so
that SubRip can detect the subtitle color and width. Look in the panel
to confirm: the detected colors should match what you see in the video
window. If not, click again. The "text line width" box should show
values between 1 and 5. Anything larger would probably mean that
detection was unsuccessful. If after several tries, the detection still
does not seem to work, try lowering the "text color tolerance".

4) In the movie window, right-click and drag to define the region inside
which the detection takes place. The smaller the region, the faster it
will work. The region should completely enclose the subtitles, and have
a generous margin of at least the width of the text line and outline
combined. Be careful for cases when there are more lines in the
subtitle. You can always stop processing and redefine the region by
right-clicking and dragging.

5) Check the "View" check box. In the main window, a rectangle the size
of the selected region should show up, with the subtitles in white and
the outline in red. If the subtitles have outlines (they mostly do),
also check the "Use" check box.

7) If the subtitles do not show up properly (the lines are too thin),
try playing with the text line and outline widths and increasing the
tolerances. Ideally, even on a bright background, you should only see
the text and the outline. If large bright areas also show up, try
checking the "Try to fill open areas" and "Try to fill large areas"
check boxes. Do not use them unless you have to, they slow down the
recognition process.

8) When you are satisfied, press the "Rewind" button to go to the start
of the movie, then the "Run" button to start the OCR process. The OCR
will be similar to what you see when ripping SubTitles from DVDs. If at
any time you need to change any parameters, press the "Pause/Abort"
button.

9) If you want to change the last subtitle, press the "Back" button to
go the first frame of that subtitle in the movie.
ai4spam is offline   Reply With Quote
Old 29th April 2005, 03:26   #10  |  Link
E-Male
mad computer-scientist
 
Join Date: Mar 2002
Posts: 1,375
had a quick look, and all seems ok (as always with subrip)
the only thing i don't like is this:
if a windows is on top of the video it get's captured instead
so far not so bad, i guess it has a technical reason, but:
1. the control windows (with the options for the ocr from video) hardly fits between the video and the edge of the window, maybe one solution would be to allow the window to be set to fullscreen
2. the text window (showing the sub file), it always appears at the same place (partially over the video), could it be changed to remember it's last position

an other (low priority) request:
since subrip recognises the subs in the video
could it output their frame times, coordinates and bitmaps of them
if that format could be fine-tuned together with an avisynth-logo-removal-plug-in-developer [what a word] it would be possible to do high quality subtitle removing
(later a translation of the OCRed subs could be put back as hard- or soft-subs)
E-Male is offline   Reply With Quote
Old 29th April 2005, 06:37   #11  |  Link
ai4spam
Programmer
 
ai4spam's Avatar
 
Join Date: Sep 2003
Posts: 378
Thanks for the feedback.

For issue 1, indeed the reason is technical: in MediaPlayer mode it can open pretty much any file that MediaPlayer opens, but it can't draw it double-buffered. I'm changing this so that the video is in a separate window that is always on top. That will take care of issue 2 also.

The low-priority request can be done, and will be if such a developer adresses me with a request.
ai4spam is offline   Reply With Quote
Old 1st May 2005, 16:35   #12  |  Link
ai4spam
Programmer
 
ai4spam's Avatar
 
Join Date: Sep 2003
Posts: 378
About the VirtualDub Plugin: there is already a plugin that pulls the subtitles in "standard" SubRip files (.sub and .idx) from video files. I haven't tested it (a friend says it only works with VirtualDub 1.5.4) and I don't know how well it deals with tough cases like white subs on white background (it seems rather simplistic, from the interface), but it may be worthwhile talking to its developer about your request:
http://www.sharemation.com/PodPis/PodPis.html
ai4spam is offline   Reply With Quote
Old 2nd May 2005, 22:01   #13  |  Link
masken
uhm... ?
 
Join Date: Oct 2001
Location: Gothenburg, Sweden
Posts: 280
Great news!

Still any need for "hard cases" vobsubs as reference material?

Taking my chance for a feature suggestion:
Accelerator indicators for selecting formatting wanted (Italics, Underlined, Bold, Sticky (or whatever the no-named one's named )).

Also pause/abort in the 4-color selection dialogue.

Last edited by masken; 2nd May 2005 at 22:07.
masken is offline   Reply With Quote
Old 2nd May 2005, 23:30   #14  |  Link
ai4spam
Programmer
 
ai4spam's Avatar
 
Join Date: Sep 2003
Posts: 378
Thanks for the reply. I don't quite understand what you mean by "hard cases". About the feature suggestion: in what window? The pause/abort can be done, no problem.

I'll resume work at the end of this week. Meanwhile, if he has time, Zuggy will try to solve the crash bug on P4.
ai4spam is offline   Reply With Quote
Old 4th May 2005, 08:58   #15  |  Link
Endzeit
Registered User
 
Join Date: Mar 2005
Location: Vienna, Austria
Posts: 1
ai4spam,

you're definitely a genius!
Never heard of this program before, d/l'ed it and gave it a try - OCRing hardcoded subs is an amazing approach!

Thanks a bunch for all the effort you put into this great piece of software.
Endzeit is offline   Reply With Quote
Old 4th May 2005, 09:53   #16  |  Link
niamh
Dismembered
 
niamh's Avatar
 
Join Date: Nov 2003
Location: Craggy Island
Posts: 873
I have a feature request if that's possible .. I love the best guess button, but it gets it wrong half the time, so I end up moving the mouse up and down to fill the letter in or click best guess (in the end, I just fill the letters in). Would it be possible to make an option saying "press enter on the empty box will automatically enter best guess in the matrix" (if that's english). This way it would be even faster and easier

I have an idx here that won't be ocred at all with any app, I tried fiddling with the detection thresholds with no joy, if anyone wants it to improve ocr detection I can upload it.
I think masken meant those, idx files that won't be ocred properly no matter what.
__________________
Allen's Axiom: When all else fails, read the instructions.
niamh is offline   Reply With Quote
Old 4th May 2005, 15:31   #17  |  Link
smiller667
Registered User
 
Join Date: Oct 2001
Posts: 1,121
If you can up it somewhere, I'd like to take a look ... just out of curiosity.

Steve
smiller667 is offline   Reply With Quote
Old 4th May 2005, 16:25   #18  |  Link
castellanos
Member
 
castellanos's Avatar
 
Join Date: Jan 2004
Location: Argentina and Germany
Posts: 687
Hi!
I'll give a try, but I would like to know if an old bug is fixed it:
When saving subs in srt text mode, SubRip makes no difference between "I" and "l" then, for example, the word "Golf" comes like this: "GoIf" and so on with the rest of the "l"s...
Somebody noticed that problem?
Greetings!
castellanos is offline   Reply With Quote
Old 4th May 2005, 16:44   #19  |  Link
niamh
Dismembered
 
niamh's Avatar
 
Join Date: Nov 2003
Location: Craggy Island
Posts: 873
subs

Have fun
__________________
Allen's Axiom: When all else fails, read the instructions.
niamh is offline   Reply With Quote
Old 4th May 2005, 16:45   #20  |  Link
E-Male
mad computer-scientist
 
Join Date: Mar 2002
Posts: 1,375
Quote:
Originally posted by castellanos
Hi!
I'll give a try, but I would like to know if an old bug is fixed it:
When saving subs in srt text mode, SubRip makes no difference between "I" and "l" then, for example, the word "Golf" comes like this: "GoIf" and so on with the rest of the "l"s...
Somebody noticed that problem?
Greetings!
have you tried the 'postproccessing'? [don't remember the exact name of the feature]
E-Male is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 16:03.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2017, vBulletin Solutions Inc.