Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#1 | Link |
Registered User
Join Date: Mar 2010
Posts: 106
|
subtitle conversion from DVD to text without OCR?
Hi. I couldn't find a lot of info on the net on subtitles. But from what I understand, DVD subtitles don't exist as text streams but as bitmap (pixel) images ONLY, correct? (... although MediaInfo displays my .vob files as having text streams)
I found a lot of threads here on doom9 pointing to programs which demux the subtitles from the .vob files. Depending on the program, this results in .sup files or in a .sub/.idx combination. But none of them are human readable, isn't it? My guess is that the files still contain a stream with bitmap graphics. My question is: Is there any program which would decode the data in those files directly to text without the intermediate OCR step? A .sup to .srt converter? Or a .sub to .srt converter? Or something like that? ![]() Last edited by lovelove; 20th November 2010 at 21:57. |
![]() |
![]() |
![]() |
#2 | Link | |
Registered User
Join Date: May 2008
Posts: 1,618
|
Quote:
|
|
![]() |
![]() |
![]() |
#3 | Link | |
Registered User
Join Date: Mar 2010
Posts: 106
|
Quote:
On the other hand, there are programs which manipulate the JPEG bitstream directly and rotate the image without ever bringing it on the screen (not even "hidden", because the jpeg bitstream is never decoded to x,y image pixels but manipulated directly, without decoding). When translating this situation to subtitles, I was hoping that *somehow* the step of bringing the subtitles on the screen (hidden or unhidden) for OCR could be avoided. Now I admit that even after pondering this for quite a bit, I fail to see how this could possibly be done. But my hope was that the experts here, who have seen a lot in their life, would maybe know of *any* other way than OCR... Is the analogy more or less understandable? Last edited by lovelove; 21st November 2010 at 01:10. |
|
![]() |
![]() |
![]() |
#4 | Link | ||
Registered User
Join Date: Mar 2010
Posts: 106
|
Quote:
The MediaInfo output of one of my .vob files (the second 1 GB vob file of six) looks as follows: Code:
Text #1 ID : 224 (0xE0)-DVD-1 Format : EIA-608 Muxing mode : MPEG Video / DVD-Video Muxing mode, more info : Muxed in Video #1 Stream size : 0.00 Byte (0%) Text #2 ID : 32 (0x20) Format : RLE Format/Info : Run-length encoding Text #3 ID : 33 (0x21) Format : RLE Format/Info : Run-length encoding Text #4 ID : 34 (0x22) Format : RLE Format/Info : Run-length encoding When playing this .vob file in VLC, I have this in my subtitle menu: Quote:
Code:
Track 1 - [Russian] Track 2 - [English] Track 3 - [Espagnol] Track 4 - [Esperanto] closed captions 1 closed captions 2 closed captions 3 closed captions 4 hm...this seems so complicated and I just don't know where to start to better understand this ... Last edited by lovelove; 21st November 2010 at 01:13. |
||
![]() |
![]() |
![]() |
Tags |
converstion, idx, ocr, subtitle |
Thread Tools | Search this Thread |
Display Modes | |
|
|