DataLore
24th August 2004, 06:18
I have done serious research on subrip and it seems pretty simple..
I ran subrip on a 2 season dvd boxset and it went thru the first season without any problems.. (That basics i's l's mixed up and the ocr correction almost removed every one of those errors)
Once I started subrip on the second season. (with a new matrix or the old saved matrix) I was getting about 90% error on reads.. It seemed to be confused by the letter u (pulling the left half of the letter then the right half of the letter)
M and N were misunderstood as 2 letters as well.. I tried to subrip it over and over.. there seems to be a visual difference in the font used on the actual images stored in the .VOB's
I have tried to tweak the advanced OCR features but I am at best guessing on what I'm playing with.. There seems to be almost no documentation on the advanced settings..
I can subrip and pull the 10% good and manually adjust the 90% horrible errors and load them into substation and manually watch the show and make each line correct.. but this it VERY time consuming..
Running spell check on words that even next to other words make no sense, is impossible.
Has anyone else encountered this almost total inability to read a particular font style?? If so was it correctable... is there a chance that OCR tweaking will help??
if there is anything you can suggest I would be very greatful. I want to get the subs out to someone that requested them.. As I know he cannot enjoy the audio.. but I do not have the time required to fix 20+ episodes.
Thanks for your time!
DataLore
I ran subrip on a 2 season dvd boxset and it went thru the first season without any problems.. (That basics i's l's mixed up and the ocr correction almost removed every one of those errors)
Once I started subrip on the second season. (with a new matrix or the old saved matrix) I was getting about 90% error on reads.. It seemed to be confused by the letter u (pulling the left half of the letter then the right half of the letter)
M and N were misunderstood as 2 letters as well.. I tried to subrip it over and over.. there seems to be a visual difference in the font used on the actual images stored in the .VOB's
I have tried to tweak the advanced OCR features but I am at best guessing on what I'm playing with.. There seems to be almost no documentation on the advanced settings..
I can subrip and pull the 10% good and manually adjust the 90% horrible errors and load them into substation and manually watch the show and make each line correct.. but this it VERY time consuming..
Running spell check on words that even next to other words make no sense, is impossible.
Has anyone else encountered this almost total inability to read a particular font style?? If so was it correctable... is there a chance that OCR tweaking will help??
if there is anything you can suggest I would be very greatful. I want to get the subs out to someone that requested them.. As I know he cannot enjoy the audio.. but I do not have the time required to fix 20+ episodes.
Thanks for your time!
DataLore