PDA

View Full Version : bug: Subrip are detection cedillas as stand alone charater


Sven Bent
19th October 2002, 11:56
i just encodet some movies with french subtitles, witch uses C with cedilla alot "ç".

Many time sburipper think this is an normla "C" ignoreing he cedilla.
then i jump a line down and now find te cedillas as if it wa e letter en a row below

E.G the line:
D'où tu tiens ça ?

becomes
D'où tu tiens ca ?
¤

¤ is the standalone cedillas, i uses this so i can seach the text letter end correct the cedillas manually.

I know this might be hard to fix. but i just wanted to tell

Sven Bent
19th October 2002, 12:39
i just looked throuhg the french subtitles, and it seems that its only the word "ça" that the cediallas are going wrong.

it happens with both capitals and small C with cedillas.

it even happens between to lines

e.g
- Ça xxxxxx
- xxxxxxxx

becomes
- Ca xxxxxx
¤
- xxxxxxxx

hope this can help zuggy

zuggy
22nd October 2002, 08:44
The cedillas are not hard to detect, but in some fonts, as you can see, the cedillas are not "joined" with the C with font color on border, but with outline or antialiasing color - that is the problem, subrip takes it as 2 characters. It may help to increase the "Minimal Inter Line Height" in advanced ocr setup.