Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
19th March 2020, 22:34 | #841 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 434
|
@GCRaistlin: Editing the text in "Remove text for HI could be possible - how does this work: https://github.com/SubtitleEdit/subt...leEditBeta.zip
|
19th March 2020, 22:52 | #842 | Link | |
Registered User
Join Date: Oct 2003
Posts: 158
|
Quote:
My problem is just when I open an existing SRT from File / Reopen menu. Before the update, it opened both the SRT and the associated video file. Now, it doesn't open the video. I have to do it again each time I open the SRT. It concerns any codec / container. |
|
20th March 2020, 07:08 | #843 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 434
|
@Lucius Snow: Do you still have problems in latest beta: https://github.com/SubtitleEdit/subt...leEditBeta.zip ?
If yes, what are the steps for re-creating this in detail (drag-n-drop or file open or shortcuts)? |
20th March 2020, 11:54 | #844 | Link | |||
Registered User
Join Date: Jun 2006
Posts: 361
|
Quote:
Code:
92 00:07:54,641 --> 00:07:56,559 - l would take the idea to its extreme - - [ Whispering, lndistinct ] 93 00:07:56,643 --> 00:08:00,188 and draw parallels between reproduction in art. . . 577 00:38:09,245 --> 00:38:13,875 - Well, he said that, uh - - lt is actually as beautiful as the original. 578 00:38:14,000 --> 00:38:17,295 - that they thought it was an original for many, many centuries - - [ Man, ln ltalian ] When was it made? 1091 01:12:28,344 --> 01:12:32,014 - That impression is quite right, but. . . - [ Crowd Cheering ] 1092 01:12:32,181 --> 01:12:33,808 how can l say. . . Quote:
Quote:
It does, thanks again.
__________________
Windows 8.1 x64 Magically yours Raistlin |
|||
20th March 2020, 13:08 | #845 | Link | |
Registered User
Join Date: Oct 2003
Posts: 158
|
Quote:
|
|
20th March 2020, 14:29 | #847 | Link |
Registered User
Join Date: Oct 2003
Posts: 158
|
That's what I described earlier:
My problem is just when I open an existing SRT from File / Reopen menu. Before the update, it opened both the SRT and the associated video file. Now, it doesn't open the video. I have to do it again each time I open the SRT. Difficult to explain more |
20th March 2020, 22:28 | #848 | Link | |
Registered User
Join Date: Jun 2006
Posts: 361
|
Nikse555
My current Latin.db - maybe you'll find my additions useful for all.
__________________
Windows 8.1 x64 Magically yours Raistlin |
|
29th March 2020, 11:31 | #850 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
@Nikse555
I am finding some encoding giving problems to your editor. Here you can find some. I hope the names are self explanatory enough. The only ones I can open with no problems on accented vowels and symbols are UTF16-LE ones. With UTF8 it is a mess
__________________
@turment on Telegram |
29th March 2020, 12:08 | #851 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 434
|
@tormento: I don't think those files are correctly encoded, sorry.
EDIT: The UTF-8 files have UTF-8 BOM (EF BB BF) but they are not using UTF-8 encoding, they are ANSI encoded! Yes, really a mess Last edited by Nikse555; 29th March 2020 at 13:34. |
30th March 2020, 12:28 | #852 | Link | |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
Quote:
What about the other message about "L" OCR?
__________________
@turment on Telegram |
|
30th March 2020, 14:41 | #853 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 434
|
I've updated latest beta somewhat: https://github.com/SubtitleEdit/subt...leEditBeta.zip
Your subtitle runs very well through the OCR using "Binary image compare" with number-of-pixels-is-space=7 and max-error-pct=1. I did not have any problems with "L". What OCR method are you using and what lines are problematic? Last edited by Nikse555; 31st March 2020 at 05:50. |
31st March 2020, 12:57 | #854 | Link | |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
Quote:
Many many "L": 570 00:51:39,520 --> 00:51:42,478 L know the book is tough, but l liked it. 571 00:51:42,600 --> 00:51:43,476 L know. Tried with a fresh installation and "untrained" OCR database?
__________________
@turment on Telegram |
|
31st March 2020, 14:32 | #855 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 434
|
Quote:
Result here, starting with lowercase "L": Code:
l know the book is tough, but l liked it. |
|
31st March 2020, 15:20 | #856 | Link | |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
Quote:
I get capital L!
__________________
@turment on Telegram |
|
2nd April 2020, 18:15 | #857 | Link |
Pig on the wing
Join Date: Mar 2002
Location: Finland
Posts: 5,812
|
I've added the OCR fix list pair (Options -> Settings -> Word lists) l --> I to fix this, if I remember correctly. You can also do that after the OCR run.
That subtitle example was very straightforward to OCR and the characters look like most DVDs, so you get a lot of good matches for future subs. I uploaded my dictionary files and latin.db in case someone finds them useful (Nikse555 can freely use the content with SE if he wants to): https://drive.google.com/open?id=1Bo...8tAwXRn-IWRgnz https://drive.google.com/open?id=1Nz...CBOW9K93DZVxHb
__________________
And if the band you're in starts playing different tunes I'll see you on the dark side of the Moon... |
3rd April 2020, 11:07 | #858 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
One of the best things of SubRip was the possibility to save different matrixes and automatically scan for the most effective one on OCR when loading the sub bitmap file.
That gives the possibility not to pollute the good trained ones with some unusual subtitle, plus the possibility to organize them effectively. Moreover, Subtitle Edit could come with pretrained ones like SupRip did for the most commonly used fonts. @nikse would you, please?
__________________
@turment on Telegram |
3rd April 2020, 11:14 | #859 | Link |
Pig on the wing
Join Date: Mar 2002
Location: Finland
Posts: 5,812
|
SE does have the ability to use different databases for OCR and the scanning could be useful, I agree.
__________________
And if the band you're in starts playing different tunes I'll see you on the dark side of the Moon... |
3rd April 2020, 11:21 | #860 | Link | |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
Quote:
Another missing thing is the possibility to expand selection, such as for % that sometimes goes wrong on OCR. A point and click expansion thing would be even better.
__________________
@turment on Telegram |
|
Thread Tools | Search this Thread |
Display Modes | |
|
|