Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
12th September 2005, 17:08 | #202 | Link |
Registered User
Join Date: Jun 2005
Location: Denmark
Posts: 11
|
@ai4spam: Thank you very much for the reply.
ABOUT THE 44 LIMIT. The 44 limit was inded on of the first barrieres I meet when "developing" my method. I did spend some time wondering why it was there. Now I know. ABOUT THE SCALING AND PROCESSING. The scalingfactor can be set in the .idx file for the vertical/horizontal resolution, as a percentage. Like say 50% to reduce or 155% to enlarge, to suit your needs and likings. The number is used by the Avisynth VSFilter.dll, when the file is frameserved. Thus no scaling will be performed when opend directly in Subrip. And implementing that, would be a whole new world, or rather... -program! So beeing just a little realistic, it's doable but in my opinion, way off the scope of Subrip, since there are ways of doing this, without rewriting Subrip ABOUT THE BEST GUESS I'm sorry to hear, that you do'nt like the idea of an best-guess-automode. It's true, that it will introduce errors, when best guess is wrong or missing, but looking at pro's and con's I find it easyer to wipe out some errors using a search function, rather beeing tied up to the PC for hours. But I do respect that this is'nt a matter of sceince, but a matter of opinion. I still think it would be a big enhancement thoug. But for now I will use your suggestiens, and do some more tweaking the parameters to optimize the ocr'ing ANOTHER REQ Would it be possible to allow editing the output file in Subrip? LAST BUT NOT LEAST I fully respect your work, and know that Subrip is'nt like your whole life, and thus there's natural limits to fullfilling the newer ending craving of feautures and options. So I want to point out, that I find it more importent to appriciate what you HAVE done for us, rather than what will or could be done. All those hours and hours and hours... -just for free. That's the truely amazing thing here. Best regards. |
12th September 2005, 19:23 | #203 | Link |
Registered User
Join Date: Sep 2005
Posts: 6
|
Well, 50 % success..
Of two programs one converted as it should (code name italiaa), the other (code name prisma) did not when given .txt as input. Both worked when given the first .bmp, but of course the timing was screwed Samples (a few hundred bmps...) in the same place as last time. Happy bug hunt (well, not a bug really, bad initial specification rather) |
12th September 2005, 22:42 | #204 | Link |
Programmer
Join Date: Sep 2003
Posts: 382
|
@Saligia: Actually, the best guess request would be trivial to implement, compared to the resampling, wich can be done without "rewriting SubRip" (but with some substantial effort). So, how about I make the best guess an option, and you help me with the resampling? I need the original subs and a couple of screenshots from the .avi that show exactly what the rescaling does.
Editing the output is not easy, because the list is re-filled each time some operation is performed from the internal structure, so I'd have to allow editing that structure. @chkp45: Did you check the file names? I mean, do the .bmp files in the directory actually coincide with the file names in the .txt file? I actully ran DVDSupDecode on an example and it worked on the output, so there must be something wrong with the file names. |
13th September 2005, 00:19 | #205 | Link |
Registered User
Join Date: Jun 2005
Location: Denmark
Posts: 11
|
@ai4spam: Seems like a pretty good deal to me!
ABOUT DOKUMENTATION. I'll be more than happy, to put together some dokumentation with screenshoots and all, of what I'm fiddling around with, and what i looks like. I'll do that by tomorrow. PM or here? But like I said, I know it will be a huuuge task, to implement - if possible at all. that's why I was'nt to pushy about it. But I'll let you judge yourself, if it's any good, and worth the effort. ABOUT BEST GUESS OPTION. After I read your reply, I did some tweeaking here and there, and I did get some minor improvements. But not even close to what an best-guess-auto option would be. I don't think I was clear enough on this point in my first messesage: The fact is, that when ocr'ing this way the Best Guess is correct 99,99% of the time. So cleaning out a pair of errors afterwards, is a so little price to pay, for the ability to let Subrip do it's magic on its own overnight! Therefore I'm absolutly exited, that you are willing to give it a try! You´r to cool to be true. ABOUT THE SAVE OPTION IN VIDEO FILE VIEWER WINDOW. I've tryed ticking this option on. I see that i creates a subfolder with .pgm files. But i haven't been able, to find any doku about it. First i thought it would save some images for later processing, but it keeps on ocr'ing AND saving .pgm files. Sorry if it seems like I've been living under a rock, but how is it supposed to be used? And what is a .pgm file? ABOUT EDITING OUTPUT FILE. It's no biggie that it can't be done. But as you see, most of the time we have absolutly no idea of what we REALLY ask for. But at least, it's not because we aren't greatfull, of the work beeing done. If we did'nt find Subrip importent and usefull, we proberly would'nt bother asking. It's just because that most of us don't realize the huge amount of effort that goes into even changes that seems, barely noticable to us. Best regards. Last edited by Saligia; 13th September 2005 at 01:01. |
13th September 2005, 00:33 | #206 | Link |
Registered User
Join Date: Jan 2005
Posts: 74
|
Hey, ai4spam! Thanks for keeping working on it.
I opened the new version and the toolbar battons look kind of strange. Like the pictures do not fit the buttons and go over the edge. A screenshot of 3 of them is attached to this message. |
13th September 2005, 00:55 | #207 | Link |
Programmer
Join Date: Sep 2003
Posts: 382
|
@Saligia: No rush with the documentation, take your time - I won't be able to do anything until the weekend anyway. Don't go into too much details either, I just need some examples where the "low-res" detection fails, but the upsampled version works. PM or email is fine (lookup the address in the SubRip manual). As for the .pgm files, they were designed for removal using a VirtualDub and special version of DeLogo that I was working on. Unfortunately, debugging it is a royal pain and I simply don't have the time. I've sent the source to one of the members here, maybe he can work on it. This option will probably be replaced with saving regular .idx/.sub pairs, just like PodPis does, if we can get the removal part grafted onto VSFilter to remove subs intead of putting them in, yet still use the .idx/.sub loagind routines.
@Esc: I've changed the button components because the old ones did not have hints in UniCode, so they didn't work with the new UniCode language format. I'll make them a bit larger if needed. Also, I've shortened a couple of captions in the "Enter new char" form to make all the buttons fit in. All the languages need updating. Last edited by ai4spam; 13th September 2005 at 01:27. |
13th September 2005, 16:27 | #208 | Link |
Registered User
Join Date: Sep 2005
Posts: 6
|
@ai4spam: This is really something different.
The 'prisma' set shows bitmaps as black on black initially, and processes nothing. (All bitmaps seem to be read and displayed but nothing is offered to be recognized. When I go to 'change text color' I see texts in red with blue outline on black background. Then I uncheck black and blue and conversion starts. Trouble is, it shows only the character to be recognized, (blue on black in thin red outline box). Other characters are black on black ??? (This is on main window where tho whole bitmap is shown). You can actually do the conversion but you get errors as the whole text is not visible. Funny it does not behave like this on the other sample ?? Another minor detail, as these recordings have ridiculously long file names they spoil the apperance of the status line where the file name is shown.. Best regards |
13th September 2005, 16:58 | #209 | Link |
Programmer
Join Date: Sep 2003
Posts: 382
|
@chkp45: I've looked into it. It seems like I overdid it: the bitmaps are always 4 colors (0..3) even though the palette may have more, and they always refer to shades of gray. I guess I'll just ignore the "Palette=" lines and be done with it.
Last edited by ai4spam; 14th September 2005 at 16:36. |
15th September 2005, 05:39 | #210 | Link |
Programmer
Join Date: Sep 2003
Posts: 382
|
SubRip 1.40 Beta 2 is now up on the home site.
ChangeLog: - Fixed a few bugs. - Added an option to automatically fill the best guess (look in the global options window for how to turn it on). - Updated Portuguese language by Vítor Vieira Future plans are to implement something to automatically look in all your fonts for a match, instead of asking you to choose the font. It may take a long time to do (got tons of other things to worry about in my own life). @Saligia: now it's your turn to hold your end of the bargain . Last edited by ai4spam; 15th September 2005 at 06:10. |
15th September 2005, 11:03 | #211 | Link |
uhm... ?
Join Date: Oct 2001
Location: Gothenburg, Sweden
Posts: 281
|
@ai4spam, how did it go with the suggestion to allow only whole words to be formatted? (italics, bold etc, the issue where some characters are formatted with italics and some not, causing hard-to-read and large files)?
|
15th September 2005, 19:33 | #215 | Link | |
Programmer
Join Date: Sep 2003
Posts: 382
|
Quote:
And remember: I am to blame for only part of the mess in the code . Last edited by ai4spam; 15th September 2005 at 23:29. |
|
17th September 2005, 20:59 | #216 | Link |
Registered User
Join Date: Nov 2004
Posts: 30
|
Hi,
I opened an .srt file in the 1.40 Beta 1, and I tried to use the "Post OCR Spelling Correction", but this option was not available. I think it's available from a fresh rip but not for an old, previously ripped .srt file. |
17th September 2005, 23:22 | #217 | Link |
Programmer
Join Date: Sep 2003
Posts: 382
|
The first thing that happens when you open an old file is that SubRip asks you what CodePage/CharSet to use to convert to UniCode. Chose the values you want, then press "Done". Then all the other options will be enabled again.
The whole purpose of this is to deal with non-standard CharSets. For example, for Eastern European, if you don't choose that CharSet, some UniCode characters (the diacritics) will be converted to the wrong values. The reason the text shows is so that you can try different CharSets until you are satisfied with the conversion. Anyway, I recommend other tools for better OCR and spelling correction, like SubtitleWorkshop. |
20th September 2005, 21:17 | #219 | Link |
Registered User
Join Date: Aug 2004
Posts: 24
|
Anyone have problem with subrip not recognising whole lines?
I tryed changing colors etc, even it looking perfect it doesnt recognise the line. Some subs have more than others this problem... sometimes have to type the whole line dialogue 20-30 times... Is there any way to solve that? maybe a config etc? thanks |
20th September 2005, 22:17 | #220 | Link |
Registered User
Join Date: Sep 2004
Location: France
Posts: 367
|
On french windows, with default XP theme, when i save subs, i can see the end of the last charsets, so i don't know if they are utf-8 or utf-16 or other, i just see until UTF
and with french language selected, i can't the char proposed by the soft, it appears under the words "meilleure réponse" :| |
Thread Tools | Search this Thread |
Display Modes | |
|
|