View Full Version : Subtitle Edit
Nikse555
3rd October 2014, 14:18
@Mole: In next update the OCR fix list and the names etc list will have a "user" file.
@minhjirachi: thx, should be fixed here: http://www.nikse.dk/SubtitleEdit.zip (on src on github + next release)
@Music Fan:
1) the crash should be fixed here: http://www.nikse.dk/SubtitleEdit.zip (on src on github + next release)
Could you verify?
2) Could I check the file (email/msg)? Perhaps VLC/MPC-HC scales the subs etc?
3) yes, small fonts don't work too well in Tesseract.
4) English vs US? Try another dictionary.
Music Fan
4th October 2014, 13:43
Thanks Nikse for all responses.
@Music Fan:
1) the crash should be fixed here: http://www.nikse.dk/SubtitleEdit.zip (on src on github + next release)
Could you verify?
That's a little bit better : files that crashed yesterday can now be parsed but the crash happens after : when I export in Blu-ray sup (without OCR) or when launching OCR.:o
2) Could I check the file (email/msg)?
Here is a 1080i file that can be opened and "OCRed" ;
http://www72.zippyshare.com/v/94029699/file.html
You should be able to play it in VLC and MPC-HC to see the subtitles size (activate the french sub if not done automatically).
When exported in 1080p Blu-ray sup (whatever the bottom line setting) and remuxed in TS with TSmuxer, the subtitles size is much more little.
By the way, could you add the possibility to avoid the bottom line setting to have exactly the same position than in the original sub (if you solve the resize problem, because currently the position changes anyway because of the resize) ?
Perhaps VLC/MPC-HC scales the subs etc?
I don't believe so, they are displayed at the same size than with my DVB recorder.
3) yes, small fonts don't work too well in Tesseract.
Mmmh, could the other method work ?
4) English vs US? Try another dictionary.
That's french, I didn't realize I had to change the language :rolleyes: :D, that's ok now (I tried with the old 3.4.2 version, not the one you linked yesterday) :)
But a lot of t and q are considered as (
:confused:
And I had to download the french dictionnary while there was already a fr-FR.xml file in the language folder. Or does it only concern the language of the program ? If yes, where are stored the dictionnaries ?
Anyway, there are still two problems ; crash with some ts and resize in Blu-ray sup.
Music Fan
5th October 2014, 20:21
That's french, I didn't realize I had to change the language :rolleyes: :D, that's ok now (I tried with the old 3.4.2 version, not the one you linked yesterday) :)
But a lot of t and q are considered as (
:confused:
I found a trick that seems to solve this problem due to little characters size : after exporting in Blu-ray sup with SE (without OCR), I resize the sup with BDSup2Sub ("scale" setting). The text becomes bigger inside the sup which keeps its resolution (1080p sup remains in 1080p).
I chose 2 for x and y (twice the size for width and height to keep text's proportions).
Then I open this sup in SE and launch the OCR which is better done :)
Nikse555
5th October 2014, 20:42
@Music Fan: OK, nice - in the meantime I've hopefully fixed the last TS crash here: http://www.nikse.dk/SubtitleEdit.zip (also in latest src on GitHub)
Music Fan
5th October 2014, 21:02
Thanks, it doesn't crash anymore :)
I recall my other questions ;
1)did you try my ts file to see why it's resized when converted in Blu-ray sup ?
2)could you add the possibility to avoid the bottom line setting to have exactly the same position than in the original sub ?
3)in which folder are stored the dictionnaries ?
;)
minhjirachi
6th October 2014, 03:29
Can you add the subtitle preview, which like as BDSup2Sub? I think it's better when re-check all the subtitle to correct position. With current preview, I can't see nothing. Just the color, style without pos.
mantis2892
12th October 2014, 20:00
I have a problem when converting from SUP to SRT.
When there are two sets of subtitles shown, one at the top of the screen and one at the bottom at the same time, the timings of the bottom get screwed up :(
I mean lets say a subtitle appears at the top of the screen from 15:30:20 - 18:20:12, and lets say something appears at the bottom from 17:15:15 - 19:10:12. With the OCR method, the SRT file generates timecodes of 15:30:20 - 18:20:12 for the top subtitle and 18:20:12 - 19:10:12 for the bottom one.
Is there anyway to fix this (except for manually setting the timecodes of course).
minhjirachi
13th October 2014, 02:25
I have a alignment problem with this script:
Phim thuyết minh độc quyền tại
<font color=yellow><b> WWW.ABC.NET </b></font>
And with the .ass subtitle, when I export to .sup, it has lost everything like subtitle position, color, and the subtitle layers.
Still have error with time stamp when exporting as xml/png.
Subtitle Edit cannot re-load the previous setting.
Music Fan
21st October 2014, 14:01
v3.4.3 has been released 8 days ago, I don't understand why Nikse555 didn't tell us this ;
https://github.com/SubtitleEdit/subtitleedit/releases
Nikse555
21st October 2014, 19:32
He, I only updated the title of this thread.
SE 3.4.3 has some general performance improvements + some fixed memory leaks (especially in batch convert).
Also, "Compare" + "Measurement converter" are broken... not on purpose though ;)
Music Fan
21st October 2014, 22:00
1)did you try my ts file to see why it's resized when converted in Blu-ray sup ?
2)could you add the possibility to avoid the bottom line setting to have exactly the same position than in the original sub ?
3)in which folder are stored the dictionnaries ?
I answer my 3rd question ; on Windows 7, the dictionnaries for Tesseract are stored in ;
C:\Users\user name\AppData\Roaming\Subtitle Edit\Tesseract\tessdata
I needed to know this because as it is optional and installed by download when SE is running, I had to copy this folder for a pc not connected to the web.
Nikse, could you answer the 2 first questions please ? You asked me a TS file a few weeks ago, I guess you tried it.
Thunderbolt8
2nd November 2014, 13:55
again regarding .ass subs and hearing impaired removal, could you please implement some more stuff:
I'm talking about such lines as e.g.
1. {\an4\pos(691,748)}(Chuckles) Yes, ok.
2. {\an4\pos(691,748)}(radio noise)
3. {\an4\pos(691,748)}SPEAKER: text blabla.
1. if a line begins with {\ could you then please implement that "remove text between" '(' and ')' only affects that part of the line which comes after the closing bracket } ? otherwise subtitle screen position information gets deleted as well. (atm the result looks like this: {\an4\pos}Yes, ok. )
1 & 2. and also, if a line begins with {\ and is completely empty after the closing bracket } could you please change it that only such a line gets removed entirely when remove text between '{' and '}' is ticked and no such lines which still include information after the closing bracket } ?
3. and also that "remove text before a colon" does not remove text before a colon which is included in between the {\ and } part of a line. (atm the result looks like this: text blabla)
thanks!
edit: in short, ultimately its about excluding the {\an4\pos(691,748)} stuff in .ass subs from HSD removal.
Nikse555
2nd November 2014, 21:31
@minhjirachi:
I've tried to fix the center issue here: http://www.nikse.dk/SubtitleEdit.zip (beta, portable version)
What settings is not re-loaded in the above version?
Only some settings works for ass/ssa... font size, color, name etc should be used from the style.
@Music fan:
1) I cannot find other subtitles... tried a few different tools. I do believe that it's resized.
2) Not sure where you mean?
3) You can also use the menu: Spell check -> Get dictionaries -> Open dictionary folder
@Thunderbolt8: could you give some examples of the most important issue regarding remove text for HI?
Thunderbolt8
2nd November 2014, 23:42
@Nikse555 check my last posts up to august 18th. some of the former stuff might be redundant taking my last post into regard. implementing what I wrote in my last post might be the best idea atm for .ass subs.
Nikse555
3rd November 2014, 18:56
@Thunderbolt8: thx for the clarification - could you test this a bit (just updated - also in latest src on github): http://www.nikse.dk/SubtitleEdit.zip (beta, portable version)
Music Fan
3rd November 2014, 22:38
@Music fan:
1) I cannot find other subtitles... tried a few different tools. I do believe that it's resized.
2) Not sure where you mean?
3) You can also use the menu: Spell check -> Get dictionaries -> Open dictionary folder
1)Resized by players ? I don't believe so, I explained why.
Anywyay, do you know why DVB-SUB are resized by SE when converted in Blu-ray sup without OCR ?
Other possibility : DVB-SUB may have a real size different than its display size which would be stored in the header of the sup track (or may be linked to the video resolution) ; if SE ignores this header, that could explain why the sup appears more little when converted in Blu-ray sup (which would be its real size).
Otherwise, I don't understand why it becomes more little only when converted in Blu-ray sup with SE.
2)in the Blu-ray sup window, when avoiding OCR (right click in OCR window), otherwise there is a little position change and one has to test several bottom line settings to find the original subtitles position.
3)I was talking about the dictionaries used for OCR, but I found where they are located (C:\Users\user name\AppData\Roaming\Subtitle Edit\Tesseract\tessdata).
Thunderbolt8
3rd November 2014, 23:13
from what I can see that last implementation looks good so far. theres no wrong deletion of positional information within the {\ } brackets any more :)
two more things I noticed though:
1. deletion of residual hyphens '-' could still be improved (refers to both .ass and .srt subs). sometimes after SHD removal, a single or two hyphens remain (as only parts of a line), this should then be deleted as well. e.g.
{\an4\pos(782,897)}<i>- [Grunting Continues]</i> --> {\an4\pos(782,897)}<i>-</i>
works correctly:
- Hello? --> - Hello?
- [ Man ] Paul? --> - Paul?
doesnt work correctly:
- I insist.
<i>- [ Woman Laughing]</i> --> - I insist.
- Oh.
<i>- Yeah.</i> --> - <i>- Yeah.</i>
guess this might get tricky because you'd need to prevent the deletion of hyphens which are part of speech or a sentence in contrast to the deletion of hyphens as indicator of change of the speaker. could be hard in case of single lines. maybe in such a case it would be possible to distinguish between hyphens which are there from the start and such which occured only after the SHD removal process? dunno. but maybe all this could still work out without breaking the other thing :p
2. partly also refers back to the first one, is that in .ass subs lines which have the same timestamps should be regarded as belonging together in terms of SHD removal. e.g.
00:07:04,130 00:07:08,760 {\an4\pos(643,838)}[BOTH SPEAKING IN
00:07:04,130 00:07:08,760 {\an4\pos(643,904)}FOREIGN LANGUAGE]
or
00:07:08,930 00:07:12,760 {\an4\pos(521,772)}(AFU-RA FEATURING JAHDON
00:07:08,930 00:07:12,760 {\an4\pos(519,832)}& KARDINAL OFFISHALL'S
00:07:08,930 00:07:12,760 {\an4\pos(519,904)}"DEAL WIT IT" PLAYING)
get ignored because the brackets () [] dont close at the same line, but those lines are nevertheless displayed as belonging together on the screen at the same time
00:02:56,170 00:02:58,800 {\an4\pos(970,772)}MAN 1: Morning, staff sergeant.
00:02:56,170 00:02:58,800 {\an4\pos(970,653)}Morning, General.
00:02:56,170 00:02:58,800 {\an4\pos(106,905)}MAN 2: Morning, sergeant.
gets changed to:
00:02:56,170 00:02:58,800 {\an4\pos(970,772)}Morning, staff sergeant.
00:02:56,170 00:02:58,800 {\an4\pos(970,653)}Morning, General.
00:02:56,170 00:02:58,800 {\an4\pos(106,905)}Morning, sergeant.
but it should be:
00:02:56,170 00:02:58,800 {\an4\pos(970,772)}- Morning, staff sergeant.
00:02:56,170 00:02:58,800 {\an4\pos(970,653)}Morning, General.
00:02:56,170 00:02:58,800 {\an4\pos(106,905)}- Morning, sergeant.
two different persons are speaking here, but the viewer wouldnt be able to make out that difference on the screen, because adding of the hyphens (as it happens if this occurs within one single line) is missing.
00:41:51,170 00:41:53,050 {\an4\pos(708,827)}- Focus! Focus, Jean!
00:41:51,170 00:41:53,050 {\an4\pos(708,897)}- [ Rattling Stops ]
gets changed to:
00:41:51,170 00:41:53,050 {\an4\pos(708,827)}- Focus! Focus, Jean!
but it should be:
00:41:51,170 00:41:53,050 {\an4\pos(708,827)}Focus! Focus, Jean!
here we have again residual '-' because I guess both lines are not regarded as belonging together and as it seems deletion of hyphens in single lines is tricky because hyphens belonging to actual speech could be affected... notice though how in this example the last line gets deleted entirely and in the very first example at the top the hyphens still remains. could be that the line is not regarded as empty in the top example because of the italic tags (maybe a little bug).
correct SHD removal of subs, especially in case of such .ass subs can get very tricky, I know ;)
Nikse555
4th November 2014, 21:19
@Music Fan:
1) Yeah, there could surely be something SE don't know about TS DVB subtitle pictures! I cannot see anymore from the docs and the java ts parsers. Perhaps some else knows about this!?
(perhaps subtitles are added to the video - before the video player resizes the video to full screen)
The subtitles from TS files are not resized when converted to bluray sup files.
2) Hm, I guess the bottom margin could be calculated (not easy though) - but I guess it could be different for each subtitle?
@Thunderbolt8: I've tried to fix first issue (deletion of residual hyphens - thx for finding it) here: http://www.nikse.dk/SubtitleEdit.zip (beta, portable version)
Yeah, the multiple lines below each other with same time codes seems advanced...
Music Fan
5th November 2014, 10:05
@Music Fan:
1) Yeah, there could surely be something SE don't know about TS DVB subtitle pictures! I cannot see anymore from the docs and the java ts parsers. Perhaps some else knows about this!?
(perhaps subtitles are added to the video - before the video player resizes the video to full screen)
The subtitles from TS files are not resized when converted to bluray sup files.
2) Hm, I guess the bottom margin could be calculated (not easy though) - but I guess it could be different for each subtitle?
1) As the subtitles size is the same with VLC (and also MPC-HC) than with my DVB recorder, I guess VLC does not resize subtitles (or make the same resize than my DVB recorder). You could maybe look at VLC sources to see how DVB-SUB are handled ?
2) I believed all sup formats had the same resolution than videos (480, 576, 1080 ...) including a lot of transparent lines to be displayed over the video. If it's the case, there should be no need to calculate the bottom margin of original DVB-SUB because this margin is a part of subtitles (I mean lines among others ; some with text, some with only transparency).
When I convert Hd-dvd sup to Blu-ray sup with BDSup2Sub, I don't have to specify bottom margin, I believed it could be as simple for DVB-SUB (when OCR is not needed).
Thunderbolt8
5th November 2014, 17:32
@Thunderbolt8: I've tried to fix first issue (deletion of residual hyphens - thx for finding it) here: http://www.nikse.dk/SubtitleEdit.zip (beta, portable version)
Yeah, the multiple lines below each other with same time codes seems advanced...will try out the new one.
maybe it help, the multiple line stuff should only apply to .ass subs. afaik not to other formats (well, I only know .srt and .ass :D anyway)
Thunderbolt8
6th November 2014, 15:25
residual hyphen deletion looks good in most cases, but apparently there are still problems with italics:
{\an4\pos(708,897)}- [ Rattling Stops ] <-- was already fine
{\an4\pos(782,897)}<i>- [Grunting Continues]</i> --> {\an4\pos(782,897)}<i>-</i> <-- still not fine
apart from that, I found something else related to it: residual comma deletion after SHD removal :P
Oh, yes. It was, um — --> Yes. It was, —
I ended up — --> I ended up —
its working fine though in this case:
<i>- Mikey!</i> --> <i>- Mikey!</i>
<i>- Aw, man!</i> --> <i>- Man!</i>
Thunderbolt8
8th November 2014, 16:01
residual hyphen deletion looks good in most cases, but apparently there are still problems with italicsanother nice example:
- [panting] <i>Felt that one day,</i> --> - <i>Felt that one day,</i>
Thunderbolt8
14th November 2014, 18:05
and another case of a residual hyphen which should be deleted:
WOMAN: A glass of champagne, please.|- (Laughter) --> - A glass of champagne, please.
works as intended though if there was additionally actual speech information in the 2nd line instead of only SHD stuff.
Thunderbolt8
1st December 2014, 00:31
@Music Fan:
@Thunderbolt8: I've tried to fix first issue (deletion of residual hyphens - thx for finding it) here: http://www.nikse.dk/SubtitleEdit.zip (beta, portable version)
Yeah, the multiple lines below each other with same time codes seems advanced...have there been with the release of 3.4.4 (or with the commits made after) additional changes made compared to this beta above from November 4th? if so then I'd check on those subs again.
Nikse555
1st December 2014, 19:48
@Thunderbolt8: Yes, I've fixed some of the issues in 3.4.4... I hope. Please do test :)
Blueray sup reading will crash on some files (with multi image subs) in 3.4.4 but that is fixed in current c# source on github - and here also: http://www.nikse.dk/SubtitleEdit.zip (portable version, beta)
Some image export issues have also been addressed in latest beta: https://github.com/SubtitleEdit/subtitleedit/issues/343
aax
13th December 2014, 02:02
Hey nikse,
any chance you can add groupings to Multiple Replace tool? With names for each group, and checkmarks for both the whole group and individual actions inside the group.
Music Fan
13th December 2014, 12:23
There is a strange thing with export in sub/idx : the size is not the same than if I export first in Blu-ray sup then convert the sup in sub/idx, while I export both in 720p.
Is there a way to add automatically a space before ? and ! (and maybe also : and ; ) when there is no space between the last word of a sentence and these signs ?
aax
13th December 2014, 20:58
Is there a way to add automatically a space before ? and ! (and maybe also : and ; ) when there is no space between the last word of a sentence and these signs ?
Use replace, set to regular expression:
find(?<! )([:;!?])
replace with
$1
(space before dollar symbol)
You can also add this to "Tools → Multiple replace..." so you can easily run it later.
Music Fan
13th December 2014, 22:20
Thanks but that does not work.
edit : works now, I believe I forgot the space before $ ;) (multiple replace is in edit and not tools, below replace)
By the way, could you explain why you type (?<! )([:;!?]) and not only ([:;!?]) or (:;!?) (without [] ) ?
Betsy25
13th December 2014, 22:50
Thanks but that does not work.
edit : works now, I believe I forgot the space before $ ;) (multiple replace is in edit and not tools, below replace)
By the way, could you explain why you type (?<! )([:;!?]) and not only ([:;!?]) or (:;!?) (without [] ) ?
I think this is a negative lookbehind in regular expressions. the (?<! ) in front literally means "Do not match when there's (already) a space here"
http://www.regular-expressions.info/lookaround.html
The brackets [] mean "any of the following". If they would not be there, it would be seen as the string ":;!?" .
Music Fan
13th December 2014, 22:57
Ok thanks.
Music Fan
14th December 2014, 00:44
Actually I guess I can also write (?<!<:<; )([:;!?]) instead of (?<! )([:;!?]) to avoid to add space before : and ; (and not only ? and !) when there is already a space before these signs, right ?
And do you know how to do it for ... (3 points) ?
Is it something like this for find ;
(... )([...])
and this for replace with ;
$1
?
aax
14th December 2014, 02:48
Check out that Betsy's link and this cheat sheet (http://www.rexegg.com/regex-quickstart.html) for info on regular expressions.
As Betsy already said, the (?<!) is a command that will check if the characters that you put inside it come before whatever else you put next in your search string. Then it will match your search string only if they don't.
(?<!<:<; )([:;!?]) doesn't make sense in your case because it will look for :;!? that aren't precede by "<:<; ".
The expression I gave you in my first reply won't add a space before any of your characters (:;!?) if there is already a space there.
Music Fan
14th December 2014, 03:10
Ok thanks, but when you say "the characters that you put inside it", I guess you mean the characters after ?<!, thus only the space in this case.
For the 3 points, is there a solution with multiple replace or it can't work because it's 3 tree times one character and not a triple character (which doesn't even exist I think) ?
Otherwise I simply go in replace, type ..., search it and add a space before it.
aax
14th December 2014, 03:25
Ok thanks, but when you say "the characters that you put inside it", I guess you mean the characters after ?<!, thus only the space in this case.
Exactly. But check the cheat sheet, there are good examples.
For adding space before three points (a.k.a. ellipsis) when there is none this should do it:
find: (?<![\. ])\.{3}(?!\.)
replace with: space and three dots
In regular expressions dot matches any character except line breaks, so when you want it to mean an actual dot you have to "escape" it by putting a backslash before it.
Music Fan
14th December 2014, 03:51
Thanks for your explanations, I wouldn't have understood it alone, the cheat sheet is not very clear to me ;)
I guess that in your first example, replace with $1 means "let the character, no matter which of the 4, but add a space in front of it", because there are four characters and none is in the replace with line.
aax
14th December 2014, 04:25
Parentheses mark groups that can be used later. You can put them wherever it's convenient. Then you can recall the first group with $1, the second with $2 and so on. (But lookbehind and lookahead don't count as groups because parentheses are a part of their commands, that's why you're replacing with $1 and not $2.)
Music Fan
14th December 2014, 17:53
Ok, interesting.
Thunderbolt8
25th December 2014, 17:43
I Have two files .srt files attached which still show problems with SHD removal. that should make it easier for you to see for yourself first if the changes you made are doing what they are supposed to do http://www.sendspace.com/filegroup/aL1hsk990Hrz6JX%2BwXUNmg
in case of the 'halloween' one there are quite a few lines which should get removed but are not:
? My Paul ?
? I give you all ?
? No keys ?
those files remain completely untouched despite of the remove text between '?' and '?' option. maybe its because of the spaces? anyway, those lines should be removed.
the 2nd one 'safe' is much more tricky. there is still quite some stuff which is not affected correctly by SHD removal (maybe you forgot to add stuff you added for ( ) brackets to affect other types of brackets as well [ ] ?)
<i>- ♪♪[ Upbeat Dance ]</i>
- Big push. Four more.
--> if "remove text if it contains ♪" is ticked both lines get removed even though only the first one and the hyphen of the 2nd line should be. if unticked, the result is:
<i>- </i>
- Big push. Four more.
I'm sorry. I, um — --> I'm sorry. I, —
--> should be: I'm sorry. I— (afaik this already works(?) if there was a regular hyphen - or double hyphen -- instead of that special character for a hyphen)
<i>- a man who wants to make his mark...
- [ Coughing]</i>
--> <i>- a man who wants to make his mark...</i> --> the hyphen needs to be removed.
<i>- Mm-hmm.</i>
- in my spare time.
--> <i>- </i>
- in my spare time.
--> first line and hyphen of 2nd line need to be removed.
there was also a false positive in another subtitle track which was:
- And you?
- I —
--> And you?
should be: untouched, the 2nd line is fine as it is.
Music Fan
25th December 2014, 19:41
I have problems with multiple replace, I don't find the codes to replace some signs or set of signs.
I'dl ike to replace
7
(space and seven)
by
?
because some ? were interpreted as 7
And also
'?
by
?
The apostrophe shouldn't be there.
and
' ?
by
?
Same with space between ' and ?
Thanks.
edit : actually it works if I choose normal instead of regular expression. Some corrections need to choose normal to work, others need regular expression, I still have to understand differences between these settings.
Nikse555
30th December 2014, 14:37
@Thunderbolt8: thx for the info and testing :)
New beta version is here: http://www.nikse.dk/SubtitleEdit.zip
Let me know how it works.
@Music Fan: Regular expressions are not normal strings but more like a programming language... check this tutorial: http://www.codeproject.com/Articles/9099/The-Minute-Regex-Tutorial
Also, you can right-click in the "Find" textbox in SE for some common expressions (they are kinda hard to remember...)
Thunderbolt8
30th December 2014, 23:38
thanks. However, I have a problem accessing the hearing impaired removal screen with the 'safe' subtitle file now after replacing the old files with the new beta ones. but this seems only to affect this specific file as it works fine in case of other ones. I get this unhandled exception message:
See the end of this message for details on invoking
just-in-time (JIT) debugging instead of this dialog box.
************** Exception Text **************
System.ArgumentOutOfRangeException: Index and count must refer to a location within the string.
Parameter name: count
at System.String.Remove(Int32 startIndex, Int32 count)
at Nikse.SubtitleEdit.Logic.Forms.RemoveTextForHI.RemoveTextFromHearImpaired(String text)
at Nikse.SubtitleEdit.Forms.FormRemoveTextForHearImpaired.GeneratePreview()
at Nikse.SubtitleEdit.Forms.Main.RemoveTextForHearImparedToolStripMenuItemClick(Object sender, EventArgs e)
at System.Windows.Forms.ToolStripMenuItem.OnClick(EventArgs e)
at System.Windows.Forms.ToolStripItem.HandleClick(EventArgs e)
at System.Windows.Forms.ToolStripMenuItem.ProcessCmdKey(Message& m, Keys keyData)
at System.Windows.Forms.ToolStripManager.ProcessShortcut(Message& m, Keys shortcut)
at System.Windows.Forms.Form.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.ContainerControl.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.ContainerControl.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.ContainerControl.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.ProcessCmdKey(Message& msg, Keys keyData)
at System.Windows.Forms.Control.PreProcessMessage(Message& msg)
at System.Windows.Forms.Control.PreProcessControlMessageInternal(Control target, Message& msg)
at System.Windows.Forms.Application.ThreadContext.PreTranslateMessage(MSG& msg)
************** Loaded Assemblies **************
mscorlib
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.34014 built by: FX45W81RTMGDR
CodeBase: file:///C:/Windows/Microsoft.NET/Framework64/v4.0.30319/mscorlib.dll
----------------------------------------
SubtitleEdit
Assembly Version: 3.4.4.115
Win32 Version: 3.4.4.115
CodeBase: file:///C:/Program%20Files%20(x86)/Subtitle%20Edit/SubtitleEdit.exe
----------------------------------------
System.Windows.Forms
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.33440 built by: FX45W81RTMREL
CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Windows.Forms/v4.0_4.0.0.0__b77a5c561934e089/System.Windows.Forms.dll
----------------------------------------
System.Drawing
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.33440 built by: FX45W81RTMREL
CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Drawing/v4.0_4.0.0.0__b03f5f7f11d50a3a/System.Drawing.dll
----------------------------------------
System
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.34239 built by: FX452RTMGDR
CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System/v4.0_4.0.0.0__b77a5c561934e089/System.dll
----------------------------------------
System.Xml
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.34230 built by: FX452RTMGDR
CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Xml/v4.0_4.0.0.0__b77a5c561934e089/System.Xml.dll
----------------------------------------
System.Core
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.33440 built by: FX45W81RTMREL
CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Core/v4.0_4.0.0.0__b77a5c561934e089/System.Core.dll
----------------------------------------
System.Configuration
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.33440 built by: FX45W81RTMREL
CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Configuration/v4.0_4.0.0.0__b03f5f7f11d50a3a/System.Configuration.dll
----------------------------------------
Accessibility
Assembly Version: 4.0.0.0
Win32 Version: 4.0.30319.33440 built by: FX45W81RTMREL
CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/Accessibility/v4.0_4.0.0.0__b03f5f7f11d50a3a/Accessibility.dll
----------------------------------------
************** JIT Debugging **************
To enable just-in-time (JIT) debugging, the .config file for this
application or computer (machine.config) must have the
jitDebugging value set in the system.windows.forms section.
The application must also be compiled with debugging
enabled.
For example:
<configuration>
<system.windows.forms jitDebugging="true" />
</configuration>
When JIT debugging is enabled, any unhandled exception
will be sent to the JIT debugger registered on the computer
rather than be handled by this dialog box.
Nikse555
31st December 2014, 00:45
@Thunderbolt8: thx again - the crash should now be fixed: http://www.nikse.dk/SubtitleEdit.zip
Thunderbolt8
31st December 2014, 15:09
thanks for fixing. I tested the 'safe' subtitle file again and some problems still persist:
line 30:
<i>- ♪♪[ Upbeat Dance ]</i>
- Big push. Four more.
when unticking the "remove text if it contains ♪" box then the result is like this:
<i></i>- Big push. Four more.
the hyphen between the brackets is gone, but the brackets are still there (which itself shouldnt be that bad as it shouldnt show during watching) and the 2nd hyphen is still there as well which needs to be removed (e.g. italic boxes with no information between them should be disregarded as textual information of a line so you would have a single hyphen in a single line which then can get removed)
then the problem with , — still exists:
line 62 (or 363 or others)
- I just, uh —
- What?
--> - I just, —
- What?
the comma and the space after that still need to go. (or at least the comma, the space is debatable. dont know if its possible to implement this but the best solution would be to check whether there always is a space in front of the - or — symbol in other examples in the corresponding subtitle track or not and then correct it following the pattern, either with removing the space or leaving it in. but this might be pretty advanced stuff :) )
line 340
- Oh — Oh, my God!
- Oh, my God.
--> - — My God!
- My God.
this one is really tricky, I dont really know if its possible to fix this without breaking anything else. in this case the 2nd hyphen would have to go because both words in front of and after it will be removed as well (while usually such a hyphen as sign of stuttering etc. shouldnt be removed). guess its not a biggie if this cannot be fixed without breaking anything else.
happy new year :)
Music Fan
31st December 2014, 16:27
the hyphen between the brackets is gone, but the brackets are still there (which itself shouldnt be that bad as it shouldnt show during watching) and the 2nd hyphen is still there as well which needs to be removed (e.g. italic boxes with no information between them should be disregarded as textual information of a line so you would have a single hyphen in a single line which then can get removed)
Did you try to remove it by the replace function ?
Thus you make it in 2 steps ;
1) do what you did (to remove text between the brackets)
2) then replace <i></i>- by - (to remove the brackets)
But I'm not sure it will work, I don't know if brackets can be considered as text.
Thunderbolt8
31st December 2014, 20:05
I already have some stuff added there, but a proper SHD removal would be preferred.
Nikse555
1st January 2015, 01:48
Happy new year :)
@Thunderbolt8: thx - new version up again - http://www.nikse.dk/SubtitleEdit.zip (portable version, beta)
Thunderbolt8
1st January 2015, 13:38
thank you!
quick feedback: example 2 and 3 of post number 294 are fixed, example 1 is still the same as before
Thunderbolt8
2nd January 2015, 16:56
would it please be possible for you to try to work at these SHD fixes again? topic was taking multiple (2-3) lines in .ass (only!) subs into account for SHD removal, because they appear at the screen at exactly the same time and can potentially appear clumbed together so you wouldn't know theres a change of speaker after current SHD removal.
01:08:40.570 1:08:42.660 {\an4\pos(556,723)}MAN1: What's wrong with you?
01:08:40.570 1:08:42.660 {\an4\pos(841,823)}MAN2: You crazy?
01:08:40.570 1:08:42.660 {\an4\pos(838,897)}You outta your mind?
which gets changed to
01:08:40.570 1:08:42.660 {\an4\pos(556,723)}What's wrong with you?
01:08:40.570 1:08:42.660 {\an4\pos(841,823)}You crazy?
01:08:40.570 1:08:42.660 {\an4\pos(838,897)}You outta your mind?
but should be:
01:08:40.570 1:08:42.660 {\an4\pos(556,723)}- What's wrong with you?
01:08:40.570 1:08:42.660 {\an4\pos(841,823)}- You crazy?
01:08:40.570 1:08:42.660 {\an4\pos(838,897)}You outta your mind?
in case this proves to be difficult I guess you could have a look at Subextractor http://subextractor.codeplex.com/ which is able to do this when you save OCRed file as .ass and you tick SHD removal. maybe there is something which can help you implementing this in SubtitleEdit ;)
Bozotheclown
14th January 2015, 23:39
Hi,
Very nice subeditor. Especially for bitmap subs. (read and ocr)
I'am looking for solution to ocr'ed subs from dvb recorded stream. CCextractor - failed, SubEdit - do the job.
I want to ask if preview for original bitmap subs (like in ocr window) is possible during spell checking (an option to load original sub in spellcheck window).
vBulletin® v3.8.11, Copyright ©2000-2026, vBulletin Solutions Inc.