Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Programming and Hacking > Development

Reply
 
Thread Tools Search this Thread Display Modes
Old 7th September 2011, 00:23   #421  |  Link
manolito
Registered User
 
manolito's Avatar
 
Join Date: Sep 2003
Location: Berlin, Germany
Posts: 3,079
Just a thought:
Have you tried to use norm instead of gain -n?

Cheers
manolito
manolito is offline   Reply With Quote
Old 7th September 2011, 00:56   #422  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
The global "--norm" option basically is a shortcut for inserting the "gain -n" filter. Also the "norm" filter is nothing but an alias for "gain −n".

And yes, I tried it, just to be sure. But same effect
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊

Last edited by LoRd_MuldeR; 7th September 2011 at 01:06.
LoRd_MuldeR is offline   Reply With Quote
Old 17th September 2011, 20:51   #423  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
LameXP v4.03 Beta-2:
http://sourceforge.net/projects/lame...29/2011-09-17/

Quote:
Changes between v4.02 and v4.03:
* Added an option to rename the output files (based on an user-defined naming pattern)
* Added an option to enforce Stereo Downmix for Multi-Channel sources
* Added "built-in" WMA decoder (see this thread for details) and removed all remnants of "old" decoder
* Added optional support for the FHG AAC Encoder included with Winamp 5.62 (see FAQ doc for details)
* Added a menu for bookmarking "favorite" output folders to the "output folder" tab
* Updated Qt runtime libraries to v4.8.0 Beta-1 (2011-07-19), compiled with MSVC 10.0
* Updated MediaInfo to v0.7.49 (2011-09-09), compiled with MSVC 10.0
* Updated language files (big thank-you to all contributors !!!)
* Improved "downmix" filter by using explicit channel mappings for each number of input channels
* Fixed Cue Sheet import for tracks with certain characters in the title
* Fixed a problem with the "normalization" filter that sometimes caused the resulting file to be empty
* Workaround for malicious "anti-virus" programs that prevent innocent applications from functioning
* Enabled "Aero Glass" theme in installer and web-update program (Vista and Windows 7 only)
* Restored Windows 2000 support with Visual Studio 2010 builds (this is experimental!)
* The "Open File(s)" and "Open Folder" dialogs will now remember the most recent directory
* Miscellaneous bugfixes
[UPDATE] Build #687 reverted NSIS to v2.46.1, because NSIS v2.46.2 broke Win2k support. Apart from that, build #687 is identical to build build #686. [/UPDATE]
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊

Last edited by LoRd_MuldeR; 17th September 2011 at 23:31.
LoRd_MuldeR is offline   Reply With Quote
Old 18th September 2011, 17:56   #424  |  Link
b66pak
Registered User
 
b66pak's Avatar
 
Join Date: Aug 2008
Location: The Land Of Dracula (Romania - EU)
Posts: 934
here is the source code under GNU General Public License for a very fast normalizer...it has support only for 8/16bit simple header wavs up to 4gb...
_
__________________
if you ask a question and somebody give you the correct answer don't forget to leave a "thank you" note...
Visit The Land Of Dracula (Romania - EU)!
b66pak is offline   Reply With Quote
Old 19th September 2011, 21:30   #425  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
LameXP running on Windows 8:
http://img841.imageshack.us/img841/2341/lamexpwin8.jpg

Unfortunately it seems that some of the tools, including MediaInfo, will crash right away on Windows 8. Bummer!

[EDIT] Seems like only the 64-Bit tools will crash. And yes, I'm using the x64 edition of Win8. [/EDIT]

[EDIT2] It appears that MPRESS causes the issue. The original 64-Bit binary works fine, while the MPRESS-compressed one crashes right away. [/EDIT2]

[EDIT3] As a temporary workaround you can use "--force-cpu-no-64bit" with the latest build in order to make LameXP work on Win8 64-Bit. [/EDIT3]
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊

Last edited by LoRd_MuldeR; 20th September 2011 at 00:00.
LoRd_MuldeR is offline   Reply With Quote
Old 21st September 2011, 04:59   #426  |  Link
MajorX
Registered User
 
Join Date: Mar 2010
Posts: 52
Nice GUI.
Is LameXP support audio encoding from video file like*.mkv or *.avi ? If not is it possible to add it in LameXP ?
MajorX is offline   Reply With Quote
Old 21st September 2011, 11:10   #427  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
Quote:
Originally Posted by MajorX View Post
Nice GUI.
Is LameXP support audio encoding from video file like*.mkv or *.avi?
Nope. But Avisynth-input (audio only) is supported. So you can encode the audio from your video files that way...

Use a simple script, such as:
FFAudioSource("C:\Some Path\Input.mkv")

Quote:
Originally Posted by MajorX View Post
If not is it possible to add it in LameXP ?
Possible, maybe. We would need to integrate a tool that can split the video file and "extract" the audio stream (e.g. MKVExtract for MKV files) for further processing.

But this additional "splitting" step it doesn't fit into the current design of the software. So this would require bigger changes in the code...
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊
LoRd_MuldeR is offline   Reply With Quote
Old 29th September 2011, 22:13   #428  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
LameXP v4.03 Beta-3:

Quote:
Changes between v4.02 and v4.03:
* Added an option to rename the output files (based on an user-defined naming pattern)
* Added an option to enforce Stereo Downmix for Multi-Channel sources
* Added "built-in" WMA decoder (see this thread for details) and removed all remnants of "old" decoder
* Added optional support for the FHG AAC Encoder included with Winamp 5.62 (see FAQ doc for details)
* Added a menu for bookmarking "favorite" output folders to the "output folder" tab
* Updated Qt runtime libraries to v4.8.0 Beta-1 (2011-07-19), compiled with MSVC 10.0
* Updated MediaInfo to v0.7.49 (2011-09-09), compiled with MSVC 10.0
* Updated language files (big thank-you to all contributors !!!)
* Improved "downmix" filter by using explicit channel mappings for each number of input channels
* Fixed Cue Sheet import for tracks with certain characters in the title
* Fixed a problem with the "normalization" filter that sometimes caused the resulting file to be empty
* Workaround for malicious "anti-virus" programs that prevent innocent applications from functioning
* Enabled "Aero Glass" theme in installer and web-update program (Vista and Windows 7 only)
* Restored Windows 2000 support with Visual Studio 2010 builds (this is experimental!)
* The "Open File(s)" and "Open Folder" dialogs will now remember the most recent directory
* Miscellaneous bugfixes
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊

Last edited by LoRd_MuldeR; 16th October 2011 at 19:01.
LoRd_MuldeR is offline   Reply With Quote
Old 4th October 2011, 01:07   #429  |  Link
manolito
Registered User
 
manolito's Avatar
 
Join Date: Sep 2003
Location: Berlin, Germany
Posts: 3,079
Normalizing with SoX

Just noticed an unnecessary annoyance with the SoX "-ne" parameter. Whenever this parameter is used, SoX switches to 2-pass mode which of course doubles the filtering time.

While this is probably unavoidable for multichannel sources, it is complete nonsense to do 2 passes for stereo sources. SoX by itself is not smart enough to switch from "-ne" to just "-n" and does 2 passes on stereo sources which is a big waste of time.

Could you add some code to LameXP to use "-ne" conditionally only if the source has more than 2 channels?


Cheers
manolito
manolito is offline   Reply With Quote
Old 4th October 2011, 01:17   #430  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
Well, all the reason why we switched from "-n" to "-ne" is because "-n" is buggy. So, unfortunately, I think we can not switch back (now).

(Also, to my understanding, normalization always requires two passes. That's because the maximum peak in the file needs to be determined first, before the actual processing can start. Normalizing only within a local window would effectively "compress" the dynamics. IMHO that's not "normalization" any more)
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊

Last edited by LoRd_MuldeR; 4th October 2011 at 13:16.
LoRd_MuldeR is offline   Reply With Quote
Old 4th October 2011, 14:35   #431  |  Link
manolito
Registered User
 
manolito's Avatar
 
Join Date: Sep 2003
Location: Berlin, Germany
Posts: 3,079
Quote:
Originally Posted by LoRd_MuldeR View Post
Well, all the reason why we switched from "-n" to "-ne" is because "-n" is buggy. So, unfortunately, I think we can not switch back (now).
Did you ever encounter any problems with the "-n" parameter for stereo files? I certainly didn't.

Quote:
Originally Posted by LoRd_MuldeR View Post
(Also, to my understanding, normalization always requires two passes. That's because the maximum peak in the file needs to be determined first, before the actual processing can start. Normalizing only within a local window would effectively "compress" the dynamics. IMHO that's not "normalization" any more)
Alright, point taken, but then the "-ne" paramter invokes a third pass...
How else would you explain the performance hit when using "-ne" compared to just "-n" for stereo sources? I just did a couple of tests, and the results are reproduceable.

Code:
Source: 2-ch WAV 1.4 GB
Convert 2-ch WAV to normalized 2-ch WAV
Commandline: SoX.exe infile outfile gain -n(e) -0.50

Execution time for "-ne": 16min 20sec
Execution time for "-n":  12min 40sec
To make things worse, the resulting normalized files are NOT identical (which they should be).

So I would still lobby for using "-ne" only for multichannel sources...


Cheers
manolito
manolito is offline   Reply With Quote
Old 4th October 2011, 14:46   #432  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
Quote:
Originally Posted by manolito View Post
Did you ever encounter any problems with the "-n" parameter for stereo files? I certainly didn't
I did not encounter any problems with "-n" and Stereo files either. But that's probably just by chance

Quote:
Originally Posted by manolito View Post
To make things worse, the resulting normalized files are NOT identical (which they should be).
Well, "-ne" works slightly different from "-n". As far as I understood:

The latter will normalize all samples in all channels by the same factor, thus it is limited by the loudest sample in the loudest channel.

At the same time the former will normalize all samples in a channel by the the same factor, but may choose a different factor for each channel.

Consequently it is expected to get different results with "-ne", iff the peak values differ between your channels...
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊

Last edited by LoRd_MuldeR; 4th October 2011 at 16:21.
LoRd_MuldeR is offline   Reply With Quote
Old 4th October 2011, 16:39   #433  |  Link
manolito
Registered User
 
manolito's Avatar
 
Join Date: Sep 2003
Location: Berlin, Germany
Posts: 3,079
Quote:
Originally Posted by LoRd_MuldeR View Post
At the same time the former will normalize all samples in a channel by the the same factor, but may choose a different factor for each channel.
From the SoX manual:
Quote:
Given the −e option, the levels of the audio channels of a multi-channel file are ‘equalised’, i.e.
gain is applied to all channels other than that with the highest peak level, such that all channels
attain the same peak level
The question is if the SoX guys regard a stereo file as "multi-channel". If they do then the "-ne" parameter will inevitably change the stereo balance of the source which is REAL BAD. Makes it unusable for stereo files.

Maybe the "-nb" option would be more suitable for stereo sources because it uses RMS values instead of peaks. This will result in a file with an even stereo balance (both channels have the same RMS value). This is often quite desirable, but probably not always...


Cheers
manolito
manolito is offline   Reply With Quote
Old 4th October 2011, 16:48   #434  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
Quote:
Originally Posted by manolito View Post
If they do then the "-ne" parameter will inevitably change the stereo balance of the source which is REAL BAD. Makes it unusable for stereo files.
It would do so, if the Stereo channels have different peak levels.

And I think, except for very rare cases, the peak levels of the channels of a Stereo recording should be identical (and should be equalized, if they differ).
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊
LoRd_MuldeR is offline   Reply With Quote
Old 4th October 2011, 17:27   #435  |  Link
b66pak
Registered User
 
b66pak's Avatar
 
Join Date: Aug 2008
Location: The Land Of Dracula (Romania - EU)
Posts: 934
no they are not...from my experience it is always a 0.5 to 1 db between left and right channels...

for 5.1 is even worse: center vs left+right front is 3-4 db and center vs left+right side/back is 4-8db...
_
__________________
if you ask a question and somebody give you the correct answer don't forget to leave a "thank you" note...
Visit The Land Of Dracula (Romania - EU)!
b66pak is offline   Reply With Quote
Old 4th October 2011, 17:35   #436  |  Link
manolito
Registered User
 
manolito's Avatar
 
Join Date: Sep 2003
Location: Berlin, Germany
Posts: 3,079
Quote:
Originally Posted by LoRd_MuldeR View Post
And I think, except for very rare cases, the peak levels of the channels of a Stereo recording should be identical (and should be equalized, if they differ).
This is not true at all (maybe for classical recordings). Most modern instruments (i.e. synthesizers, electric guitar and bass, percussion) generate high peaks which do not contribute to the perceived loudness of the track. Unless brutal compression or brickwall limiting is used to achieve maximum loudness, the peak levels for the 2 channels will amost never be identical while the acoustic balance is perfect.

I have a background as a recording engineer (many years ago in the good old analog times), but if you do not believe me, test it for yourself:

Take some of your favorite audio records (good quality please, maybe jazz), load them into WaveLab and do a "Global Analysis". This will give you the peak levels plus the RMS levels for each track (separately for each channel).


Cheers
manolito
manolito is offline   Reply With Quote
Old 4th October 2011, 18:15   #437  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
Yes, I know that "sample value" and "perceived loudness" aren't the same at all and that RMS is a better loudness indicator. And I would never suggest to use the sample value to judge the loudness of a track. But we are not talking about average values here. Not even about average values for small segments. It's all about peak (maximum) values for complete tracks. Sure there are "high peaks which do not contribute to the perceived loudness of the track". And the number of such peaks may be very different between the channels. But each track/channel contains millions of individual sample values. So usually there is at least one (and one is sufficient for maximum calculation) of these "high" peaks in each channel. Actually my finding is that the maximum sample value generally is identical (or pretty close) between both channels of a Stereo recording, while both, the average and especially the maximum, RMS can differ significantly between the channels...

Example:
http://img834.imageshack.us/img834/3...xpeakvalue.png
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊

Last edited by LoRd_MuldeR; 4th October 2011 at 19:13.
LoRd_MuldeR is offline   Reply With Quote
Old 4th October 2011, 19:26   #438  |  Link
b66pak
Registered User
 
b66pak's Avatar
 
Join Date: Aug 2008
Location: The Land Of Dracula (Romania - EU)
Posts: 934
its time for some examples...


stereo (normalized after downmixing from 5.1)

Quote:
[20:43:30.796] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n remix -m 1v0.3694,3v0.2612,5v0.3694 2v0.3694,3v0.2612,6v0.3694 gain -n stats

[20:48:21.750] Overall Left Right

[20:48:21.750] DC offset -0.000005 -0.000005 -0.000005
[20:48:21.750] Min level -0.934752 -0.934752 -0.776695
[20:48:21.750] Max level 1.000000 1.000000 0.872530

[20:48:21.750] Pk lev dB 0.00 0.00 -1.18

[20:48:21.750] RMS lev dB -26.14 -26.14 -26.15
[20:48:21.750] RMS Pk dB -10.96 -11.04 -10.96
[20:48:21.750] RMS Tr dB -1.#J -1.#J -1.#J
[20:48:21.750] Crest factor - 20.28 17.70
[20:48:21.750] Flat factor 0.00 0.00 0.00
[20:48:21.750] Pk count 2 2 2
[20:48:21.750] Bit-depth 32/32 32/32 32/32
[20:48:21.750] Num samples 124M
[20:48:21.750] Length s 2582.272
[20:48:21.750] Scale max 1.000000
[20:48:21.750] Window s 0.050
[20:48:21.765] Process terminated with code: 0
[20:48:21.765] Execution took 4 minute(s), 50 second(s).



[20:48:22.406] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n remix -m 1v0.3694,3v0.2612,5v0.3694 2v0.3694,3v0.2612,6v0.3694 gain -n stats

[20:51:12.343] Overall Left Right

[20:51:12.343] DC offset -0.000002 -0.000001 -0.000002
[20:51:12.343] Min level -1.000000 -1.000000 -0.930617
[20:51:12.343] Max level 0.958940 0.958940 0.937415

[20:51:12.343] Pk lev dB 0.00 0.00 -0.56

[20:51:12.343] RMS lev dB -25.23 -25.23 -25.23
[20:51:12.343] RMS Pk dB -9.66 -10.45 -9.66
[20:51:12.343] RMS Tr dB -1.#J -1.#J -1.#J
[20:51:12.343] Crest factor - 18.27 17.11
[20:51:12.343] Flat factor 0.00 0.00 0.00
[20:51:12.343] Pk count 2 2 2
[20:51:12.343] Bit-depth 32/32 32/32 32/32
[20:51:12.343] Num samples 126M
[20:51:12.343] Length s 2631.840
[20:51:12.343] Scale max 1.000000
[20:51:12.343] Window s 0.050
[20:51:12.343] Process terminated with code: 0
[20:51:12.343] Execution took 2 minute(s), 49 second(s).



[20:51:12.953] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n remix -m 1v0.3694,3v0.2612,5v0.3694 2v0.3694,3v0.2612,6v0.3694 gain -n stats

[20:55:57.062] Overall Left Right

[20:55:57.062] DC offset -0.000003 -0.000002 -0.000003
[20:55:57.062] Min level -0.966295 -0.940450 -0.966295
[20:55:57.062] Max level 1.000000 0.951579 1.000000

[20:55:57.062] Pk lev dB 0.00 -0.43 0.00

[20:55:57.062] RMS lev dB -24.49 -24.46 -24.52
[20:55:57.062] RMS Pk dB -8.03 -8.33 -8.03
[20:55:57.062] RMS Tr dB -133.10 -132.31 -133.10
[20:55:57.062] Crest factor - 15.90 16.82
[20:55:57.062] Flat factor 0.00 0.00 0.00
[20:55:57.062] Pk count 2 2 2
[20:55:57.062] Bit-depth 32/32 32/32 32/32
[20:55:57.062] Num samples 125M
[20:55:57.062] Length s 2595.808
[20:55:57.062] Scale max 1.000000
[20:55:57.062] Window s 0.050
[20:55:57.093] Process terminated with code: 0
[20:55:57.093] Execution took 4 minute(s), 43 second(s).



[20:55:57.640] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n remix -m 1v0.3694,3v0.2612,5v0.3694 2v0.3694,3v0.2612,6v0.3694 gain -n stats

[20:58:52.781] Overall Left Right

[20:58:52.781] DC offset -0.000003 -0.000002 -0.000003
[20:58:52.781] Min level -0.983251 -0.983251 -0.936560
[20:58:52.781] Max level 1.000000 1.000000 0.970447

[20:58:52.781] Pk lev dB 0.00 0.00 -0.26

[20:58:52.781] RMS lev dB -27.89 -27.96 -27.82
[20:58:52.781] RMS Pk dB -10.44 -10.44 -11.09
[20:58:52.781] RMS Tr dB -98.63 -98.63 -98.42
[20:58:52.781] Crest factor - 24.99 23.88
[20:58:52.781] Flat factor 0.00 0.00 0.00
[20:58:52.781] Pk count 2 2 2
[20:58:52.781] Bit-depth 32/32 32/32 32/32
[20:58:52.781] Num samples 124M
[20:58:52.781] Length s 2593.312
[20:58:52.781] Scale max 1.000000
[20:58:52.781] Window s 0.050
[20:58:52.781] Process terminated with code: 0
[20:58:52.781] Execution took 2 minute(s), 54 second(s).

original 5.1

Quote:
[20:59:52.140] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n stats

[21:01:40.687] Overall Ch1 Ch2 Ch3 Ch4 Ch5 Ch6

[21:01:40.687] DC offset -0.000003 -0.000002 -0.000002 -0.000003 -0.000001 -0.000001 -0.000000
[21:01:40.687] Min level -0.464905 -0.464905 -0.443695 -0.464386 -0.321808 -0.427521 -0.333832
[21:01:40.687] Max level 0.501587 0.464935 0.481415 0.501587 0.349060 0.424225 0.382660


[21:01:40.687] Pk lev dB -5.99 -6.65 -6.35 -5.99 -9.14 -7.38 -8.34

0.00 -0.66 -0.36 0.00 -3.15 -1.39 -2.35 < this will be after normalizing


[21:01:40.687] RMS lev dB -32.58 -31.85 -31.59 -27.83 -39.44 -39.40 -40.73
[21:01:40.687] RMS Pk dB -12.62 -12.83 -12.62 -15.78 -14.06 -13.20 -19.63
[21:01:40.687] RMS Tr dB -1.#J -1.#J -1.#J -1.#J -1.#J -1.#J -1.#J
[21:01:40.687] Crest factor - 18.19 18.28 12.35 32.72 39.88 41.64
[21:01:40.687] Flat factor 12.84 0.00 0.00 0.00 17.08 4.44 0.00
[21:01:40.687] Pk count 4.33 2 3 2 14 3 2
[21:01:40.687] Bit-depth 15/16 15/16 15/16 15/16 15/16 15/16 15/16
[21:01:40.687] Num samples 124M
[21:01:40.687] Length s 2582.272
[21:01:40.687] Scale max 1.000000
[21:01:40.687] Window s 0.050
[21:01:40.687] Process terminated with code: 0
[21:01:40.687] Execution took 1 minute(s), 48 second(s).



[21:01:40.781] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n stats

[21:03:28.468] Overall Ch1 Ch2 Ch3 Ch4 Ch5 Ch6

[21:03:28.468] DC offset -0.000001 0.000000 -0.000001 -0.000000 -0.000000 -0.000000 -0.000000
[21:03:28.468] Min level -0.608521 -0.398529 -0.406006 -0.608521 -0.022552 -0.063690 -0.074982
[21:03:28.468] Max level 0.667389 0.328278 0.462891 0.667389 0.016663 0.068573 0.065765


[21:03:28.468] Pk lev dB -3.51 -7.99 -6.69 -3.51 -32.94 -23.28 -22.50

0.00 -4.48 -3.18 0.00 -29.43 -19.77 -18.99 < this will be after normalizing


[21:03:28.468] RMS lev dB -35.90 -38.52 -38.33 -29.07 -65.58 -51.39 -51.07
[21:03:28.468] RMS Pk dB -15.47 -16.51 -15.71 -15.47 -39.79 -31.00 -30.26
[21:03:28.468] RMS Tr dB -1.#J -1.#J -1.#J -1.#J -1.#J -1.#J -1.#J
[21:03:28.468] Crest factor - 33.62 38.19 18.97 42.86 25.45 26.83
[21:03:28.468] Flat factor 18.42 0.00 0.00 0.00 21.23 0.00 0.00
[21:03:28.468] Pk count 5.50 2 2 2 23 2 2
[21:03:28.468] Bit-depth 16/16 15/16 15/16 16/16 11/16 13/16 13/16
[21:03:28.468] Num samples 126M
[21:03:28.468] Length s 2631.840
[21:03:28.468] Scale max 1.000000
[21:03:28.468] Window s 0.050
[21:03:28.468] Process terminated with code: 0
[21:03:28.468] Execution took 1 minute(s), 47 second(s).



[21:03:28.937] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n stats

[21:05:14.234] Overall Ch1 Ch2 Ch3 Ch4 Ch5 Ch6

[21:05:14.234] DC offset -0.000001 -0.000000 -0.000001 -0.000000 -0.000000 -0.000001 -0.000001
[21:05:14.234] Min level -0.805939 -0.367798 -0.404785 -0.805939 -0.203033 -0.312012 -0.278168
[21:05:14.234] Max level 0.834442 0.398376 0.410004 0.834442 0.189209 0.278931 0.304901


[21:05:14.250] Pk lev dB -1.57 -7.99 -7.74 -1.57 -13.85 -10.12 -10.32

0.00 -6.42 -6.17 0.00 -12.28 -8.55 -8.75 < this will be after normalizing


[21:05:14.250] RMS lev dB -34.19 -34.00 -34.26 -28.99 -54.86 -39.13 -38.95
[21:05:14.250] RMS Pk dB -15.26 -15.62 -15.26 -15.52 -20.48 -22.44 -22.31
[21:05:14.250] RMS Tr dB -1.#J -138.72 -139.84 -147.72 -1.#J -1.#J -1.#J
[21:05:14.250] Crest factor - 19.96 21.16 23.49 112.39 28.22 27.01
[21:05:14.250] Flat factor 7.76 0.00 0.00 0.00 12.57 0.00 0.00
[21:05:14.250] Pk count 3 2 2 2 8 2 2
[21:05:14.250] Bit-depth 16/16 15/16 15/16 16/16 14/16 15/16 15/16
[21:05:14.250] Num samples 125M
[21:05:14.250] Length s 2595.808
[21:05:14.250] Scale max 1.000000
[21:05:14.250] Window s 0.050
[21:05:14.250] Process terminated with code: 0
[21:05:14.250] Execution took 1 minute(s), 45 second(s).



[21:05:14.359] Commandline: sox -V0 -t raw -L -s -b 16 -c 6 -r 48000 - -n stats

[21:06:50.937] Overall Ch1 Ch2 Ch3 Ch4 Ch5 Ch6

[21:06:50.937] DC offset -0.000002 -0.000001 -0.000002 -0.000001 -0.000000 -0.000000 -0.000000
[21:06:50.937] Min level -0.676056 -0.536987 -0.549896 -0.676056 -0.183105 -0.209625 -0.200348
[21:06:50.937] Max level 0.750793 0.552826 0.549530 0.750793 0.181732 0.236572 0.201385


[21:06:50.937] Pk lev dB -2.49 -5.15 -5.19 -2.49 -14.75 -12.52 -13.92

0.00 -2.66 -2.70 0.00 -12.26 -10.03 -11.43 < this will be after normalizing


[21:06:50.937] RMS lev dB -34.29 -33.64 -33.28 -29.03 -53.29 -44.20 -44.04
[21:06:50.937] RMS Pk dB -12.28 -12.28 -12.86 -14.12 -22.45 -24.03 -23.55
[21:06:50.937] RMS Tr dB -1.#J -107.22 -105.87 -128.05 -1.#J -193.53 -192.30
[21:06:50.937] Crest factor - 26.59 25.36 21.24 84.59 38.39 32.05
[21:06:50.937] Flat factor 0.00 0.00 0.00 0.00 0.00 0.00 0.00
[21:06:50.937] Pk count 2 2 2 2 2 2 2
[21:06:50.937] Bit-depth 16/16 16/16 16/16 16/16 14/16 14/16 14/16
[21:06:50.937] Num samples 124M
[21:06:50.937] Length s 2593.312
[21:06:50.937] Scale max 1.000000
[21:06:50.937] Window s 0.050
[21:06:50.937] Process terminated with code: 0
[21:06:50.937] Execution took 1 minute(s), 36 second(s).

-ne for stereo will do:

0.00 -1.18 > 0.00 0.00 > meaning the right channel will be amplified by 1.18bd

0.00 -0.56 > 0.00 0.00 > meaning the right channel will be amplified by 0.56bd

-0.43 0.00 > 0.00 0.00 > meaning the left channel will be amplified by 0.43bd

0.00 -0.26 > 0.00 0.00 > meaning the right channel will be amplified by 0.26bd


-ne for original 5.1 will do:

-0.66 -0.36 0.00 -3.15 -1.39 -2.35 > 0.00 0.00 0.00 0.00 0.00 0.00 > meaning the music (left/right channels) and the surround effect (side left/right channels) will be amplified a lot!

-4.48 -3.18 0.00 -29.43 -19.77 -18.99 > 0.00 0.00 0.00 0.00 0.00 0.00 > meaning the music (left/right channels) and the surround effect (side left/right channels) will be amplified a lot!

-6.42 -6.17 0.00 -12.28 -8.55 -8.75 > 0.00 0.00 0.00 0.00 0.00 0.00 > meaning the music (left/right channels) and the surround effect (side left/right channels) will be amplified a lot!

-2.66 -2.70 0.00 -12.26 -10.03 -11.43 > 0.00 0.00 0.00 0.00 0.00 0.00 > meaning the music (left/right channels) and the surround effect (side left/right channels) will be amplified a lot!
_
__________________
if you ask a question and somebody give you the correct answer don't forget to leave a "thank you" note...
Visit The Land Of Dracula (Romania - EU)!

Last edited by b66pak; 4th October 2011 at 19:53.
b66pak is offline   Reply With Quote
Old 6th October 2011, 23:31   #439  |  Link
LoRd_MuldeR
Software Developer
 
LoRd_MuldeR's Avatar
 
Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,248
After all I have decided to make the "channel equalization mode" an option.

Now you can even select "-n" again, but it will still fail with (some) multi-channel files, of course.

The new build is available via auto-update...
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊
LoRd_MuldeR is offline   Reply With Quote
Old 7th October 2011, 00:31   #440  |  Link
manolito
Registered User
 
manolito's Avatar
 
Join Date: Sep 2003
Location: Berlin, Germany
Posts: 3,079
Thanks very much...

Could you elaborate on how the three different normalization modes (max level, max energy, none) translate to SoX parameters? Couldn't find anything in the documentation...


Cheers
manolito
manolito is offline   Reply With Quote
Reply

Tags
aac, aotuv, flac, lame, lamexp, mp3, mp4, ogg, oggenc, opus, vorbis

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 17:45.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.