Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
|
|||||||
| View Poll Results: Metrics. | |||
| Only SSIM is a reliable metric test. |
|
4 | 9.09% |
| Only PSNR is a reliable metric test. |
|
0 | 0% |
| Both SSIM and PSNR are reliable metric test. |
|
13 | 29.55% |
| I meassure quality only by my own visual perception. |
|
27 | 61.36% |
| Voters: 44. You may not vote on this poll | |||
![]() |
|
|
Thread Tools | Display Modes |
|
|
#1 | Link |
|
Registered User
Join Date: Apr 2004
Posts: 1,025
|
Do you beleive to metric result? or Do you agree with them?
Last time SSIM and OPSNR tests became even more popular between hobbists and devs of video compression.
Personally many times I was disagree with OPSNR metrics. When the difference was +/- 0.35-0.5 db but visually I prefered the video with lower opsnr. And the difference was noticeble. My visual perception is more related with ssim metrics. I always was agree with the ssim results when the difference was noticeble . At least +/- 0.2 ssim . Last edited by IgorC; 27th April 2006 at 05:12. |
|
|
|
|
|
#3 | Link |
|
interlace this!
Join Date: Jun 2003
Location: i'm in ur transfers, addin noise
Posts: 4,267
|
i agree with the wording that "PSNR and SSIM are reliable metrics", because that statement says nothing about perceptual quality...
i judge quality with my eyes, but numbers are a very good guide. for example, when i do MPEG-2 (which is all the time), i'll do a bitrateview scan and look for the bits where the quants visibly spike. you're unlikely to have quality problems with q2 or q1 footage. but q17 stuff requires visual inspection... so i guess i'd have to say my quality metric is quantiser rather than PSNR or SSIM, as i hardly ever use either of them. also, i read a lot of threads with metric comparisons, and PSNR and SSIM are almost always in agreement. they only differ greatly when the visual quality starts getting bad, or in special cases that i can't think of offhand.
__________________
interlace... right or wrong, just deal with it. |
|
|
|
|
|
#4 | Link |
|
ангел смерти
![]() Join Date: Nov 2004
Location: Oceanborn
Posts: 7,598
|
PSNR has a noise area, based on what I can tell it's something like 1-2 db at low values and more as you go higher... any two encodings within that area may be better, worse, or identical, only when the scores are very different are they reliable (at which point visual quality differences are very obvious anyway). SSIM is the same, but much smaller intervals, which is the only reason it's more useful. Still fine differences can throw it off, and it isn't really optimized for all the different kinds of artifacts on different kinds of video that encoders can cause.
__________________
There are four boxes to be used in defense of liberty: soap, ballot, jury, and ammo. Please use in that order. ~ Ed Howdershelt |
|
|
|
|
|
#5 | Link |
|
clueless n00b
![]() Join Date: Oct 2001
Location: somewhere over the rainbow
Posts: 10,258
|
c't performs codec test using the most complex metric of 'em all: SND. Yet, in my codec comparisons, the codec they find to be great do not do so well.. hence I trust only my eyes. No matter how much time and maths you invest to model the HVS, each and everyone of us has a different visual trace and places focus on different areas. The occasional block may be distracting, but personally if the alternative means a lack of crispness, then when I watch both clips one after another, I'll still rate the crisper but blockier one higher than the washed out one. But I think Joe Average would rate them just the reverse.. there are enough people that are still happy with VHS quality after all and that just looks terrible. Even a lot of DVDs don't look that great and the fact that directors add arbitrary noise only makes the situation worse.
__________________
For the web's most comprehensive collection of DVD backup guides go to www.doom9.org |
|
|
|
|
|
#6 | Link |
|
Registered User
Join Date: Feb 2006
Posts: 620
|
I think for an intra-codec comparison (look for one codec what options are better) PSNR certainly is an useful value. Except for CQM's I think any option that is supposed to increase quality will also increase PSNR. Codecs seem to be mostly tuned on achieving the highest PSNR, anyway.
The problem with PSNR is that there is no useful scale that correlates with subjective judgments like good, ok, bad, etc. It is very easy to pump a PSNR score by adding letterboxing (which won't increase quality of course) and it is also very easy to decrease PSNR by changing the brightness just the slightest bit (which won't really decrease the quality of course). So if you have codec A with score X and codec B with score Y, it is very difficult to say which one is better, because we lack this scale. Except of course when one codec scores idiotically low. |
|
|
|
|
|
#7 | Link | |||
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,065
|
PSNR compare simply equivalent pixels : if there are high convergence between source and video then PSNR is high
Quote:
Quote:
Quote:
MPEG4 ASP is better than MPEG2 MPEG4 AVC is better than MPEG4 ASP For make PSNR judgement you must use threshold: if codec A obtain 43.0 dB and codec B obtain 43.1 dB we can't say that "codec B is better than codec A". But if codec A obtain 43.0 dB and codec B obtain 44.0 dB we can say with very high probablity that "codec B is better than codec A". the threshold of uncertainty for PSNR is higher than the threshold of uncertainty for SSIM simply because HVS evaluation for SSIM is better than HVS evaluation for PSNR. you do not believe in the metrics : well metric test for codec done exactly the same result that the last doom9 codec test ... try explain that like you want. All the codec developper in the world use metric to improve their codec ... try explain that like you want.
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 |
|||
|
|
|
|
|
#8 | Link | |
|
clueless n00b
![]() Join Date: Oct 2001
Location: somewhere over the rainbow
Posts: 10,258
|
Quote:
I don't have their eyes so unless claims are really outrageous, I can't really say their opinion is wrong.And I gess I got SSIM eyes then because SND doesn't agree with my eyes.
__________________
For the web's most comprehensive collection of DVD backup guides go to www.doom9.org |
|
|
|
|
|
|
#9 | Link | |
|
interlace this!
Join Date: Jun 2003
Location: i'm in ur transfers, addin noise
Posts: 4,267
|
Quote:
it's used by codec developers to improve their codecs because they are doing several small controlled tests - they'll vary one thing and check the metrics. usually it's because the differences are imperceptible that metrics are used - we can't see 1 pixel difference in luma around the edge of a man's glasses while said man is being shot out of a cannon at high shutter-speed, but an objective metric can. then there's codec optimisation techniques that would make it impractical to use visual inspection: i remember reading here Pengvado's way of finding an optimal command line for (i think) mencoder - construct a genetic algorithm with bitrate and PSNR as selection criteria, then breed the settings together. doing that with visual inspection would yeild possibly stupid and superfluous settings, but using PSNR helped remove the crap. contrast this with LAME's old "r3mix" preset which was a mixture of good sense and stupidity. however the analogy ends there because objective metrics mean even less to audio, where blind testing and statistical analysis is the only way to trust one's results. it would be difficult to compare AVC with ASP when the loop filter is turned on - perceptual quality increases and PSNR drops. and as psychovisual modelling in codecs becomes more sophisticated, we will expect to see more of this kind of thing happening - PSNR going down while perceptual quality goes up.
__________________
interlace... right or wrong, just deal with it. Last edited by Mug Funky; 27th April 2006 at 10:52. |
|
|
|
|
|
|
#10 | Link | |||
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,065
|
Quote:
Quote:
Quote:
and inloop increase always PSNR: - best size for -2:-2 (constant quant encoding) - best PSNR for 0:0 (constant quant or size encoding) - best SSIM generaly for -1:-1 (constant quant or size encoding) - best default value for developper is in [-2;0] interval. x264 use 0:0, Elecard use -1:-1 and Ateme -2:-2
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 |
|||
|
|
|
|
|
#11 | Link | |
|
....
Join Date: May 2002
Location: Australia
Posts: 2,789
|
Quote:
http://forum.doom9.org/showthread.ph...634#post698634 i give my reply a few posts down about how you misunderstand the statement you use/quote. so yes devs can and do use metrics to try and give clues or ideas where there codec/encoder may need improvement. this does not mean they suddenly think it will tell you whats the best quality all the time or to everybody because that simply is not the case. EDIT: oh and PLEASE stop saying silly things like "All the codec developper in the world" since you have absolutly no way of knowing such a thing.
__________________
Narrator: And of course, with the birth of the artist came the inevitable afterbirth - the critic. (History of the World part 1) |
|
|
|
|
|
|
#12 | Link | ||
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,065
|
Quote:
- Ateme developper - Elecard developper - x264 developper - RV10 developper - XviD developper - DivX developper - VP7 developper - VC1 developper ... etc etc etc and most of them use metric to compare their codecs with the concurence. Quote:
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 |
||
|
|
|
|
|
#13 | Link | |||||
|
....
Join Date: May 2002
Location: Australia
Posts: 2,789
|
Quote:
Quote:
Quote:
take the line Quote:
so please stop pushing that quote as some evidence that metrics will ALWAYS show you whats best quality to everybody because it simply is NOT true. Quote:
i am rather tired of having to say the same things to you about this so i wont bother anymore.
__________________
Narrator: And of course, with the birth of the artist came the inevitable afterbirth - the critic. (History of the World part 1) |
|||||
|
|
|
|
|
#14 | Link | |
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,065
|
Quote:
"for very large majority with high probability if there are subtancial difference" little example : at this time impossible to say which is the best between XviD and DivX with metric simply metric test are really close. Many users find that XviD is visually the best and many users find that DivX is visually the best. -> That's the reality at this time really easy to say which is the best between ASP and AVP with metric simply metric test are very best for AVC. Large user's majority find that AVC is by far visually better than ASP. -> That's the reality Many developper use metric to compare their codec with concurence and for me too "subjective testing is also very difficult to do and is not always reliable" simply because overall sujective judgement is really really hard (very easy for me to make demonstration here) -> That's the reality
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 |
|
|
|
|
|
|
#15 | Link | |
|
interlace this!
Join Date: Jun 2003
Location: i'm in ur transfers, addin noise
Posts: 4,267
|
i wonder what would look better between an AVC encode with +2,+2 vs an ASP encode with a higher PSNR? obviously this is strongly dependent on the amount of quantization going on, but my point is that the ASP encode would have visible blocking where the AVC wouldn't. on a lot of sources the AVC would look much better in spite of being further away from the original, metric wise.
hopefully one can see that such a metric-based comparison breaks down when it's applied to drastically different technologies. however (and i suspect this is what codec developers would be doing, though i can't make such a call and i can't be bothered sitting up all night digging up quotes), "quality" metrics are valid and very useful when tweaking settings within the same codec, on the same source. tests of this kind must be controlled or they are invalid and useless (like going xvid + fft3dfilter versus divx + conv3d in a PSNR battle would say nothing about either the codecs or the filters). also, though i will look at the numbers when doing an encode, nothing goes on a DVD master that hasn't been watched by a human in a controlled environment. though quants/metrics can tell us where the problem areas might be, they wont say whether there's other problems going on. in the case of a total A/V dropout, PSNR can go to -inf, and quants will drop to 1, but that doesn't mean the quality is good because in fact it's a total disaster and will require several hours in photoshop cursing the name of certain equipment manufacturers. [edit] Quote:
__________________
interlace... right or wrong, just deal with it. Last edited by Mug Funky; 27th April 2006 at 16:08. |
|
|
|
|
|
|
#16 | Link | ||
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,065
|
Quote:
... only pre-process for that.Quote:
2) "result better than original" doesn't mean anything for video codec
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 Last edited by Sagittaire; 27th April 2006 at 18:13. |
||
|
|
|
|
|
#17 | Link | |
|
ангел смерти
![]() Join Date: Nov 2004
Location: Oceanborn
Posts: 7,598
|
Quote:
I hope someone comes up with a quality evaluator like that someday, it'd be so useful. (For one thing, guis could use it to place certain filters in their scripts, and tweak them in certain frame ranges as necessary, extending the trend of automated filtering.)
__________________
There are four boxes to be used in defense of liberty: soap, ballot, jury, and ammo. Please use in that order. ~ Ed Howdershelt |
|
|
|
|
|
|
#18 | Link | |
|
Testeur de codecs
Join Date: May 2003
Location: France
Posts: 2,065
|
Quote:
http://www.cns.nyu.edu/~zwang/files/...demo_lena.html http://www.cns.nyu.edu/~zwang/files/...demo_blur.html http://www.cns.nyu.edu/~zwang/files/.../demo_jpg.html http://www.cns.nyu.edu/~zwang/files/...mo_couple.html you speak about metric but you don't use it or you don't know their real capacity and limitation ... ;-)
__________________
Le Sagittaire ... ;-) 1- Ateme AVC or x264 2- VP7 or RV10 only for anime 3- XviD, DivX or WMV9 |
|
|
|
|
|
|
#19 | Link |
|
Go Nero Digital
Join Date: Jan 2006
Location: Edison, NJ
Posts: 466
|
Both are reliable tests. SSIM may be better, but not that much. However, they are by no means perfect and they really need to be improved. But they generally should be good.
I am using PSNR for my upcoming codec comparison. |
|
|
|
|
|
#20 | Link |
|
Leaker!
Join Date: Apr 2003
Location: Unknown
Posts: 2,773
|
I think it always depends on... what you wanna test with the metrics, how you to do the test, and how you interpret the results... In some cases metric test can be reliable, in others not, eh!? ^^
Bye Last edited by Soulhunter; 28th April 2006 at 00:24. |
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|