Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. Domains: forum.doom9.org / forum.doom9.net / forum.doom9.se |
|
|
#61 | Link | |
|
Registered Developer
Join Date: Sep 2006
Posts: 9,140
|
Quote:
![]() That said, the weights are usually only calculated once and then stored in some array. So it's not a few saved multiplies per pixel, not even per frame. |
|
|
|
|
|
|
#62 | Link |
|
Grand Fruitioner
Join Date: Mar 2004
Location: Chicago, IL
Posts: 115
|
Here's version 5. Hopefully the last of the chroma bugs. Been hitting F2 so much in Virtualdub that I've started hitting F2 instead of F5 in browsers, lol.
Download for 32-bit and 64-bit: http://sourceforge.net/projects/int6...5.zip/download Full documentation: http://svn.int64.org/viewvc/int64/re...l?revision=268 Changelog:
|
|
|
|
|
|
#64 | Link |
|
Registered User
Join Date: Nov 2009
Posts: 2,375
|
Thank you!
I can't make RGB work, either input or ouput. I get a crash (my fault?): crashinfo.txt edit: that was in avs 2.57 MT winXP SP3 edit2: It worked if dither=false, so its a dither thing in RGB.
__________________
i7-4790K@Stock::GTX 1070] AviSynth+ filters and mods on GitHub + Discussion thread Last edited by Dogway; 8th April 2011 at 06:33. |
|
|
|
|
|
#65 | Link | |
|
Grand Fruitioner
Join Date: Mar 2004
Location: Chicago, IL
Posts: 115
|
Quote:
edit: nevermind, i got it through another browser. Last edited by PhrostByte; 7th April 2011 at 13:03. |
|
|
|
|
|
|
#66 | Link |
|
Registered User
Join Date: Sep 2009
Posts: 378
|
Is there anything to gain from implementing yesgrey's high precision 3D LUT (yCMS) for colourspace conversions used in conjunction with the necessary bits of Triticals t3dlut plugin?
http://forum.doom9.org/showthread.php?t=154719 |
|
|
|
|
|
#67 | Link | |
|
Grand Fruitioner
Join Date: Mar 2004
Location: Chicago, IL
Posts: 115
|
Quote:
Actual colorspace conversion is the cheapest part of ResampleHQ right now -- resampling takes up a huge amount of time compared to it. I'm currently experimenting with OpenCL which should give a pretty nice speedup as is, though it can also use 3D LUTs so it could also help assuming you've got enough vram. |
|
|
|
|
|
|
#69 | Link |
|
Software Developer
![]() Join Date: Jun 2005
Location: Last House on Slunk Street
Posts: 13,275
|
Also "Spline32" (mentioned in the docs) does not work, but "Spline36" (undocumented) does
__________________
Go to https://standforukraine.com/ to find legitimate Ukrainian Charities 🇺🇦✊ |
|
|
|
|
|
#70 | Link | |||
|
Grand Fruitioner
Join Date: Mar 2004
Location: Chicago, IL
Posts: 115
|
Quote:
Quote:
Quote:
|
|||
|
|
|
|
|
#72 | Link |
|
Grand Fruitioner
Join Date: Mar 2004
Location: Chicago, IL
Posts: 115
|
Version 6. Fixes Dogway's crash and some other things.
Download for 32-bit and 64-bit: http://sourceforge.net/projects/int6...6.zip/download Full documentation: http://svn.int64.org/viewvc/int64/re...l?revision=272 Changelog:
|
|
|
|
|
|
#75 | Link |
|
Registered User
Join Date: Sep 2009
Posts: 378
|
Enjoying using your plugin and have a query, not a bug to report but I notice when converting to RGB that I see faint red/blue rectangular blocks along the edges of content in any chosen image frame. The source is h264AVC from a Canon DSLR. I can put up a sample but guess the description is enough. Assume this is result of chroma subsampling / moire / line skipping. They also appear to be non square even though its square pixel source
So query is, is there anything that can be done in the interpolation of chroma, assume its going 4:2:0 to 4:2:2 to RGB that could subdue the alternate red blue blocks its like chequreboard I notice this with other conversion methods to RGB including yCMS 3DLUT + Triticals t3dlut plugin Last edited by Yellow_; 13th April 2011 at 08:11. |
|
|
|
|
|
#76 | Link | |
|
Registered User
Join Date: Dec 2005
Location: Germany
Posts: 1,827
|
Quote:
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth VapourSynth Portable FATPACK || VapourSynth Database |
|
|
|
|
|
|
#77 | Link | |
|
Grand Fruitioner
Join Date: Mar 2004
Location: Chicago, IL
Posts: 115
|
Quote:
Here it is. It'll create a "benchmark results.txt" once it's done, you can just paste the results here. http://sourceforge.net/projects/int6...h.zip/download |
|
|
|
|
|
|
#78 | Link |
|
Registered User
Join Date: Nov 2009
Posts: 2,375
|
I made some tests. I use an image I used once before because it has good features to test on like small details, big details, and some well defined letters.
From what I observed most of the scaling kernels show the ringing I described a few weeks back. So this is how I classify the different algorithms: blurry: principally because of this bilinear rubidoux ringing: lanczos4 spline64 spline36 good balance of both of them: lanczos (best in my opinion) blackman (2nd best) spline16 catmull I also included automttap3 for comparison, it was the one who retained most detail in my last test, but I wouldnt use it for graphics or cartoons because it also creates some ugly ringing repetition. Also added spline64. If I tell you the truth I was a bit dissappointed concerning the ringing thing, even if that was what it was supposed to do :/ Pack with the tests, .psd file and separated .png files 35Mb rar: http://www.mediafire.com/?16a31o7v0im652o
__________________
i7-4790K@Stock::GTX 1070] AviSynth+ filters and mods on GitHub + Discussion thread |
|
|
|
|
|
#79 | Link |
|
Registered User
Join Date: Dec 2005
Location: Germany
Posts: 1,827
|
Benchmark results
------------------ The AVX version seems to be very slow :/ Code:
RGB24 -> linear RGB C: (fastest/slowest/average) wall time speed is 6.59281/6.10231/6.51858 runs/second. RGB24 -> linear RGB C relative speed: 1.0x RGB24 -> linear RGB SSSE3: (fastest/slowest/average) wall time speed is 37.5498/33.94/36.5475 runs/second. RGB24 -> linear RGB SSSE3 relative speed: 5.69557x RGB24 -> linear RGB AVX: (fastest/slowest/average) wall time speed is 3.00409/2.71127/2.94455 runs/second. RGB24 -> linear RGB AVX relative speed: 0.455661x RGB32 -> linear RGB C: (fastest/slowest/average) wall time speed is 4.67475/3.83495/4.55645 runs/second. RGB32 -> linear RGB C relative speed: 1.0x RGB32 -> linear RGB SSE2: (fastest/slowest/average) wall time speed is 37.7303/35.7484/37.0366 runs/second. RGB32 -> linear RGB SSE2 relative speed: 8.07107x RGB32 -> linear RGB AVX: (fastest/slowest/average) wall time speed is 2.76705/2.44912/2.72385 runs/second. RGB32 -> linear RGB AVX relative speed: 0.591913x RGB32 -> linear RGBA C: (fastest/slowest/average) wall time speed is 4.55221/4.31145/4.49747 runs/second. RGB32 -> linear RGBA C relative speed: 1.0x RGB32 -> linear RGBA SSE2: (fastest/slowest/average) wall time speed is 36.1119/33.9868/35.8453 runs/second. RGB32 -> linear RGBA SSE2 relative speed: 7.93283x RGB32 -> linear RGBA AVX: (fastest/slowest/average) wall time speed is 2.85771/2.20617/2.79098 runs/second. RGB32 -> linear RGBA AVX relative speed: 0.627764x YV12 -> YUV C: (fastest/slowest/average) wall time speed is 165.108/157.79/163.724 runs/second. YV12 -> YUV C relative speed: 1.0x YV12 -> YUV SSE2: (fastest/slowest/average) wall time speed is 300.263/286.618/295.344 runs/second. YV12 -> YUV SSE2 relative speed: 1.81859x YV12 -> YUV AVX: (fastest/slowest/average) wall time speed is 19.5201/18.1788/19.3726 runs/second. YV12 -> YUV AVX relative speed: 0.118226x YUY2 -> YUV C: (fastest/slowest/average) wall time speed is 204.879/178.597/199.624 runs/second. YUY2 -> YUV C relative speed: 1.0x YUY2 -> YUV SSE2: (fastest/slowest/average) wall time speed is 424.132/361.432/417.757 runs/second. YUY2 -> YUV SSE2 relative speed: 2.07016x YUY2 -> YUV AVX: (fastest/slowest/average) wall time speed is 32.1951/28.7248/31.7467 runs/second. YUY2 -> YUV AVX relative speed: 0.157142x YUV -> linear RGB C: (fastest/slowest/average) wall time speed is 16.9974/12.2325/16.4998 runs/second. YUV -> linear RGB C relative speed: 1.0x YUV -> linear RGB SSE2: (fastest/slowest/average) wall time speed is 37.3566/35.0877/36.9554 runs/second. YUV -> linear RGB SSE2 relative speed: 2.19778x YUV -> linear RGB AVX: (fastest/slowest/average) wall time speed is 3.72789/3.47262/3.68511 runs/second. YUV -> linear RGB AVX relative speed: 0.219321x linear RGB -> RGB24 C: (fastest/slowest/average) wall time speed is 6.25605/5.79279/6.17731 runs/second. linear RGB -> RGB24 C relative speed: 1.0x linear RGB -> RGB24 SSSE3: (fastest/slowest/average) wall time speed is 40.4334/33.1002/39.0018 runs/second. linear RGB -> RGB24 SSSE3 relative speed: 6.46308x linear RGB -> RGB24 AVX: (fastest/slowest/average) wall time speed is 3.30754/2.9804/3.24551 runs/second. linear RGB -> RGB24 AVX relative speed: 0.528695x linear RGB -> RGB32 C: (fastest/slowest/average) wall time speed is 4.37648/4.11399/4.33744 runs/second. linear RGB -> RGB32 C relative speed: 1.0x linear RGB -> RGB32 SSE2: (fastest/slowest/average) wall time speed is 41.4194/40.0089/41.09 runs/second. linear RGB -> RGB32 SSE2 relative speed: 9.46409x linear RGB -> RGB32 AVX: (fastest/slowest/average) wall time speed is 3.57475/3.04053/3.43611 runs/second. linear RGB -> RGB32 AVX relative speed: 0.816809x linear RGBA -> RGB32 C: (fastest/slowest/average) wall time speed is 4.44825/4.36037/4.42751 runs/second. linear RGBA -> RGB32 C relative speed: 1.0x linear RGBA -> RGB32 SSE2: (fastest/slowest/average) wall time speed is 39.6152/36.8389/39.01 runs/second. linear RGBA -> RGB32 SSE2 relative speed: 8.90578x linear RGBA -> RGB32 AVX: (fastest/slowest/average) wall time speed is 3.31775/2.84886/3.2405 runs/second. linear RGBA -> RGB32 AVX relative speed: 0.745854x YUV -> YV12 C: (fastest/slowest/average) wall time speed is 155.964/138.181/153.442 runs/second. YUV -> YV12 C relative speed: 1.0x YUV -> YV12 SSE2: (fastest/slowest/average) wall time speed is 450.69/430.569/445.116 runs/second. YUV -> YV12 SSE2 relative speed: 2.88972x YUV -> YV12 AVX: (fastest/slowest/average) wall time speed is 15.0684/14.3686/14.9177 runs/second. YUV -> YV12 AVX relative speed: 0.0966147x YUV -> YUY2 C: (fastest/slowest/average) wall time speed is 201.52/167.831/199.426 runs/second. YUV -> YUY2 C relative speed: 1.0x YUV -> YUY2 SSE2: (fastest/slowest/average) wall time speed is 659.215/645.618/654.092 runs/second. YUV -> YUY2 SSE2 relative speed: 3.2712x YUV -> YUY2 AVX: (fastest/slowest/average) wall time speed is 21.4287/20.0173/21.215 runs/second. YUV -> YUY2 AVX relative speed: 0.106335x linear RGB -> YUV C: (fastest/slowest/average) wall time speed is 8.66521/8.32953/8.50003 runs/second. linear RGB -> YUV C relative speed: 1.0x linear RGB -> YUV SSE2: (fastest/slowest/average) wall time speed is 38.0747/33.7847/37.6462 runs/second. linear RGB -> YUV SSE2 relative speed: 4.39398x linear RGB -> YUV AVX: (fastest/slowest/average) wall time speed is 3.74787/3.71133/3.74012 runs/second. linear RGB -> YUV AVX relative speed: 0.432519x
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth VapourSynth Portable FATPACK || VapourSynth Database |
|
|
|
|
|
#80 | Link | |
|
Grand Fruitioner
Join Date: Mar 2004
Location: Chicago, IL
Posts: 115
|
Quote:
Very bizarre results indeed. I'll have to look things over closely, because those can't be correct. |
|
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|