Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
|
|
Thread Tools | Search this Thread | Display Modes |
12th April 2015, 19:06 | #81 | Link | |
Registered User
Join Date: Nov 2014
Posts: 440
|
Quote:
OpenCL 1.2 / 1.1 API are different. The plugin should be rewritten to exploit all. EDIT. OpenCL 1.2 was presented on November 15, 2011. Last edited by Khanattila; 12th April 2015 at 19:11. |
|
14th April 2015, 11:51 | #82 | Link |
RipBot264 author
Join Date: May 2006
Location: Poland
Posts: 7,812
|
What are your thoughts about opencl 2.0? AMD has just added support in latest drivers.
__________________
Windows 7 Image Updater - SkyLake\KabyLake\CoffeLake\Ryzen Threadripper |
17th April 2015, 18:17 | #83 | Link | |
Registered User
Join Date: Nov 2014
Posts: 440
|
Quote:
Is a compatibility issue. I'd like to use OpenCL 2.0, but I can't. To achieve the maximum diffusion I have to use OpenCL 1.1. NVIDIA CUDA Toolkit v7.0 can't stand even OpenCL 1.2. (http://developer.download.nvidia.com/compute/cuda/7_0/Prod/doc/CUDA_Toolkit_Release_Notes.pdf) I don't want to build two different versions. I hope so. |
|
17th April 2015, 18:24 | #84 | Link | |
Registered User
Join Date: Nov 2014
Posts: 440
|
Quote:
Probably a big "A" use too much private/local memory for yor GPU. (GPU have 3 three different memories: private memory, local memory, global memory. Where private is the faster and global is the bigger.) Please reduce the value of A. |
|
21st April 2015, 10:56 | #86 | Link |
Registered User
Join Date: Nov 2014
Posts: 440
|
NLMeansCL2b - v.0.5 - Beta 1
Code:
NLMeansCL2b ( clip src, int D (0), // Temporal windows, disabled in Beta 1 int A (4), // Search window int S (2), // Similarity neighborhood window int B (0), // Base window int wmode (1), // Weighting function float h (1.8), // Strength of the filtering string device_type ("default"), int y (3), int u (2), int v (2), bool lsb_inout (false), bool info (false) ) [wmode = 0] Cauchy weighting function has a very slow decay. It assign larger weights to dissimilar blocks than the Leclerc robust function, which will eventually lead to oversmoothing. [wmode = 1] Leclerc weighting function has a faster decay, but still assigns positive weights to dissimilar blocks. Original NLMeans weighting function. [wmode = 2] Bisquare weighting function use a soft threshold. Download: removed. ################ New:
Last edited by Khanattila; 27th April 2015 at 18:22. |
21st April 2015, 18:52 | #87 | Link |
Registered User
Join Date: Mar 2012
Location: Texas
Posts: 1,666
|
Hi Khanattila, thanks for the update! I tried using NLMeansCL2b with AviSynth+ r1576 on my work computer but I got this error message: "Plugin was designed for a later version of Avisynth (6)"
I updated to r1779 and it works with that version. Was that intentional or is there something else going on? If so will this also be true for anyone using an older version before AviSynth 2.6 R2? |
21st April 2015, 19:56 | #88 | Link | |
Registered User
Join Date: Nov 2014
Posts: 440
|
Quote:
Version 6 is 2.6.0. Version 5 is 2.6.0a1-a5. Version 4 is reserved. Version 3 is 2.5.6. Last edited by Khanattila; 21st April 2015 at 21:49. |
|
21st April 2015, 21:38 | #89 | Link |
HeartlessS Usurer
Join Date: Dec 2009
Location: Over the rainbow
Posts: 10,980
|
I think there were multiple versions of Avisynth Header VERSION 5, a change was made to the header at some point which made plugins compiled with it
require Avisynth versionv2.6 Alpha 4+. The previous to current version of ClipClop plugin crashed on Avisynth v2.6a3 and previous (as with other plugins, immediately at startup).
__________________
I sometimes post sober. StainlessS@MediaFire ::: AND/OR ::: StainlessS@SendSpace "Some infinities are bigger than other infinities", but how many of them are infinitely bigger ??? Last edited by StainlessS; 21st April 2015 at 21:42. |
21st April 2015, 22:41 | #90 | Link | |
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
|
Quote:
As for the "AVISYNTH_INTERFACE_VERSION": "3" : 2.5.x "5" : 2.6.0 < RC1 "6" : 2.6.0 >= RC1 As for AVS+, the lastest builds (r17xx) have the header updated to v6.
__________________
Groucho's Avisynth Stuff |
|
22nd April 2015, 03:18 | #91 | Link |
HeartlessS Usurer
Join Date: Dec 2009
Location: Over the rainbow
Posts: 10,980
|
Guilty as charged M'lud
__________________
I sometimes post sober. StainlessS@MediaFire ::: AND/OR ::: StainlessS@SendSpace "Some infinities are bigger than other infinities", but how many of them are infinitely bigger ??? |
27th April 2015, 18:39 | #92 | Link |
Registered User
Join Date: Nov 2014
Posts: 440
|
Code:
KNLMeansCL ( clip src, int D (0), // Temporal window int A (4), // Search window int S (2), // Similarity neighborhood window int wmode (1), // Weighting function float h (1.8), // Strength of the filtering string device_type ("default"), bool lsb_inout (false), bool info (false) )
Last edited by Khanattila; 6th May 2016 at 15:45. |
27th April 2015, 22:14 | #93 | Link | |
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
|
Quote:
I really like this filter and I think I have to consider upgrading from my old GT240. So, how about running a simple benchmark to see how various video cards fare? I'm thinking something like this: Code:
colorbars(width = 1280, height = 720, pixel_type = "yv12").killaudio().assumefps(24000, 1001) trim(0,99) fadeio(48) trim(0,99) KNLMeansCL(D = 1) Code:
[Runtime info] Frames processed: 100 (0 - 99) FPS (min | max | average): 1.421 | 1.452 | 1.448 Memory usage (phys | virt): 44 | 43 MB Thread count: 8 CPU usage (average): 25% GPU usage (average): 99% Video engine load (average): 0% GPU memory usage: 67 MB Time (elapsed): 00:01:09.051 [Graphics card info] Card name: NVIDIA GeForce GT 240 GPU name: GT215 Memory size: 512 OpenCL version: OpenCL 1.0 CUDA Driver version: 6.14.13.4052 (ForceWare 340.52) / XP
__________________
Groucho's Avisynth Stuff |
|
28th April 2015, 00:31 | #94 | Link | |||
Registered User
Join Date: Nov 2014
Posts: 440
|
D is the number of past and future frame that the filter uses for denoising the current frame.
D = 0, only current frame (n). D = 1, use n - 1, n, n + 1. D = 2, use n - 2, n - 1, n, n + 1, n + 2. etc. D = 0 also exploit the symmetry property of the weights, i.e. w(p, p + q) = w(p + q, p). But this requires an accumulation buffer, probably in an old GPU makes performance down. (Symmetry is also possible in temporal but requires too many check). Quote:
Quote:
Quote:
Computational complexity: ((2 * A + 1) * (2 * A +1) * (2 * D + 1) - 1) / (D ? 1 : 2) Last edited by Khanattila; 28th April 2015 at 00:34. |
|||
28th April 2015, 01:25 | #95 | Link |
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
|
I suppose you used the GTX760 for these measurements?
__________________
Groucho's Avisynth Stuff Last edited by Groucho2004; 28th April 2015 at 10:02. |
28th April 2015, 11:53 | #96 | Link |
Registered User
Join Date: Nov 2014
Posts: 440
|
Code:
[Graphics card info] Card name: NVIDIA GeForce GTX 760 GPU name: GK104 Memory size: 2048 OpenCL version: OpenCL 1.2 CUDA Driver version: 9.18.13.5012 WHQL (ForceWare 350.12) / Win8.1 64 Would you try this? - 720x480. KNLMeansCL(0, 2, 1) - 720x480. KNLMeansCL(0, 3, 1) - 720x480. KNLMeansCL(0, 5, 1) Original 9600 GT take: 100.00 FPS / 52.46 FPS / 18.46 FPS. (B. Goossens, H.Q. Luong, J. Aelterman, A. Pizurica, and W. Philips, "A GPU-Accelerated Real-Time NLMeans Algorithm for Denoising Color Video Sequences", in Proc. ACIVS (2), 2010, pp.46-57. ) Last edited by Khanattila; 28th April 2015 at 12:15. |
28th April 2015, 12:19 | #97 | Link | |
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
|
Quote:
Code:
colorbars(width = 720, height = 480, pixel_type = "yv12").killaudio().assumefps(24000, 1001) trim(0,99) fadeio(48) trim(0,99) KNLMeansCL(0, x, 1) #x = 2, 3, 5 How can this be? The 9600 is ancient.
__________________
Groucho's Avisynth Stuff |
|
28th April 2015, 14:15 | #98 | Link | |
Registered User
Join Date: Nov 2014
Posts: 440
|
Quote:
|
|
30th April 2015, 10:35 | #99 | Link |
Registered User
Join Date: Nov 2014
Posts: 440
|
Code:
Beta2 Beta3 S=0 59.05 FPS 52.44 FPS -11% S=1 47.35 FPS 51.82 FPS + 9% S=2 43.99 FPS 51.43 FPS +17% S=3 38.88 FPS 50.39 FPS +30% S=4 36.89 FPS 49.20 FPS +33% Last edited by Khanattila; 6th May 2016 at 15:45. |
30th April 2015, 21:18 | #100 | Link | |
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
|
Quote:
Code:
[build_programm (CL_BUILD_PROGRAM_FAILURE)] Code:
"error: macro 'V_BLOCK_Y' contains embedded newline, text after the newline is ignored."
__________________
Groucho's Avisynth Stuff |
|
Thread Tools | Search this Thread |
Display Modes | |
|
|