Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
13th January 2011, 01:54 | #62 | Link |
Usered Register
Join Date: Dec 2006
Posts: 9
|
Version 0.2.1 on i7 860 and Radeon HD 4870. When I load my avs script, I see a couple dos windows flash by quickly but I get this error from Avisynth when using cpu=true and buffer=true. I seem to be getting a little bit closer to actually seeing what this filter does
Debug log: Code:
NLMeansCL Version 0.2.1 ScriptEnvironment present. Number of OpenCL Compute Platforms = 1. Trying OpenCL Compute Platform Advanced Micro Devices, Inc.. OpenCL 1.1 ATI-Stream-v2.3 (451). OpenCL Compute Context successfully created. Number of OpenCL Devices in Platform = 2. Trying OpenCL Device ATI RV770. Device available. Trying OpenCL Device Intel(R) Core(TM) i7 CPU 860 @ 2.80GHz. Device available. Device Type Cpu. Device does not support images. Using Device Intel(R) Core(TM) i7 CPU 860 @ 2.80GHz. OpenCL Command Queue successfully created. OpenCL Program successfully built. Prog Y Build log: . Prog UV Build log: . |
13th January 2011, 11:10 | #63 | Link | |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
Quote:
the flashing of the dos windows is normal. It is caused by the AMD OpenCL driver. I will look into the remaining erro that prevents execution. I assume this is only a minor problem. |
|
14th January 2011, 15:58 | #67 | Link | |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
Quote:
When using OpenCL image objects, the video frames are stored as textures on the gfx cards main memory. GPUs have very fast texture caches. Therefore, reading and writing from/to images is fast. When using OpenCL buffers, the video frames are placed as 'arrays' into the main memory of the gfx card. Access is not cached. Therefore the performace is very low in comparison! (*) When you are using a CPU as OpenCL device, this difference doesn't matter, since there are no 'texture caches' in CPUs. (memory is cached anyway) AMD has not implemented image support in its CPU OpenCL driver yet (but it's planned!). There is no performance drawback per se as explained above. But the pity is, that you cannot execute OpenCL kernels that uses images... AMDs RV7xxx cards don't have image support either, although they are GPUs and have texture caches for sure! It's just that the image capabilities of these GPUs is not compliant to the OpenCL spec. Therefore -> no support. * You can get around this limitation and be faster than using images. It's just that you have to hand-tune to the exact hardware you are using and it also depends on the dimensions of the processed data. Both things are not known beforehand for avisynth filters typically (the used hardware (gfx card) and the data dimensions (video sizes)). Therefore i have implemented it only for my gfx card with hand-tuned values for 720x576 video. It's faster than using images. But it doesn't make sens to release it. |
|
14th January 2011, 20:09 | #68 | Link |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
I have uploaded version 0.3. See first post.
Changes
|
15th January 2011, 09:51 | #69 | Link |
Novice of AVS
Join Date: Oct 2009
Posts: 156
|
Why do I setting avs in NLMeansCL(buffer=true) and it's seem...
http://imagewtf.com/11941.png/images/11941.png
__________________
My PC Last edited by Hiritsuki; 15th January 2011 at 09:58. |
15th January 2011, 13:10 | #71 | Link | |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
Quote:
Otherwise: Please send me the XML report of GPU Caps Viewer as well as the logfile of NLMeansCL. |
|
15th January 2011, 17:49 | #72 | Link | |
Registered User
Join Date: Apr 2005
Posts: 213
|
Version 0.2.2 works fine with NLMeansCL(cpu=true, buffer=true), but with Version 0.3 an error occurred.
Here is the log file: Quote:
|
|
15th January 2011, 22:59 | #73 | Link |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
I have uploaded version 0.3.1.
This should solve Archimedes and Hiritsukis problems. Working on a theoretical basis without all the platforms for testing is really a minefield! So sorry, if it takes several iterations until it runs without hiccups! To the change of the memory allocation strategy. The different possible OpenCL memory allocation strategies are really badly documented in the OpenCL spec. There are various discussions in OpenCL forums around it. It seems no one except NVidias and AMDs core developers knows what's really going on under the hood and how they are supposed to be used... |
16th January 2011, 01:25 | #74 | Link |
Novice of AVS
Join Date: Oct 2009
Posts: 156
|
ok, this is my GPU's OpenCL XML
http://www.mediafire.com/?y04xn3m1qxfby2v
__________________
My PC |
16th January 2011, 06:14 | #76 | Link |
Registered User
Join Date: Nov 2009
Posts: 2,361
|
@Malcolm: Just wanted to say thanks again for this great filter. I saw az implementation on the todo list so besides that, and although Im going to stick to b=0 you might want to see the aliasing issues when b>0.
__________________
i7-4790K@Stock::GTX 1070] AviSynth+ filters and mods on GitHub + Discussion thread |
16th January 2011, 09:06 | #77 | Link |
Novice of AVS
Join Date: Oct 2009
Posts: 156
|
I just test 0.31 to compare with 0.22 speed.
0.22 is fatser than 0.31 5.5fps:5.0fps And 0.31 GPU using is lower than 1% ... this is cannot to use GPU?
__________________
My PC Last edited by Hiritsuki; 16th January 2011 at 09:37. |
16th January 2011, 23:21 | #78 | Link | |
Leader of Dual-Duality
Join Date: Aug 2010
Location: America
Posts: 134
|
An interesting filter, can get good results without killing details if you set the h and hc below 1 , I suggest using something like
Quote:
I forgot to mention I was having problems where I got this error message http://img510.imageshack.us/img510/8287/afag.png But then I updated my graphics card driver to the new beta (I had the most recent stable version before) and now it works fine http://www.nvidia.com/object/win7-wi...ta-driver.html
__________________
I'm Mr.Fixit and I feel good, fixin all the sources in the neighborhood My New filter is in the works, and will be out soon Last edited by TheProfileth; 17th January 2011 at 06:29. |
|
17th January 2011, 22:40 | #80 | Link |
Registered User
Join Date: Nov 2009
Posts: 2,361
|
I think it's easy enough, you'd mostly use only A and H parameters
__________________
i7-4790K@Stock::GTX 1070] AviSynth+ filters and mods on GitHub + Discussion thread |
|
|