Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
8th January 2011, 00:15 | #41 | Link |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
@Wilbert
I will do that. But i would like to work some things out first! I don't think your 6800LE will be capable of executing OpenCL. it's from around 2004 if i'm right. That's long, long ago... And even if, the performance would probably be not much better that TNLMeans. |
8th January 2011, 00:51 | #42 | Link | |
warpsharpened
Join Date: Feb 2007
Posts: 787
|
Quote:
Edit: on a side note v 0.1.1 actually works for me. I did get a green bar on the bottom of the screen with non-mod16 resolution. I also get a green bar on the right side of the screen with plane=4 at 1280x720. With plane=0 about 4-8 pixels on the right side turn black. Edit2: padding like 16 pixels (addborders) on each side then cropping later seems to work. Last edited by TheRyuu; 8th January 2011 at 01:00. |
|
8th January 2011, 18:14 | #43 | Link |
Registered User
Join Date: Oct 2009
Posts: 151
|
Well I played around with smf parameters but it still won't work.
Here's the full XML export of my GPU, radeon HD4850 http://www.mediafire.com/?f4s4ktmocfk2hot Hope that could help! |
8th January 2011, 22:14 | #45 | Link | |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
Quote:
Seems like all RV770 users are out of business at the moment. The AMD OpenCL drivers do not suppport images. Same issue as GoodzMastaJ. Please see below for a snippet of the full XML export from GPU Caps Viewer. Code:
CL_DEVICE_NAME="ATI RV770" CL_DEVICE_VENDOR="Advanced Micro Devices, Inc." CL_DRIVER_VERSION="CAL 1.4.900" CL_DEVICE_PROFILE="FULL_PROFILE" CL_DEVICE_VERSION="OpenCL 1.0 ATI-Stream-v2.3 (451)" CL_DEVICE_TYPE="GPU" CL_DEVICE_IMAGE_SUPPORT="NO" CL_DEVICE_IMAGE2D_MAX_WIDTH="0" CL_DEVICE_IMAGE2D_MAX_HEIGHT="0" Please provide the contents of the debug file and the xml export of GPU Caps Viewer. Otherwise i can't help. @all I have uploaded version 0.1.2. See the changelog in my first post for details. @Didée Version 0.1.2 should fix you error with the dots spread all over the screen! Can you give me feedback if this version works for you? Thanks. Last edited by Malcolm; 8th January 2011 at 23:28. Reason: uploaded version 0.1.2 |
|
9th January 2011, 00:51 | #46 | Link | ||
warpsharpened
Join Date: Feb 2007
Posts: 787
|
Getting a weird issue with regards to setting h = 1.0
Setting it to 1.0 will cause it to not work (gives error) but 0.9, 1.1 both work (any other value works I do believe). Some tests someone I know did (credits to mirkosp): Quote:
Quote:
Last edited by TheRyuu; 9th January 2011 at 01:16. |
||
9th January 2011, 11:35 | #48 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,580
|
Here you are. I noticed even GPU Caps Viewer crashes in CL test. An encoding is going now, I'll try to change drivers version ASAP and report you.
PHP Code:
__________________
@turment on Telegram |
9th January 2011, 12:51 | #49 | Link |
Registered User
Join Date: Nov 2009
Posts: 2,361
|
-Remember my post?
It was caused by source being non mod16. I made work a few images with mod4, but below mod16 performance. I havent checked video sources, but its something to look up. -Also there are some aliasing issues on b>0 (less than tnlmeans but still) -crashed on a 1280x688 (mod16) video source with this message. (mkv through ffmpegsource) (Edit: problem was setting h<1.1 on v. 0.1.2)
__________________
i7-4790K@Stock::GTX 1070] AviSynth+ filters and mods on GitHub + Discussion thread Last edited by Dogway; 9th January 2011 at 17:01. |
9th January 2011, 18:51 | #50 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,580
|
Ok, updated to 266.44 drivers and works.
If only I could figure how to get a good value for parameters... They are a bit obscure to me. For example, in a 1080p x264 encoding, what should I put to mimic SMDegrain (tr=3,thSAD=300,blksize=16,overlap=8) ? I suppose not... The temporal part is missing..
__________________
@turment on Telegram Last edited by tormento; 9th January 2011 at 19:22. |
10th January 2011, 20:42 | #51 | Link |
Registered User
Join Date: Sep 2010
Location: Austin, TX
Posts: 40
|
My understanding is that OpenCL programs can be compiled to run either on a GPU or on the host CPU. If this is true, I'm curious, Malcom, if you have tried this, and how fast the filter runs on the host CPU vs the original TNLmeans filter. How much penalty is there (if any) for writing it as OpenCL vs C if there is no GPU to help out.
|
11th January 2011, 10:21 | #52 | Link |
Registered User
Join Date: Feb 2003
Location: Russia, Moscow
Posts: 854
|
Malcolm for Your time.
NLMeansCL work on my 8800 GTS 512 with last Nvidia driver (260.99). Speed for SD content little faster than real time 26-27 fps. If You find time add also Az support for Az=1 and Az=2 it may be useful for motion compensating filtering. yup. |
12th January 2011, 00:04 | #53 | Link | |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
Quote:
Let me explain some aspects regarding CPU vs. GPU: - CPUs are not generally slower than CPUs. The performance does heavily depend on the algorithm that is implemented. For NLMeans, the results are clear. But that's just one example. There are algorithms that will always be slower on a GPU than on a CPU. - You can see from my performance measurements, that using buffers instead of images is much slower. But that's only half the truth. I have a version of NLMeansCL on my harddisk, that is faster on my Geforce when using buffers instead of images! So the execution speed does also heavily depend on how memory is used and accessed inside the filter. But on the other hand this version is actually even slower on the CPU that version 0.2 when using buffers! So though OpenCL executes on GPUs as well as CPUs, you typically have to code to the exact hardware to achieve peak performance! - One benefit of OpenCL is that it's naturally multithreaded. So if you throw in another CPU with double the number of cores (or a new gfx card), it will typically scale well. That's not true for TNLMeans. - My current implementation is tuned for execution on GPUs. There are some micro optimizations for individual math functions and so on. For a serious CPU version of the filter, i would go over the implementation and play around with different expressions to see if they execute faster than the existing ones. But this is - as i said -a micro optimization. @tormento Sorry, i can't help you out with suggestions to mimic SMDegrain. @yup Yeah, i will see what i can do. |
|
12th January 2011, 11:39 | #55 | Link |
Registered User
Join Date: Dec 2005
Location: Germany
Posts: 1,795
|
yep, i have problems with cpu and buffer too.
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth VapourSynth Portable FATPACK || VapourSynth Database |
12th January 2011, 12:20 | #57 | Link |
Registered User
Join Date: Oct 2009
Posts: 151
|
Oh, sorry. Here it is (with buffer=true) :
Code:
NLMeansCL Version 0.2 ScriptEnvironment present. Number of OpenCL Compute Platforms = 1. Trying OpenCL Compute Platform Advanced Micro Devices, Inc.. OpenCL 1.1 ATI-Stream-v2.3 (451). OpenCL Compute Context successfully created. Number of OpenCL Devices in Platform = 2. Using first OpenCL Device. ATI RV770. Device available. OpenCL Command Queue successfully created. OpenCL error code detected: BuildProgramFailure. Thanks again. |
12th January 2011, 13:49 | #58 | Link |
Registered User
Join Date: Sep 2002
Location: Germany
Posts: 352
|
Ok, i see. There are 2 problems.
The behaviour of AMDs OpenCL platform is different than i thought. So if you have an AMD gfx card, as well as the CPU as OpenCL device, then the selection of the CPU fails and it is using the GPU. I can change the selection strategy, to overcome this. (Selecting the CPU as computation device actually works if you have an Nvidia GPU). I will provide this ASAP. The second thing is, that the compilation of the kernel fails. Since i don't own an AMD gfx card, it's nearly impossible for me to tell what the reason could be! So far i can only tell that my implementation is correct, since it runs on Nvidia GPUs as well as the CPU. The cause can only be that i'm using some functions that are not implemented by AMD and thus the AMD OpenCL driver refuses to compile! Sorry for any inconvenience i have caused, by providing something which obviously doesn't run on Radeons. But i'm just trying to give you someting.. |
12th January 2011, 14:50 | #59 | Link |
Registered User
Join Date: Oct 2009
Posts: 151
|
Hey, no worries there, in fact I'm very thankful you even try to do something about it and for making this plugin available to community for free.
I've been meaning to upgrade my GPU, and now my choice is clear, thanks to this plugin. |
12th January 2011, 18:13 | #60 | Link |
Registered User
Join Date: Dec 2005
Location: Germany
Posts: 1,795
|
NLMeansCL(cpu=true) ->
My CPU: Intel Q9550 I can't find a debug log, c:\temp is empty PHP Code:
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth VapourSynth Portable FATPACK || VapourSynth Database Last edited by ChaosKing; 12th January 2011 at 18:30. |
Thread Tools | Search this Thread |
Display Modes | |
|
|