Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
20th November 2013, 06:44 | #21 | Link |
Registered User
Join Date: Feb 2003
Location: Russia, Moscow
Posts: 854
|
SEt
Test for GTX560 Code:
GPU 1: NVIDIA GeForce GTX 560 OpenCL 1.1, GeForce GTX 560 compute units:7@1620MHz FPS (min | max | average): 1.84 | 416268.26 | 7.43 CPU usage (average): 13% yup. Last edited by yup; 20th November 2013 at 09:46. Reason: add info |
21st November 2013, 12:45 | #23 | Link |
Registered User
Join Date: Oct 2001
Location: Germany
Posts: 7,277
|
using a NVIDIA GeForce GTX 660 ti and
Code:
LoadPlugin("nnedi3ocl.dll") SetMTMode(2,8) BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0) nnedi3ocl_rpow2(2, nns=2, qual=1) Code:
AVSMeter 1.7.2 [AVS2.6] by Groucho2004 AviSynth 2.60, build:Sep 28 2013 [15:09:12] Active MT Mode: 2 Number of frames: 1000 Length (hhh:mm:ss.ms): 000:00:41.708 Frame width: 2560 Frame height: 1440 Framerate: 23.976 (24000/1001) Interlaced: No Colorspace: YV12 Frames processed: 1000 (0 - 999) FPS (min | max | average): 1.93 | 419534.11 | 15.70 CPU usage (average): 13% Thread count: 22 Physical Memory usage (peak): 569 MB Virtual Memory usage (peak): 552 MB Time (elapsed): 000:01:03.714 Code:
LoadPlugin("nnedi3ocl.dll") #SetMTMode(2,8) BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0) nnedi3ocl_rpow2(2, nns=2, qual=1) Code:
Active MT Mode: 0 Number of frames: 1000 Length (hhh:mm:ss.ms): 000:00:41.708 Frame width: 2560 Frame height: 1440 Framerate: 23.976 (24000/1001) Interlaced: No Colorspace: YV12 Frames processed: 1000 (0 - 999) FPS (min | max | average): 11.83 | 14.50 | 14.24 CPU usage (average): 12% Thread count: 8 Physical Memory usage (peak): 540 MB Virtual Memory usage (peak): 535 MB Time (elapsed): 000:01:10.224 Code:
LoadPlugin("nnedi3ocl.dll") SetMTMode(2,4) BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0) nnedi3ocl_rpow2(2, nns=2, qual=1) Code:
Active MT Mode: 2 Number of frames: 1000 Length (hhh:mm:ss.ms): 000:00:41.708 Frame width: 2560 Frame height: 1440 Framerate: 23.976 (24000/1001) Interlaced: No Colorspace: YV12 Frames processed: 1000 (0 - 999) FPS (min | max | average): 3.71 | 419534.11 | 15.78 CPU usage (average): 13% Thread count: 12 Physical Memory usage (peak): 544 MB Virtual Memory usage (peak): 540 MB Time (elapsed): 000:01:03.379 |
21st November 2013, 17:18 | #24 | Link |
Registered User
Join Date: Aug 2007
Posts: 374
|
Given how many people here turned out to use Nvidia cards, I spent some effort optimizing for them. Result is pretty consistent 15% speed boost: https://www.dropbox.com/s/exz8knrygk...ate20131121.7z Don't know why I bothered though, given how Nvidia treats OpenCL (really bad, if you didn't know).
Here you can see how quite minor changes can have noticeable impact on the speed. If you want your results included in first page table: please provide all the info! Third number (the "efficiency") is computed as your avg FPS with MTMode divided by theoretical TFLOPS on your frequency (so, lookup the reference FLOPS for your card, multiply it by your core frequency and divide by reference core frequency). |
21st November 2013, 17:29 | #25 | Link |
47.952fps@71.928Hz
Join Date: Mar 2011
Posts: 940
|
Was trying to participate, but Nvidia just had an update recently and it seems to give me BSOD after benchmark completes.
Currently, siphoning through the Nvidia forum for details and provide input and then falling back to previous version which didn't give me BSOD with this plugin. And hopefully will be able to get some results up by the end of the day.
__________________
Win10 (x64) build 19041 NVIDIA GeForce GTX 1060 3GB (GP106) 3071MB/GDDR5 | (r435_95-4) NTSC | DVD: R1 | BD: A AMD Ryzen 5 2600 @3.4GHz (6c/12th, I'm on AVX2 now!)
|
21st November 2013, 17:41 | #26 | Link |
Registered User
Join Date: Aug 2007
Posts: 374
|
Yeah, that's Nvidia drivers today. That they are more stable than AMD is pure myth. I also got BSODs recently only from Nvidia ones.
Speed of nnedi3ocl really depends on how wise/stupid OpenCL compiler in driver was, so it's worth trying several driver versions and see if it changes anything. |
22nd November 2013, 05:16 | #31 | Link |
Anime addict
Join Date: Feb 2009
Location: Spain
Posts: 673
|
Hi, when I execute this code:
Code:
LoadPlugin("C:\Program Files (x86)\AviSynth 2.5\plugins\nnedi3ocl.dll") SetMTMode(2,4) BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0) nnedi3ocl_rpow2(2, nns=2, qual=1) Build used is 2013.11.18. Graphic: AMD Radeon HD 4870. What am I doing wrong??
__________________
Intel i7-6700K + Noctua NH-D15 + Z170A XPower G. Titanium + Kingston HyperX Savage DDR4 2x8GB + Radeon RX580 8GB DDR5 + ADATA SX8200 Pro 1 TB + Antec EDG750 80 Plus Gold Mod + Corsair 780T Graphite |
22nd November 2013, 05:42 | #32 | Link |
契約者
Join Date: Jun 2008
Posts: 1,576
|
Did a quick test on GTX 570 with latest stable drivers, no MT mode and default settings, but there are no speed differences between the old and new version. The difference is always within 0.03 fps plus or minus with around 14.46 in total. I checked 3 times. Maybe later I'll check once more after I'll get some sleep...
BTW I used real 720p video. blankclip is a up to 80% faster even though source filter is capable of providing input with more than 500fps Last edited by Keiyakusha; 22nd November 2013 at 05:53. |
22nd November 2013, 07:35 | #33 | Link |
Registered User
Join Date: Feb 2003
Location: Russia, Moscow
Posts: 854
|
Hi all!
I was out my working horse, testing both version Code:
GPU 1: NVIDIA GeForce GTX 560 OpenCL 1.1, GeForce GTX 560 compute units:7@1620MHz 18 November cl code SetMTMode(2,4) FPS (min | max | average): 1.84 | 416268.26 | 7.43 CPU usage (average): 13% noMT FPS (min | max | average): 6.83 | 7.29 | 7.20 CPU usage (average): 13% 20 November cl code SetMTMode(2,4) FPS (min | max | average): 2.17 | 416267.00 | 8.73 CPU usage (average): 13% noMT FPS (min | max | average): 7.95 | 8.53 | 8.44 CPU usage (average): 14% yup. |
22nd November 2013, 12:13 | #34 | Link |
Registered User
Join Date: Aug 2007
Posts: 374
|
Overdrive80, try this version: https://www.dropbox.com/s/oz1xz9k8mx...cl_fixocl10.7z Your card only supports OpenCL 1.0 while I used 1.1 feature on it. Also note that Radeon HD4xxx not fully OpenCL "capable" (their local memory isn't conformant and emulated with global memory), so "efficiency" will be less than newer Radeons.
Keiyakusha, your speed is faster than it should be on previous version but slower than it should be on new one, huh... The problem with real scripts is that even with MTMode Avisynth scheduling is pretty bad and you likely see not 100% GPU load. Try putting source in MTMode 2 and/or increasing number of threads. |
23rd November 2013, 00:00 | #35 | Link |
Anime addict
Join Date: Feb 2009
Location: Spain
Posts: 673
|
Ok, thanks SEt. Here go my results:
- Four Threads: Code:
LoadPlugin("C:\Program Files (x86)\AviSynth 2.5\plugins\nnedi3ocl.dll") SetMTMode(2,4) BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0) nnedi3ocl_rpow2(2, nns=2, qual=1) Code:
Frames processed: 1000 (0 - 999) FPS (min | max | average): 0.63 | 325290.06 | 2.55 CPU usage (average): 6% Thread count: 9 Physical Memory usage (peak): 1123 MB Virtual Memory usage (peak): 1123 MB Time (elapsed): 000:06:32.524 Code:
LoadPlugin("C:\Program Files (x86)\AviSynth 2.5\plugins\nnedi3ocl.dll") SetMTMode(2,8) BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0) nnedi3ocl_rpow2(2, nns=2, qual=1) Code:
Frames processed: 1000 (0 - 999) FPS (min | max | average): 0.31 | 325290.06 | 2.52 CPU usage (average): 6% Thread count: 17 Physical Memory usage (peak): 1169 MB Virtual Memory usage (peak): 1178 MB Time (elapsed): 000:06:37.608 Code:
Frames processed: 1000 (0 - 999) FPS (min | max | average): 1.95 | 2.50 | 2.47 CPU usage (average): 2% Thread count: 2 Physical Memory usage (peak): 298 MB Virtual Memory usage (peak): 292 MB Time (elapsed): 000:06:44.659
__________________
Intel i7-6700K + Noctua NH-D15 + Z170A XPower G. Titanium + Kingston HyperX Savage DDR4 2x8GB + Radeon RX580 8GB DDR5 + ADATA SX8200 Pro 1 TB + Antec EDG750 80 Plus Gold Mod + Corsair 780T Graphite Last edited by Overdrive80; 23rd November 2013 at 00:14. |
23rd November 2013, 00:32 | #37 | Link | |
Registered User
Join Date: Oct 2001
Location: Germany
Posts: 7,277
|
Quote:
and iirc. at least all presets use nsize 1 and up,... checked: Code:
# Very Very Super Ultra # Preset groups: Placebo Slow Slower Slow Medium Fast Faster Fast Fast Fast Draft ... EdiMode = default( EdiMode, Select( pNum, "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "RepYadif","Bob" ) ) NNSize = default( NNSize, Select( pNum, 1, 1, 1, 1, 5, 5, 4, 4, 4, 4, 4 ) ) Last edited by Selur; 23rd November 2013 at 00:34. |
|
23rd November 2013, 01:24 | #38 | Link |
Registered User
Join Date: Aug 2007
Posts: 374
|
nsize=0 should be better than nsize=4, so shouldn't hurt using 0 instead of it. For other nsize it's effectively quality of connecting horizontal lines, so you can use 0 instead but quality will be worse than expected.
|
|
|