Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Capturing and Editing Video > Avisynth Development

Reply
 
Thread Tools Search this Thread Display Modes
Old 20th November 2013, 06:44   #21  |  Link
yup
Registered User
 
Join Date: Feb 2003
Location: Russia, Moscow
Posts: 854
SEt
Test for GTX560
Code:
GPU 1: NVIDIA GeForce GTX 560
OpenCL 1.1, GeForce GTX 560 compute units:7@1620MHz
FPS (min | max | average):       1.84 | 416268.26 | 7.43
CPU usage (average):             13%
SetMTMode(2,4), version 18 November
yup.

Last edited by yup; 20th November 2013 at 09:46. Reason: add info
yup is offline   Reply With Quote
Old 20th November 2013, 08:18   #22  |  Link
SEt
Registered User
 
Join Date: Aug 2007
Posts: 374
yup, was it with MT or without? Need both for statistics.
SEt is offline   Reply With Quote
Old 21st November 2013, 12:45   #23  |  Link
Selur
Registered User
 
Selur's Avatar
 
Join Date: Oct 2001
Location: Germany
Posts: 7,347
using a NVIDIA GeForce GTX 660 ti and
Code:
LoadPlugin("nnedi3ocl.dll")
SetMTMode(2,8)
BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0)
nnedi3ocl_rpow2(2, nns=2, qual=1)
I got:
Code:
AVSMeter 1.7.2 [AVS2.6] by Groucho2004
AviSynth 2.60, build:Sep 28 2013 [15:09:12]
Active MT Mode: 2

Number of frames:                  1000
Length (hhh:mm:ss.ms):    000:00:41.708
Frame width:                       2560
Frame height:                      1440
Framerate:                       23.976 (24000/1001)
Interlaced:                          No
Colorspace:                        YV12

Frames processed:                1000 (0 - 999)
FPS (min | max | average):       1.93 | 419534.11 | 15.70
CPU usage (average):             13%
Thread count:                    22
Physical Memory usage (peak):    569 MB
Virtual Memory usage (peak):     552 MB
Time (elapsed):                  000:01:03.714
using:
Code:
LoadPlugin("nnedi3ocl.dll")
#SetMTMode(2,8)
BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0)
nnedi3ocl_rpow2(2, nns=2, qual=1)
I got:
Code:
Active MT Mode: 0

Number of frames:                  1000
Length (hhh:mm:ss.ms):    000:00:41.708
Frame width:                       2560
Frame height:                      1440
Framerate:                       23.976 (24000/1001)
Interlaced:                          No
Colorspace:                        YV12

Frames processed:                1000 (0 - 999)
FPS (min | max | average):       11.83 | 14.50 | 14.24
CPU usage (average):             12%
Thread count:                    8
Physical Memory usage (peak):    540 MB
Virtual Memory usage (peak):     535 MB
Time (elapsed):                  000:01:10.224
using:
Code:
LoadPlugin("nnedi3ocl.dll")
SetMTMode(2,4)
BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0)
nnedi3ocl_rpow2(2, nns=2, qual=1)
I got:
Code:
Active MT Mode: 2

Number of frames:                  1000
Length (hhh:mm:ss.ms):    000:00:41.708
Frame width:                       2560
Frame height:                      1440
Framerate:                       23.976 (24000/1001)
Interlaced:                          No
Colorspace:                        YV12

Frames processed:                1000 (0 - 999)
FPS (min | max | average):       3.71 | 419534.11 | 15.78
CPU usage (average):             13%
Thread count:                    12
Physical Memory usage (peak):    544 MB
Virtual Memory usage (peak):     540 MB
Time (elapsed):                  000:01:03.379
Cu Selur
__________________
Hybrid here in the forum, homepage
Selur is offline   Reply With Quote
Old 21st November 2013, 17:18   #24  |  Link
SEt
Registered User
 
Join Date: Aug 2007
Posts: 374
Given how many people here turned out to use Nvidia cards, I spent some effort optimizing for them. Result is pretty consistent 15% speed boost: https://www.dropbox.com/s/exz8knrygk...ate20131121.7z Don't know why I bothered though, given how Nvidia treats OpenCL (really bad, if you didn't know).

Here you can see how quite minor changes can have noticeable impact on the speed.

If you want your results included in first page table: please provide all the info! Third number (the "efficiency") is computed as your avg FPS with MTMode divided by theoretical TFLOPS on your frequency (so, lookup the reference FLOPS for your card, multiply it by your core frequency and divide by reference core frequency).
SEt is offline   Reply With Quote
Old 21st November 2013, 17:29   #25  |  Link
Sparktank
47.952fps@71.928Hz
 
Sparktank's Avatar
 
Join Date: Mar 2011
Posts: 940
Was trying to participate, but Nvidia just had an update recently and it seems to give me BSOD after benchmark completes.

Currently, siphoning through the Nvidia forum for details and provide input and then falling back to previous version which didn't give me BSOD with this plugin.
And hopefully will be able to get some results up by the end of the day.
__________________
Win10 (x64) build 19041
NVIDIA GeForce GTX 1060 3GB (GP106) 3071MB/GDDR5 | (r435_95-4)
NTSC | DVD: R1 | BD: A
AMD Ryzen 5 2600 @3.4GHz (6c/12th, I'm on AVX2 now!)
Sparktank is offline   Reply With Quote
Old 21st November 2013, 17:41   #26  |  Link
SEt
Registered User
 
Join Date: Aug 2007
Posts: 374
Yeah, that's Nvidia drivers today. That they are more stable than AMD is pure myth. I also got BSODs recently only from Nvidia ones.

Speed of nnedi3ocl really depends on how wise/stupid OpenCL compiler in driver was, so it's worth trying several driver versions and see if it changes anything.
SEt is offline   Reply With Quote
Old 21st November 2013, 18:15   #27  |  Link
Groucho2004
 
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
Code:
0.91, 0.91, 2.35, GeForce GT 240, 550, 2, i5-2500K@4GHz, 2013.11.21
I grabbed the number for FLOPS from here, I hope that's the right place.

Last edited by Groucho2004; 21st November 2013 at 18:29.
Groucho2004 is offline   Reply With Quote
Old 21st November 2013, 20:29   #28  |  Link
olcifaraga
Registered User
 
Join Date: Sep 2012
Posts: 4
5.28 5.48 GeForce GT 750M, 967, 1.1, i7-4700HQ@2.4 2013.11.21
olcifaraga is offline   Reply With Quote
Old 21st November 2013, 20:51   #29  |  Link
mikeyakame
lookin for my sanity
 
Join Date: Feb 2007
Location: it all depends on the day and which country comes to mind
Posts: 42
@SEt

I get about a 1.6% speed increase with the 2013.11.21 build and 331.82 drivers.
I'll check more later, off to work.
mikeyakame is offline   Reply With Quote
Old 22nd November 2013, 00:14   #30  |  Link
lansing
Registered User
 
Join Date: Sep 2006
Posts: 1,657
well mine went from 9.60fps to 10.69fps with mt on with the new build, 11% increase, not bad
lansing is offline   Reply With Quote
Old 22nd November 2013, 05:16   #31  |  Link
Overdrive80
Anime addict
 
Overdrive80's Avatar
 
Join Date: Feb 2009
Location: Spain
Posts: 673
Hi, when I execute this code:

Code:
LoadPlugin("C:\Program Files (x86)\AviSynth 2.5\plugins\nnedi3ocl.dll")
SetMTMode(2,4)
BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0)
nnedi3ocl_rpow2(2, nns=2, qual=1)
I get this error message:



Build used is 2013.11.18. Graphic: AMD Radeon HD 4870.

What am I doing wrong??
__________________
Intel i7-6700K + Noctua NH-D15 + Z170A XPower G. Titanium + Kingston HyperX Savage DDR4 2x8GB + Radeon RX580 8GB DDR5 + ADATA SX8200 Pro 1 TB + Antec EDG750 80 Plus Gold Mod + Corsair 780T Graphite
Overdrive80 is offline   Reply With Quote
Old 22nd November 2013, 05:42   #32  |  Link
Keiyakusha
契約者
 
Keiyakusha's Avatar
 
Join Date: Jun 2008
Posts: 1,576
Did a quick test on GTX 570 with latest stable drivers, no MT mode and default settings, but there are no speed differences between the old and new version. The difference is always within 0.03 fps plus or minus with around 14.46 in total. I checked 3 times. Maybe later I'll check once more after I'll get some sleep...

BTW I used real 720p video. blankclip is a up to 80% faster even though source filter is capable of providing input with more than 500fps

Last edited by Keiyakusha; 22nd November 2013 at 05:53.
Keiyakusha is offline   Reply With Quote
Old 22nd November 2013, 07:35   #33  |  Link
yup
Registered User
 
Join Date: Feb 2003
Location: Russia, Moscow
Posts: 854
Hi all!
I was out my working horse, testing both version
Code:
GPU 1: NVIDIA GeForce GTX 560
OpenCL 1.1, GeForce GTX 560 compute units:7@1620MHz
18 November cl code
SetMTMode(2,4)
FPS (min | max | average):       1.84 | 416268.26 | 7.43
CPU usage (average):             13%
noMT
FPS (min | max | average):       6.83 | 7.29 | 7.20
CPU usage (average):             13%
20 November cl code
SetMTMode(2,4)
FPS (min | max | average):       2.17 | 416267.00 | 8.73
CPU usage (average):             13%
noMT
FPS (min | max | average):       7.95 | 8.53 | 8.44
CPU usage (average):             14%
Last version give speed up more than 10%.
yup.
yup is offline   Reply With Quote
Old 22nd November 2013, 12:13   #34  |  Link
SEt
Registered User
 
Join Date: Aug 2007
Posts: 374
Overdrive80, try this version: https://www.dropbox.com/s/oz1xz9k8mx...cl_fixocl10.7z Your card only supports OpenCL 1.0 while I used 1.1 feature on it. Also note that Radeon HD4xxx not fully OpenCL "capable" (their local memory isn't conformant and emulated with global memory), so "efficiency" will be less than newer Radeons.

Keiyakusha, your speed is faster than it should be on previous version but slower than it should be on new one, huh...
The problem with real scripts is that even with MTMode Avisynth scheduling is pretty bad and you likely see not 100% GPU load. Try putting source in MTMode 2 and/or increasing number of threads.
SEt is offline   Reply With Quote
Old 23rd November 2013, 00:00   #35  |  Link
Overdrive80
Anime addict
 
Overdrive80's Avatar
 
Join Date: Feb 2009
Location: Spain
Posts: 673
Ok, thanks SEt. Here go my results:

- Four Threads:

Code:
LoadPlugin("C:\Program Files (x86)\AviSynth 2.5\plugins\nnedi3ocl.dll")
SetMTMode(2,4)
BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0)
nnedi3ocl_rpow2(2, nns=2, qual=1)
Code:
Frames processed:                1000 (0 - 999)
FPS (min | max | average):       0.63 | 325290.06 | 2.55
CPU usage (average):             6%
Thread count:                    9
Physical Memory usage (peak):    1123 MB
Virtual Memory usage (peak):     1123 MB
Time (elapsed):                  000:06:32.524
- Eight Threads:

Code:
LoadPlugin("C:\Program Files (x86)\AviSynth 2.5\plugins\nnedi3ocl.dll")
SetMTMode(2,8)
BlankClip(1000, 1280, 720, "YV12", 24000, 1001, 0)
nnedi3ocl_rpow2(2, nns=2, qual=1)
Code:
Frames processed:                1000 (0 - 999)
FPS (min | max | average):       0.31 | 325290.06 | 2.52
CPU usage (average):             6%
Thread count:                    17
Physical Memory usage (peak):    1169 MB
Virtual Memory usage (peak):     1178 MB
Time (elapsed):                  000:06:37.608
- None Threads:

Code:
Frames processed:                1000 (0 - 999)
FPS (min | max | average):       1.95 | 2.50 | 2.47
CPU usage (average):             2%
Thread count:                    2
Physical Memory usage (peak):    298 MB
Virtual Memory usage (peak):     292 MB
Time (elapsed):                  000:06:44.659
__________________
Intel i7-6700K + Noctua NH-D15 + Z170A XPower G. Titanium + Kingston HyperX Savage DDR4 2x8GB + Radeon RX580 8GB DDR5 + ADATA SX8200 Pro 1 TB + Antec EDG750 80 Plus Gold Mod + Corsair 780T Graphite

Last edited by Overdrive80; 23rd November 2013 at 00:14.
Overdrive80 is offline   Reply With Quote
Old 23rd November 2013, 00:22   #36  |  Link
Gser
Registered User
 
Join Date: Apr 2008
Posts: 418
Anybody tried putting this into QTGMC yet?
Gser is offline   Reply With Quote
Old 23rd November 2013, 00:32   #37  |  Link
Selur
Registered User
 
Selur's Avatar
 
Join Date: Oct 2001
Location: Germany
Posts: 7,347
Quote:
Anybody tried putting this into QTGMC yet?
"Only nsize=0 implemented, other values silently ignored."
and iirc. at least all presets use nsize 1 and up,...

checked:
Code:
	#                                                               Very                                                        Very      Super      Ultra
	# Preset groups:                                     Placebo    Slow      Slower    Slow      Medium    Fast      Faster    Fast      Fast       Fast       Draft
...
	EdiMode      = default( EdiMode,      Select( pNum, "NNEDI3",  "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3", "NNEDI3",  "RepYadif","Bob"   ) )
	NNSize       = default( NNSize,       Select( pNum,  1,         1,        1,        1,        5,        5,        4,        4,        4,         4,         4      ) )
-> atm. it's not really that interesting for QTGMC
__________________
Hybrid here in the forum, homepage

Last edited by Selur; 23rd November 2013 at 00:34.
Selur is offline   Reply With Quote
Old 23rd November 2013, 01:24   #38  |  Link
SEt
Registered User
 
Join Date: Aug 2007
Posts: 374
nsize=0 should be better than nsize=4, so shouldn't hurt using 0 instead of it. For other nsize it's effectively quality of connecting horizontal lines, so you can use 0 instead but quality will be worse than expected.
SEt is offline   Reply With Quote
Old 24th November 2013, 20:37   #39  |  Link
bcn_246
Registered User
 
bcn_246's Avatar
 
Join Date: Nov 2005
Location: UK
Posts: 117
Quote:
Originally Posted by zero9999 View Post
Thanks a million. I assume it's just the nnedi3_resize16_rpow2 part thats been modded for OCL?

Last edited by bcn_246; 24th November 2013 at 20:40.
bcn_246 is offline   Reply With Quote
Old 26th November 2013, 01:36   #40  |  Link
zero9999
Registered User
 
Join Date: Oct 2011
Posts: 52
Quote:
Originally Posted by bcn_246 View Post
Thanks a million. I assume it's just the nnedi3_resize16_rpow2 part thats been modded for OCL?
yes, ofc also calls to this function to pass on the gpu parameter.
zero9999 is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 02:39.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.