Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Capturing and Editing Video > Avisynth Development

Reply
 
Thread Tools Search this Thread Display Modes
Old 28th April 2020, 00:15   #1421  |  Link
Groucho2004
 
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
Quote:
Originally Posted by Atak_Snajpera View Post
Indeed. Gtx1660 Has 5x more tflops than 750. I wonder where is bottleneck in this case.
Maybe it's my oldish mobo (P8P67) with PCI Express 2.0 slot.
__________________
Groucho's Avisynth Stuff
Groucho2004 is offline   Reply With Quote
Old 28th April 2020, 00:25   #1422  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
Is your test stream HEVC? Frame size? Didn't read the whole thread, perhaps you already said.
videoh is offline   Reply With Quote
Old 28th April 2020, 00:42   #1423  |  Link
Groucho2004
 
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
Quote:
Originally Posted by videoh View Post
Is your test stream HEVC? Frame size? Didn't read the whole thread, perhaps you already said.
Decoding 4K HEVC (SonyCamp4K) I get 155 fps with DGDecNV. 1080p H.264 is ~ 500 fps. That seems pretty fast.
__________________
Groucho's Avisynth Stuff
Groucho2004 is offline   Reply With Quote
Old 28th April 2020, 01:05   #1424  |  Link
Sparktank
47.952fps@71.928Hz
 
Sparktank's Avatar
 
Join Date: Mar 2011
Posts: 940
Quote:
Originally Posted by Groucho2004 View Post
I updated my graphics card from GTX750 to GTX1660 a few days ago. I'm getting about twice the speed with KNLMeansCL. I was expecting a bit more but still, nice speed-up.
I need to get one of those.
The 1060 is a nice upgrade over my 750ti.
__________________
Win10 (x64) build 19041
NVIDIA GeForce GTX 1060 3GB (GP106) 3071MB/GDDR5 | (r435_95-4)
NTSC | DVD: R1 | BD: A
AMD Ryzen 5 2600 @3.4GHz (6c/12th, I'm on AVX2 now!)
Sparktank is offline   Reply With Quote
Old 28th April 2020, 02:02   #1425  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
Quote:
Originally Posted by Groucho2004 View Post
Decoding 4K HEVC (SonyCamp4K) I get 155 fps with DGDecNV.
Pretty good. I'm getting 326 fps with my 2080 Ti.
videoh is offline   Reply With Quote
Old 28th April 2020, 02:31   #1426  |  Link
Groucho2004
 
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
Quote:
Originally Posted by videoh View Post
I'm getting 326 fps with my 2080 Ti.
Which is 6-7 times the price of the 1660.
__________________
Groucho's Avisynth Stuff
Groucho2004 is offline   Reply With Quote
Old 28th April 2020, 03:05   #1427  |  Link
videoh
Useful n00b
 
Join Date: Jul 2014
Posts: 1,667
My progress bar of life is nearing the end (probably not soon enough for a lot of you). I'm not leaving my money to anybody. Gonna live it up!
videoh is offline   Reply With Quote
Old 28th April 2020, 08:05   #1428  |  Link
LigH
German doom9/Gleitz SuMo
 
LigH's Avatar
 
Join Date: Oct 2001
Location: Germany, rural Altmark
Posts: 6,753
I still have a PC with Phenom-II X6 and GT450. I seriously need a new PC ... but to be future-proof, it would cost me a month's wages.
__________________

New German Gleitz board
MediaFire: x264 | x265 | VPx | AOM | Xvid
LigH is offline   Reply With Quote
Old 28th April 2020, 08:15   #1429  |  Link
Boulder
Pig on the wing
 
Boulder's Avatar
 
Join Date: Mar 2002
Location: Finland
Posts: 5,718
Quote:
Originally Posted by LigH View Post
I still have a PC with Phenom-II X6 and GT450. I seriously need a new PC ... but to be future-proof, it would cost me a month's wages.
An X470 board + Ryzen 3700X is a quite affordable upgrade, which gives a big boost in video encoding and will easily last several years. AVX2 performance was increased substantially compared to the first-gen Ryzens. I don't expect to see any major jumps in the near future so my 3900X will do nicely for years (the next gen will require a new mobo and DDR5 RAM so no rush).
RAM is not too expensive either, the speed won't matter much when encoding.
__________________
And if the band you're in starts playing different tunes
I'll see you on the dark side of the Moon...
Boulder is offline   Reply With Quote
Old 28th April 2020, 08:31   #1430  |  Link
MeteorRain
結城有紀
 
Join Date: Dec 2003
Location: NJ; OR; Shanghai
Posts: 894
I'm on B450 + Ryzen 3600 + GTX 1650S and I feel like it's pretty future proof.
__________________
Projects
x265 - Yuuki-Asuna-mod Download / GitHub
TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS
Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median
MeteorRain is offline   Reply With Quote
Old 28th April 2020, 08:49   #1431  |  Link
tormento
Acid fr0g
 
tormento's Avatar
 
Join Date: May 2002
Location: Italy
Posts: 2,542
Quote:
Originally Posted by Boulder View Post
AVX2 performance was increased substantially compared to the first-gen Ryzens.
Zen 2 is a terrific architecture but about AVX2, that is sort of “emulated”. There are many articles about this topic, look at them if interested.
__________________
@turment on Telegram
tormento is offline   Reply With Quote
Old 28th April 2020, 09:07   #1432  |  Link
Boulder
Pig on the wing
 
Boulder's Avatar
 
Join Date: Mar 2002
Location: Finland
Posts: 5,718
Quote:
Originally Posted by tormento View Post
Zen 2 is a terrific architecture but about AVX2, that is sort of “emulated”. There are many articles about this topic, look at them if interested.
I'm not sure I follow you. The first-gen Zen did some tricks which is why it was quite poor in AVX2. The current gen has full AVX2 support.

https://forum.doom9.org/showthread.php?t=174393&page=2 shows nicely how big the difference is. A 32-thread 1950X scored 4.26 for AVX2 and a stock 3900X (24 threads) got 5.51.
__________________
And if the band you're in starts playing different tunes
I'll see you on the dark side of the Moon...
Boulder is offline   Reply With Quote
Old 28th April 2020, 09:33   #1433  |  Link
MeteorRain
結城有紀
 
Join Date: Dec 2003
Location: NJ; OR; Shanghai
Posts: 894
Quote:
Originally Posted by tormento View Post
Zen 2 is a terrific architecture but about AVX2, that is sort of “emulated”. There are many articles about this topic, look at them if interested.
Zen2 has full AVX2 capability. Not sure where you got that "emulated" idea.
__________________
Projects
x265 - Yuuki-Asuna-mod Download / GitHub
TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS
Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median
MeteorRain is offline   Reply With Quote
Old 28th April 2020, 10:06   #1434  |  Link
tormento
Acid fr0g
 
tormento's Avatar
 
Join Date: May 2002
Location: Italy
Posts: 2,542
Quote:
Originally Posted by MeteorRain View Post
Zen2 has full AVX2 capability. Not sure where you got that "emulated" idea.
Unfortunately I can't find the page anymore, I can summarize only.

By SIMDjson benchmark, it was shown that Intel has more AVX2 IPC than Zen2.

Apart from that, the higher core count could nullify that penalty.

Zen3 should come across that and introduce AVX512 too.
__________________
@turment on Telegram

Last edited by tormento; 28th April 2020 at 10:08.
tormento is offline   Reply With Quote
Old 28th April 2020, 10:33   #1435  |  Link
MeteorRain
結城有紀
 
Join Date: Dec 2003
Location: NJ; OR; Shanghai
Posts: 894
Quote:
Originally Posted by tormento View Post
Unfortunately I can't find the page anymore, I can summarize only.

By SIMDjson benchmark, it was shown that Intel has more AVX2 IPC than Zen2.

Apart from that, the higher core count could nullify that penalty.

Zen3 should come across that and introduce AVX512 too.
I found that benchmark. It was concluded that AMD has a poor IPC performance on AVX2. That is kinda expected, and actually many factors matters including that SIMDjson was optimized against Intel processors.
CPU pipelines, instruction order, cache size, even picking one instruction over another, can make a lot difference.
__________________
Projects
x265 - Yuuki-Asuna-mod Download / GitHub
TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS
Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median
MeteorRain is offline   Reply With Quote
Old 28th April 2020, 10:42   #1436  |  Link
tormento
Acid fr0g
 
tormento's Avatar
 
Join Date: May 2002
Location: Italy
Posts: 2,542
Quote:
Originally Posted by MeteorRain View Post
That is kinda expected, and actually many factors matters including that SIMDjson was optimized against Intel processors.
Wasn't compiled using neutral flags?
__________________
@turment on Telegram
tormento is offline   Reply With Quote
Old 28th April 2020, 10:44   #1437  |  Link
Groucho2004
 
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
Conversation is getting a bit off-topic...
__________________
Groucho's Avisynth Stuff
Groucho2004 is offline   Reply With Quote
Old 28th April 2020, 23:14   #1438  |  Link
MeteorRain
結城有紀
 
Join Date: Dec 2003
Location: NJ; OR; Shanghai
Posts: 894
Quote:
Originally Posted by tormento View Post
Wasn't compiled using neutral flags?
A highly optimized library wouldn't use a neutral flag. Also there's no neutral flag. If you have Intel CPU then all you can do is make sure your program runs fastest on that Intel CPU. If later AMD CPU appears and has different characteristics, likely some of the previous optimization won't make it as fast as on Intel CPU.

One example would be Intel has avx down clocking. So to reduce the chance of that you can carefully reduce the amount of heavy avx computing, and let CPU stay on light load frequency (higher frequency). But AMD doesn't have avx down clocking so the best thing to do is give it all the heavy lifting as much as possible.

Intel compiler for example when compiling avx 512, will take the down clocking in to account, and emit code that is not too hard for the CPU, so it won't run too fast to run slow (lol) but if AMD comes with 512 some day and it does not clock down, then when it runs the same binary it would not use its 100% potential.
__________________
Projects
x265 - Yuuki-Asuna-mod Download / GitHub
TS - ADTS AAC Splitter | LATM AAC Splitter | BS4K-ASS
Neo AviSynth+ filters - F3KDB | FFT3D | DFTTest | MiniDeen | Temporal Median
MeteorRain is offline   Reply With Quote
Old 29th April 2020, 00:00   #1439  |  Link
tormento
Acid fr0g
 
tormento's Avatar
 
Join Date: May 2002
Location: Italy
Posts: 2,542
Quote:
Originally Posted by MeteorRain View Post
One example would be Intel has avx down clocking.
AFAIK it's a bios option, not mandatory.
__________________
@turment on Telegram
tormento is offline   Reply With Quote
Old 27th May 2020, 12:58   #1440  |  Link
hintlou
Registered User
 
Join Date: Nov 2019
Posts: 16
After I updated some filters (yadifmod2-0.2.4,SangNom2-0.5.0,DCTFilter-0.5.1,vsMSharpen-2.0.0,MatchHistogram-2.1.0,TMM2-0.1.4,CombMask-2.0.0 and MPEG2DecPlus-0.1.2),my QTGMC 3.364 got error and warned me that "KNLMeans.dll cannot be used as a plugin for Avisynth".
I don't know which filter caused it.
hintlou is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 14:28.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.