Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Hardware & Software > Software players

Reply
 
Thread Tools Search this Thread Display Modes
Old 17th September 2014, 16:47   #381  |  Link
nevcairiel
Registered Developer
 
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,344
I'm not talking about OpenCL, but NVIDIAs own hybrid decoder included in the driver, the same thing thats exposed through DXVA2.
Its not available through any software that I know of so far though, so maybe mine will be the first.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders
nevcairiel is offline   Reply With Quote
Old 17th September 2014, 17:05   #382  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Nice!

The reason I asked you is because I thought that hybrid DXVA decoders would have the same experience/behavior like OpenCL decoders and you just confirm that!

Is there any VP5 assistance for H.265 or it's just SMXs and CPU ?

Because 780 has a lot of shaders and other Kepler GPUs will be a lot slower than that.

Have you tried Intel's DXVA H.265 decoder ?
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all
NikosD is offline   Reply With Quote
Old 18th September 2014, 17:05   #383  |  Link
nevcairiel
Registered Developer
 
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,344
Its unclear which features it really uses of the GPU, and I've been busy implementing it instead of worrying about driver details.
In any case, I have both LAV's CUVID and DXVA2 working on NVIDIA, for some reason DXVA2 still is broken on Intel, more to do in the coming days to get to the bottom of this. QuickSync (or more specifically, Intel's MediaSDK) doesn't seem to expose HEVC support yet, as far as i can see.

Drivers only expose 8-bit modes so far.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders
nevcairiel is offline   Reply With Quote
Old 18th September 2014, 17:24   #384  |  Link
JohnLai
Registered User
 
Join Date: Mar 2008
Posts: 448
It would be great if Intel GPU assisted HEVC DXVA can work considering that most ordinary users tend to use Intel Haswell CPU/IGPU these days......Too bad I heard Intel won't implement such DXVA workaround with past Ivy Bridge IGPU (particularly HD 2000 and 4000)
JohnLai is offline   Reply With Quote
Old 18th September 2014, 18:02   #385  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Quote:
Originally Posted by nevcairiel View Post
Its unclear which features it really uses of the GPU, and I've been busy implementing it instead of worrying about driver details.
In any case, I have both LAV's CUVID and DXVA2 working on NVIDIA, for some reason DXVA2 still is broken on Intel, more to do in the coming days to get to the bottom of this. QuickSync (or more specifically, Intel's MediaSDK) doesn't seem to expose HEVC support yet, as far as i can see.

Drivers only expose 8-bit modes so far.
Thanks for the info.

It's bad for Intel to implement first officially a hybrid decoder, but give it broken.

Did you try latest official drivers v. 3907 ?

About Nvidia I guess they will not have implemented a hybrid decoder with fixed-function assistance.

I think for low power Kepler/Maxwell cards it will make a difference.

It's easy to check it out by using a monitor tool like GPU-Z or a gadget while playing/ benchmarking the hybrid decoder.

Nvidia offers a VPU load metric different than GPU load which shows exactly that - fixed function load.

You definitely don't have to look inside the driver, instead you look at the decoding itself.
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all
NikosD is offline   Reply With Quote
Old 18th September 2014, 18:05   #386  |  Link
nevcairiel
Registered Developer
 
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,344
I didn't say theirs is broken, it may as well be my code not being finished yet, its hard to judge if you have no alternate reference implementation to test if the hardware/drivers work.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders
nevcairiel is offline   Reply With Quote
Old 19th September 2014, 08:49   #387  |  Link
nevcairiel
Registered Developer
 
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,344
Maxwell v2 has a hardware HEVC encoder, but still only the hybrid decoder. I'll be getting one of those cards, maybe the hybrid decoder got faster on that architecture at least.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders
nevcairiel is offline   Reply With Quote
Old 19th September 2014, 10:39   #388  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Anandtech says it uses only shaders and fixed-function HW for H.265 decoding.

Is it accurate ?

Is there an ETA for the release of your new decoder including HEVC hybrid ?

I would like to test it myself on a Kepler 740M card inside a girlfriend's laptop.

It would be better if I could test Intel's H.265 too

Thanks.
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all
NikosD is offline   Reply With Quote
Old 9th January 2015, 13:34   #389  |  Link
Yups
Registered User
 
Join Date: Sep 2011
Posts: 362
HEVC Decoding works on my Haswell iGPU with MPC 1.7.7 via DXVA2 by the way. CPU utilization much lower than before.
Yups is offline   Reply With Quote
Old 9th January 2015, 14:36   #390  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
There is a new thread opened by me for HEVC decoding.

There is a lot of discussion and figures for HEVC decoding.

Look at my signature.
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all
NikosD is offline   Reply With Quote
Old 28th February 2015, 21:06   #391  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Now that all three companies (Intel, Nvidia, AMD) have updated their fixed-function video decoders, I wonder what is their performance in H.264 1080p clips.

If someone has an Nvidia 960 GTX or AMD R9 285 or IvyBridge or Broadwell iGPU, I would like to see their results using DXVA Checker x64 v3.3.2 in pure decode mode and LAV x64 v0.64 in DXVA native for the clips from the first post.

1 to 6 is here: ftp://helpedia.com/pub/multimedia/x2...s%20UVD%202.2/

7 to 10 is here:ftp://helpedia.com/pub/multimedia/x2....2%20vs%20VP4/
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all

Last edited by NikosD; 6th March 2015 at 21:05.
NikosD is offline   Reply With Quote
Old 5th March 2015, 09:48   #392  |  Link
P.J
Δ
 
Join Date: Jun 2008
Posts: 535
Quote:
Originally Posted by NikosD View Post
Now that all three companies (Intel, Nvidia, AMD) have updated their fixed-function video decoders, I wonder what is their performance in H.264 1080p clips.

If someone has an Nvidia 960 GTX or AMD R9 285 or Broadwell iGPU, I would like to see their results using DXVA Checker x64 v3.3.2 in pure decode mode and LAV x64 v0.64 in DXVA native for the clips with huge bitrate from 7 to 10 from the first post.
ftp://helpedia.com/pub/multimedia/x2....2%20vs%20VP4/

Any of those clips would definitely give us an idea of H.264 decoding performance.
1. Twinpeaks-30fps
VP7 LAV NATIVE 475/475/475

2. Samsung-30fps
VP7 LAV NATIVE 293/347/404

3. Basket-60fps
VP7 LAV NATIVE 558/562/565

4. Girls-60fps
VP7 LAV NATIVE 534/540/548

5. Birds-60fps
VP7 LAV NATIVE 474/503/527

6. Cat-60fps
VP7 LAV NATIVE 510/518/524

7. Vortex-24fps
VP7 LAV NATIVE 169/171/170

8. Birds-24fps
VP7 LAV NATIVE 171/177/176

9. Ducks -30fps
VP7 LAV NATIVE 181/191/192

10. Crowd Run-25fps
VP7 LAV NATIVE 152/154/152


Just be aware of the Core/Memory clocks
I set 'Repeat Count' to '8' because it's very slow in thr first benchmark

Last edited by P.J; 6th March 2015 at 16:58. Reason: Added more tests...
P.J is offline   Reply With Quote
Old 6th March 2015, 20:28   #393  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Pure ASIC decoding performance on 1080p H.264

Using DXVA Checker x64 v3.3.2 in pure decode mode and LAV Video x64 v0.64 in DXVA native mode, I did the tests again with modern and older ASICs for the whole collection of clips from 1 to 10.

The results of VP7 (Nvidia GTX 960) are from P.J which has a GPU with high clocks (GPU clock = 1.47GHz).
The results of Neet009 using a different GTX 960 are a little lower.


Results:


1. Twinpeaks-30fps


QS3 1070/1070/1070

CPU 4790 515/515/515

QS1 484/484/484

VP7 475/475/475

RX 470 414/419/414 [Playback] - New driver

RX 470 312/338/380 [Decode] - New driver

RX 470 288/308/326 [Playback]

RX 470 279/289/310 [Decode]

VP5 125/139/143

Intel GMA HD 85/128/142

VP4 80/84/86

UVD2.2 46/56/62

UVD+ 2/51/60



2. Samsung-30fps


QS3 767/808/872

VP7 293/347/404

QS1 293/340/376

CPU 4790 218/268/349

RX 470 218/265/328 [Playback] - New driver

RX 470 199/240/306 [Decode] - New driver

RX 470 168/209/259 [Playback]

RX 470 158/200/263 [Decode]

VP5 109/119/125

Intel GMA HD 51/74/103

VP4 34/55/82

UVD2.2 35/46/55



3. Basket-60fps


QS3 1105/1155/1184

RX 470 530/588/617 [Playback] - New driver

QS1 540/581/633

VP7 558/562/565

CPU 4790 471/539/610

RX 470 422/450/471 [Playback]

RX 470 345/362/432 [Decode] - New driver

RX 470 317/337/398 [Decode]

VP5 136/144/157

Intel GMA HD 75/103/134

VP4 71/82/104

UVD2.2 54/57/59


4. Girls-60fps


QS3 1023/1060/1091

RX 470 532/548/561 [Playback] - New driver

VP7 534/540/548

QS1 487/500/514

CPU 4790 411/427/441

RX 470 412/425/444 [Playback]

RX 470 340/346/440 [Decode] - New driver

RX 470 310/333/415 [Decode]

VP5 135/137/139

Intel GMA HD 90/105/112

VP4 74/76/79

UVD2.2 55/56/58


6. Cat-60fps


QS3 959/967/970

VP7 510/518/524

RX 470 438/471/489 [Playback] - New driver

QS1 404/424/435

CPU 4790 347/379/400

RX 470 344/368/383 [Playback]

RX 470 313/326/383 [Decode] - New driver

RX 470 285/299/328 [Decode]

VP5 131/137/143

Intel GMA HD 68/92/96

VP4 67/76/83

UVD2.2 48/52/55

UVD+ 0/53/56


7. Vortex-24fps


QS3 358/359/358

VP7 169/171/170

QS1 156/159/158

CPU 4790 97/113/119

RX 470 107/110/110 [Playback & Decode] - new driver

RX 470 83/85/85 [Playback & Decode]

VP5 72/73/74

Intel GMA HD 35/46/49

UVD2.2 0/26/58

UVD+ 0/25/29

VP4 19/22/24


8. Birds-24fps


QS3 351/360/358

VP7 171/177/176

QS1 151/160/161

RX 470 105/111/111 [Playback & Decode] - new driver

CPU 4790 100/110/113

RX 470 75/86/92 [Playback & Decode]

VP5 71/77/79

Intel GMA HD 35/42/47

UVD2.2 13/27/47

UVD+ 0/26/29

VP4 19/22/28


9. Ducks -30fps


QS3 413/413/413

VP8 249/258/249

VP7 181/191/192

QS1 168/183/183

RX 470 105/126/134 [Playback & Decode] - new driver

CPU 4790 115/125/134

RX 470 80/98/110 [Playback & Decode]

VP5 74/84/91

Intel GMA HD 25/48/58

UVD2.2 0/30/58

VP4 21/25/30


10. Crowd Run-25fps


QS3 328/330/328

VP7 152/154/152

QS1 143/145/143

RX 470 98/101/105 [Playback & Decode] - new driver

CPU 4790 87/98/102

RX 470 74/78/78 [Playback & Decode]

VP5 66/68/69

Intel GMA HD 17/35/43

UVD2.2 20/23/24

UVD+ 0/22/28

VP4 18/21/21


Comments:

CPU 4790 = Core i7 4790@3.8GHz using 16 threads (LAV properties) - Win 8.1 Pro x64

QS3 = Haswell HD 4600@1.5GHz drivers: 4080

QS1 = SandyBridge HD 2000@1.5GHz drivers: 4101

VP8 = Nvidia GTX 1060 (Zotac Mini) drivers: 368.95

VP7 = Nvidia GTX 960@1.47GHz drivers: 347.52

RX 470 = AMD Radeon RX 470 drivers: 17.2.1 / New driver >17.4

VP5 = Nvidia GT610@0.81GHz drivers: 347.52

VP4 = Nvidia GT440@0.82GHz drivers: 347.52

Intel GMA HD = Core i5 520M (Arrandale)@0.77GHz drivers: 3268

UVD2.2 = Radeon 5750@0.4GHz drivers: 14.12

UVD+ = Radeon 3650@0.72GHz drivers: 13.9


P.S

1) For Arrandale I used MS DS x64 decoder because LAV video is not compatible with Arrandale.

Arrandale uses GPU shaders a lot, but CPU ~2%

For clips 3,4,6 the GPU load was 100% with a GPU clock of max 777MHz
For clips 1,2,7,8,9,10 the GPU load was ~70% with a GPU clock of min 372MHz

The GMA HD looks like a rather hybrid decoder using both ASIC+GPU, than a pure ASIC decoder.

2) Looking forward for Radeon R9 285, IvyBridge, Broadwell results

3) For RX 470 it was used DXVA Checker v3.15, LAV filters 0.69, Win 10 x64 and Playback mode used a 1280x720 scaling.
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all

Last edited by NikosD; 16th July 2017 at 10:43. Reason: Added AMD RX 470 new driver results
NikosD is offline   Reply With Quote
Old 18th March 2015, 03:48   #394  |  Link
leonccyiu
Registered User
 
Join Date: Feb 2015
Posts: 45
No 4k H264 Decoding on Pentium G3258

I hope I am not posting this in the wrong place
I recently purchased an intel pentium g3258 thinking the hd graphics should be able to decode 4k h264 video, however my cpu is being used.
Upon running DXVA Checker my suspicions were confirmed

H264_VLD_NoFGT: DXVA2/D3D11, SD / HD / FHD

I looked through earlier posts in the thread and noticed that QFHD used to be included but after a driver update isn't anymore.
Does anyone know more about this? Or if it's still possible to revert to an older driver?

Thanks
leonccyiu is offline   Reply With Quote
Old 18th March 2015, 07:50   #395  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Quote:
Originally Posted by leonccyiu View Post

H264_VLD_NoFGT: DXVA2/D3D11, SD / HD / FHD
Really ?

That's a very bad thing for Intel.

I remember when I had a Pentium G3420 installed that a beta driver disabled 4K H.264 for Pentium but that was a beta driver and then I switched to Core i7.

I don't believe that they did that on official drivers!

If I were you I would download and install latest driver v.4156 and complain to Intel forums if 4K is still disabled.

Here is the download link and the forum:
https://communities.intel.com/thread/61436
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all

Last edited by NikosD; 18th March 2015 at 19:59. Reason: Fixed Pentium model
NikosD is offline   Reply With Quote
Old 18th March 2015, 17:54   #396  |  Link
P.J
Δ
 
Join Date: Jun 2008
Posts: 535
Strange, my N2830 can play 4K H.264
P.J is offline   Reply With Quote
Old 19th March 2015, 02:56   #397  |  Link
leonccyiu
Registered User
 
Join Date: Feb 2015
Posts: 45
Quote:
Originally Posted by NikosD View Post
Really ?

That's a very bad thing for Intel.

I remember when I had a Pentium G3420 installed that a beta driver disabled 4K H.264 for Pentium but that was a beta driver and then I switched to Core i7.

I don't believe that they did that on official drivers!

If I were you I would download and install latest driver v.4156 and complain to Intel forums if 4K is still disabled.

Here is the download link and the forum:
https://communities.intel.com/thread/61436
My Driver version is 15.33.32.64.4061

According to the Intel Driver update utility that is the latest version. I downloaded the driver in your link, but it said it wasn't compatible with my hardware.

I believe the beta drivers you installed for your Pentium G3420 is the time in which Intel decided to disable 4k h264 decoding on hd graphics.

Do you mind if I use your screenshot to complain?
and Do you know where I can complain to Intel?

Thanks
leonccyiu is offline   Reply With Quote
Old 19th March 2015, 06:50   #398  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Quote:
Originally Posted by leonccyiu View Post
My Driver version is 15.33.32.64.4061

According to the Intel Driver update utility that is the latest version. I downloaded the driver in your link, but it said it wasn't compatible with my hardware.
You are doing something wrong.
Your Pentium G3258 belongs to Haswell family.
First of all you should install new family drivers 15.36 and not 15.33.
15.33 is the old family which is compatible with Ivy and Haswell and 4061 are the latest, when 15.36 is the new family that supports Haswell and Broadwell and 4156 are the latest drivers.

Maybe you downloaded the wrong link 32bit vs 64bit.

If you can't install 4156 drivers, ask for help from the link above that I gave you.


Quote:
Originally Posted by leonccyiu View Post
Do you mind if I use your screenshot to complain?
and Do you know where I can complain to Intel?
Don't use something from me, take a screenshot of your system using DXVA checker and paste it in the link above that I gave you.

It's from Intel forums, I don't know any other way.
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all

Last edited by NikosD; 19th March 2015 at 06:52.
NikosD is offline   Reply With Quote
Old 26th June 2016, 21:34   #399  |  Link
HarryM
Registered User
 
Join Date: May 2002
Location: Czech rep.
Posts: 390
Quote:
Originally Posted by NikosD View Post
You are doing something wrong.
Your Pentium G3258 belongs to Haswell family.
First of all you should install new family drivers 15.36 and not 15.33.
15.33 is the old family which is compatible with Ivy and Haswell and 4061 are the latest, when 15.36 is the new family that supports Haswell and Broadwell and 4156 are the latest drivers.
15.40 family is the newest.
__________________
Czech DivX/XviD discussion club
HarryM is offline   Reply With Quote
Old 28th February 2017, 14:10   #400  |  Link
NikosD
Registered User
 
Join Date: Aug 2010
Location: Athens, Greece
Posts: 2,901
Added today at this post https://forum.doom9.org/showthread.p...50#post1712350 my new results for AMD RX 470 card.

The behavior was totally strange, as playback performance was better or even a lot better using playback mode (scale to 1280x720) than decode mode, but after clip 7 where the bitrate is huge, the playback performance was the same like pure decode performance.

For "normal" bitrate clips up to 30Mbps, the decoder is very fast but above 100Mbps the performance drops a lot, more than any other decoder.

Another strange thing is that 4K H.264 decoding performance looks exactly like 1080p.
Low bitrate clips at 4K resolution have almost same decoding speed like 1080p, but when the bitrate rises ~100Mbps, the decoding performance drops a lot.

The HW H.264 decoder of Polaris cards looks like it's not affected by higher resolution compared to high (>100Mbps) bitrate.

The last strange result is that clips from 7 to 9 show a 100% utilisation of 1 core (25% CPU usage of my quad core Core i5-2400) but only using "Decode" mode.
Playback mode is using CPU <5% like the decode and playback mode of all the other clips.

Even clip 10 has no problem at all in CPU usage in "Decode" and "Playback" mode.

What a strange HW decoder...
__________________
Win 10 x64 (19042.572) - Core i5-2400 - Radeon RX 470 (20.10.1)
HEVC decoding benchmarks
H.264 DXVA Benchmarks for all
NikosD is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 03:27.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.