Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Video Encoding > High Efficiency Video Coding (HEVC)

Reply
 
Thread Tools Search this Thread Display Modes
Old 30th August 2020, 17:12   #1  |  Link
Lyris
Registered User
 
Join Date: Sep 2007
Location: Europe
Posts: 602
New CPU - Intel or AMD?

I'm a little confused on what to buy, because most of the CPU reviews online, for obvious reasons, only give an idea of encode speed with x265 for RE-transcoding already compressed video.

For first-gen encodes from an uncompressed YUV source, the speeds are of course much slower. I'm running a bunch of AMD Threadripper 1950Xs and also one 2990WX. The typical speed for encoding content with film grain on the veryslow preset is about 0.19fps (although on the 32-core 2990WX we can run two chunks concurrently and get about 0.15fps on each).

Supposedly, whatever design issue the 2990WX had which stopped it getting better performance gains has been solved with the new chips, and I'm told that one of the newer, beefier Threadripper CPUs would hopefully show some performance gains.

Or, should I only be looking at Intel - maybe a 10980XE - so we can benefit from the AVX512 extensions? Is anyone else out there running high end CPUs for 4K content encoding, and what sort of speeds are you getting with film sources?
Lyris is offline   Reply With Quote
Old 31st August 2020, 03:43   #2  |  Link
RanmaCanada
Registered User
 
Join Date: May 2009
Posts: 325
AVX512 is only good for 2 things, heat and giving you a higher power bill. If you can wait a little bit, Zen 3 will be released, and it's rumoured to have 15% of so IPC increase over Zen 2. It will be the last to support AM4. If you need it now, I would say get a 3950 with a x570 as you could upgrade to whatever Zen 3 gives. The 3950 is faster than the 10980XE, at stock clocks.

Or if you really need to spend the money, yes the 3000 series Threadrippers destroy everything but we don't know if the current socket for 3000 series Threadripper will work with 4000 series. Maybe replace your 1950x system with a 3950, or even a 3900, which will show massive improvement, believe it or not.
RanmaCanada is offline   Reply With Quote
Old 31st August 2020, 18:54   #3  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,650
Quote:
Originally Posted by Lyris View Post
Or, should I only be looking at Intel - maybe a 10980XE - so we can benefit from the AVX512 extensions? Is anyone else out there running high end CPUs for 4K content encoding, and what sort of speeds are you getting with film sources?
My workstation uses dual Xeon(R) Gold 6240 @ 2.60GHz, 2594 Mhz, 18 Core(s), 36 Logical Processor(s).

A few frames a second pinned to one socket is reasonable using balanced encoding settings. If I try to do veryslow, it can drop below 1 fps.

I've seen AVX512 offer a little speed boost doing 4K @ veryslow or with 8K. The thermal throttling triggered by using AVX512 in current CPUs is a big drag on the theoretical gains. I hope that future updates will have less AVX512 thermal throttling, the same way that... Skylake (?) made AVX2 much more effective by reducing thermal throttling when using it. The perf gain from using AVX2 went from something like 15% to 40% with the scenarios I was looking at, in my dim recollection.

More broadly, from everything I see right now, a recent AMD with lots of cores definately will maximize your content throughput per dollar.

I haven't seen detailed benchmarks of x265 with the Gen 10 Intel consumer CPUs, though. Going dual socket doesn't make a lot of economic sense these days unless you're doing 8K or want to do multiple encodes or an encode with other stuff at the same time. A fast single socket system is going to deliver a lot better throughput for dollar, and multiple sockets only make sense when getting multiple encodes out at once quickly is more valuable important than a few $K.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Old 1st September 2020, 04:30   #4  |  Link
H2sixty
Not a Registered User
 
H2sixty's Avatar
 
Join Date: Aug 2020
Location: earth
Posts: 24
intel is usually faster, amd has always been better quality (ie less problems/errors with poor power supplies, can run high density ram, etc) amd used to offer military grade chips back in the day... intel buys its chip designs from amd. amd gave birth to 64bit computing. if intel doesn't give you the heebee geebee's, then you should get more bang for your dollar. these trends have been the same between the two companies for decades (quality & bang4dollar)... as for individual chips, prices change all the time. there are websites to compare cpu's. personaly i want 1 core with 32 threads, for higher quality encodes at full speeds, no slices etc... but they still do not sell high thread single core cpus for decades and hyper threading does not count, thats fake cpu threads, but does help to leave enabled...
H2sixty is offline   Reply With Quote
Old 1st September 2020, 05:30   #5  |  Link
Cary Knoop
Cary Knoop
 
Cary Knoop's Avatar
 
Join Date: Feb 2017
Location: Newark CA, USA
Posts: 397
I would go AMD all the way!
Cary Knoop is offline   Reply With Quote
Old 1st September 2020, 13:45   #6  |  Link
excellentswordfight
Lost my old account :(
 
Join Date: Jul 2017
Posts: 321
Quote:
Originally Posted by benwaggoner View Post
My workstation uses dual Xeon(R) Gold 6240 @ 2.60GHz, 2594 Mhz, 18 Core(s), 36 Logical Processor(s).

A few frames a second pinned to one socket is reasonable using balanced encoding settings. If I try to do veryslow, it can drop below 1 fps.

I've seen AVX512 offer a little speed boost doing 4K @ veryslow or with 8K. The thermal throttling triggered by using AVX512 in current CPUs is a big drag on the theoretical gains. I hope that future updates will have less AVX512 thermal throttling, the same way that... Skylake (?) made AVX2 much more effective by reducing thermal throttling when using it. The perf gain from using AVX2 went from something like 15% to 40% with the scenarios I was looking at, in my dim recollection.

More broadly, from everything I see right now, a recent AMD with lots of cores definately will maximize your content throughput per dollar.

I haven't seen detailed benchmarks of x265 with the Gen 10 Intel consumer CPUs, though. Going dual socket doesn't make a lot of economic sense these days unless you're doing 8K or want to do multiple encodes or an encode with other stuff at the same time. A fast single socket system is going to deliver a lot better throughput for dollar, and multiple sockets only make sense when getting multiple encodes out at once quickly is more valuable important than a few $K.
It seems like ice lake (Sunny Cove) will fix most of the AVX512 downclocking, so it would be intresting to see AVX512 performance for Ice Lake-SP down the line.

https://i.imgur.com/S6tL29Y.png

Atm, not only does AMD Rome offer more cores per $, it also sustain much higher clockspeeds in their repetitive price bracket under multi core loads. I have been using an 7402p and it can keep an 3.3Ghz clock on 48 threads with close to 100% x265 load in an 1u an server!

My guess would be that the fastest out of the box CPU atm for single instance encoding of 4k material should be 3960X. Not only does it has more threads than 10980XE, it will run higher clockspeeds under full load. And dont think going above 48threads will give that much extra performance for single instance encoding, even at 4k.

But tbh I dont think it will get you in to what I would consider manageable speed (but it would probably be a nice gain from 1950Xs/2990WX as 1st and 2th gen Zen do have avx limitations). I would also maybe looking at dropping from 'veryslow' to 'slower'. Cause maybe we get some more performance with Ice Lake-SP/Ice Lake-X (who knows when or if that is coming) or Milan/4th gen threadripper, but I dont think it will be enough to make any major difference in speed for your use case. So it might be best at looking at chunk encoding workflows if you are going to stick with 'veryslow'.

Last edited by excellentswordfight; 1st September 2020 at 15:26.
excellentswordfight is offline   Reply With Quote
Old 1st September 2020, 18:45   #7  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,650
Quote:
Originally Posted by excellentswordfight View Post
It seems like ice lake (Sunny Cove) will fix most of the AVX512 downclocking, so it would be intresting to see AVX512 performance for Ice Lake-SP down the line.

https://i.imgur.com/S6tL29Y.png
Alas, that chart's range starts pretty high. It's about at 15% improvement, which is welcome, but not revolutionary. Perhaps enough to justify having AVX512 on by default on Sunny Cove, though. But hey, 10th gen chips are already available to the consumer; I bought my fiancé a 10th gen laptop last month ("Intel Core i7-10510U CPU @ 1.80GHz, 2304 Mhz, 4 Core(s), 8 Logical Processor(s)"). Has anyone tested AVX-512 perf on these yet?

What's the difference between 512L and 512H?

Quote:
My guess would be that the fastest out of the box CPU atm for single instance encoding of 4k material should be 3960X. Not only does it has more threads than 10980XE, it will run higher clockspeeds under full load. And dont think going above 48threads will give that much extra performance for single instance encoding, even at 4k.
>48 with pmode and a bunch of frame threads for 4K is probably helpful, but perf gains will definitely be nonlinear.

Quote:
But tbh I dont think it will get you in to what I would consider manageable speed (but it would probably be a nice gain from 1950Xs/2990WX as 1st and 2th gen Zen do have avx limitations). I would also maybe looking at dropping from 'veryslow' to 'slower'. Cause maybe we get some more performance with Ice Lake-SP/Ice Lake-X (who knows when or if that is coming) or Milan/4th gen threadripper, but I dont think it will be enough to make any major difference in speed for your use case. So it might be best at looking at chunk encoding workflows if you are going to stick with 'veryslow'.
And slower is really most of the all the "good stuff" of HEVC and x265 kicks in. The "style" of encoding quality doesn't change between slower and placebo.

I'd generally prefer to add --tskip --tskip-fast to slower than move to the stock veryslow. Tskip doesn't do much for natural image content, but offers huge gains with animation and motion graphics.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book

Last edited by benwaggoner; 1st September 2020 at 18:56. Reason: Added 10th gen CPU details
benwaggoner is offline   Reply With Quote
Old 1st September 2020, 20:56   #8  |  Link
Lyris
Registered User
 
Join Date: Sep 2007
Location: Europe
Posts: 602
Thanks, all. It seems like AVX512 isn't all that. As a result, I've ordered a 3950X. I'll report back with the speeds I'm getting on it.

In the future I'll probably replace one of the 1950Xs with one of the beefier newer Threadrippers. The idea of being able to use the same motherboard is tempting.

Last edited by Lyris; 1st September 2020 at 21:03.
Lyris is offline   Reply With Quote
Old 1st September 2020, 22:06   #9  |  Link
RanmaCanada
Registered User
 
Join Date: May 2009
Posts: 325
Quote:
Originally Posted by Lyris View Post
Thanks, all. It seems like AVX512 isn't all that. As a result, I've ordered a 3950X. I'll report back with the speeds I'm getting on it.

In the future I'll probably replace one of the 1950Xs with one of the beefier newer Threadrippers. The idea of being able to use the same motherboard is tempting.
You can not use 3rd gen Threadrippers on 1st-2nd gen boards.

https://www.notebookcheck.net/AMD-ju....442727.0.html
RanmaCanada is offline   Reply With Quote
Old 2nd September 2020, 15:29   #10  |  Link
excellentswordfight
Lost my old account :(
 
Join Date: Jul 2017
Posts: 321
Quote:
Originally Posted by benwaggoner View Post
Alas, that chart's range starts pretty high. It's about at 15% improvement, which is welcome, but not revolutionary. Perhaps enough to justify having AVX512 on by default on Sunny Cove, though. But hey, 10th gen chips are already available to the consumer; I bought my fiancé a 10th gen laptop last month ("Intel Core i7-10510U CPU @ 1.80GHz, 2304 Mhz, 4 Core(s), 8 Logical Processor(s)"). Has anyone tested AVX-512 perf on these yet?

What's the difference between 512L and 512H?
I think it stands for "light" and "heavy".

I found this test for ice lake-U

https://travisdowns.github.io/blog/2...x512-freq.html

"The Ice Lake i5-1035 CPU exhibits only 100 MHz of licence-based downclock with 1 active core when running 512-bit instructions.
There is no downclock in any other scenario.

The all-core 512-bit turbo frequency of 3.3 GHz is 89% of the maximum single-core scalar frequency of 3.7 GHz, so within power and thermal limits this chip has a very “flat” frequency profile.

Unlike SKX, this Ice Lake chip does not distinguish between “light” and “heavy” instructions for frequency scaling purposes: FMA operations behave the same as lighter operations."

The same all core clockspeed of 3.3Ghz is achieved regardless of AVX load/type.
excellentswordfight is offline   Reply With Quote
Old 2nd September 2020, 16:59   #11  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,650
Quote:
Originally Posted by excellentswordfight View Post
I think it stands for "light" and "heavy".

I found this test for ice lake-U

https://travisdowns.github.io/blog/2...x512-freq.html

"The Ice Lake i5-1035 CPU exhibits only 100 MHz of licence-based downclock with 1 active core when running 512-bit instructions.
There is no downclock in any other scenario.

The all-core 512-bit turbo frequency of 3.3 GHz is 89% of the maximum single-core scalar frequency of 3.7 GHz, so within power and thermal limits this chip has a very “flat” frequency profile.

Unlike SKX, this Ice Lake chip does not distinguish between “light” and “heavy” instructions for frequency scaling purposes: FMA operations behave the same as lighter operations."

The same all core clockspeed of 3.3Ghz is achieved regardless of AVX load/type.
Wow, that's really promising for AVX-512 value!

I'll see if I can pry my sweetie's laptop away from her for a few hours and run some tests with and without AVX-512.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Old 4th September 2020, 00:13   #12  |  Link
Lyris
Registered User
 
Join Date: Sep 2007
Location: Europe
Posts: 602
So, from the 0.19fps of the 1950X and sometimes the 2990WX, the 3950X is delivering me 0.33fps in identical conditions, which is obviously a huge jump. Very happy, especially since I could re-use an old power supply and use a Noctua cooler that I've had since 2012 on the AM4 platform with a new mounting kit.

Last edited by Lyris; 7th September 2020 at 00:11.
Lyris is offline   Reply With Quote
Old 7th November 2020, 09:11   #13  |  Link
hajj_3
Registered User
 
Join Date: Mar 2004
Posts: 1,099
The AMD Ryzen 5000 series (zen3) reviews are out now, they have real AVX2 256bit instead of 2x128bit and have a 19% ipc increase over ryzen 3000 (zen2): https://www.anandtech.com/show/16214...700x-tested/15

They are the faster than any intel chips at encoding video.
hajj_3 is offline   Reply With Quote
Old 8th November 2020, 13:56   #14  |  Link
birdie
Artem S. Tashkinov
 
birdie's Avatar
 
Join Date: Dec 2006
Posts: 311












birdie is offline   Reply With Quote
Old 8th November 2020, 14:01   #15  |  Link
birdie
Artem S. Tashkinov
 
birdie's Avatar
 
Join Date: Dec 2006
Posts: 311
10980XE is slower than 5900X almost all the time.
birdie is offline   Reply With Quote
Old 9th November 2020, 01:53   #16  |  Link
Asmodian
Registered User
 
Join Date: Feb 2002
Location: San Jose, California
Posts: 4,395
Quote:
Originally Posted by birdie View Post
10980XE is slower than 5900X almost all the time.
The new Zen3 CPUs make the OP's question very easy today.

The 5900X and 5950X are a very good deal for x265 encoding. A 32 or 64 core Zen3 Threadripper should be amazing.
__________________
madVR options explained
Asmodian is offline   Reply With Quote
Old 9th November 2020, 09:26   #17  |  Link
charliebaby
Registered User
 
charliebaby's Avatar
 
Join Date: Jun 2020
Posts: 37
wait for AMD 5950x
__________________
charliebaby is offline   Reply With Quote
Old 11th November 2020, 09:06   #18  |  Link
wlee15
Registered User
 
Join Date: Jun 2009
Posts: 16
Quote:
Originally Posted by hajj_3 View Post
The AMD Ryzen 5000 series (zen3) reviews are out now, they have real AVX2 256bit instead of 2x128bit and have a 19% ipc increase over ryzen 3000 (zen2): https://www.anandtech.com/show/16214...700x-tested/15

They are the faster than any intel chips at encoding video.
Zen 2 had 256-bit, it was the original Zen that had 128-bit AVX.
wlee15 is offline   Reply With Quote
Old 11th November 2020, 13:21   #19  |  Link
ShortKatz
Registered User
 
Join Date: Aug 2018
Location: Germany
Posts: 102
I am very curious to see how fast the HEVC encoding will be on Apples M1 chip.
ShortKatz is offline   Reply With Quote
Old 11th November 2020, 16:44   #20  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,650
Quote:
Originally Posted by ShortKatz View Post
I am very curious to see how fast the HEVC encoding will be on Apples M1 chip.
Big parallel SIMD sustained workloads aren't really ARM's sweet spot. I imagine the M1 will be the best software encoding ARM processor ever, but would be surprised if it could beat an Intel Tiger Lake (11th gen) i7 1185G7. M1 may be able to beat that in fps/watt for the CPU itself. Intel and AMD have demonstrated the power of a whole lot of beefy x64 cores running at a high TDP for encoding operations.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 14:00.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, vBulletin Solutions Inc.