Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Video Encoding > High Efficiency Video Coding (HEVC)

Reply
 
Thread Tools Search this Thread Display Modes
Old 18th March 2023, 05:21   #21  |  Link
HD MOVIE SOURCE
Registered User
 
HD MOVIE SOURCE's Avatar
 
Join Date: Mar 2021
Location: North Carolina
Posts: 138
Quote:
Originally Posted by microchip8 View Post
Ryzen 1xxx is znver1
Ryzen 2xxx / 3xxx is znver2
Ryzen 5xxx is znver3

Or read the man page of GCC.
I appreciate it thank you.
HD MOVIE SOURCE is offline   Reply With Quote
Old 18th March 2023, 12:19   #22  |  Link
ghostshadow
Registered User
 
Join Date: Jan 2020
Posts: 34
Quote:
Originally Posted by microchip8 View Post
-mtune=znver4

it is only present in GCC 13 and above
-march=znver4 -mtune=znver4
ghostshadow is offline   Reply With Quote
Old 19th March 2023, 15:01   #23  |  Link
john33
Registered User
 
Join Date: Apr 2002
Location: UK
Posts: 68
Quote:
Originally Posted by ~ VEGETA ~ View Post
can you point me to the website page where i can download such version?
And at Winlibs, here: https://github.com/brechtsanders/win...ingw/releases/
john33 is offline   Reply With Quote
Old 19th March 2023, 20:24   #24  |  Link
~ VEGETA ~
The cult of personality
 
~ VEGETA ~'s Avatar
 
Join Date: May 2013
Location: Planet Vegeta
Posts: 155
Quote:
Originally Posted by john33 View Post
I mean best x265 gcc build suitable for ryzen 7900x.
~ VEGETA ~ is offline   Reply With Quote
Old 20th March 2023, 15:43   #25  |  Link
microchip8
ffx264/ffhevc author
 
microchip8's Avatar
 
Join Date: May 2007
Location: /dev/video0
Posts: 1,843
Quote:
Originally Posted by ~ VEGETA ~ View Post
I mean best x265 gcc build suitable for ryzen 7900x.
I don't think you'll find a build tuned for your Zen 4 CPU yet. GCC 13, that supports tuning for znver4, is not officially out yet (though that doesn't say much) and I haven't seen anything by the local people who offer builds for Windows that they offer an optimized version of x265.

I'm afraid you'll have to compile it yourself and use Zen 4 optimization flags.
__________________
ffx264 || ffhevc || ffxvid || microenc
microchip8 is offline   Reply With Quote
Old 20th March 2023, 20:54   #26  |  Link
~ VEGETA ~
The cult of personality
 
~ VEGETA ~'s Avatar
 
Join Date: May 2013
Location: Planet Vegeta
Posts: 155
Quote:
Originally Posted by microchip8 View Post
I don't think you'll find a build tuned for your Zen 4 CPU yet. GCC 13, that supports tuning for znver4, is not officially out yet (though that doesn't say much) and I haven't seen anything by the local people who offer builds for Windows that they offer an optimized version of x265.

I'm afraid you'll have to compile it yourself and use Zen 4 optimization flags.
ok fine, but i am ok with the next best choice.

this is the website I use currently: http://msystem.waw.pl/x265/

I get AVX2 GCC 12.2 version (64-bit, 10-bit).

does it offer that ryzen 4 cli command?
~ VEGETA ~ is offline   Reply With Quote
Old 21st March 2023, 04:03   #27  |  Link
HD MOVIE SOURCE
Registered User
 
HD MOVIE SOURCE's Avatar
 
Join Date: Mar 2021
Location: North Carolina
Posts: 138
When looking for a CPU for encoding, is there a core limit? Or does x265 take advantage of as many cores as you can throw at it? I've heard that it might not be as efficient with more cores, however my goal is to understand whether more cores are better, or to look at CPUs that have faster single-thread performance.

So if you had a Intel Core i9-13900KS that has great single-core performance. Would something that is slower but has double the amount of cores finish encodes faster?
HD MOVIE SOURCE is offline   Reply With Quote
Old 21st March 2023, 10:07   #28  |  Link
ghostshadow
Registered User
 
Join Date: Jan 2020
Posts: 34
Intel Core i9-13900K Review encoding
https://www.techpowerup.com/review/i...13900k/16.html
ghostshadow is offline   Reply With Quote
Old 21st March 2023, 17:14   #29  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,750
Quote:
Originally Posted by HD MOVIE SOURCE View Post
When looking for a CPU for encoding, is there a core limit? Or does x265 take advantage of as many cores as you can throw at it? I've heard that it might not be as efficient with more cores, however my goal is to understand whether more cores are better, or to look at CPUs that have faster single-thread performance.

So if you had a Intel Core i9-13900KS that has great single-core performance. Would something that is slower but has double the amount of cores finish encodes faster?
Yes, absolutely. x265 can scale up to lots of cores. The max useful cores/threads depends on frame size and frame threading parameters. 4K can easily saturate 8 physical cores even with a single frame thread. I'd think that would be the absolute minimum I'd want in an encoding box.

Strong single-thread performance is really important, of course, and scales linearly while more cores are additive, but not linear, and don't help at all past a certain point. With consumer CPUs, 2x the CPUs even if each is 20% slower will still be a lot faster.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Old 21st March 2023, 17:15   #30  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,750
Quote:
Originally Posted by ghostshadow View Post
Intel Core i9-13900K Review encoding
https://www.techpowerup.com/review/i...13900k/16.html
Impressive!

I'm really curious to see how Zen 4 will stack up.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Old 21st March 2023, 19:46   #31  |  Link
ghostshadow
Registered User
 
Join Date: Jan 2020
Posts: 34
Quote:
Originally Posted by benwaggoner View Post
Impressive!

I'm really curious to see how Zen 4 will stack up.
You have on it the result of the zen 4(Ryzen 9 7950x)
ghostshadow is offline   Reply With Quote
Old 22nd March 2023, 16:17   #32  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,750
Quote:
Originally Posted by ghostshadow View Post
You have on it the result of the zen 4(Ryzen 9 7950x)
Oh, right. I meant the Zen 4 version of Epyc more specifically.

Has anyone toyed with --pools on Zen 4 to see how much efficiency improves by pinning to a single NUMA node? I'd think than Zen 4 chipset NUMA would have less of a perf hit than between Xeon sockets, but no idea what it actually would be.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Old 23rd March 2023, 23:06   #33  |  Link
~ VEGETA ~
The cult of personality
 
~ VEGETA ~'s Avatar
 
Join Date: May 2013
Location: Planet Vegeta
Posts: 155
Quote:
Originally Posted by filler56789 View Post
http://www.msystem.waw.pl/x265/

NOTICE: experimental stuff.

Play at your own risk :-/
i used the gcc13 avx2 latest one.

however, could not find such option for ryzen cpu? can you elaborate on which version is fastest for me?

thanks
~ VEGETA ~ is offline   Reply With Quote
Old 24th March 2023, 01:24   #34  |  Link
TDS
Formally known as .......
 
TDS's Avatar
 
Join Date: Sep 2021
Location: Down Under.
Posts: 965
Quote:
Originally Posted by ~ VEGETA ~ View Post
i used the gcc13 avx2 latest one.
however, could not find such option for ryzen cpu? can you elaborate on which version is fastest for me?
thanks
Well, that's new

You could also try this one :-

x265 v3.5+97

I have both a 13900KF & Ryzen 7950X, and in my experience, the 7950X is slightly faster with x265 encodes (using RipBot264).

I have tried these so called AVX2 & Ryzen Zen3 optimised builds and there is no noticeable improvement..

I also have the avx512 switch in my x265 command line...
__________________
Long term RipBot264 user.

RipBot264 modded builds..
TDS is offline   Reply With Quote
Old 21st April 2023, 19:21   #35  |  Link
HD MOVIE SOURCE
Registered User
 
HD MOVIE SOURCE's Avatar
 
Join Date: Mar 2021
Location: North Carolina
Posts: 138
Quote:
Originally Posted by benwaggoner View Post
Yes, absolutely. x265 can scale up to lots of cores. The max useful cores/threads depends on frame size and frame threading parameters. 4K can easily saturate 8 physical cores even with a single frame thread. I'd think that would be the absolute minimum I'd want in an encoding box.

Strong single-thread performance is really important, of course, and scales linearly while more cores are additive, but not linear, and don't help at all past a certain point. With consumer CPUs, 2x the CPUs even if each is 20% slower will still be a lot faster.
When looking for a PC to buy would you absolutely buy a workstation to encode on? Can you buy workstations pre-built? Because I don't know how to build a PC.
HD MOVIE SOURCE is offline   Reply With Quote
Old 21st April 2023, 21:48   #36  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,750
Quote:
Originally Posted by HD MOVIE SOURCE View Post
When looking for a PC to buy would you absolutely buy a workstation to encode on? Can you buy workstations pre-built? Because I don't know how to build a PC.
Yeah, the major PC manufacturers sell high performance workstations, both pre-built and configurable.

The component prices can have a huge markup, but sometimes worth it to ensure everything works reliably together.

It is surprisingly easy to get into five figure prices.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book

Last edited by benwaggoner; 21st April 2023 at 21:50. Reason: Added detail
benwaggoner is offline   Reply With Quote
Old 22nd April 2023, 03:18   #37  |  Link
RanmaCanada
Registered User
 
Join Date: May 2009
Posts: 328
Quote:
Originally Posted by HD MOVIE SOURCE View Post
When looking for a PC to buy would you absolutely buy a workstation to encode on? Can you buy workstations pre-built? Because I don't know how to build a PC.
As Ben said, yes you absolutely can buy workstations. The issue is they are insanely expensive. I know Lenovo had Threadripper Pro workstations in the past, and Dell has started selling them as well. The issue is cost. As stated, they can easily get to 5 figures, and at that point you're better off to buy multiple high end desktop units and use software like Ripbot for distributed encoding. For example, Dell sells a 64 core 5995wx workstation for $21000 CDN (16 gigs of Ram and a T420 potato graphics with a 256gb SSD). Which is insane. You can easily buy 4 16 core systems for less than that and then use Ripbot. Yes you will use more power in the long run, but how long will it take you to use $8-11k in power? Assuming of course you are spending $2500-$3000 per system.
RanmaCanada is offline   Reply With Quote
Old 22nd April 2023, 03:38   #38  |  Link
HD MOVIE SOURCE
Registered User
 
HD MOVIE SOURCE's Avatar
 
Join Date: Mar 2021
Location: North Carolina
Posts: 138
Quote:
Originally Posted by RanmaCanada View Post
As Ben said, yes you absolutely can buy workstations. The issue is they are insanely expensive. I know Lenovo had Threadripper Pro workstations in the past, and Dell has started selling them as well. The issue is cost. As stated, they can easily get to 5 figures, and at that point you're better off to buy multiple high end desktop units and use software like Ripbot for distributed encoding. For example, Dell sells a 64 core 5995wx workstation for $21000 CDN (16 gigs of Ram and a T420 potato graphics with a 256gb SSD). Which is insane. You can easily buy 4 16 core systems for less than that and then use Ripbot. Yes you will use more power in the long run, but how long will it take you to use $8-11k in power? Assuming of course you are spending $2500-$3000 per system.
WOW, seriously? Thats crazy prices. So what is it like encoding on systems like this? Can you encode a 2 hour movie using Placebo on x265 in a day?

Can you rent ultra high end workstations in a cloud. Enode and perform all the tasks you need, controlled by your own PC. Get a job done super quick, and then send yourself the file once you've finished.
Because it seems that computing power really holds back encoding.

What about cpu capabilities like MMX2 and I know there's a bunch of others I see when I encode. Does this help none workstations catchup a little? How do I know which CPU's have all the CPU capabilities that x265 needs?
HD MOVIE SOURCE is offline   Reply With Quote
Old 22nd April 2023, 05:49   #39  |  Link
RanmaCanada
Registered User
 
Join Date: May 2009
Posts: 328
Quote:
Originally Posted by HD MOVIE SOURCE View Post
WOW, seriously? Thats crazy prices. So what is it like encoding on systems like this? Can you encode a 2 hour movie using Placebo on x265 in a day?

Can you rent ultra high end workstations in a cloud. Enode and perform all the tasks you need, controlled by your own PC. Get a job done super quick, and then send yourself the file once you've finished.
Because it seems that computing power really holds back encoding.

What about cpu capabilities like MMX2 and I know there's a bunch of others I see when I encode. Does this help none workstations catchup a little? How do I know which CPU's have all the CPU capabilities that x265 needs?
To answer your last question, every modern CPU will have everything that x265 "needs" with the caveat being avx-512. Some chips have it, some don't. You could always rent a VPS or other system like AWS but as I understand it the costs there are usually pinned to the amount of cores available and or time and can get very high again.

I'll be honest, most of what you are asking is way beyond my area of expertise, so hopefully someone comes in to further expand, and or correct anything I've written in error.

the last time encoding on 64 cores was tested, it was slower than a 16 core And in older tests, 32 cores was the sweet spot.

But for current results, there are some users here with access to ridonkulous hardware; if they could chime in, that would be fantastic.
RanmaCanada is offline   Reply With Quote
Old 23rd April 2023, 23:12   #40  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,750
Quote:
Originally Posted by HD MOVIE SOURCE View Post
WOW, seriously? Thats crazy prices. So what is it like encoding on systems like this? Can you encode a 2 hour movie using Placebo on x265 in a day?
You're not going to get 2x faster than the fastest i9 at 4x the cost. Workstations like that make sense when the difference between a 3 and a 2 day encode is worth money, or if there's a highly paid operator using it. Spending an extra $20K to improve the efficiency of a $200K/year professional by 20% has a great ROI. Workstations often get used for other stuff, like uncompressed input/output, GPU-accelerated rendering, etc. This is a world where the I/O board can cost $4000, the GPU can cost $6500, the monitor can cost $2000-$30,000, the RAID controller and drives $3000. 128-256 GB of RAM is typical. So even if the bare workstation costs $10K more than a high-end consumer setup, that's still only a fraction of the whole system cost.

Quote:
Can you rent ultra high end workstations in a cloud. Enode and perform all the tasks you need, controlled by your own PC. Get a job done super quick, and then send yourself the file once you've finished.
Because it seems that computing power really holds back encoding.
Getting source files uploaded to the cloud can be the bottleneck here. The ProRes mezzanine for a long movie can get close to 1 TB. Can be a long upload with asymmetric consumer broadband.
The most cost-effective AWS instance for fast 4K encoding would probably be the c7g.8xlarge, with an on-demand cost of $1.16/hour. So that 2 hour movie in 2 days would cost about $56. Prices can get a lot lower as usage goes up.
Spot instances can be a lot cheaper, but aren't great for traditional encoding since each operation can take hours or days. A segmenting encoder could take good advantage of spot pricing, but full quality with VBV compliance requires overlapping segments using --chunk-start and/or --chunk-end.

It's good to test how many cores your content and settings can saturate, and then use the smallest instance that can accommodate that. More cores don't make encoding faster beyond that, but do make it more expensive.

Quote:
What about cpu capabilities like MMX2 and I know there's a bunch of others I see when I encode. Does this help none workstations catchup a little? How do I know which CPU's have all the CPU capabilities that x265 needs?
On AWS you can get the processor names and look up their capabilities. https://aws.amazon.com/ec2/instance-...pute_Optimized.

For x86-64, AVX2 is pretty much required for good perf (and universal). Whether AVX512 is useful depends on settings, resolution, and specific processor.

The ARM-based Graviton2 has received really great NEON etc. optimizations for x265, ffmpeg, and other open source encoder components over the last 18 months, and can be performance and price/performance at least as good as x86-64. Graviton3 using the current MASTER branch versions seems about the best price/performance today, and should pull away even more over time as wider SIMD optimization gets implemented for it. AWS has been checking in tons of optimized assembly for ARM.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 12:57.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.