Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
19th March 2023, 15:01 | #23 | Link | |
Registered User
Join Date: Apr 2002
Location: UK
Posts: 68
|
Quote:
|
|
19th March 2023, 20:24 | #24 | Link | |
The cult of personality
Join Date: May 2013
Location: Planet Vegeta
Posts: 155
|
Quote:
|
|
20th March 2023, 15:43 | #25 | Link |
ffx264/ffhevc author
Join Date: May 2007
Location: /dev/video0
Posts: 1,843
|
I don't think you'll find a build tuned for your Zen 4 CPU yet. GCC 13, that supports tuning for znver4, is not officially out yet (though that doesn't say much) and I haven't seen anything by the local people who offer builds for Windows that they offer an optimized version of x265.
I'm afraid you'll have to compile it yourself and use Zen 4 optimization flags. |
20th March 2023, 20:54 | #26 | Link | |
The cult of personality
Join Date: May 2013
Location: Planet Vegeta
Posts: 155
|
Quote:
this is the website I use currently: http://msystem.waw.pl/x265/ I get AVX2 GCC 12.2 version (64-bit, 10-bit). does it offer that ryzen 4 cli command? |
|
21st March 2023, 04:03 | #27 | Link |
Registered User
Join Date: Mar 2021
Location: North Carolina
Posts: 138
|
When looking for a CPU for encoding, is there a core limit? Or does x265 take advantage of as many cores as you can throw at it? I've heard that it might not be as efficient with more cores, however my goal is to understand whether more cores are better, or to look at CPUs that have faster single-thread performance.
So if you had a Intel Core i9-13900KS that has great single-core performance. Would something that is slower but has double the amount of cores finish encodes faster? |
21st March 2023, 10:07 | #28 | Link |
Registered User
Join Date: Jan 2020
Posts: 34
|
Intel Core i9-13900K Review encoding
https://www.techpowerup.com/review/i...13900k/16.html |
21st March 2023, 17:14 | #29 | Link | |
Moderator
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,752
|
Quote:
Strong single-thread performance is really important, of course, and scales linearly while more cores are additive, but not linear, and don't help at all past a certain point. With consumer CPUs, 2x the CPUs even if each is 20% slower will still be a lot faster. |
|
21st March 2023, 17:15 | #30 | Link | |
Moderator
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,752
|
Quote:
I'm really curious to see how Zen 4 will stack up. |
|
22nd March 2023, 16:17 | #32 | Link |
Moderator
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,752
|
Oh, right. I meant the Zen 4 version of Epyc more specifically.
Has anyone toyed with --pools on Zen 4 to see how much efficiency improves by pinning to a single NUMA node? I'd think than Zen 4 chipset NUMA would have less of a perf hit than between Xeon sockets, but no idea what it actually would be. |
24th March 2023, 01:24 | #34 | Link | |
Formally known as .......
Join Date: Sep 2021
Location: Down Under.
Posts: 965
|
Quote:
You could also try this one :- x265 v3.5+97 I have both a 13900KF & Ryzen 7950X, and in my experience, the 7950X is slightly faster with x265 encodes (using RipBot264). I have tried these so called AVX2 & Ryzen Zen3 optimised builds and there is no noticeable improvement.. I also have the avx512 switch in my x265 command line... |
|
21st April 2023, 19:21 | #35 | Link | |
Registered User
Join Date: Mar 2021
Location: North Carolina
Posts: 138
|
Quote:
|
|
21st April 2023, 21:48 | #36 | Link | |
Moderator
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,752
|
Quote:
The component prices can have a huge markup, but sometimes worth it to ensure everything works reliably together. It is surprisingly easy to get into five figure prices. Last edited by benwaggoner; 21st April 2023 at 21:50. Reason: Added detail |
|
22nd April 2023, 03:18 | #37 | Link | |
Registered User
Join Date: May 2009
Posts: 328
|
Quote:
|
|
22nd April 2023, 03:38 | #38 | Link | |
Registered User
Join Date: Mar 2021
Location: North Carolina
Posts: 138
|
Quote:
Can you rent ultra high end workstations in a cloud. Enode and perform all the tasks you need, controlled by your own PC. Get a job done super quick, and then send yourself the file once you've finished. Because it seems that computing power really holds back encoding. What about cpu capabilities like MMX2 and I know there's a bunch of others I see when I encode. Does this help none workstations catchup a little? How do I know which CPU's have all the CPU capabilities that x265 needs? |
|
22nd April 2023, 05:49 | #39 | Link | |
Registered User
Join Date: May 2009
Posts: 328
|
Quote:
I'll be honest, most of what you are asking is way beyond my area of expertise, so hopefully someone comes in to further expand, and or correct anything I've written in error. the last time encoding on 64 cores was tested, it was slower than a 16 core And in older tests, 32 cores was the sweet spot. But for current results, there are some users here with access to ridonkulous hardware; if they could chime in, that would be fantastic. |
|
23rd April 2023, 23:12 | #40 | Link | |||
Moderator
Join Date: Jan 2006
Location: Portland, OR
Posts: 4,752
|
Quote:
Quote:
The most cost-effective AWS instance for fast 4K encoding would probably be the c7g.8xlarge, with an on-demand cost of $1.16/hour. So that 2 hour movie in 2 days would cost about $56. Prices can get a lot lower as usage goes up. Spot instances can be a lot cheaper, but aren't great for traditional encoding since each operation can take hours or days. A segmenting encoder could take good advantage of spot pricing, but full quality with VBV compliance requires overlapping segments using --chunk-start and/or --chunk-end. It's good to test how many cores your content and settings can saturate, and then use the smallest instance that can accommodate that. More cores don't make encoding faster beyond that, but do make it more expensive. Quote:
For x86-64, AVX2 is pretty much required for good perf (and universal). Whether AVX512 is useful depends on settings, resolution, and specific processor. The ARM-based Graviton2 has received really great NEON etc. optimizations for x265, ffmpeg, and other open source encoder components over the last 18 months, and can be performance and price/performance at least as good as x86-64. Graviton3 using the current MASTER branch versions seems about the best price/performance today, and should pull away even more over time as wider SIMD optimization gets implemented for it. AWS has been checking in tons of optimized assembly for ARM. |
|||
Thread Tools | Search this Thread |
Display Modes | |
|
|