View Full Version : Which processor to encode x265 4K ?
Pages :
1
2
3
4
[
5]
6
7
8
Atak_Snajpera
24th January 2020, 14:36
So everything is ok then. Activate 3rd server and see if cpu usage goes up.
blublub
25th January 2020, 07:54
So everything is ok then. Activate 3rd server and see if cpu usage goes up.
Hi
I did have to fiddle aroud to get distributed encoding to work and while doing that I did not set the process priority correctly. After setting it to normal the usage is mostly 100%.
thx
Atak_Snajpera
8th February 2020, 14:33
ULTIMATE ENCODING CPU
https://i.postimg.cc/zGGxXvZk/Untitled-1.png
Source -> https://hk.xfastest.com/45759/amd-ryzen-threadripper-3990x-performance-test-kill-intel-i9-10980xe/
Emulgator
8th February 2020, 16:36
Muuuhahahaa !
Still so much improvement above the 3970, unbelievable.
excellentswordfight
8th February 2020, 18:23
Muuuhahahaa !
Still so much improvement above the 3970, unbelievable.
What? Twice the price, twice the number of cores, 18% perf increase.
If its not a saturation/thread utilization issue, i wonder were the bottlneck is? Cache/ram bandwith?
Atak_Snajpera
8th February 2020, 18:37
What? Twice the price, twice the number of cores, 18% perf increase.
I suspect that 5 instances of x265 encoding 1080p at the same time might not be enough for constant 100% cpu usage. If you overclock 3990X to 3.7 GHz then you should get extra ~50% vs 3970x.
Stereodude
8th February 2020, 19:54
What? Twice the price, twice the number of cores, 18% perf increase.
If its not a saturation/thread utilization issue, i wonder were the bottlneck is? Cache/ram bandwith?
Could be the OS. Most versions of Windows 10 only allow for 64 threads in a single processor group.
https://www.anandtech.com/show/15483/amd-threadripper-3990x-review/3
Sagittaire
8th February 2020, 23:35
I suspect that 5 instances of x265 encoding 1080p at the same time might not be enough for constant 100% cpu usage. If you overclock 3990X to 3.7 GHz then you should get extra ~50% vs 3970x.
I think that 5 instance for x265 1080p encoding can't saturate 128 thread. Perhaps time for update at 8 instance or even more with this really particular CPU. Even 3970x with 64 thread seem have problem. In good multithreaded application like cinebench, you have 100% improvement between 3950x and 3970x.
Atak_Snajpera
8th February 2020, 23:47
I think that 5 instance for x265 1080p encoding can't saturate 128 thread. Perhaps time for update at 8 instance or even more with this really particular CPU. Even 3970x with 64 thread seem have problem. In good multithreaded application like cinebench, you have 100% improvement between 3950x and 3970x.
The simplest solution would be reduction of ctu size from 64 to 32.
RanmaCanada
9th February 2020, 02:16
Or someone could test it on Linux, as it doesn't have the issues that windows does when it comes to high core count CPU's, even with "enterprise" edition it still has problems accessing all that power properly.
Nintendo Maniac 64
9th February 2020, 07:01
It could additionally be interesting to test it on normal non-enterprise Windows 10 but with SMT disabled.
NikosD
9th February 2020, 09:26
Summary:
Someone with access to a 3990X reviewer/ tester could ask for benchmarks using:
1) Windows 10 Pro
2) Windows 10 Enterprise
3) Linux
4) SMT disabled
excellentswordfight
9th February 2020, 10:35
I think that 5 instance for x265 1080p encoding can't saturate 128 thread. Perhaps time for update at 8 instance or even more with this really particular CPU. Even 3970x with 64 thread seem have problem. In good multithreaded application like cinebench, you have 100% improvement between 3950x and 3970x.
And "good multithreaded application" or maybe its more suted to call it calculations, are better of running on an GPU.
I suspect that 5 instances of x265 encoding 1080p at the same time might not be enough for constant 100% cpu usage. If you overclock 3990X to 3.7 GHz then you should get extra ~50% vs 3970x.
Yes that could be the case. But not sure about the practicality of that scenario... In one of the reviews I read is that they reached a maximum of 3.6Ghz for all core under load, that with a 280mm AIO and pullng 650W from the wall.
Could be the OS. Most versions of Windows 10 only allow for 64 threads in a single processor group.
https://www.anandtech.com/show/15483/amd-threadripper-3990x-review/3
Yes I read that article as well, and alot of others. It's still a guess game of why the resaults in that benchmark screenshot posted are the way they are.
3990X is a very cool CPU, but tbh its not a good buy for a workstation CPU *. Look at the benchmarks, you get good performance scaling in very few applications, 3970X is faster in quite a few cases cause of higher clock speeds and the applications were the scaling is good like in cinebench, yeah I rather just get octane and 3x 2080Ti if I were an c4d user...
* in general that is, iaf you have workloads were this makes sense for you, then yeah good for you.
Emulgator
9th February 2020, 23:42
What? Twice the price, twice the number of cores, 18% perf increase.
If its not a saturation/thread utilization issue, i wonder were the bottlneck is? Cache/ram bandwith?
I would have thought that thermal restrictions of these 9-die monsters would have capped any progress above the TR3960X around 5..10% per generation, so I am positively surprised by these +18% at the same 280W TDP.
(Price blanked out by me of course.)
I see the TR3990X base clock significantly limited to 2,9GHz for that reason...
(TR3960X was 3,8GHz, TR3970X was still 3,7GHz)
Well, maybe a bit unfair to compare multi-die CPUs against single-die CPUs anyway, but the race is on.
For bigger dies the manufacturer loses the bet against failures more often.
Yields of smaller dies seem more rewarding.
The best bet seems to be after the race: to compile a CPU out of the smaller die survivers;-)
Atak_Snajpera
10th February 2020, 15:27
Well, maybe a bit unfair to compare multi-die CPUs against single-die CPUs anyway, but the race is on.
Q6600 was also a multi-die CPU.
https://www.behardware.com/medias/photos_news/00/18/IMG0018284.jpg
Stereodude
24th February 2020, 16:09
I was able to benchmark my new R9 3950X system against my E5-2687W v2 in x265 using the settings I've been playing around with for high image quality encodes. Both are running stock clocks.
x265 version:
x265 [info]: HEVC encoder version 3.1.1+1-04b37fdfd2dc
x265 [info]: build info [Windows][GCC 9.1.1][64 bit] 8bit+10bit+12bit
x265 [info]: (libavcodec 58.53.101)
x265 [info]: (libavformat 58.28.101)
x265 [info]: (libavutil 56.30.100)
x265 [info]: (lsmash 2.16.1)
x265 options:
--pools 4 -F 1 --crf 16.0 -p veryslow --no-sao --aq-mode 1 --aq-strength 1.15 --vbv-maxrate 25000 --vbv-bufsize 25000 --level 5.0 --keyint 125 --open-gop -D 10 --colorprim "bt709" --transfer "bt709" --colormatrix "bt709" --sar 1:1
The source is a piece of a 1080i60 concert blu-ray that was restored to 1080p25 (1920x1080). I encoded the same 2500 frame segment multiple times and measured the time to encode. Four simultaneous encodes will effectively saturate the E5-2687W v2. It took an average time of 24001 seconds to encode the 2500 frames (four simultaneous encodes) on the E5-2687W v2. That gives a combined fps of the 4 simultaneous encodes of 0.417 fps. Four simultaneous encodes will hit 55-60% CPU on the R9 3950X. It took an average time of 7191 seconds to encode the 2500 frames (four simultaneous encodes) on the R9 3950X. That gives a combined fps of the 4 simultaneous encodes of 1.391 fps. Eight simultaneous encodes will effectively saturate the R9 3950X. It took an average time of 11302 seconds to encode the 2500 frames (eight simultaneous encodes) on the R9 3950X. That gives a combined fps of the 8 simultaneous encodes of 1.770 fps.
As a point of reference x265_FHD_Benchmark said my R9 3950X is ~4x faster than the E5-2687W v2 (77.49 fps vs. 19.05 fps).
TLDR / summary:
E5-2687W v2 - 0.417 fps (4 simultaneous encodes combined)
R9 3950X - 1.391 fps (4 simultaneous encodes combined)
R9 3950X - 1.770 fps (8 simultaneous encodes combined)
jfcarbel
15th March 2020, 00:25
Yep. However If you have x570 then you will have to add this step to get USB 3 ports to work
https://www.sevenforums.com/drivers/419688-will-new-amd-x570-chipset-mobos-have-win-7-chipset-drivers-post3441921.html?#post3441921
Hi Atak, finally getting to this build this weekend for 3900X
Is this still neccessary as I think the Jan 15 update you added x570 drivers. Doing this on a Asrock Taichi x570. Do the USB ports work after installing from an ISO updated using you latest now?
Seems there are modded drivers that will even allow for USB to work. Are these the ones you used?
https://www.win-raid.com/t4960f52-Solution-Win-Drivers-for-USB-Controllers-of-new-AMD-Chipset-Systems-5.html#msg99977
Also will this work with IDE SATA set to AHCI mode?
This is going to be installed on a Samsung NVMe drive.
Also any tips on a dual boot Win7 and Win10
As I notice that you mention to enable CSM on Motherboards to be able to install Win7, but this noticed this at AMD site. Is this true or is much of what they are saying negligible as well?
Also not sure if Win7/10 can peacefully co-exist on an NVMe. As I keep reading that in order for Win10 install to see NVMe drive you need UEFI.
For users running the AMD Ryzen Processor with Radeon Vega Graphics, AMD strongly recommends that your motherboard firmware (UEFI) be configured full UEFI Mode to ensure optimal performance, compatibility, and stability with the Windows 10 operating system.
Best pick for x265 4k encoding, Ryzen 5 3600 or Ryzen 7 2700x? How much fps I can expect from those? How much performance I would get if I go for Ryzen 7 3700x?
excellentswordfight
4th May 2020, 09:13
Best pick for x265 4k encoding, Ryzen 5 3600 or Ryzen 7 2700x? How much fps I can expect from those? How much performance I would get if I go for Ryzen 7 3700x?
3700x i quite a bit faster then 2700x, maybe about 30-40% cause of much better avx performance that was pretty limited in Ryzen 1000 and 2000-series. So i would say its worth the money.
How fast will it be? How long is a piece of string?
Between Ryzen 7 2700x and Ryzen 5 3600, which will give better performance? I have no clue about x265. I have to rip my 1080p and 4K BD collection.
Groucho2004
4th May 2020, 14:33
Between Ryzen 7 2700x and Ryzen 5 3600, which will give better performance? I have no clue about x265. I have to rip my 1080p and 4K BD collection.This (https://www.techpowerup.com/review/amd-ryzen-5-3600/13.html) might help.
Forteen88
5th May 2020, 00:06
This (https://www.techpowerup.com/review/amd-ryzen-5-3600/13.html) might help.And the comparisons here gives similar results,
https://forum.doom9.org/showthread.php?p=1908148#post1908148
MeteorRain
5th May 2020, 13:56
I have to say, zen2 is a great die.
crystalfunky
9th May 2020, 10:18
I have a 3700x and a B450 chipset with Crucial 3600 DDR4 16GB Ram.
I only get 2 fps when encoding 4K HDR10 videos. Is that normal? I'm encoding with Staxrip on preset slow.
microchip8
9th May 2020, 10:24
I have a 3700x and a B450 chipset with Crucial 3600 DDR4 16GB Ram.
I only get 2 fps when encoding 4K HDR10 videos. Is that normal? I'm encoding with Staxrip on preset slow.
Yes, it's normal
Sagittaire
9th May 2020, 11:22
I have a 3700x and a B450 chipset with Crucial 3600 DDR4 16GB Ram.
I only get 2 fps when encoding 4K HDR10 videos. Is that normal? I'm encoding with Staxrip on preset slow.
and it's really good speed for 4K HDR10 encoding.
You have 100% CPU usage?
RanmaCanada
9th May 2020, 21:48
I have a 3700x and a B450 chipset with Crucial 3600 DDR4 16GB Ram.
I only get 2 fps when encoding 4K HDR10 videos. Is that normal? I'm encoding with Staxrip on preset slow.
Not only is it normal, it's actually quite fast. What preset are you using?
crystalfunky
10th May 2020, 10:33
and it's really good speed for 4K HDR10 encoding.
You have 100% CPU usage?
Most of the time it's around 95 and 100.
Not only is it normal, it's actually quite fast. What preset are you using?
slow
blublub
15th May 2020, 06:55
Most of the time it's around 95 and 100.
slowFor preset slow that sounds about right.
crystalfunky
15th May 2020, 13:29
I'm encoding Black Hawk Down 4K HDR right now.
Preset slow
CRF 24, I get around 17 Mbit/s
FPS: around 2,6
59 % and it still says 9 hrs remaining.
Nico8583
26th June 2020, 10:10
2 FPS with a 3700X but anyone knows with a 3900X ?
excellentswordfight
26th June 2020, 13:43
2 FPS with a 3700X but anyone knows with a 3900X ?
No, but I'm using a 7402P, I get around 5fps for single file encodes using preset slow for 2160p content. Not 100% utilization though, around 80% (24C/48T).
7402P is actually very impressive, keeps clocks at 3.3Ghz in an 1u server under this rather heavy avx2 load. No intel high core count server/workstation system I've encoded on has been able to do so.
Boulder
26th June 2020, 19:04
With my 3900X and a script which has some motion-compensated denoising + downscaling to 1440p in linear light at 32bit precision and debanding, all in 16bit depth, I get over 2fps in x265 using --preset slower + hme-search umh,umh,star and slight tweaks. Using --rskip 2 makes things go faster and seems to be higher quality as the onion artifacts are less apparent. Then again, I use --rd-refine which should make things much slower.
benwaggoner
26th June 2020, 19:08
With my 3900X and a script which has some motion-compensated denoising + downscaling to 1440p in linear light at 32bit precision and debanding, all in 16bit depth, I get over 2fps in x265 using --preset slower + hme-search umh,umh,star and slight tweaks. Using --rskip 2 makes things go faster and seems to be higher quality as the onion artifacts are less apparent. Then again, I use --rd-refine which should make things much slower.
It'd be interesting to compare --preset ultrafast and see how much of the fps is x265 versus the preprocessing. It may well be the long pole isn't encoding.
Boulder
26th June 2020, 20:40
It'd be interesting to compare --preset ultrafast and see how much of the fps is x265 versus the preprocessing. It may well be the long pole isn't encoding.
Sure, I'll try to remember to do some tests after the chain of encodes finishes. Right now I'm processing a video like I described, and the frameserver part uses maybe an average of 20% of total CPU. Sometimes more, sometimes much less depending on how much the denoising part actually has to work on the frame. x265 uses 40-75% of CPU.
charliebaby
27th June 2020, 10:22
Me Benchmark Ryzen Threadrriper 2990wx 32 core 3700mhz x265 3.4.7 :-)
https://i.goopics.net/yWoZD.png
https://i.goopics.net/Z0mqG.png
Boulder
28th June 2020, 09:03
I did a little test of 500 frames, my standard Avisynth script template and encoded at 1440p (2560 x 1064).
Without any tweaks other than the HDR metadata, --preset ultrafast was 5.90 fps, --preset slower 2.81 fps.
With my x265 settings template, --preset ultrafast was 5.71 fps, --preset slower 2.63 fps.
Without any processing, i.e. just loading and decoding the source with DGSource and encoding at 4K (3840 x 1600), --preset ultrafast was 13.42 fps and --preset slower 1.54 fps using my x265 template.
Forteen88
29th June 2020, 10:28
Me Benchmark Ryzen Threadrriper 2990wx 32 core 3700mhz x265 3.4.7 :-)
https://i.goopics.net/yWoZD.png
https://i.goopics.net/Z0mqG.pngDo you normally encode with "Placebo" preset? When doing benchmarks, it's better to encode with the preset that you (or people in general) use mostly. I use almost only "Slower" preset.
charliebaby
29th June 2020, 17:23
Do you normally encode with "Placebo" preset? When doing benchmarks, it's better to encode with the preset that you (or people in generally) use mostly. I use almost only "Slower" preset.
YES juste PLACEBO Very Best :-)
DMD
18th January 2022, 22:18
Good evening.
I need to upgrade PC with CPU Ryzen 9 5900X, is it a good choice to encode H265 with reasonable fps?
Thank you
excellentswordfight
18th January 2022, 22:26
Good evening.
I need to upgrade PC with CPU Ryzen 9 5900X, is it a good choice to encode H265 with reasonable fps?
Thank you
Yes, either that or 12700k (with DDR4 if you want better bang for the buck).
https://tpucdn.com/review/intel-core-i7-12700k-alder-lake-12th-gen/images/encode-h265.png
I get about 20fps for 1080p Bluray re-encodes with x265 preset slow on a 12700k for reference, 5900X should be rather similar in terms of performance.
DMD
18th January 2022, 22:48
Yes, either that or 12700k (with DDR4 if you want better bang for the buck).
https://tpucdn.com/review/intel-core-i7-12700k-alder-lake-12th-gen/images/encode-h265.png
Personally I am interested in H265 4K encoding, I have no idea how much lower fps can be compared to 1080p (medium preset or slower?)
thank you
excellentswordfight
18th January 2022, 22:56
Personally I'm interested in H265 4K encoding, I have no idea how much lower fps can be than 1080p.
thank you
It will be much lower, I saw that you were talking about preset slower in the other thread as well, it will take a looong time to encode full length movies of thats your plan. I would guess that you only would get a few fps on an 5900x.
What speed is usable is individual I guess, but for me there isnt a use case for doing uhd bluray re-encodes with x265 at presets lower then slow for private use.
DMD
19th January 2022, 09:18
It will be much lower, I saw that you were talking about preset slower in the other thread as well, it will take a looong time to encode full length movies of thats your plan. I would guess that you only would get a few fps on an 5900x.
What speed is usable is individual I guess, but for me there isnt a use case for doing uhd bluray re-encodes with x265 at presets lower then slow for private use.
Good morning.
To understand and have a reference with the hardware, I did a test with sample video @ 23.976fps with the current PC (Intel Core i5 9600K \ GeForce RTX 2070).
I set the StaxRip preset to Slower, the process is 0.1 FPS.
MISSION IMPOSSIBLE!!! :(
https://i.postimg.cc/tRVz0PsG/Screenshot-19-01-2022-08-34-42.png
Making a prediction of around 2 FPS (I hope) with a Ryzen 9 5900X, I made this reasoning, if that's correct.
In the duration of a second of movie there are about 24fps and the processor processes 2 FPS, it means that the ratio is 1:12, so a movie with a duration of 2 hours needs 24 hours of processing, is that correct?
If necessary, it is possible to divide the source file into several parts, in order to do the processing at different times and then carry out the final union?
Thank you
Boulder
19th January 2022, 09:30
Some speedups: remove --hme (EDIT: note not --umh but --hme) and --rd-refine, set --no-amp --limit-modes 3 --ref 4. I'm not sure --pmode and --pme help at all. Set --ctu 32 for better threading (no, it won't reduce quality).
And set --no-sao unless you like to have all the details removed.
Blue_MiSfit
19th January 2022, 09:42
SAO is generally quite useful at lower bitrates. x265's implementation is not perfect, but it's net positive a lot of the time :)
When you're gunning for total transparency it may indeed be good to disable it.
excellentswordfight
19th January 2022, 09:54
Good morning.
Making a prediction of around 2 FPS (I hope) with a Ryzen 9 5900X, I made this reasoning, if that's correct.
In the duration of a second of movie there are about 24fps and the processor processes 2 FPS, it means that the ratio is 1:12, so a movie with a duration of 2 hours needs 24 hours of processing, is that correct?
If necessary, it is possible to divide the source file into several parts, in order to do the processing at different times and then carry out the final union?
Thank you
Yes, I would guess that 2fps isnt that far off (although I would think that it would be closer to 1). And yes that calculation would be correct. Slow would be about 2x faster then Slower, if you actually do some tests I would think you will see that it will not be worth the speed penalty.
And yes there are ways were you can do it in parts. Some easy (like pausing the encode), and some more complex (chunk encoding).
edit. Did a quick test and 1,2fps for 2160p with preset slower, and 4fps for preset slow. 12700k@125w. I noticed though with x265 that speed is a bit dependent on source complexity so your mileage may vary.
Set --ctu 32 for better threading (no, it won't reduce quality).
As we talking about UHD at low presets CPU saturation shouldnt be an issue even with --ctu 64 for 5900X/12700k. I've found scaling to be decent up to about 16c/32t at default settings (1080p up to about 8c/16t), then the performance boost starts to take a drastic turn.
RanmaCanada
20th January 2022, 02:00
I know on my 5800x it will take about 24 hours to do a 4k encode with preset slow. If OP wants to make an encoding only machine, I would honestly say at this point get a Threadripper or EPYC and use Ripbot to encode. But that's insane talk as no one wants to spend that much on a machine that will be beaten in say 5 years by consumer grade hardware, unless of course this is a tax write off haha.
benwaggoner
22nd January 2022, 02:32
I know on my 5800x it will take about 24 hours to do a 4k encode with preset slow. If OP wants to make an encoding only machine, I would honestly say at this point get a Threadripper or EPYC and use Ripbot to encode. But that's insane talk as no one wants to spend that much on a machine that will be beaten in say 5 years by consumer grade hardware, unless of course this is a tax write off haha.
Systems used for professional encoding get replaced a lot more often than every five years due to the big improvements in encoding time and watts/pixel over just a couple CPU generations. Heck, power savings alone would likely justify more frequent updates for a hobbyist that has their computers doing video encoding even 50% of the time.
Nico8583
14th February 2022, 13:27
Hi :)
Now there is a new Intel gen, what is the best choice to encode 4K ? I'm still on a B450M + 3700X combo but I would like to know which one between 5900X or i9 12900 would be the best choice if I upgrade ? The 5900X could be used on the B450M platform. Or perhaps a 3900X used if price is good.
Thanks !
vBulletin® v3.8.11, Copyright ©2000-2026, vBulletin Solutions Inc.