Log in

View Full Version : New advanced Benchmark


Pages : [1] 2 3

Sagittaire
7th March 2017, 13:18
Here the source:
http://jfl1974.free.fr/Benchmark/Benchmark.zip

1) x264 1080p 8 bits bt.709 with BD FHD H264 source
2) x265 2160p 10 bits HDR bt.2020 with ~BD UHD HEVC HDR source
3) decoding benchmark with libavcodec and ~BD UHD HEVC HDR source
4) Advanced x265 test for complete SIMD MMX, SSE, SSE2, SSE3, SSE4, AVX, FMA3, AVX2, FMA4, XOP

Update:
-14/04/2017: new automatic benchmark

try and report your result:

|----------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU (Ghz) | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|----------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| i5-3550@3.50 | 7.12 | 1.04 | 48.0 | 0.84 | 0.35 | 0.35 | 0.54 | 0.58 | 0.86 | 0.86 | N/A | N/A |
| i7-5960X@4.40 | 24.03 | 4.38 | 150.0 | 3.50 | 1.01 | 1.01 | 1.64 | 1.78 | 2.78 | 2.80 | 3.43 | N/A |
| R7 1700@3.75 | 21.66 | 3.45 | 110.0 | 2.74 | 1.00 | 1.01 | 1.67 | 1.87 | 2.61 | 2.60 | 2.67 | N/A |
| x5670@4.00 | 11.33 | 1.58 | 69.0 | 1.32 | 0.56 | 0.56 | 0.88 | 0.94 | N/A | N/A | N/A | N/A |
| i7-4770K@4.50 | 10.89 | 2.11 | 72.0 | 1.75 | 0.49 | 0.49 | 0.80 | 0.86 | 1.34 | 1.37 | 1.62 | N/A |
| i7-2600K@4.20 | 9.27 | 1.33 | 58.0 | 1.10 | 0.44 | 0.44 | 0.70 | 0.75 | 1.10 | 1.11 | N/A | N/A |
| i5-2500K@4.50 | 6.95 | 1.15 | 52.0 | 0.98 | 0.41 | 0.41 | 0.61 | 0.66 | 0.96 | 0.95 | N/A | N/A |
|----------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| R7 1700@stock | 18.51 | 2.98 | 103.0 | 2.32 | 0.96 | 0.95 | 1.41 | 1.54 | 2.11 | 2.05 | 2.08 | N/A |
| R7 1700X@stock | 20.61 | | | | | | | | | | | |
| R7 1800X@stock | 21.28 | 3.42 | 118.0 | 2.72 | 1.13 | 1.13 | 1.71 | 1.83 | 2.52 | 2.53 | 2.65 | N/A |
| i7-6700K@stock | 11.95 | 2.28 | 82.0 | 1.89 | 0.56 | 0.55 | 0.88 | 0.97 | 1.47 | 1.50 | 1.85 | N/A |
| i7-7700K@stock | 14.02 | | | | | | | | | | | |
| i7-5960X@stock | 18.18 | 3.30 | 116.0 | 2.66 | 0.76 | 0.76 | 1.23 | 1.33 | 2.09 | 2.09 | 2.58 | N/A |
| i7-6900K@stock | 19.83 | | | | | | | | | | | |
| i7-6950K@stock | 22.10 | | | | | | | | | | | |
| i7-4770@stock | 10.30 | 1.89 | 67.0 | 1.55 | 0.43 | 0.44 | 0.71 | 0.76 | 1.21 | 1.21 | 1.51 | N/A |
| E5-2670@stock | 12.64 | 1.80 | 79.0 | 1.50 | 0.62 | 0.62 | 0.94 | 1.02 | 1.46 | 1.50 | N/A | N/A |
|----------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|

x265 is new benchmark; You can find result for x264 here:

http://www.hardware.fr/getgraphimg.php?id=506&n=9

Sagittaire
2nd April 2017, 23:30
update

Brazil2
3rd April 2017, 10:15
The archive is broken.

Benchmark.zip.007 is only 20 MB and with 7-zip the archive test fails at 225 files 88%: Data error: Benchmark\Sample\Sample1080p.ts

And once extracted Sample1080p.ts is only 22 882 817 bytes large (199 159 304 expected).

Sagittaire
3rd April 2017, 21:02
I see that ... I (re)upload next weekend.

Sagittaire
8th April 2017, 16:26
Update ...

Try and report your result please ...

Danielcz
12th April 2017, 12:44
some error did not complete:

Motenai Yoda
12th April 2017, 21:07
Can you capture fps values and put them into a txt file?
also maybe adding -v quite to ffmpeg cli can help.

Selur
14th April 2017, 11:35
using a R7 1800X@stock I got the same problem as Danilecz -> https://pastebin.com/xEd81pxZ
ffmpeg\ffmpeg.exe -i Sample\Exodus_UHD_HDR_Exodus_draft.mp4 -an -f rawvideo - | x265\x265.exe --input-res 3840x2160 --fps 23.976 - -o Output\x265_2160p.265 --input-depth 10 --output-depth 10 --crf 24 --preset medium --tune grain --ssim --psnr --asm AVX,FMA3,FMA4,LZCNT,BMI1,BMI2,AVX2 --frames 100
aborts with:
frame= 105 fps=3.3 q=-0.0 size= 2551500kB time=00:00:04.37 bitrate=4777574.4kbiError writing trailer of pipe:: Broken pipe
frame= 105 fps=2.5 q=-0.0 Lsize= 2551500kB time=00:00:04.37 bitrate=4777574.4kbits/s speed=0.104x
video:2551500kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.000000%
Conversion failed!

Cu Selur

Sagittaire
14th April 2017, 13:41
broken pipe with ffmpeg is normal because i use --frames 100 for SIMD test with x265.

benchmark work well with R7 1700@stock and R7 1700@3.75 ghz for 2 samples here in France.

I make new automatised benchmark: downlaod, retry and report your result. Thanks.

Selur
14th April 2017, 15:41
I'm still getting the:
av_interleaved_write_frame(): Invalid argument
Error writing trailer of pipe:: Invalid argument
but this time a results.log was created:
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Ryzen 7 1800X Eight-Core Processor | 21.28 | 3.42 | 118 | 2.72 | 1.13 | 1.13 | 1.71 | 1.83 | 2.52 | 2.53 | 2.65 | N/A |
(I ran the Benchmark_auto.bat)

---

also ran Bechmark.bat an it runs through without a problem now

Danielcz
14th April 2017, 17:46
i7 4770 stock (non-K), R9 290X, 32GB RAM, Win 7 utimate x64

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i7-4770 | 10.30 | 1.89 | 67 | 1.55 | 0.43 | 0.44 | 0.71 | 0.76 | 1.21 | 1.21 | 1.51 | N/A |

Sagittaire
14th April 2017, 20:22
I'm still getting the:
av_interleaved_write_frame(): Invalid argument
Error writing trailer of pipe:: Invalid argument
but this time a results.log was created:
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Ryzen 7 1800X Eight-Core Processor | 21.28 | 3.42 | 118 | 2.72 | 1.13 | 1.13 | 1.71 | 1.83 | 2.52 | 2.53 | 2.65 | N/A |
(I ran the Benchmark_auto.bat)

---

also ran Bechmark.bat an it runs through without a problem now

yes ... really powerfull CPU this Rysen 7.

Ma
14th April 2017, 23:03
On my i5 3450S (with enhanced turbo) x265 4K encoding speed was 1.06.
For curiosity I changed x265 to VS 2017 AVX version and the speed was 1.11. Then I used VS 2017 AVX PGO version and the speed was 1.15 (I added only '-v warning' to ffmpeg part to cleaner output).

Then I go to the decoding -- with included in benchmark ffmpeg the speed was 48 to 49, with Zeranoe ffmpeg 2017-04-11 it was 64, with compiled by GCC 7 today snapshot the speed was 66.

The speed difference in x265 encoding is normal, but it is surprising big difference in decoding speed.

RanmaCanada
15th April 2017, 08:07
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Xeon E5-2670 0 | 12.64 | 1.80 | 79 | 1.50 | 0.62 | 0.62 | 0.94 | 1.02 | 1.46 | 1.50 | N/A | N/A |

Sagittaire
15th April 2017, 12:29
On my i5 3450S (with enhanced turbo) x265 4K encoding speed was 1.06.
For curiosity I changed x265 to VS 2017 AVX version and the speed was 1.11. Then I used VS 2017 AVX PGO version and the speed was 1.15 (I added only '-v warning' to ffmpeg part to cleaner output).

Then I go to the decoding -- with included in benchmark ffmpeg the speed was 48 to 49, with Zeranoe ffmpeg 2017-04-11 it was 64, with compiled by GCC 7 today snapshot the speed was 66.

The speed difference in x265 encoding is normal, but it is surprising big difference in decoding speed.

Yes i know for x265, I will try to introduce script to use best x265 compilation for each CPU ... ;-)

Anyway your ffmpeg result is really big surprise. You have link for ffmpeg 64 bit with GCC7 compilation?

Ma
15th April 2017, 15:31
ffmpeg (without any libs) compiled by GCC 7 from snapshot 2017-04-14:
www.msystem.waw.pl/x265/ffmpeg-2017-04-14.7z

The ffmpeg2.exe is the same with '-O2' optimize option instead of default '-O3 -fno-tree-vectorize'. For my i5 3450S this ffmpeg2.exe is slightly faster.

It looks like ffmpeg is faster in decoding HEVC than one month before.

Sagittaire
16th April 2017, 16:31
ffmpeg (without any libs) compiled by GCC 7 from snapshot 2017-04-14:
www.msystem.waw.pl/x265/ffmpeg-2017-04-14.7z

The ffmpeg2.exe is the same with '-O2' optimize option instead of default '-O3 -fno-tree-vectorize'. For my i5 3450S this ffmpeg2.exe is slightly faster.

It looks like ffmpeg is faster in decoding HEVC than one month before.

yes ... really higher speed.

THX for the ffmpeg build.

shinchiro
16th April 2017, 16:43
It looks like ffmpeg is faster in decoding HEVC than one month before.
Yeah..lots of hevc asm landed in ffmpeg's upstream recently, like:
http://git.videolan.org/?p=ffmpeg.git;a=commit;h=947230837cb6d64323590650554dad7abaf9a93f

NikosD
18th April 2017, 10:31
Hey,

I have made a few corrections to your "run.sh" file in order to be more consistent to what is running and what is being displayed.

For example, there is no "SSE3" instruction set tested, it's "SSSE3" and so on.

Also, the "All" setting is actually like "auto" so we don't need that.

I added also --no-asm option and I named it "No SIMD" in the final text, in order to see better the speedups of various SIMD sets compared to "No SIMD" setting

All of my changes are here:
http://txt.do/drlqm

SquallMX
18th April 2017, 19:00
6700K Stock 4.0 Ghz

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i7-6700K | 11.95 | 2.28 | 82 | 1.89 | 0.56 | 0.55 | 0.88 | 0.97 | 1.47 | 1.50 | 1.85 | N/A |

Sagittaire
19th April 2017, 16:17
Hey,

I have made a few corrections to your "run.sh" file in order to be more consistent to what is running and what is being displayed.

For example, there is no "SSE3" instruction set tested, it's "SSSE3" and so on.

Also, the "All" setting is actually like "auto" so we don't need that.

I added also --no-asm option and I named it "No SIMD" in the final text, in order to see better the speedups of various SIMD sets compared to "No SIMD" setting

All of my changes are here:
http://txt.do/drlqm

yes, I have little bug for "all SIMD" bench.
I will make new script for choose the best x264 and x265 build between VS2017, GCC7 and ICC17

Sagittaire
19th April 2017, 16:19
6700K Stock 4.0 Ghz

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i7-6700K | 11.95 | 2.28 | 82 | 1.89 | 0.56 | 0.55 | 0.88 | 0.97 | 1.47 | 1.50 | 1.85 | N/A |

the most interessing result ... thx

you can (re)make benchmark at your max overclocking?

Stock 4.0 Ghz

in fact stock is at 4.1 Ghz because turbo max for all core is 4.1 Ghz
You choose imposed 4.0 Ghz for all core or your i7-6700K is really in default stock frequency?

NikosD
19th April 2017, 16:59
yes, I have little bug for "all SIMD" bench.
I will make new script for choose the best x264 and x265 build between VS2017, GCC7 and ICC17
Don't forget to add the --no-asm option.

The "auto" is just fine for "all" SIMD sets.

Also, as I had told you in PM, the ffmpeg LAV video decoder is a lot faster in HEVC decoding than the script, probably due to more threads involved.

But using that script we can compare different CPUs to each other on the same script and not to absolutely fastest decoding.

SquallMX
19th April 2017, 18:09
the most interesting result ... thx

you can (re)make benchmark at your max overclocking?



in fact stock is at 4.1 Ghz because turbo max for all core is 4.1 Ghz
You choose imposed 4.0 Ghz for all core or your i7-6700K is really in default stock frequency?

i7-6700K stock clock for all cores at full is 4.0 GHz. Unfortunately is a DTR so no overclocking because of the cooling conditions:(.

Motenai Yoda
20th April 2017, 12:19
here my results.log
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i7 920 @3.8 | 7.90 | 1.14 | 49 | 0.96 | 0.37 | 0.37 | 0.60 | 0.65 | 0.96 | N/A | N/A | N/A |

RanmaCanada
23rd April 2017, 17:40
Well it looks like according to these benches I need to upgrade to at least R1700.

Thank you for the comprehensive tables.

Fador
25th April 2017, 08:29
Xeon E5-2699 v4 @ Stock

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Xeon E5-2699 v4 | 31.04 | 6.79 | 135 | 4.95 | 1.72 | 1.73 | 2.66 | 2.88 | 4.21 | 4.22 | 4.87 | N/A |

Yanak
27th April 2017, 15:58
Hello, CPU runs @4.2Ghz
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i7-3770K | 10.35 | 1.46 | 72 | 1.22 | 0.47 | 0.47 | 0.76 | 0.81 | 1.22 | 1.23 | N/A | N/A |

shh
30th April 2017, 09:51
2x Xeon E5-2699 v4 @2.20GHz, 128GB RAM, 2xThreads: 22/22 (HT: off)
https://ark.intel.com/de/products/91317/Intel-Xeon-Processor-E5-2699-v4-55M-Cache-2_20-GHz
2x22 full cores oviously are a little better than just one E5-2699v4 with Hyper-Threading on. :)

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| 2x Xeon E5-2699 v4 | 37.19 | 9.36 | 118 | 5.48 | 2.13 | 2.12 | 3.25 | 3.48 | 4.90 | 4.89 | 5.28 | N/A |

Selur
3rd June 2017, 08:06
I overclocked my system to 4 GHz to see how this would affect the benchmark, and got some mixed results.
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Ryzen 7 1800X Eight-Core Processor | 21.91 | 3.39 | 74 | 2.59 | 1.19 | 1.14 | 1.66 | 1.76 | 2.05 | 2.18 | 2.12 | N/A |4GHz (Mainboard default)
| Ryzen 7 1800X Eight-Core Processor | 22.14 | 3.43 | 73 | 2.52 | 1.21 | 1.15 | 1.64 | 1.73 | 2.22 | 2.24 | 2.32 | N/A |4GHz (only changed the cpu mult)
| Ryzen 7 1800X Eight-Core Processor | 22.35 | 3.41 | 74 | 2.59 | 1.14 | 1.14 | 1.56 | 1.75 | 2.17 | 2.32 | 2.37 | N/A | 4GHz (only changed the cpu mult) + MalwareBytes and Windows Defender disabled
I'm wondering why the LAVC benchmark is so low compared to the old unoverclocked result I posted before (https://forum.doom9.org/showthread.php?p=1803733#post1803733):
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Ryzen 7 1800X Eight-Core Processor | 21.28 | 3.42 | 118 | 2.72 | 1.13 | 1.13 | 1.71 | 1.83 | 2.52 | 2.53 | 2.65 | N/A |@Stock speed

Not aware that I changed anything else on my system aside from updating Windows, since the last benchmark.

Cu Selur

Sagittaire
3rd June 2017, 20:04
Hi Selur

AMD make new compilator optimized for Ryzen (linux only?)
http://developer.amd.com/tools-and-sdks/cpu-development/amd-optimizing-cc-compiler/

You can try to make compilation for x264 and x265, make test with this benchmark and post your result?

THX

jd17
29th July 2017, 15:32
If anyone is interested, I tested my i5-7500 as well, thanks for compiling the benchmark! :)

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i5-7500 | 9.21 | 1.89 | 64 | 1.55 | 0.45 | 0.45 | 0.72 | 0.78 | 1.19 | 1.20 | 1.52 | N/A |

sacd
5th August 2017, 10:24
Interesting benchmark, measures all the right applications.
This is my new editing PC running the benchmark_auto from the first post dated April 14 2017 (CPU OC 4.5GHz):


|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i9-7900X | 32.97 | 6.24 | 157 | 4.70 | 1.57 | 1.58 | 2.50 | 2.74 | 4.07 | 3.98 | 4.82 | N/A |

drizzit
6th August 2017, 17:07
My results with the auto bat, manually overclocked to 4.4GHz

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i7-4790K | 11.70 | 2.10 | 78 | 1.74 | 0.49 | 0.49 | 0.82 | 0.89 | 1.39 | 1.40 | 1.75 | N/A |

Sagittaire
10th August 2017, 15:34
you can say if you have real full CPU charge at 100% for x264 test with 8C/16T CPU and higher (if someone have 16C/32T).

thx

Clare
10th August 2017, 21:45
I had to modify the script to run in Linux.


|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i7-6700HQ | 8.96 | 1.75 | 88 | 1.43 | 0.27 | 0.28 | 0.49 | 0.54 | 1.05 | 1.12 | 1.45 | N/A |


Here is the link to the modified script for Linux: https://gist.github.com/WyohKnott/72a7f35d28062bf48610a4aea92af788
It depends on python-cpuinfo, and of course, you need to install ffmpeg, x264 and x265 from your distribution repository.

Balthazar2k4
22nd August 2017, 22:02
Here you go:
|------------------------------------------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|------------------------------------------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Ryzen Threadripper 1950X 16-Core Processor | 37.59 | 6.50 | 136 | 4.65 | 2.02 | 2.00 | 2.95 | 3.10 | 3.89 | 4.10 | 4.26 | N/A |

Edit: x264 does NOT utilize all 32 threads. During that portion of the test I was seeing ~80% utilization.

kolak
24th August 2017, 21:01
2x Xeon E5-2699 v4 @2.20GHz, 128GB RAM, 2xThreads: 22/22 (HT: off)
https://ark.intel.com/de/products/91317/Intel-Xeon-Processor-E5-2699-v4-55M-Cache-2_20-GHz
2x22 full cores oviously are a little better than just one E5-2699v4 with Hyper-Threading on. :)

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| 2x Xeon E5-2699 v4 | 37.19 | 9.36 | 118 | 5.48 | 2.13 | 2.12 | 3.25 | 3.48 | 4.90 | 4.89 | 5.28 | N/A |

Far beyond reasonable scaling. Such a waste of processing power :) This is only good for running many encodes at the same time or splitting long encodes into chunks.
Can you test it with 1 core, so we can see how all this cores are wasted :)

kolak
24th August 2017, 21:05
Interesting benchmark, measures all the right applications.
This is my new editing PC running the benchmark_auto from the first post dated April 14 2017 (CPU OC 4.5GHz):


|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i9-7900X | 32.97 | 6.24 | 157 | 4.70 | 1.57 | 1.58 | 2.50 | 2.74 | 4.07 | 3.98 | 4.82 | N/A |

Just shows how good clock makes things fast.

bladerunner1982
27th August 2017, 20:06
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Pentium G4560 | 5.02 | 0.77 | 37 | 0.66 | 0.25 | 0.25 | 0.40 | 0.44 | 0.67 | N/A | N/A | N/A |

zurv
10th October 2017, 19:15
Here is the auto benchmark run on (windows 10 x64)
i9-7980xe @ 4.7 (with AVX offset of 12 and avx512 of 15)

|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i9-7980XE | 44.89 | 9.34 | 186 | 6.84 | 2.48 | 2.46 | 3.69 | 3.86 | 5.25 | 5.75 | 6.71 | N/A |


I'm still playing around with the offsets. But 18 cores + avx. stuff gets HOT (mainly the VRMs)

NikosD
15th October 2017, 11:39
Here is the auto benchmark run on (windows 10 x64)
i9-7980xe @ 4.7 (with AVX offset of 12 and avx512 of 15)
[
I'm still playing around with the offsets. But 18 cores + avx. stuff gets HOT (mainly the VRMs)

With an all core turbo clock@4.7 GHz you are approaching a 600W power consumption.

Have you delidded the CPU ?

What is the translation of AVX offset of 12 in clock speed ?

I'm assuming that you have a good liquid cooler in order to go that high.

nevcairiel
15th October 2017, 12:06
What is the translation of AVX offset of 12 in clock speed ?

Every offset reduces the clock by 100MHz, so thats 1200 less in AVX2 mode.

zurv
15th October 2017, 15:46
oddly the 7980 is ez'r to cool than the 7900 (i have that too), the die is much larger on the 7980. Yes, i delided but it wasn't as helpful as with other CPUs (maybe 5C drop)

The issue with the AVX stuff (and 512 even more) is the power draw. Cooling the CPU isn't a problem, but so much power is going through the CPU package and VRM.. ugh. (I have a mono block too.) (Mesh is a problem too as the jacks up power usage.. normally i'd want to run it at 32 (default is 24.) But AVX + higher mesh is monster power/heat.
Also, quick stuff (even this benchmark) is fine. The hard part is something that will be stable for a long time. Doing a 4k video (game capture lossless) for 30min in x264 is still 2-3 hours. (hrmm.. video that no one really look at.. and YT totally re-encodes to pooo quality.. so maybe i shouldn't be using "very slow"... )

I wonder how much faster avx is vs not using it. Ie, over 18 cores I'm losing over 18gigs of speed when using avx. I'm assuming that it is worth it.

(also, the default speed for the 7980 is 2.6ghz. It OCs really well. Most people are getting 4.5-.49 on the OC. AVX is a problem, but most don't use that. That said, the new Time Spy 4k 3dmark test does.)

Atak_Snajpera
15th October 2017, 17:27
(also, the default speed for the 7980 is 2.6ghz. It OCs really well. Most people are getting 4.5-.49 on the OC. AVX is a problem, but most don't use that. That said, the new Time Spy 4k 3dmark test does.)
If you do not care about AVX performance then maybe you should just have bought ThreadRipper 1950x for 2 times less. In non-AVX tasks difference between those two is not big.

zurv
15th October 2017, 17:35
that is a silly reply. One could just have the encoder not use avx. clock to clock intel is faster and there are more core on the 7980. (also the 7980 OCs much higher than the threadripper too. So more cores, clock to clock faster and faster clocks.) perf matters more than cost. Other than encoding nothing i do uses avx. So even for single core usages this CPU is better for me than threadripper. (the 7980 OCs just as well as the 7900 (which i also have) ... well.. other than avx :) )
AVX 512 is a big deal (or so i'm told :) ) the i9s are the only desktop CPU with it. I'm not fully clear by that is :) but i haven't looked into it much.

Atak_Snajpera
15th October 2017, 18:06
clock to clock intel is faster
AMD 1 core+SMT=Intel 1 core+HT

http://i.cubeupload.com/Tsc1ox.png

AVX 512 is a big deal (or so i'm told ) the i9s are the only desktop CPU with it.
You will have to wait another 10 years to see decent support in apps for this SIMD instructions. AVX-512 for now is just a placebo.

NikosD
15th October 2017, 18:16
AVX 512 is a big deal (or so i'm told :) ) the i9s are the only desktop CPU with it. I'm not fully clear by that is :) but i haven't looked into it much.

AVX512 is simply not existent and as Atak_Snajpera said in the previous post, don't expect it soon.

Anandtech.com is looking for apps leveraging AVX512 since i9 7900X launch and they didn't find any to test it.

Also, as i asked you before, do you use an AIO liquid cooler for the 18core ?

littleD
27th December 2017, 07:38
Not really impressive but good to know
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| CPU | x264 | x265 | LAVC | auto | MMX2 | SSE | SSE2 | SSE3 | SSE4 | AVX | AVX2 | All |
|---------------------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|---------|
| Core i5-7200U | 4.79 | 0.94 | 35 | 0.78 | 0.23 | 0.23 | 0.36 | 0.39 | 0.61 | 0.61 | 0.77 | N/A |

Boulder
3rd January 2018, 18:41
So what is the current optimal bang for buck CPU to get if 95% of all use is encoding with x265 and processing with Vapoursynth? I'm unsure about the current reasonable prices because Ryzens and i7's are often quite close?