View Single Post
Old 25th March 2017, 18:57   #5077  |  Link
Ma
Registered User
 
Join Date: Feb 2015
Posts: 326
Quote:
Originally Posted by easyfab View Post
@Ma could you test this build https://www.sendspace.com/file/biunt6

GCC 6.3 but fprofile with my i2600k
normal GCC 6.3 = 100%
Code:
 --asm=	|SSSE3	|SSE4.1	|AVX
gcc63s2	|100.00%|100.00%|100.00%
gcc70s2	| +5.00%| +0.54%| +0.83%
gcc63pr	|-36.29%| +1.89%| +1.46%
vs2017s2|-30.60%| +0.99%| +0.87%
At SSSE3 level your profiled version is very slow but it is OK -- when you profile the functions that are not executed are compiled for minimize size.

At SSE4.1 level you speed-up +1.89% to normal GCC 6.3 build -- it is better than VS 2017 build (but a bit slower than VS 2017 AVX build). So it is the fastest version that you can execute.
Attached Files
File Type: txt screen2.txt (20.3 KB, 36 views)
Ma is offline   Reply With Quote