Log in

View Full Version : Current Patches, Where to get them, How they affect speed/output


Pages : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [23] 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69

Ranguvar
17th September 2008, 00:15
ahhh....

Soichiro
17th September 2008, 00:57
Excited about the b-adapt 2 speedboost!

Still would like some working multithreading up in there. :(

Dark Shikari
17th September 2008, 01:01
Still would like some working multithreading up in there. :(What's the problem? Works fine here...

kemuri-_9
17th September 2008, 01:07
What's the problem? Works fine here...

maybe whining that it's not as fast as b-adapt 1?

Sharktooth
17th September 2008, 01:27
Still would like some working multithreading up in there. :(
just use a sane number of b-frames.

Soichiro
17th September 2008, 02:13
Of course it's all my fault if it's being slow. Of course I'm just whining and being stupid. -_-;

I am using a sane number of b-frames. It's set to 6. And while it's better than before, it still doesn't use all of both cores with --b-adapt 2, which is something ideally I would like to see fixed. It's not too crippling on a dual core (though 70% CPU usage isn't something to be happy about either), but I imagine the new method would cause much more of a slowdown on a 4+ core system.

Sharktooth
17th September 2008, 02:15
6 is not sane... 3 (or 4 maybe) is...
and as Dark_Shikari said, even 2 with b-adapt 2 will look better then 16 with b-adapt 1.

Soichiro
17th September 2008, 02:18
Well my tests show that 6 is a good balance between speed and quality for animated material. Almost all my anime encodes have shown large (10%+) b-frame usage in up to 6-frame sequences, while beyond 6 frames is below 1%. I would call that "sane".

Sharktooth
17th September 2008, 02:21
not with b-adapt 2... using 6 b-frames will kill the encoding speed, so it's not a good balance.
use 3 b-frames with b-adapt 2. also for animes, maybe you better stick with b-adapt 1 and 6 or more b-frames.

Soichiro
17th September 2008, 02:23
Has b-adapt 2 shown bad results on anime? o.O

Dark Shikari
17th September 2008, 02:27
Has b-adapt 2 shown bad results on anime? o.ONo, I haven't done much testing on anime.

The question is whether you lose more from the lower --bframes than you gain from b-adapt 2, and I highly doubt that is the case.

Sharktooth
17th September 2008, 02:27
no, but since anime encodings will be happy with more b-frames, b-adapt 1 is faster and probably will be not much different than b-adapt 2 with less b-frames.

Dark Shikari
17th September 2008, 02:33
no, but since anime encodings will be happy with more b-frames, b-adapt 1 is faster and probably will be not much different than b-adapt 2 with less b-frames.We don't know this for sure :p Some testing will have to be done.

I have some anime clips that benefit tremendously from b-adapt 2. Though it might be because they're full of fades.

Soichiro
17th September 2008, 02:33
Ah, that makes sense, then. So I should keep using the old method until the new method is more optimized/properly multithreaded, then (unless I have extra time to kill :p).

Thank you. ;D

Quark.Fusion
17th September 2008, 12:37
AFAIK --b-adapt 2 can't be multithreaded as it relies on it's previous result (correct me if I'm wrong). Get process explorer and see how much cpu b-adapt thread uses, if it's greatly lesser that 100%/(num of cores) — try thread-pool patch.

kemuri-_9
17th September 2008, 13:05
working on some b-adapt comparisons
http://forum.doom9.org/showthread.php?p=1184814#post1184814

skystrife
17th September 2008, 14:04
x264.979.modified.exe (http://www.mediafire.com/?rzixxtfxxqc) - Alternate Download (http://skystrife.com/x264/x264.979.modified.exe)

Patches used:

x264_hrd_pulldown.09_interlace.diff

gcc 3.4.5 fprofiled build with -march=pentium2.

Sharktooth
17th September 2008, 14:07
tnx skystrife :)

gav1577
17th September 2008, 14:32
x264.979.modified.exe (http://www.mediafire.com/?rzixxtfxxqc) - Alternate Download (http://skystrife.com/x264/x264.979.modified.exe)

Patches used:

x264_hrd_pulldown.09_interlace.diff

gcc 3.4.5 fprofiled build with -march=pentium2.

Thanks :)

fields_g
17th September 2008, 14:34
If the next 21 builds come as quickly as the last 21, we will be at r1000 in 2 weeks...... wow....

It's not the quantity changes, but the QUALITY of the changes that impress me though! Thanks Devs!!

burfadel
17th September 2008, 14:41
Just wondering whether anyone has done tests adjusting the b-frame bias with --b-adapt 2? would be interesting to see the outcome of such tests. A different bias may be beneficial for anime, or resolve those having issues with psy trellis? (thats a question not a statement)!

Ranguvar
18th September 2008, 21:27
Sorry for the wait. Busy with encode tests.

Home (http://sites.google.com/site/ranguvar13/x264-builds)
Direct download (http://sites.google.com/site/ranguvar13/x264-builds/rang_x264_r0979.7z?attredirects=0), Mirrors (http://www.rapidspread.com/file.jsp?id=vjhzyxcku8)

x264 r979 from Git (patched, fprofiled).
Thanks to the x264 devs, including those who made the patches I use.
Compiled by Ranguvar on September 18th, 2008, with GCC 4.3.2.
DON'T think that because I used march=athlon it restricts the CPUs you can use.
It seems to improve performance (VERY slightly) for all CPUs.

Open this archive with the free, multi-platform tools 7-Zip or p7zip. Compressed with LZMA.
The src folder contains the patched source code.
The bin folder contains a binary executable, and a DLL for those apps that use it
(May not work in AviDemux. Get those DLLs from LoRd_MuldeR).

Git: git://git.videolan.org/x264.git
Info, and source tarballs: http://www.videolan.org/developers/x264.html
Changelog: http://git.videolan.org/gitweb.cgi?p=x264.git
Vanilla builds: http://x264.nl/
Discussion: http://forum.doom9.org/forumdisplay.php?f=77
http://forum.doom9.org/showthread.php?t=130364


Applied patches, in the order applied (included, unchanged, in the patches folder):

patch -p1 -i ../x264diffs/rang_x264_version.diff
patch -p1 -i ../x264diffs/x264_dll_alignment_fix.01.diff
patch -p1 -i ../x264diffs/x264_hrd_pulldown.09_interlace.diff
patch -p0 -i ../x264diffs/x264_fp-eta.01.r680.diff


CLI used for build: ./configure --enable-shared --extra-cflags="-march=athlon -pipe"
make fprofiled VIDS="../enctests/deadline_cif.y4m"

Platform: X86
System: MINGW
asm: yes
avis input: yes
mp4 output: yes
pthread: yes
gtk: no
debug: no
gprof: no
PIC: no
shared: yes
visualize: no

Ranguvar
20th September 2008, 16:20
WARNING: --bime b0rked in all revisions since and including r980, including this build. Don't use.

Home (http://sites.google.com/site/ranguvar13/x264-builds)
Direct download (http://sites.google.com/site/ranguvar13/x264-builds/rang_x264_r0982.7z?attredirects=0), Mirrors (http://www.rapidspread.com/file.jsp?id=jdfnjy3qms)

x264 r982 from Git (patched, fprofiled).
Thanks to the x264 devs, including those who made the patches I use.
Compiled by Ranguvar on September 20th, 2008, with GCC 4.3.2.
DON'T think that because I used march=athlon it restricts the CPUs you can use.
It seems to improve performance (VERY slightly) for all CPUs.

Open this archive with the free, multi-platform tools 7-Zip or p7zip. Compressed with LZMA.
The src folder contains the patched source code.
The bin folder contains a binary executable, and a DLL for those apps that use it
(May not work in AviDemux. Get those DLLs from LoRd_MuldeR).

Git: git://git.videolan.org/x264.git
Info, and source tarballs: http://www.videolan.org/developers/x264.html
Changelog: http://git.videolan.org/gitweb.cgi?p=x264.git
Vanilla builds: http://x264.nl/
Discussion: http://forum.doom9.org/forumdisplay.php?f=77
http://forum.doom9.org/showthread.php?t=130364


Applied patches, in the order applied (included, unchanged, in the patches folder):

patch -p1 -i ../x264diffs/rang_x264_version.diff
patch -p1 -i ../x264diffs/x264_dll_alignment_fix.01.diff
patch -p1 -i ../x264diffs/x264_hrd_pulldown.09_interlace.diff
patch -p0 -i ../x264diffs/x264_fp-eta.01.r680.diff


CLI used for build: ./configure --enable-shared --extra-cflags="-march=athlon -pipe"
make fprofiled VIDS="../enctests/deadline_cif.y4m"

Platform: X86
System: MINGW
asm: yes
avis input: yes
mp4 output: yes
pthread: yes
gtk: no
debug: no
gprof: no
PIC: no
shared: yes
visualize: no

gigah72
20th September 2008, 16:44
under which catergory do the last 3 patches fall (since 979), e.g. stability-, general-, internal-, bug- or ??-fix?
i ask, 'coz i notice a decrease of ~15% in encodingspeed with my usual testfiles?


EDIT:
the problem was in my setup, sorry, speed is OK.

Warpman
20th September 2008, 17:27
under which catergory do the last 3 patches fall (since 979), e.g. stability-, general-, internal-, bug- or ??-fix?
i ask, 'coz i notice a decrease of ~15% in encodingspeed with my usual testfiles?

The last 3 commits were mostly bugfixes I don't expect any big performence gain or loss.

skystrife
20th September 2008, 23:37
x264.983.modified.exe (http://www.mediafire.com/?jmwztuy5kbi) - Alternate Download (http://skystrife.com/x264/x264.983.modified.exe)

Patches used:

x264_hrd_pulldown.09_interlace.diff

gcc 3.4.5 fprofiled build with -march=pentium2.

Apologies for the recent absence--cleaning out the water loop was worth it though.

Ranguvar
21st September 2008, 00:26
Home (http://sites.google.com/site/ranguvar13/x264-builds)
Direct Download (http://sites.google.com/site/ranguvar13/x264-builds/rang_x264_r0983.7z?attredirects=0), Mirrors (http://www.rapidspread.com/file.jsp?id=zehc3cjzv8)

x264 r983 from Git (patched, fprofiled).
Thanks to the x264 devs, including those who made the patches I use.
Compiled by Ranguvar on September 20th, 2008, with GCC 4.3.2.
DON'T think that because I used march=athlon it restricts the CPUs you can use.
It seems to improve performance (VERY slightly) for all CPUs.

Open this archive with the free, multi-platform tools 7-Zip or p7zip. Compressed with LZMA.
The src folder contains the patched source code.
The bin folder contains a binary executable, and a DLL for those apps that use it
(May not work in AviDemux. Get those DLLs from LoRd_MuldeR).

Git: git://git.videolan.org/x264.git
Info, and source tarballs: http://www.videolan.org/developers/x264.html
Changelog: http://git.videolan.org/gitweb.cgi?p=x264.git
Vanilla builds: http://x264.nl/
Discussion: http://forum.doom9.org/forumdisplay.php?f=77
http://forum.doom9.org/showthread.php?t=130364


Applied patches, in the order applied (included, unchanged, in the patches folder):

patch -p1 -i ../x264diffs/rang_x264_version.diff
patch -p1 -i ../x264diffs/x264_dll_alignment_fix.01.diff
patch -p1 -i ../x264diffs/x264_hrd_pulldown.09_interlace.diff
patch -p0 -i ../x264diffs/x264_fp-eta.01.r680.diff


CLI used for build: ./configure --enable-shared --extra-cflags="-march=athlon -pipe"
make fprofiled VIDS="../enctests/deadline_cif.y4m"

Platform: X86
System: MINGW
asm: yes
avis input: yes
mp4 output: yes
pthread: yes
gtk: no
debug: no
gprof: no
PIC: no
shared: yes
visualize: no

skystrife
21st September 2008, 04:21
x264.985.modified.exe (http://www.mediafire.com/?yjuhyfzzykm) - Alternate Download (http://skystrife.com/x264/x264.985.modified.exe)

Patches used:

x264_hrd_pulldown.09_interlace.diff

gcc 3.4.5 fprofiled build with -march=pentium2.

Romario
21st September 2008, 04:23
Why you not use GCC 4.3.2, it's mush better then too old 3.4.5 ?

Sagekilla
21st September 2008, 05:14
Just saw r984 hit git, how much of a speedup can we expect from this?

Dark Shikari
21st September 2008, 05:23
Just saw r984 hit git, how much of a speedup can we expect from this?Depends on settings, since it speeds up hpel, and hpel is run a constant number of times regardless of settings (so slower settings == less speedup).

I think the speedup should be about 0-1% depending on settings. There's more coming afterwards; I'm going to be merging holger's intra pred and a few other things.

Sagekilla
21st September 2008, 05:30
Any speedup is good to hear. On that note, was this the one GSOC project that was "successful," besides the B frame patch you did yourself?

Dark Shikari
21st September 2008, 05:37
Any speedup is good to hear. On that note, was this the one GSOC project that was "successful," besides the B frame patch you did yourself?Yes. After this set of patches will come the merging of the other main successful patch.

skystrife
21st September 2008, 22:35
x264.986.modified.exe (http://www.mediafire.com/?tb2wv3r2nhm) - Alternate Download (http://skystrife.com/x264/x264.986.modified.exe)

Patches used:

x264_hrd_pulldown.09_interlace.diff

gcc 3.4.5 fprofiled build with -march=pentium2.

Audionut
21st September 2008, 23:00
I got problems trying to get the version info.

mingw32 says "./version.sh: git-rev-list: command not found"

msysgit says "./version.sh: line 5: join: command not found"

Dark Shikari
21st September 2008, 23:08
I got problems trying to get the version info.

mingw32 says "./version.sh: git-rev-list: command not found"

msysgit says "./version.sh: line 5: join: command not found"The current script doesn't work with the latest version of git because the command names were changed from "git-$command" to "git $command", I think.

LoRd_MuldeR
21st September 2008, 23:09
I got problems trying to get the version info.

mingw32 says "./version.sh: git-rev-list: command not found"

msysgit says "./version.sh: line 5: join: command not found"

You must add the Git "bin" folder to your PATH variable. Then the first warning will be gone :)

I do it this way:
export PATH=$PATH:"/c/Program Files (x86)/Git/bin"

EDIT: I currently use Git version 1.5.6.1.1071.g76fb and I get the "join: command not found" warning too (but this seems to be no problem).

foxyshadis
21st September 2008, 23:25
Machinae Supremacy talk moved (https://forum.doom9.org/showthread.php?t=141346).

Audionut
21st September 2008, 23:34
Thanks guys.

Lux Delux
21st September 2008, 23:50
I'm using megui with the latest skystrife build, just now I noticed (in just 1 scene for now) a strange blocking in an area

http://img217.imageshack.us/img217/1049/snapshot20080922004609uq2.jpg (http://imageshack.us)

below is the 1st and 2nd pass settings


Job commandline: "C:\Program Files (x86)\megui\tools\x264\x264.exe" --pass 1 --bitrate 1137 --stats "D:\WALKER S1\01\udegrain2.stats"
--ref 16 --mixed-refs --no-fast-pskip --bframes 6
--b-adapt 2 --b-pyramid --b-rdo --bime --weightb --direct auto --filter -1:-1 --subme 7 --trellis 2 --partitions p8x8,b8x8,i4x4,i8x8 --8x8dct
--me tesa --merange 32 --threads auto --thread-input --sar 59:65 --progress --no-dct-decimate --no-psnr --no-ssim --output NUL "D:\WALKER S1\01\udegrain2.avs"

Job commandline: "C:\Program Files (x86)\megui\tools\x264\x264.exe" --pass 2 --bitrate 1137 --stats "D:\WALKER S1\01\udegrain2.stats"
--ref 16 --mixed-refs --no-fast-pskip --bframes 6
--b-adapt 2 --b-pyramid --b-rdo --bime --weightb --direct auto --filter -1:-1 --subme 7 --trellis 2 --partitions p8x8,b8x8,i4x4,i8x8 --8x8dct
--me tesa --merange 32 --threads auto --thread-input --sar 59:65 --progress --no-dct-decimate --no-psnr --no-ssim --output "D:\WALKER S1\01\udegrain2.mkv" "D:\WALKER S1\01\udegrain2.avs"

LoRd_MuldeR
21st September 2008, 23:54
Lux Delux, it usually helps to upload a short sample of your source, which can be used to reproduce the problem...

Audionut
22nd September 2008, 00:28
I had funky blocking problems like that when I forgot to change the output in mpc from system default.

Changing to haali or vmr9 fixed it.

Lux Delux
22nd September 2008, 02:24
Yikes it seems it was the ffdshow build I was using, reverted to an older one had the output fine then.

Sorry for the false alarm :o

LoRd_MuldeR
22nd September 2008, 02:29
Yikes it seems it was the ffdshow build I was using, reverted to an older one had the output fine then.

Sorry for the false alarm :o

You should report that in the ffdshow thread:
http://forum.doom9.org/showthread.php?t=120465&page=214

Ranguvar
22nd September 2008, 04:53
NOTICE: I have switched back over to Arch GNU+Linux. Therefore, unless someone is willing to help me learn to cross-compile for Windows, I will either stop making builds, or may eventually put XP in a virtual machine and do it from there. :)

gruntster
22nd September 2008, 05:05
EDIT: I currently use Git version 1.5.6.1.1071.g76fb and I get the "join: command not found" warning too (but this seems to be no problem).
Installing MSYS coreutils (http://sourceforge.net/project/showfiles.php?group_id=2435&package_id=24963) should resolve this warning.

Audionut
22nd September 2008, 13:36
Revision 987

Fix deblocking + threads + AQ bug

At low QPs, with threads and deblocking on, deblocking could be improperly disabled.
Revision in which this bug was introduced is unknown; it may be as old as b_variable_qp in x264 itself.


Trafficshare Download (http://rapidshare.com/files/147408536/x264-987.rar)


Compiled with gcc 3.4.5
./configure --enable-mp4-output --extra-cflags="-march=athlon -pipe"
fprofiled.

skystrife
22nd September 2008, 13:56
x264.987.modified.exe (http://www.mediafire.com/?wozyejly3nz) - Alternate Download (http://skystrife.com/x264/x264.987.modified.exe)

Patches used:

x264_hrd_pulldown.09_interlace.diff

gcc 3.4.5 fprofiled build with -march=pentium2.

bob0r
22nd September 2008, 14:22
x264.987.modified.01.exe (http://files.x264.nl/x264.987.modified.01.exe)
libx264-64.987.modified.01.dll (http://files.x264.nl/libx264-64.987.modified.01.dll)

x264_hrd_pulldown.09_interlace.diff

komisar
22nd September 2008, 14:55
And my 987 builds also... :) (profiled, patched vaq-mod, unpatсhed as skyfire/bob0r, vfw, tuned)

http://komisar.gin.by/