Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#141 | Link |
Registered User
Join Date: Aug 2015
Posts: 276
|
8-bit video: SSE4.1 vs AVX2 vs AVX-512 (on 8C/16T Rocket Lake) - https://code.videolan.org/videolan/d..._requests/1301
Last edited by lvqcl; 23rd March 2022 at 19:46. |
![]() |
![]() |
![]() |
#142 | Link | |
Registered User
Join Date: Feb 2003
Location: New York, NY (USA)
Posts: 108
|
Quote:
Extreme example of the latter: 8-bit film grain is more than 3x as fast with AVX512 compared to AVX2. Last edited by Beelzebubu; 23rd March 2022 at 19:04. |
|
![]() |
![]() |
![]() |
#143 | Link |
Moderator
![]() Join Date: Jan 2006
Location: Portland, OR
Posts: 4,567
|
Wow, those are some very impressive speedups with AVX512! The new instructions are making at least as much of a difference than the "AVX2, but 2x wider" instructions.
Of course, Icelake CPUs don't have that much market share yet, but these kinds of speedups are quite promising in the long term for software decoding. |
![]() |
![]() |
![]() |
#145 | Link | |
Registered User
Join Date: Jun 2019
Posts: 16
|
dav1d 1.1.0 'Arctic Peregrine Falcon'
dav1d 1.1.0 was released yesterday. (Tag)
Quote:
|
|
![]() |
![]() |
![]() |
#146 | Link |
Registered User
Join Date: Mar 2004
Posts: 1,080
|
Changes for 1.2.0 'Arctic Peregrine Falcon':
------------------------------------------- - Improvements on attachments of props and T.35 entries on output pictures - NEON z1/z3 high bit-depth optimizations and improvements for 8bpc - SSSE3 z2/z3 8bpc and SSSE3 z1/z3 high bit-depth optimziations - refmvs.save_tmvs optimizations in SSSE3/AVX2/AVX-512 - AVX-512 optimizations for high bit-depth itx (16x64, 32x64, 64x16, 64x32, 64x64) - AVX2 optimizations for 12bpc for 16x32, 32x16, 32x32 itx |
![]() |
![]() |
![]() |
#147 | Link |
Registered User
Join Date: Mar 2004
Posts: 1,080
|
Changes for 1.2.1 'Arctic Peregrine Falcon':
------------------------------------------- - Fix a threading race on task_thread.init_done - NEON z2 8bpc and high bit-depth optimizations - SSSE3 z2 high bit-depth optimziations - Fix a desynced luma/chroma planes issue with Film Grain - Reduce memory consumption - Improve dav1d_parse_sequence_header() speed - OBU: Improve header parsing and fix potential overflows - OBU: Improve ITU-T T.35 parsing speed - Misc buildsystems, CI and headers fixes |
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|