Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Video Encoding > VP9 and AV1

Reply
 
Thread Tools Search this Thread Display Modes
Old 23rd May 2020, 12:25   #2201  |  Link
Yups
Registered User
 
Join Date: Sep 2011
Posts: 289
Quote:
Originally Posted by Beelzebubu View Post
It says profile 0 only, which is 4:2:0 and 8+10bit only. 4:4:4 is profile 1, 12-bit is profile 2 (along with 4:2:2 for 8+10bit).

Even in this leak there is a discrepancy. In the table there is 10 bit for video but the text says it's 8 bit for video. I'm keen to believe the table in this case but not sure. In the RKL-S slide there is 12 bit AV1, it could be just a mistake (confused with 12 Bit HEVC), it could refer to still image or it could be that the Gen12LP in RKL-S uses a newer decoding unit than TGL-U.
Yups is offline   Reply With Quote
Old 23rd May 2020, 19:50   #2202  |  Link
Beelzebubu
Registered User
 
Join Date: Feb 2003
Location: New York, NY (USA)
Posts: 82
Quote:
Originally Posted by Yups View Post
Even in this leak there is a discrepancy. In the table there is 10 bit for video but the text says it's 8 bit for video.
I think you're misreading a table converted to text. The table says "profile 0 (8-bit and 10-bit 4:2:0) 4k60 video, 16k still (HW decode)". The text says "AV1 codec support<newline>Profile 0 (10-bit 4:2:0)<tab>16k (still picture)<newline>Profile 0 (8-bit 4:2:0)<tab>4k x 2k (video)". I inserted the <tab> because it matches what the table says: 4k60 (4k x 2k) video, 16k still (picture) and each supporting profile 0 (8-bit & 10-bit, 4:2:0).
Beelzebubu is offline   Reply With Quote
Old 23rd May 2020, 20:41   #2203  |  Link
Sagittaire
Testeur de codecs
 
Sagittaire's Avatar
 
Join Date: May 2003
Location: France
Posts: 2,489
Quote:
Originally Posted by Beelzebubu View Post
aomenc --denoise-noise-level= does the same as a FFT denoiser + examples/noise_model + aomenc --film-grain-table=.
Well seem not work. I have denoise but seem not have FGM in output.

You have command line exemple?
__________________
Le Sagittaire ... ;-)

1- Ateme AVC or x264
2- VP7 or RV10 only for anime
3- XviD, DivX or WMV9
Sagittaire is offline   Reply With Quote
Old 23rd May 2020, 20:54   #2204  |  Link
Beelzebubu
Registered User
 
Join Date: Feb 2003
Location: New York, NY (USA)
Posts: 82
Quote:
Originally Posted by Sagittaire View Post
Well seem not work. I have denoise but seem not have FGM in output.
Please show your commandlines.
Beelzebubu is offline   Reply With Quote
Old 23rd May 2020, 21:15   #2205  |  Link
Sagittaire
Testeur de codecs
 
Sagittaire's Avatar
 
Join Date: May 2003
Location: France
Posts: 2,489
Quote:
Originally Posted by Beelzebubu View Post
Please show your commandlines.
for exemple.

Quote:
aomenc.exe -o ToS-1000.ivf C:\ToS_1920x800_xdither.y4m --ivf --i420 --width=1920 --height=800 --fps=25000/1000 --passes=2 --pass=1 --fpf=stats.log --target-bitrate=960 --maxsection-pct=4000 --buf-sz=12000 --buf-initial-sz=8000 --buf-optimal-sz=10000 --end-usage=vbr --cpu-used=3 --verbose --tune=ssim --psnr --q-hist=30 --bit-depth=10 --bias-pct=75 --kf-max-dist=120 --aq-mode=2 --test-decode=fatal --limit=1332 --skip=214 --denoise-noise-level=25

aomenc.exe -o ToS-1000.ivf C:\ToS_1920x800_xdither.y4m --ivf --i420 --width=1920 --height=800 --fps=25000/1000 --passes=2 --pass=2 --fpf=stats.log --target-bitrate=960 --maxsection-pct=4000 --buf-sz=12000 --buf-initial-sz=8000 --buf-optimal-sz=10000 --end-usage=vbr --cpu-used=3 --verbose --tune=ssim --psnr --q-hist=30 --bit-depth=10 --bias-pct=75 --kf-max-dist=120 --aq-mode=2 --test-decode=fatal --limit=1332 --skip=214 --denoise-noise-level=25
the noise is dithering. Perhaps not adapted denoising value for that? I try with --denoise-noise-level=5,10,50 and no result.
__________________
Le Sagittaire ... ;-)

1- Ateme AVC or x264
2- VP7 or RV10 only for anime
3- XviD, DivX or WMV9

Last edited by Sagittaire; 23rd May 2020 at 21:19.
Sagittaire is offline   Reply With Quote
Old 24th May 2020, 03:42   #2206  |  Link
Beelzebubu
Registered User
 
Join Date: Feb 2003
Location: New York, NY (USA)
Posts: 82
Quote:
Originally Posted by Sagittaire View Post
for exemple.



the noise is dithering. Perhaps not adapted denoising value for that? I try with --denoise-noise-level=5,10,50 and no result.
Dithering should be fine. Does --help include a description for --denoise-noise-level? It might be built without CONFIG_DENOISE (in which case the --help output will be missing also). Otherwise I'm not entirely sure, it has worked fine for me. Does it work on 8-bit material?
Beelzebubu is offline   Reply With Quote
Old 25th May 2020, 23:58   #2207  |  Link
Sagittaire
Testeur de codecs
 
Sagittaire's Avatar
 
Join Date: May 2003
Location: France
Posts: 2,489
Quote:
Originally Posted by Beelzebubu View Post
Dithering should be fine. Does --help include a description for --denoise-noise-level? It might be built without CONFIG_DENOISE (in which case the --help output will be missing also). Otherwise I'm not entirely sure, it has worked fine for me. Does it work on 8-bit material?
well seem work only if you have noise_model.exe in same directory than aomenc.exe ... ?
__________________
Le Sagittaire ... ;-)

1- Ateme AVC or x264
2- VP7 or RV10 only for anime
3- XviD, DivX or WMV9
Sagittaire is offline   Reply With Quote
Old 26th May 2020, 01:50   #2208  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
Quote:
Originally Posted by soresu View Post
Isn't all this pointless when hardly any TVs actually support 12 bit though?
Certainly for the moment, yes. Perhaps 12-bit could become a default with VVC or AV2.
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Old 26th May 2020, 17:42   #2209  |  Link
soresu
Registered User
 
Join Date: May 2005
Location: Swansea, Wales, UK
Posts: 184
Oh man, the new Cortex X1 CPU core has twice the NEON units of the A78 and previous Axx cores.

At 4x 128 bit units that's some real grunt for an off the shelf design, should be noice for encoding.
soresu is offline   Reply With Quote
Old 17th June 2020, 09:41   #2210  |  Link
mzso
Registered User
 
Join Date: Oct 2009
Posts: 901
What kind of beast of a computer would you need to play 8k AV1?
I wanted to check out a few 8k videos on Youtube, but I accidentally downloaded AV1 streams, and the videos ate up my R5 1600. Even VP9 did...
I wonder if any mere mortal will have such a machine in the next decade.

Last edited by mzso; 28th June 2020 at 16:30.
mzso is offline   Reply With Quote
Old 17th June 2020, 11:28   #2211  |  Link
ChaosKing
Registered User
 
Join Date: Dec 2005
Location: Germany
Posts: 1,467
Beast machine? You used the wrong player!
In my browser it is almost smooth, with some dopped frames
In mpv it is perfect with ~ 55% cpu

My pc: ryzen 2600 (no oc) + gtx 1070
8K Video: https://www.youtube.com/watch?v=ChOhcHD8fBA
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth
VapourSynth Portable FATPACK || VapourSynth Database || https://github.com/avisynth-repository
ChaosKing is offline   Reply With Quote
Old 17th June 2020, 11:36   #2212  |  Link
huhn
Registered User
 
Join Date: Oct 2012
Posts: 6,901
your GPU can decode this video...
huhn is offline   Reply With Quote
Old 17th June 2020, 12:58   #2213  |  Link
ChaosKing
Registered User
 
Join Date: Dec 2005
Location: Germany
Posts: 1,467
gtx 1070 does not have a HW AV1 decoder...
And mpv uses the dav1d decoder which works on cpu only.
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth
VapourSynth Portable FATPACK || VapourSynth Database || https://github.com/avisynth-repository
ChaosKing is offline   Reply With Quote
Old 17th June 2020, 16:52   #2214  |  Link
nevcairiel
Registered Developer
 
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,072
My 7900X (OC) can also play that video in 8K without drops, at 40% usage at most - straight in Chrome 83. Make sure its not your GPU that struggles downscaling the 8K video.

With Ryzen making 8 cores available to "mere mortals" for relatively low pricing, I don't think you need any particular "beast" right now, nevermind the next decade.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders
nevcairiel is offline   Reply With Quote
Old 18th June 2020, 02:04   #2215  |  Link
benwaggoner
Moderator
 
Join Date: Jan 2006
Location: Portland, OR
Posts: 3,596
Quote:
Originally Posted by nevcairiel View Post
My 7900X (OC) can also play that video in 8K without drops, at 40% usage at most - straight in Chrome 83. Make sure its not your GPU that struggles downscaling the 8K video.

With Ryzen making 8 cores available to "mere mortals" for relatively low pricing, I don't think you need any particular "beast" right now, nevermind the next decade.
How well are the fast SW decoders scaling with multiple cores? VP9 struggled to get much parallelism due to some unfortunate serialization in the loop filters and the lack of a clear reference structure that allowed for parallel decoding of non-reference frames.

AV1 is certainly going to be more parallizable than VP9. But how far have the best decoders gone so far?
__________________
Ben Waggoner
Principal Video Specialist, Amazon Prime Video

My Compression Book
benwaggoner is offline   Reply With Quote
Old 18th June 2020, 04:00   #2216  |  Link
huhn
Registered User
 
Join Date: Oct 2012
Posts: 6,901
Quote:
Originally Posted by ChaosKing View Post
gtx 1070 does not have a HW AV1 decoder...
And mpv uses the dav1d decoder which works on cpu only.
the video is as far as i can see only VP9 8 bit.
if there is a AV1 version i could try it with an R7 3700X which could yield up to ~2.5x more performance then a r5 2600 in this special case
Quote:
Originally Posted by benwaggoner View Post
How well are the fast SW decoders scaling with multiple cores? VP9 struggled to get much parallelism due to some unfortunate serialization in the loop filters and the lack of a clear reference structure that allowed for parallel decoding of non-reference frames.

AV1 is certainly going to be more parallizable than VP9. But how far have the best decoders gone so far?
this doesn't seam to be the case anymore or it's not as bad as it was in the past. while the microsoft VP9 decoder has issues using more then 8 threads ffmpeg in lavfilter doesn't have this issues easily using 70-90% CPU usage on a 8 core 16 thread CPU. it sometimes falls to 50% or lower CPU usage but with usually over 100 FPS.

Last edited by huhn; 18th June 2020 at 04:51.
huhn is offline   Reply With Quote
Old 18th June 2020, 08:17   #2217  |  Link
ChaosKing
Registered User
 
Join Date: Dec 2005
Location: Germany
Posts: 1,467
I think I activated AV1 on YT, it was under /testtube or so. Just google it.
I used youtube-dl.exe to download the video, it was the av1 version.

@nevcairiel I use Vivaldi and have currently over 100 tabs open xD It can have an effect on performance


Quote:
Originally Posted by benwaggoner View Post
AV1 is certainly going to be more parallizable than VP9. But how far have the best decoders gone so far?
https://www.phoronix.com/scan.php?pa....7-Performance

Over 300fps for 4k is not bad
I hope 10bit decoding will be faster on ARM soon too. My 4k fire tv stick struggles with 1080p 10bit video
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth
VapourSynth Portable FATPACK || VapourSynth Database || https://github.com/avisynth-repository

Last edited by ChaosKing; 18th June 2020 at 08:24.
ChaosKing is offline   Reply With Quote
Old 18th June 2020, 09:26   #2218  |  Link
huhn
Registered User
 
Join Date: Oct 2012
Posts: 6,901
i'm still confused by YTDL doesn't showing it before. but this make far more sense now to me.
the stream is very easy to decode i get about 40~ FPS with 50% CPU usage using lavfilter from mpc-hc 1.9.4.
huhn is offline   Reply With Quote
Old 18th June 2020, 11:41   #2219  |  Link
soresu
Registered User
 
Join Date: May 2005
Location: Swansea, Wales, UK
Posts: 184
Quote:
Originally Posted by ChaosKing View Post
I hope 10bit decoding will be faster on ARM soon too. My 4k fire tv stick struggles with 1080p 10bit video
10 bit isn't likely to get much better on ARM64 as most of the NEON asm has been written at this point - the problem is that all Fire TV products use 32 bit ARM Android as a base due to laziness on their part IMHO, therefore you won't get as much performance out of it as a phone or tablet with the same HW spec.

They probably will fill out the ARM32 NEON asm over time, though I'm not sure if the performance will match the ARM64 equivalent code path.

10 bit on x86 is another matter entirely - as far as I am aware there are zero SIMD assembly optimisations currently for AVX2, SSSE3 or SSE2 where 10+ bpc video is concerned.
soresu is offline   Reply With Quote
Old 18th June 2020, 13:35   #2220  |  Link
Beelzebubu
Registered User
 
Join Date: Feb 2003
Location: New York, NY (USA)
Posts: 82
Quote:
Originally Posted by benwaggoner View Post
How well are the fast SW decoders scaling with multiple cores? VP9 struggled to get much parallelism due to some unfortunate serialization in the loop filters and the lack of a clear reference structure that allowed for parallel decoding of non-reference frames.

AV1 is certainly going to be more parallizable than VP9. But how far have the best decoders gone so far?
I don't think this is true. From a decoder's point-of-view, the reference structure (most notably the cross-frame entropy dependency) and postfilter dependency (tile-crossing) between VP9 and AV1 are the same, and equally parallelizable. dav1d uses the same techniques for frame threading as FFmpeg's native VP9 decoder (ffvp9) and achieves siimilar concurrency multipliers as dav1d. Tile threading works the same. See e.g. graphs at https://youtu.be/WgfklAi50nM?t=1378 (VP9 vs. AV1) and https://youtu.be/WgfklAi50nM?t=345 (AV1 tile vs. frame vs. both multithreading), and note that these results are over a year old, dav1d has surpassed ffhevc (SW) decoding speed since then. See also the documentation for the threading model in dav1d.

The practical problem in ffvp9 is that it decided (to fit in FFmpeg's more static design) to only allow one threading type (frame or tile) instead of multiple concurrently (frame and tile) like dav1d does. That's the only reason dav1d scales better with multi-threading. We could have resolved that, but it was decided that ffvp9 was fast enough and it wasn't worth it.

(I can explain libaom's and libvpx' threading models if you want to learn more, but since they are a subset of dav1d/ffvp9, I was assuming this would be enough. I'm not familiar with gav1's threading model.)
Beelzebubu is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 20:11.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, vBulletin Solutions Inc.