Alliance for Open Media codecs - Page 112

LigH · 15th July 2020, 14:54

New uploads: (MSYS2; MinGW32 / MinGW64: GCC 10.1.0)

AOM v2.0.0-608-gcae201b6c

rav1e 0.3.0 (01052fc / 2020-07-15)

dav1d 0.7.1 (1317e61 / 2020-07-15)

marcomsousa · 17th July 2020, 13:39

Youtube is giving me AV1 videos without changing configuration on a fresh Windows 10 installed in Chrome without login in Youtube on a i5-1035G1 CPU - 499€ Laptop.

Justing clicking on the first 5 videos from the main page. 3 are AV01 at 1080p and the other VP9.

hbbs · 17th July 2020, 13:54

Since AOM AV1 2.0.0 was released a couple of months ago. Have anyone seen a YouTube AV01 video encoded using it?

I always thought AV01 meant AOM 1.X.X

Sent from my Moto Z3 Play using Tapatalk

marcomsousa · 17th July 2020, 14:01

AV01 = AOM (AV1)
The next AV2, would be AV02 in youtube.

Also Is normal that Google isn't executing exacly the same version as in the Web. Normal a big company adds small changes to source code for specific Google Platform or backporting fixes.

Note: libaom 2.0.0 is nothing related with AV02 is just an improved version ov AV01 encoder.

hbbs · 17th July 2020, 14:14

Quote:

Originally Posted by marcomsousa

AV01 = AOM (AV1) - the version encoded will not be shared (Also, is normal they adds small changes to source code for specific Google Platform)

The next AV2, would be AV02 in youtube.

Thanks for replying it.

Since you mentioned AV2. Now that VVC is out. Is there anything shared about the developments of AV2?

I remember reading somewhere that Apple was pushing AV2 since they joined AOM late in the game.

Sent from my Moto Z3 Play using Tapatalk

nevcairiel · 17th July 2020, 14:22

AV2 development has practically only just started, it'll be a while before anything substantial can be said about it.

mzso · 18th July 2020, 08:52

Quote:

Originally Posted by nevcairiel

AV2 development has practically only just started, it'll be a while before anything substantial can be said about it.

Will they do the same old? Stick with the basics and make it more convoluted and computation heavy?

(Somehow I doubt they'll try something different, like Daala did with lapped transforms)

benwaggoner · 18th July 2020, 22:26

Quote:

Originally Posted by mzso

Will they do the same old? Stick with the basics and make it more convoluted and computation heavy?

(Somehow I doubt they'll try something different, like Daala did with lapped transforms)

More years of patents will have expired, so there are likely patented techniques they couldn't use in AV1 which are now available to use in AV2. Plus they have experience seeing what limitations there are holding back encoders and decoders, and they can engineer around those. Getting more HW decoder vendors involved early could help a LOT. MPEG codecs get way, way more input on how to optimize bitstream design to allow for low-cost HW implementations than AV1 did.

For all the complexity of VVC on the encode side, from everything I've heard it'll still be able to have cheaper, simpler decoders than AV1 can.

mzso · 26th July 2020, 18:47

Quote:

Originally Posted by benwaggoner

More years of patents will have expired, so there are likely patented techniques they couldn't use in AV1 which are now available to use in AV2. Plus they have experience seeing what limitations there are holding back encoders and decoders, and they can engineer around those. Getting more HW decoder vendors involved early could help a LOT. MPEG codecs get way, way more input on how to optimize bitstream design to allow for low-cost HW implementations than AV1 did.

For all the complexity of VVC on the encode side, from everything I've heard it'll still be able to have cheaper, simpler decoders than AV1 can.

One would hope that there's something better out there than making decades old heritage ever more complicated. Just because wavelets and lapped transforms didn't quite work out, it doesn't mean there isn't a better way to encode video.

Honestly, creating another similar (but more complicated) format AV2 seems really pointless at this point. Either create something revolutionary or keep optimizing AV1 encoding/decoding. AVC has been around for many years and will remain for many more. MPEG-2 and ASP didn't die out yet either.

marcomsousa · 26th July 2020, 20:06

AV1 will continue to be optimized for the next 5-10 years without changing format.

Changing format is AV2. The development format process will takes 2 to 3 years+5 years for HW decoder+encoder. So developing the new AV2 can be started anytime now.

If software patents expired in 20 years, so AV2 can have a lot of old technics.

Marco Sousa

benwaggoner · 27th July 2020, 02:12

Quote:

Originally Posted by mzso

One would hope that there's something better out there than making decades old heritage ever more complicated. Just because wavelets and lapped transforms didn't quite work out, it doesn't mean there isn't a better way to encode video.

Honestly, creating another similar (but more complicated) format AV2 seems really pointless at this point. Either create something revolutionary or keep optimizing AV1 encoding/decoding. AVC has been around for many years and will remain for many more. MPEG-2 and ASP didn't die out yet either.

Well, the thing is that iterations and refinement of block-based frequency transform coding keep on showing bigger potential gains than the Big Idea alternative transforms. Some of this could be because of the momentum of R&D around the traditional stuff. Or it could be that we luckily hit upon the right essential transform that balances spatial and temporal prediction better than available alternatives. Arguably modern compression is reallly "just" elaborations of JPEG and H.261. But oh, how elaborate!

mzso · 27th July 2020, 15:03

Quote:

Originally Posted by benwaggoner

Well, the thing is that iterations and refinement of block-based frequency transform coding keep on showing bigger potential gains than the Big Idea alternative transforms. Some of this could be because of the momentum of R&D around the traditional stuff. Or it could be that we luckily hit upon the right essential transform that balances spatial and temporal prediction better than available alternatives. Arguably modern compression is reallly "just" elaborations of JPEG and H.261. But oh, how elaborate!

"Or it could be that we luckily hit upon the right essential transform that balances spatial and temporal prediction better than available alternatives."

I highly doubt it. I think it's more likely the illusion of familiarity and the blinders that come with it.

"Some of this could be because of the momentum of R&D around the traditional stuff."

It certainly seems like that alternatives got only limited efforts on them.

benwaggoner · 27th July 2020, 20:47

Quote:

Originally Posted by mzso

"Or it could be that we luckily hit upon the right essential transform that balances spatial and temporal prediction better than available alternatives."

I highly doubt it. I think it's more likely the illusion of familiarity and the blinders that come with it.

"Some of this could be because of the momentum of R&D around the traditional stuff."

It certainly seems like that alternatives got only limited efforts on them.

Yeah. It's really hard to disprove the hypothesis that there could be better fundamental ways of encoding.

That said, wavelets sure got a lot of attention for image and motion coding. Good for images, but no one figured out an efficient motion compensation strategy for it.

Daala had a lot of really intriguing notions, but the most interesting stuff in it never really got to a promising proof of concept. Sure, maybe with 10 years of 1000 engineers something could be found. Any alternative transforms have to compete with decades of refinement of block-based frequency coding.

A lot of promising ideas get figured out how to port into a block-based structure. For example, HEVC's transform skip mode can make anime, graphics, and text way easier to encode at low bitrates and high quality. So new features, like have been seen in VVC and AV1, can get included as tools. Arguably, once you have 64x64 or bigger blocks, you've pretty much got all the advantages of wavelet coding already, within a block based model. And intra-frame prediction brings a lot of the potential value of fractal encoding.

One exciting thing (to me at least) about Daala that didn't make it into AV1 was doing frequency-domain prediction, so there was no need to rasterize a frame that wasn't going to get displayed, and dithering didn't need to be included in quantization. It didn't work out for reasons I don't quite recall.

nhw_pulsar · 29th July 2020, 20:17

Hello,

Thank you for your nice comment about wavelets for images (as I have also made a wavelet image codec, called NHW...).

You said: "but no one figured out an efficient motion compensation strategy for it". Do you think this is this aspect that prevents organizations such as Alliance for Open Media from starting and supporting a wavelet codec?

Cheers,
Raphael

foxyshadis · 1st August 2020, 11:13

Quote:

Originally Posted by benwaggoner

Arguably, once you have 64x64 or bigger blocks, you've pretty much got all the advantages of wavelet coding already, within a block based model.

I remember in the late 90's, first working with codecs in code, thinking that 8x8 must be some kind of fundamental limit of DCT, and wavelets must be superior since they can go from 128x128 all the way to 4096x4096 in JPEG2000. No, it turns out engineers were just excited about the new hotness instead of extending the old battleaxe, DCT, plus it would take at least until SSE2 to really be able to optimize transforms larger than 8x8.

I still think that *lets, curvelets, ridgelets, etc, could help further reduce still images/I-frames, but all the new prediction modes have really put a huge dent in how residuals look.

Quote:

Originally Posted by benwaggoner

One exciting thing (to me at least) about Daala that didn't make it into AV1 was doing frequency-domain prediction, so there was no need to rasterize a frame that wasn't going to get displayed, and dithering didn't need to be included in quantization. It didn't work out for reasons I don't quite recall.

From the Graveyard of Dead Tools post, it just never worked as well as spatial-domain, since it was another NP-hard idea. It's notable that most of the dead tool ideas came from audio coding, which is Monty's real wheelhouse, but Xiph still managed to push the state of the art and conjure up a real codec; I'm still waiting for a good intra paint plugin for Photoshop, because that tool is amazing.

nhw_pulsar · 2nd August 2020, 10:28

Hello @foxyshadis,

Hope that I am not trolling too much, I of course agree on the technical side with you and the other impressive reference members here, but I contacted you about Xiph as you seem to know well Monty and this organization.

Do you think Xiph can be interested in the NHW Project? Unfortunately I can not have contact with them, and maybe just like Alliance for Open Media, Xiph is not interested in NHW because it does not work for any image resolution? And that's why my submissions at Xiph and AOM are ignored? I thought that NHW could be a good project for Xiph... (that's only my opinion of course), and certainly a better fit than AOM, but maybe Xiph also only supports excellent codecs and they don't estimate that NHW is one of them?

Many thanks.
Cheers,
Raphael

LigH · 3rd August 2020, 07:43

Dead tools and video codecs and wavelets ... hmm ... I believe I still have a copy of Rududu.

nhw_pulsar · 3rd August 2020, 09:33

I did not test Rududu video codec, but I have tested the latest Rududu Image codec (RIC) and it is very good.If I remember correctly, RIC is kind of enhanced and state-of-the-art SPIHT, which is a different technology from NHW.-For the little story, when Rududu author released RIC in march 2008, I was totally blown away by its very impressive results on objective metrics like PSNR and by its very good precision, and then I realized that I could not be at that level of PSNR and precision with NHW, and so then I definitely decided to orientate NHW towards neatness and visual aspect.-

To come back on-topic, yes possibly in the late 90's with JPEG2000, wavelets were the hotness, but frankly since 2001, DCT block-based intra prediction+residual coding is really the main research focus of the industry.Wavelet compression research has been abandoned for years (by industry) actually, the last release of Dirac was in 2008, the last release of Rududu was also in 2008, Snow is around 2008, and most of the main ideas of NHW were also made in 2008...

Who believes organizations like AOM could restart wavelet compression technology today?

Cheers,
Raphael

Yups · 7th August 2020, 01:25

A new slide appeared on imgur with the media capabilities of Tigerlake-U.

Previously it was 4k60 and here it's 8k30 AV1.

NikosD · 7th August 2020, 05:15

Quote:

Originally Posted by Yups

A new slide appeared on imgur with the media capabilities of Tigerlake-U.

They have increased the speed - from 8K30 to 8K60 - of HEVC/VP9 decoder too.
They have added 12bit HEVC/VP9 decoding.
Also, that SCC of the table means Screen Content Coding and it's a HEVC profile/extension, optimized for screen captured content.
It could be used by streaming apps/services like YouTube, Skype, Zoom, Netflix etc but I don't know the real use of this extension.
And it's the first time I see this in the supported features of any decoder.

15th July 2020, 14:54	#2221 \| Link
LigH German doom9/Gleitz SuMo Join Date: Oct 2001 Location: Germany, rural Altmark Posts: 6,782	New uploads: (MSYS2; MinGW32 / MinGW64: GCC 10.1.0) AOM v2.0.0-608-gcae201b6c rav1e 0.3.0 (01052fc / 2020-07-15) dav1d 0.7.1 (1317e61 / 2020-07-15) __________________ New German Gleitz board MediaFire: x264 \| x265 \| VPx \| AOM \| Xvid

17th July 2020, 13:39	#2222 \| Link
marcomsousa Registered User Join Date: Jul 2018 Posts: 80	Youtube is giving me AV1 videos without changing configuration on a fresh Windows 10 installed in Chrome without login in Youtube on a i5-1035G1 CPU - 499€ Laptop. Justing clicking on the first 5 videos from the main page. 3 are AV01 at 1080p and the other VP9. __________________ AV1 win64 VS2019 builds Last build here

17th July 2020, 14:01	#2224 \| Link
marcomsousa Registered User Join Date: Jul 2018 Posts: 80	AV01 = AOM (AV1) The next AV2, would be AV02 in youtube. Also Is normal that Google isn't executing exacly the same version as in the Web. Normal a big company adds small changes to source code for specific Google Platform or backporting fixes. Note: libaom 2.0.0 is nothing related with AV02 is just an improved version ov AV01 encoder. __________________ AV1 win64 VS2019 builds Last build here Last edited by marcomsousa; 17th July 2020 at 14:05.

17th July 2020, 14:22	#2226 \| Link
nevcairiel Registered Developer Join Date: Mar 2010 Location: Hamburg/Germany Posts: 10,346	AV2 development has practically only just started, it'll be a while before anything substantial can be said about it. __________________ LAV Filters - open source ffmpeg based media splitter and decoders

26th July 2020, 20:06	#2230 \| Link
marcomsousa Registered User Join Date: Jul 2018 Posts: 80	AV1 will continue to be optimized for the next 5-10 years without changing format. Changing format is AV2. The development format process will takes 2 to 3 years+5 years for HW decoder+encoder. So developing the new AV2 can be started anytime now. If software patents expired in 20 years, so AV2 can have a lot of old technics. Marco Sousa __________________ AV1 win64 VS2019 builds Last build here

17th July 2020, 13:54	#2223 \| Link
hbbs Registered User Join Date: Nov 2015 Posts: 9	Since AOM AV1 2.0.0 was released a couple of months ago. Have anyone seen a YouTube AV01 video encoded using it? I always thought AV01 meant AOM 1.X.X Sent from my Moto Z3 Play using Tapatalk

29th July 2020, 20:17	#2234 \| Link
nhw_pulsar Registered User Join Date: Apr 2017 Posts: 171	Hello, Thank you for your nice comment about wavelets for images (as I have also made a wavelet image codec, called NHW...). You said: "but no one figured out an efficient motion compensation strategy for it". Do you think this is this aspect that prevents organizations such as Alliance for Open Media from starting and supporting a wavelet codec? Cheers, Raphael

2nd August 2020, 10:28	#2236 \| Link
nhw_pulsar Registered User Join Date: Apr 2017 Posts: 171	Hello @foxyshadis, Hope that I am not trolling too much, I of course agree on the technical side with you and the other impressive reference members here, but I contacted you about Xiph as you seem to know well Monty and this organization. Do you think Xiph can be interested in the NHW Project? Unfortunately I can not have contact with them, and maybe just like Alliance for Open Media, Xiph is not interested in NHW because it does not work for any image resolution? And that's why my submissions at Xiph and AOM are ignored? I thought that NHW could be a good project for Xiph... (that's only my opinion of course), and certainly a better fit than AOM, but maybe Xiph also only supports excellent codecs and they don't estimate that NHW is one of them? Many thanks. Cheers, Raphael

3rd August 2020, 07:43	#2237 \| Link
LigH German doom9/Gleitz SuMo Join Date: Oct 2001 Location: Germany, rural Altmark Posts: 6,782	Dead tools and video codecs and wavelets ... hmm ... I believe I still have a copy of Rududu. __________________ New German Gleitz board MediaFire: x264 \| x265 \| VPx \| AOM \| Xvid

3rd August 2020, 09:33	#2238 \| Link
nhw_pulsar Registered User Join Date: Apr 2017 Posts: 171	I did not test Rududu video codec, but I have tested the latest Rududu Image codec (RIC) and it is very good.If I remember correctly, RIC is kind of enhanced and state-of-the-art SPIHT, which is a different technology from NHW.-For the little story, when Rududu author released RIC in march 2008, I was totally blown away by its very impressive results on objective metrics like PSNR and by its very good precision, and then I realized that I could not be at that level of PSNR and precision with NHW, and so then I definitely decided to orientate NHW towards neatness and visual aspect.- To come back on-topic, yes possibly in the late 90's with JPEG2000, wavelets were the hotness, but frankly since 2001, DCT block-based intra prediction+residual coding is really the main research focus of the industry.Wavelet compression research has been abandoned for years (by industry) actually, the last release of Dirac was in 2008, the last release of Rududu was also in 2008, Snow is around 2008, and most of the main ideas of NHW were also made in 2008... Who believes organizations like AOM could restart wavelet compression technology today? Cheers, Raphael

7th August 2020, 01:25	#2239 \| Link
Yups Registered User Join Date: Sep 2011 Posts: 362	A new slide appeared on imgur with the media capabilities of Tigerlake-U. Previously it was 4k60 and here it's 8k30 AV1. Last edited by Yups; 7th August 2020 at 07:02.

Thread Tools	Search this Thread
Show Printable Version Email this Page	Search this Thread: Advanced Search
Display Modes
Linear Mode Switch to Hybrid Mode Switch to Threaded Mode