Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#1 | Link |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Deep Denoise Temporal
Trying to find nn denoisers in the temporal domain.
Here is one that seems interesting: https://github.com/clausmichele/ViDeNN Has anyone here tried it or has any other suggestions of temporal nn denoisers? Edit: more links https://github.com/ZhouYiiFeng/TDMS-Net Last edited by anton_foy; 14th May 2023 at 17:15. |
![]() |
![]() |
![]() |
#2 | Link | ||
Registered User
Join Date: Sep 2007
Posts: 5,285
|
Quote:
Ok results so far on the test sets , but nothing spectacular No strength adjustment Sometimes there is a loss of a frame , I think based on the way it works if there is a scene change, such as in the table tennis set. You have to be careful if you're using PSNR or metrics to align the original vs. results Has CPU mode fallback , but no tiling options Older smaller neural net, relatively faster than larger NN. eg VRT deniosing model about 5-6x larger and more than that many times slower https://github.com/JingyunLiang/VRT Quote:
|
||
![]() |
![]() |
![]() |
#3 | Link | |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Thanks for that!
Quote:
|
|
![]() |
![]() |
![]() |
#4 | Link |
Registered User
Join Date: Aug 2002
Location: Italy
Posts: 292
|
We strongly suggest to check the User-friendly, Fast, Self-Supervised Image Denoising for All project:
https://github.com/royerlab/aydin/
__________________
HYbrid Multimedia Production Suite page @ GitHub |
![]() |
![]() |
![]() |
#5 | Link | |
Registered User
Join Date: Sep 2007
Posts: 5,285
|
Quote:
No adjustments for ViDeNN (other than model choice) , it's slower than mdegrain derivatives. VRT has adjustments for temporal frame : width: height tiling, otherwise you'd need massive amounts of GPU memory, and strength 0-50 . Speed is very, very, very, very slow. It's got to be the one of slowest NN processing algorithms (not just for denoising) The resulting quality depends on the noise type for the pretrained models. Both tend to do better on gaussian type noise, not on stuff like large grain, because that's what they were trained on. If you look at the table tennis example in ViDeNN, mdegrain derivatives typically won't do very well on that type of noise, but ViDeNN will do ok, and scunet will do great (even though it's spatial, there will be no temporal inconsistencies in that particular case) |
|
![]() |
![]() |
![]() |
#6 | Link |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
@PatchWorks
Thanks looks interesting, it says also that it is spatiotemporal. Must look into, but it only seems to take still sequencies. @Poisondeathray Interesting that scunet doesn't leave those temporal inconsistencies yet probably because high ISO and low light footage has too little information especially in the darker parts while the tennis clip has added noise that does not distort in that way. Edit: Using Davinci Resolve's NR alot at work and while it smears (motionblurs) alot it has a great ability at low settings to remove lighter temporal fluctuations and blotches from bad compression. Last edited by anton_foy; 16th May 2023 at 17:00. |
![]() |
![]() |
![]() |
#7 | Link | |
Registered User
Join Date: Sep 2007
Posts: 5,285
|
Quote:
But on that specific table tennis sequence it does not, and you don't even need to add other smoothing afterwards. ViDNN OTOH does leave some temporal fluctuations on that sequence even though it's temporal. Scunet retains more detail and is cleaner, more temporally stable than ViDNN on that sequence . I mention the table tennis sequence because that's the highlight presented for ViDNN Last edited by poisondeathray; 16th May 2023 at 17:12. |
|
![]() |
![]() |
![]() |
#8 | Link | |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Quote:
|
|
![]() |
![]() |
![]() |
#9 | Link |
Registered User
Join Date: Sep 2007
Posts: 5,285
|
Yes - All the NN algorithms are several orders slower than mvtools2 derivatives or typical avs denoisers. The temporal ones like VRT can be exponentially slower unless you have a quadro with 48GB GPU memory
Scunet is only good for specific types of noise with the pretrained models, but it falls in the "remarkable" category. Very useful tool, sometimes as a prefilter for some avs/vpy filters . It makes you wonder how it would behave with custom training sets |
![]() |
![]() |
![]() |
#10 | Link |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
I found this clip where they are using what they call "Overlap Loss" for temporal consistency:
https://youtu.be/LA747HTukTQ |
![]() |
![]() |
![]() |
#11 | Link | |
Registered User
Join Date: Sep 2007
Posts: 5,285
|
Quote:
Only paper link , no test code published that I could find . Might have to wait a few months, sometimes people eventually publish an implementation of a paper on github |
|
![]() |
![]() |
![]() |
#12 | Link | |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Quote:
and test code but it is only for temporal consistencies and I have no idea how useful it is for real footage. Edit: the kinetics videos used there are broken. Edit: and this and this but it is for superres. Last edited by anton_foy; 18th July 2023 at 08:33. |
|
![]() |
![]() |
![]() |
#13 | Link | |||
HeartlessS Usurer
Join Date: Dec 2009
Location: Over the rainbow
Posts: 10,877
|
I know little to nothing of A.I. generative techniques, but thought that this link to a Tom'sHARDWARE article,
posted via CodeProject might be of interest to some, might even give hints to potential problems in other A.I. disciplines. Generative AI Goes 'MAD' When Trained on AI-Created Data Over Five Times:- https://www.tomshardware.com/news/ge...ver-five-times Quote:
Quote:
Quote:
![]() (a sort of 'A.I. ringing') Perhaps pertinent for eg, Super Resolution generative techniques.
__________________
I sometimes post sober. StainlessS@MediaFire ::: AND/OR ::: StainlessS@SendSpace "Some infinities are bigger than other infinities", but how many of them are infinitely bigger ??? Last edited by StainlessS; 18th July 2023 at 15:15. |
|||
![]() |
![]() |
![]() |
#14 | Link | |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Quote:
![]() Last edited by anton_foy; 19th July 2023 at 06:56. |
|
![]() |
![]() |
![]() |
#15 | Link | ||
Registered User
Join Date: Aug 2002
Location: Italy
Posts: 292
|
Sorry for the delay in reply, we're quite busy.
Quote:
Quote:
__________________
HYbrid Multimedia Production Suite page @ GitHub |
||
![]() |
![]() |
![]() |
#16 | Link | |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Quote:
|
|
![]() |
![]() |
![]() |
#17 | Link |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Ok this is a shader but it seems interesting and I wonder if it can be used to reduce noise instead of just AA. Only previous frame for motion vectors though but it looks quite stable and the nvidia paper linked in the dropboxlink mentions reduced noise and reduced ghosting. Maybe possible to tweak or replace the AA part with spatial denoising (since AA is spatial in it self)?
Link: https://www.shadertoy.com/view/MscSD7 Last edited by anton_foy; 31st August 2023 at 10:08. |
![]() |
![]() |
![]() |
#18 | Link |
Registered User
Join Date: Jul 2018
Posts: 1,000
|
It looks current cameras have enough effective internal denoisers (and/or pro shooting conditions provide enough light) so development of good denoisers for processing after shooting close to stopped.
So developers of 'neural networks' make very little progress in this way. In previous decades when we had a big market for consumer video cameras of medium and poor (cheap) quality - there was higher demand on post-shooting denoise (but PCs were slow). So it looks the era of best development of denoise solutions is spreaded in some years of ending of consumer video cameras and beginning of not very slow PCs at home. Currently even if some business will invest in development of denoise solution - there is no market for it to return the investment (like was in the old times of Neat Video products for consumer market). And pro denoise solutions for video cameras are closed in the production of video cameras for both pro and smartphone video cameras. The RAW shooting 'video modes' of consumer photo cameras may expect good lighting conditions and not shipped with good RAW denoise post-processing software. And typical large-sensor consumer photo-cameras do not have enough processing hardware resources onboard to perform good temporal denoise. |
![]() |
![]() |
![]() |
#19 | Link |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 688
|
Another interesting thing perhaps:
https://sreyas-mohan.github.io/udvd/ https://github.com/sreyas-mohan/udvd Up to 5 frames temporal radius. |
![]() |
![]() |
![]() |
#20 | Link | |
Registered User
Join Date: Sep 2007
Posts: 5,285
|
Quote:
udvd gets about 2-3db lower than VRT/RVRT and ReVid . PSNR isn't the best metric, and certainly doesn't measure any temporal component (unlike VMAF), but that delta is large enough that any metric will show similar trend. ReVid is the top on every dataset there, but there is no public code or model available yet . https://paperswithcode.com/task/video-denoising Some of the older papers have other comparisons like Neat Video for the dB included in the results chart, but it's not clear how Neat Video was used, or if it was used properly. Neat does surprisingly poorly, I suspect not used correctly. Might be fun to run current Neat video or some avs/vpy denoisers on one of the known datasets to compare https://github.com/m-tassano/dvdnet#psnrs-set8-testset I think currently VRT/RVRT are probably the best "working" ones, with configurable settings, ... but soooooooo slow . And you really need to train your own models for the best results, unless your sources are similar to DAVIS |
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|