Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
|
![]() |
|
Thread Tools | Search this Thread | Display Modes |
![]() |
#63 | Link |
Registered User
Join Date: Jan 2014
Posts: 2,358
|
Uploaded to Github as well. Code cleanup and further build enhancements: not now.
![]() https://github.com/pinterf/amDCT |
![]() |
![]() |
![]() |
#65 | Link | |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 721
|
Quote:
|
|
![]() |
![]() |
![]() |
#66 | Link | |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
Quote:
I find them faster, both on Intel and AMD.
__________________
@turment on Telegram |
|
![]() |
![]() |
![]() |
#68 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,850
|
Would be of much effort to expand to hbd and rgb?
I am testing it in 8 bits but it hasn't the precision I would like to. I have a small issue: SetMemoryMax() SetCacheMode(0) SetFilterMTMode("DEFAULT_MT_MODE", 2) LoadPlugin("D:\Eseguibili\Media\DGDecNV\DGDecodeNV.dll") DGSource("M:\In\Monster ~BDGE\1-1-01.dgi",ct=0,cb=0,cl=240,cr=240) z_ConvertFormat(resample_filter="spline64", pixel_type="yuv420p16") Resize8(720,540, kernel="z_Spline36Resize", noring=true, noring_c=true, fullc=true) z_ConvertFormat(resample_filter="spline64",dither_type="error_diffusion",pixel_type="YUV420P16") fmtc_bitdepth (bits=8,dmode=8) amDCT(adapt=31,shift=5,quality=4,quant=9,matrix=8,expand=16,sharpWpos=6,sharpTpos=7,sharpWAmt=24,sharpTAmt=31,brightStart=205,brightAmt=30) Prefetch(2,6) Where I get: amDCT: out of memory
__________________
@turment on Telegram Last edited by tormento; 3rd December 2024 at 16:48. |
![]() |
![]() |
![]() |
#69 | Link | |
Registered User
Join Date: Jan 2014
Posts: 2,358
|
Quote:
On the other hand, doesn't amDCT have to be the first filter in the chain, to be able to catch the original 8x8 blocks before any resize and other operation? Or you are not using this feature of the plugins where this counts? |
|
![]() |
![]() |
![]() |
#70 | Link |
Registered User
Join Date: Jan 2014
Posts: 2,358
|
New release 1.4.2.
https://github.com/pinterf/amDCT/releases This changeset was a hard and long process, mainly involving assembler rewrites and tests. And the many small items. It took about another 30-35 hours of work, so I'm just taking a rest now before the further cleanups. Code:
## Changelog - (20241210) v1.4.2 - Rewrite all external assembly codes (fdct, idct, h263 and mpeg quant-dequants) to Intel intrinsics. It's now quicker - sometimes significantly - than the original. - Source: changed Windows specific threading code into C 17 version. - Source cleanup: removed lots of never used test codes from the source, rewrite some others. Move to cpp. - Add ability to pass Avisynth+ frame properties - Add support for any 8 bit planar YUV(A) or Y format (was: YV12 only) - Copy A alpha plane as well, if exists. (The filter works only on luma channel, other planes are simply copied) - Fix: add meaningful error message (Issue #2) for clips with non-mod8 width or height dimensions (was: out of memory) - Add Clang-cl LLVM build option, make source Clang friendly - Speedup examples: | qtype | 32 bit clangcl | 32 bit msvc | 32 bit old 1.3 | 64 bit clangcl | |-------|----------------|-------------|----------------|----------------| | 1 | 6.12 fps | 5.49 fps | 5.40 fps | 6.56 fps | | 2 | 6.67 fps | 5.93 fps | 5.28 fps | 7.08 fps | | 3 | 4.09 fps | 3.61 fps | 3.22 fps | 4.29 fps | | 4 | 6.66 fps | 5.98 fps | 5.29 fps | 7.16 fps | Last edited by pinterf; 10th December 2024 at 13:20. |
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|