Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
|
![]() |
|
Thread Tools | Search this Thread | Display Modes |
![]() |
#63 | Link |
Registered User
Join Date: Jan 2014
Posts: 2,418
|
Uploaded to Github as well. Code cleanup and further build enhancements: not now.
![]() https://github.com/pinterf/amDCT |
![]() |
![]() |
![]() |
#65 | Link | |
Registered User
Join Date: Dec 2005
Location: Sweden
Posts: 721
|
Quote:
|
|
![]() |
![]() |
![]() |
#66 | Link | |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,885
|
Quote:
I find them faster, both on Intel and AMD.
__________________
@turment on Telegram |
|
![]() |
![]() |
![]() |
#68 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,885
|
Would be of much effort to expand to hbd and rgb?
I am testing it in 8 bits but it hasn't the precision I would like to. I have a small issue: SetMemoryMax() SetCacheMode(0) SetFilterMTMode("DEFAULT_MT_MODE", 2) LoadPlugin("D:\Eseguibili\Media\DGDecNV\DGDecodeNV.dll") DGSource("M:\In\Monster ~BDGE\1-1-01.dgi",ct=0,cb=0,cl=240,cr=240) z_ConvertFormat(resample_filter="spline64", pixel_type="yuv420p16") Resize8(720,540, kernel="z_Spline36Resize", noring=true, noring_c=true, fullc=true) z_ConvertFormat(resample_filter="spline64",dither_type="error_diffusion",pixel_type="YUV420P16") fmtc_bitdepth (bits=8,dmode=8) amDCT(adapt=31,shift=5,quality=4,quant=9,matrix=8,expand=16,sharpWpos=6,sharpTpos=7,sharpWAmt=24,sharpTAmt=31,brightStart=205,brightAmt=30) Prefetch(2,6) Where I get: amDCT: out of memory
__________________
@turment on Telegram Last edited by tormento; 3rd December 2024 at 16:48. |
![]() |
![]() |
![]() |
#69 | Link | |
Registered User
Join Date: Jan 2014
Posts: 2,418
|
Quote:
On the other hand, doesn't amDCT have to be the first filter in the chain, to be able to catch the original 8x8 blocks before any resize and other operation? Or you are not using this feature of the plugins where this counts? |
|
![]() |
![]() |
![]() |
#70 | Link |
Registered User
Join Date: Jan 2014
Posts: 2,418
|
New release 1.4.2.
https://github.com/pinterf/amDCT/releases This changeset was a hard and long process, mainly involving assembler rewrites and tests. And the many small items. It took about another 30-35 hours of work, so I'm just taking a rest now before the further cleanups. Code:
## Changelog - (20241210) v1.4.2 - Rewrite all external assembly codes (fdct, idct, h263 and mpeg quant-dequants) to Intel intrinsics. It's now quicker - sometimes significantly - than the original. - Source: changed Windows specific threading code into C 17 version. - Source cleanup: removed lots of never used test codes from the source, rewrite some others. Move to cpp. - Add ability to pass Avisynth+ frame properties - Add support for any 8 bit planar YUV(A) or Y format (was: YV12 only) - Copy A alpha plane as well, if exists. (The filter works only on luma channel, other planes are simply copied) - Fix: add meaningful error message (Issue #2) for clips with non-mod8 width or height dimensions (was: out of memory) - Add Clang-cl LLVM build option, make source Clang friendly - Speedup examples: | qtype | 32 bit clangcl | 32 bit msvc | 32 bit old 1.3 | 64 bit clangcl | |-------|----------------|-------------|----------------|----------------| | 1 | 6.12 fps | 5.49 fps | 5.40 fps | 6.56 fps | | 2 | 6.67 fps | 5.93 fps | 5.28 fps | 7.08 fps | | 3 | 4.09 fps | 3.61 fps | 3.22 fps | 4.29 fps | | 4 | 6.66 fps | 5.98 fps | 5.29 fps | 7.16 fps | Last edited by pinterf; 10th December 2024 at 13:20. |
![]() |
![]() |
![]() |
#73 | Link |
Broadcast Encoder
Join Date: Nov 2013
Location: Royal Borough of Kensington & Chelsea, UK
Posts: 3,190
|
i7 6700 4c/8th on Windows XP Professional x86
source: 1920x1080 FULL HD yv12 8bit planar BT709 SDR Tortmento's settings: amDCT(adapt=31,shift=5,quality=4,quant=9,matrix=8,expand=16,sharpWpos=6,sharpTpos=7,sharpWAmt=24,sharpTAmt=31,brightStart=205,brightAmt=30) old version: 0.7fps new version: 1.0fps Looks like 1.4.2 rocks. ![]() Thank you for supporting Windows XP with the "x86_from_2019" builds by the way. It may only have up to SSE4.2 but it definitely feels it. Last edited by FranceBB; 26th January 2025 at 19:32. |
![]() |
![]() |
![]() |
#76 | Link |
Registered User
Join Date: Jul 2015
Posts: 848
|
amDCT(adapt=31,shift=5,quality=4,quant=9,matrix=8,expand=16,sharpWpos=6,sharpTpos=7,sharpWAmt=24,sharpTAmt=31,brightStart=205,brightAmt=30)
What am I doing wrong? Assertion failed: IsClip(), file interface.cpp, line 827 Last edited by Jamaika; 27th January 2025 at 21:15. |
![]() |
![]() |
![]() |
#77 | Link |
Broadcast Encoder
Join Date: Nov 2013
Location: Royal Borough of Kensington & Chelsea, UK
Posts: 3,190
|
nope, x86_from_2019 works just fine, however x86_msvc fails to load as it's missing the following kernel functions:
ReleaseSRWLockExclusive ReleaseSRWLockShared AcquireSRWLockExclusive AcquireSRWLockShared GetThreadId ![]() here's the x86_from_2019 build instead which shows everything as green and indeed loads just fine: ![]() |
![]() |
![]() |
![]() |
#78 | Link | |
Registered User
Join Date: Jan 2014
Posts: 2,418
|
Quote:
EDIT: Reuploded to github: package updated with WinXP builds. Let's hope the best. Last edited by pinterf; 29th January 2025 at 10:08. Reason: updated release pack |
|
![]() |
![]() |
![]() |
#80 | Link |
Registered User
Join Date: Jan 2012
Location: Mesopotamia
Posts: 2,611
|
in case pinterf didn't receive github notifications https://github.com/pinterf/amDCT/issues/3
__________________
See My Avisynth Stuff |
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|