Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
23rd May 2003, 21:42 | #81 | Link | |
Registered User
Join Date: Oct 2001
Location: Gainesville FL USA
Posts: 2,092
|
Quote:
I don't know what VectorPath is, but I assume it's costly from the rest of it. What do you think of requiring SSE instead of SSEMMX for those 2 functions of mine that need bsr? That should eliminate the Athlon's but accept P3 & P4. And it's a one line change. - Tom |
|
23rd May 2003, 21:59 | #82 | Link |
Retired AviSynth Dev ;)
Join Date: Nov 2001
Location: Dark Side of the Moon
Posts: 3,480
|
VectorPath is AMD's word for complex "emulated" instructions, that usually requires several cycles to decode and execute.
Nic's Athlon XP has SSE, so it will be used there. It is however impressive that the "if" code can execute faster than the assembler.
__________________
Regards, sh0dan // VoxPod |
24th May 2003, 05:21 | #83 | Link | |
Registered User
Join Date: Oct 2001
Location: Gainesville FL USA
Posts: 2,092
|
Quote:
- Tom |
|
24th May 2003, 14:33 | #89 | Link |
Moderator
Join Date: Oct 2001
Location: England
Posts: 3,285
|
Sorry didn't check marcfd's override param, I always change it in the d2v...ill go ammend that now..
-Nic edit: Ok, that bugs fixed, please re-download: (updated links always at start of thread) http://nic.dnsalias.com/MPEG2Dec3v106.zip SourceCode: http://nic.dnsalias.com/MPEG2Dec3v106_src.zip pps BTW: Skal's iDCT is faster, but only slightly, you may not notice a difference, but its the fastest iDCT ive ever seen, so its included for completeness Last edited by Nic; 24th May 2003 at 14:46. |
24th May 2003, 15:32 | #90 | Link | |
Registered User
Join Date: May 2002
Location: Czech rep.
Posts: 390
|
Quote:
|
|
24th May 2003, 19:33 | #92 | Link |
Registered User
Join Date: Sep 2002
Posts: 10
|
Skal's iDCT is faster
Are you sure ? DVD zone2 : BLADE ( 3 min ) ~ Asus P4G8x ~ P4 2.53°2.89 ~ 512mo AviSynth 2.51 RC4 ~ Xvid's Koepi 14/05/2003 ~ VDubMOD 1.4.13 v2 (for debug .log) MPEG2Source("D:\...\.d2v",idct=X ).Crop(4,72,-8,-76).Trim(99500,104000) BicubicResize(608,256,0,0.5) idct=1 ___ 83 sec ___ 54.22 fps ___ 34.9 mo idct=2 ___ 82 sec ___ 54.88 fps ___ 34.9 mo idct=3 ___ 89 sec ___ 50.56 fps ___ 34.9 mo idct=4 ___ 133 sec ___ 33.83 fps ____ 35 mo idct=5 ___ 80 sec ___ 56.25 fps ___ 34.9 mo idct=6 ___ 82 sec ___ 54.88 fps ___ 34.9 mo idct=7 ___ 86 sec ___ 52.33 fps ___ 34.9 mo
__________________
~~~~~~ AviSynth's Guide : ~~~~~~ http://avisynth.unite-video.com/ ~~~~~~ French Forum : ~~~~~~ http://unite-video.com/phpbb/portal.php |
24th May 2003, 19:41 | #93 | Link | |
Registered User
Join Date: May 2002
Location: Czech rep.
Posts: 390
|
Quote:
I have AthlonXP and idct=6 (Skal's SSEMMX) is about +2% really faster than idct=2 (SSEMMX) for me. Of course - without Crop, Trim, etc. and with 'null - test speed' in xvid. |
|
24th May 2003, 22:24 | #96 | Link |
Registered User
Join Date: Sep 2002
Location: Seattle
Posts: 551
|
Is there a way to force an idct by placement of some kind of overriding text file with your preferred idct in the same dir as mpeg2dec3? I don't like the idea of tinkering with my .d2v, I'd rather have an automated solution for the future. The older versions used to have this feature . . . .
|
25th May 2003, 07:30 | #97 | Link | |
Registered User
Join Date: May 2002
Location: Czech rep.
Posts: 390
|
Quote:
Mpeg2Source("...", idct=6) |
|
25th May 2003, 09:12 | #98 | Link |
Moderator
Join Date: Oct 2001
Location: England
Posts: 3,285
|
@ctrl-alt-supr: "Are you sure?"
I did write for a reason: "Skal's iDCT is faster, but only slightly, you may not notice a difference, but its the fastest iDCT ive ever seen, so its included for completenes." And yes I am sure. -Nic |
25th May 2003, 12:10 | #99 | Link |
Newbie Forum Mod
Join Date: Aug 2002
Location: way too deep in (cyber)space
Posts: 2,436
|
wow... you got me confused now (:
what's supposed to be faster on a PIV now? idct=5 or 6? regards steVe
__________________
Search the forum, read the forum rules once more and use the search function on doom9.org before posting! oh btw my amazon.de wishlist |
25th May 2003, 12:34 | #100 | Link |
Moderator
Join Date: Oct 2001
Location: England
Posts: 3,285
|
@killingspree: Hmmm, 5 (SSE2) probably will be, for now I wouldn't start using 6. The speed difference is small and I havent tested it fully yet.
(when I tested it, I just left it doing random iDCTs and comparing it to other iDCTs, and it is faster, but it seems in MPEG-2 Decoding it doesnt make any real difference, still its there for completeness, ill test its accuracy properly soon) -Nic ps Im almost tempted to use 7 (simpleidct) myself...even though tests have shown that iDCT accuracy doesn't seem to make any real kind of a difference to the eye when mpeg-2 decoding, maybe it will make a difference to the encoder (even if very very tiny)...Hmmm, I may have to check. |
|
|