Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
23rd August 2016, 20:19 | #401 | Link |
Registered User
Join Date: Jan 2016
Posts: 98
|
Let me rephrase my thought: In the same search at videolan, I though I saw 8, 10 (may be 12) bit_depth sse2 and/or avx optimizations for SATD calculation.
My question is then: No matter what the original video clip bit_depth is, could you use a “simplified” 8, 10 or 12 bit temporary version of the clip for the motion estimation? (reusing the x265 SATD asm optimizations) Edit: By the way, thanks for the explanation Last edited by VS_Fan; 23rd August 2016 at 20:32. |
23rd August 2016, 20:33 | #403 | Link | |
unsigned int
Join Date: Oct 2012
Location: 🇪🇺
Posts: 760
|
Quote:
__________________
Buy me a "coffee" and/or hire me to write code! |
|
23rd August 2016, 20:43 | #404 | Link | |
I'm Siri
Join Date: Oct 2012
Location: void
Posts: 2,633
|
Quote:
Why not just re-program it in c/c++ like I did and it would be generic to all sample types |
|
23rd August 2016, 21:24 | #405 | Link | |
Registered User
Join Date: Jan 2016
Posts: 98
|
Quote:
I congratulate and thank you all for the VapourSynth's 5th birthday!!! 2 days in advance |
|
25th August 2016, 16:57 | #407 | Link | ||
I'm Siri
Join Date: Oct 2012
Location: void
Posts: 2,633
|
Quote:
Quote:
Last edited by feisty2; 25th August 2016 at 17:08. |
||
25th August 2016, 17:33 | #409 | Link | |
unsigned int
Join Date: Oct 2012
Location: 🇪🇺
Posts: 760
|
Quote:
Yes, ebooks.
__________________
Buy me a "coffee" and/or hire me to write code! Last edited by jackoneill; 25th August 2016 at 17:36. |
|
17th September 2016, 09:43 | #411 | Link |
unsigned int
Join Date: Oct 2012
Location: 🇪🇺
Posts: 760
|
I fixed the SATD functions. Please test: http://savedonthe.net/download/914/v...atd-win64.html
__________________
Buy me a "coffee" and/or hire me to write code! |
17th September 2016, 14:02 | #412 | Link |
I'm Siri
Join Date: Oct 2012
Location: void
Posts: 2,633
|
it works, but the result looks slightly different from my implementation
mine looks closer to dct=1, and yours closer to dct=0 (set thSAD to 10000 in MDeGrain and you will see the difference) not sure why.. |
23rd October 2016, 14:47 | #414 | Link |
unsigned int
Join Date: Oct 2012
Location: 🇪🇺
Posts: 760
|
v17 brings more speed for certain configurations, larger blocks, and a bug fix or two: https://github.com/dubhater/vapoursy...leases/tag/v17
Code:
* Analyse, Recalculate: Fix bug that broke 16 bit processing (patches by feisty2). * Analyse, Recalculate: Support block sizes of 64x32, 64x64, 128x64, and 128x128. * Analyse, Recalculate: Make dct=1..4 a bit faster on x86. * FlowFPS, FlowInter: Add AVX2 code. * Analyse, Recalculate: Fix SATD functions used when dct=5..10 and the input is 16 bit. * Analyse, Recalculate: Allow dct=5..10 with blocks larger than 16x16.
__________________
Buy me a "coffee" and/or hire me to write code! |
24th October 2016, 16:14 | #419 | Link | |
unsigned int
Join Date: Oct 2012
Location: 🇪🇺
Posts: 760
|
Quote:
Code:
make distclean ./configure make
__________________
Buy me a "coffee" and/or hire me to write code! |
|
Thread Tools | Search this Thread |
Display Modes | |
|
|