Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
14th April 2014, 22:22 | #25941 | Link |
Kid for Today
Join Date: Aug 2004
Posts: 3,477
|
Goodie, goodie! Well, that seems faster than my 7850
Using monostatic ED1 LL, low debanding, CC AR LL downscaling, J3AR upscaling for chroma & luma, with my 1Ghz 7850 I'm capped to: -64x NNEDI luma for <=30fps SD to 1080p -64x NNEDI luma for <=24p >=1.85 720p@1080p -32x NNEDI luma for >=25p 720p@1080p Which model is it please? How hot does it get under load? Last edited by leeperry; 14th April 2014 at 23:03. |
14th April 2014, 22:35 | #25942 | Link | |
Registered User
Join Date: Apr 2014
Posts: 46
|
Quote:
|
|
14th April 2014, 22:41 | #25943 | Link | ||
QB the Slayer
Join Date: Feb 2011
Location: Toronto
Posts: 697
|
Quote:
Luma Doubling==> Chroma Upscaling==> Dithering==> Chroma Doubling If I have resources to spare I will toss in chroma doubling neurons (I don't pay for electricty and my system's stupid cool) just to make sure I am getting every last drop of performance. Quote:
Code:
if (srcHeight < 481) and (deintFps < 31) "SD 30fps" elseif (srcHeight < 541) and (deintFps < 26) "1/2 HD" elseif (srcHeight < 541) and (deintFps < 31) "1/2 HD 30fps" elseif (srcHeight < 721) and (deintFps < 26) "720p24" elseif (srcHeight < 721) and (deintFps < 31) "720p30" elseif (srcHeight < 721) and (deintFps > 31) "720p60" else "1080p24" QB
__________________
Last edited by QBhd; 15th April 2014 at 00:24. |
||
14th April 2014, 23:03 | #25944 | Link | |
Kid for Today
Join Date: Aug 2004
Posts: 3,477
|
Quote:
Are you using PCI-E 2.0 or 3.0? What's your ASIC quality in GPU-Z BTW? |
|
15th April 2014, 05:12 | #25948 | Link |
Registered User
Join Date: Dec 2011
Posts: 180
|
Not sure if mentioned previously.
Currently running a bay trail z3740 tablet, stable (that is, until it start throttling due to temperature xD).24fps 1080p downscale to 1200 x 800 using quick sync/DXVA decode, DXVA downscaling, random diethering and FRC. Havent had a chance to get a profile for color management, hopefully it can handle it when 3dlut is added. struggles with 1080i, drop frame if deinterlanced to 60fps, repeated frame if trade quality option "half frames" is used. Last edited by baii; 15th April 2014 at 05:18. |
15th April 2014, 07:38 | #25949 | Link |
Registered User
Join Date: May 2008
Posts: 1,840
|
madshi: Do you have any plans for using ivtc with 50/59/60 fps sources? It can be a large performance gain, in my case would allow nnedi3 doubling of 720p. I have some clips of 23, 25, 29 and 59 progressive in 720p59 if they are of any use.
__________________
PC: FX-8320 GTS250 HTPC: G1610 GTX650 PotPlayer/MPC-BE LAVFilters MadVR-Bicubic75AR/Lanczos4AR/Lanczos4AR LumaSharpen -Strength0.9-Pattern3-Clamp0.1-OffsetBias2.0 |
15th April 2014, 12:17 | #25950 | Link | |
Registered User
Join Date: Jan 2014
Posts: 216
|
Quote:
Lots of stuff I have from Fox HDTV is 720p59.94 but actually 720p23.976 filmed. |
|
15th April 2014, 12:32 | #25951 | Link | |
Registered Developer
Join Date: Mar 2010
Location: Hamburg/Germany
Posts: 10,348
|
Quote:
Just toggle the content type with Ctrl-Alt-Shift-T to "Film", and it may just work.
__________________
LAV Filters - open source ffmpeg based media splitter and decoders |
|
15th April 2014, 14:14 | #25953 | Link |
Registered User
Join Date: Mar 2007
Posts: 934
|
It's probably quite high on madshi's priority list. Combined with profiles it'd mean settings don't need to manually changed per video any more.
__________________
TV Setup: LG OLED55B7V; Onkyo TX-NR515; ODroid N2+; CoreElec 9.2.7 |
15th April 2014, 14:43 | #25954 | Link | |||||||||||||||||
Registered Developer
Join Date: Sep 2006
Posts: 9,140
|
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
4K master -> Blu-Ray downscale -> 720p downscale -> 1440p upscale All the downscales were done with linear interpolation. Which means that each pixel also contains a small portion of the original neighboring pixels. The best way to watch such movies is to upscale them with a good upscaling algorithm. This will get you nearer to the way the image looked in its original resolution. If you don't believe me, try this: (1) Take a sharp and detailed photo. (2) Downscale it with your favorite image editor to 50%, by using a good downscaling algorithm (e.g. Cubic or Lanczos). (3) Upscale it again 200% to get back to the original resolution. Now for step (3) try Nearest Neighbor scaling and compare it to e.g. Lanczos. Check which upscaled image looks nearer to the original photo. This test is very valid for video playback, too. After all you don't just want to see what is in the video file, you want to see an image which is as near to the original film scan as possible, don't you? Quote:
Quote:
Quote:
Quote:
Quote:
PCIe version is only important for AMD users. Quote:
Quote:
Yes, we do need that. Unfortunately it's not that easy to implement properly. Especially if we want to take mixed sources (e.g. film content with video overlay) into account. Quote:
Basically the old madVR builds did this: Code:
for each video frame do { clTargetTex = clCreateFromD3D9TextureNV(...); clEnqueueAcquireD3D9ObjectsNV(clTargetTex); clSetKernelArg(clTargetTex); clEnqueueNDRangeKernel(...); clEnqueueReleaseD3D9ObjectsNV(clTargetTex); clFinish(...); clReleaseMemObject(clTargetTex); } The latest madVR builds now use the following approach instead, which works around the issue: Code:
clTargetTex = clCreateFromD3D9TextureNV(...); for each video frame do { clEnqueueAcquireD3D9ObjectsNV(clTargetTex); clSetKernelArg(clTargetTex); clEnqueueNDRangeKernel(...); clEnqueueReleaseD3D9ObjectsNV(clTargetTex); clFinish(...); } clReleaseMemObject(clTargetTex); |
|||||||||||||||||
15th April 2014, 14:50 | #25955 | Link |
Registered Developer
Join Date: Sep 2006
Posts: 9,140
|
Here's a new test build set for AMD users wanting to do NNEDI3:
http://madshi.net/madVRinteropTest.rar In the rar file are two madVR.ax files which use different methods to try to improve the interop problem. Unfortunately the improvement is probably not as large as I had hoped, but there should be a small improvement at least. Probably one build will work better than the other build. Please try both and let me know which build works better for you. I've intentionally removed the rendering times from the OSD (only for these test builds, of course) because due to the way these 2 test builds work, judging them by looking at the rendering times would be misleading. So please judge these builds by testing which build allows you to use higher/more quality settings. Looking forward to your feedback! (FWIW, I've concentrated on NNEDI3 luma doubling, with NNEDI3 chroma upscaling and NNEDI3 chroma doubling disabled. Enabling those might still work, but I've not tested that.) |
15th April 2014, 15:31 | #25958 | Link | |
Registered User
Join Date: Mar 2007
Posts: 934
|
Quote:
v0.87.9: 35-40 dropped frames per second; render queue is 1-2/8; present queue is 0-1/8; GPU load ~95% Test 1: 1-2 dropped frames per second; render & present queues are 0-4/8 or 1-5/8 typically; GPU load ~80% Test 2: 0 dropped frames per second; render & present queues are 4-7/8 or 5-8/8 typically; GPU load ~82% Test 2 seems the best for me. Still can't use 32 neurons though, I get a dropped frame every few seconds and GPU usage rises to 89%.
__________________
TV Setup: LG OLED55B7V; Onkyo TX-NR515; ODroid N2+; CoreElec 9.2.7 |
|
15th April 2014, 15:54 | #25959 | Link | |
Registered User
Join Date: Sep 2012
Posts: 77
|
Quote:
Tested the new builds on 480 -> 1080 (+J3AR) content in FSE (new path), which gave me about ~8-10 dropped frames per second even with 16 neurons before. TestBuild1 - Seems to work smoothly up to 64 neurons, 128 starts to give loads of presentations glitches and the playback stutters quite a lot, but it doesn't report any dropped frames, thou. GPU load is stuck at ~63%. TestBuild2 - Seems smooth up to 128 (!) neurons with no dropped frames or presentation glitches, ~64% GPU load. Setting it to 256 neurons puts 99% load on the GPU and I'm starting to get frame drops. The improvement overall looks very large to me, TB2 is a beast. Could you implement these two in your OpenCL benchmark? I'd really like to see the raw numbers Great work! |
|
15th April 2014, 16:24 | #25960 | Link | |
Registered User
Join Date: Oct 2012
Posts: 7,926
|
Quote:
@tesbuilds for me on a r9 270 the build 1 is "faster" i tested 256 neuron 480p23 to 1080p. with the old build it is impossible with both new builds it works but with test 2 all queue drop but no frame is dropped. with test1 all queue fill up after some time so i think this is working better. i get 82 % gpu usage test1 and 84% with test2 both drop like crazy with opend gpu-z so they should't be judge with gpu-z |
|
Tags |
direct compute, dithering, error diffusion, madvr, ngu, nnedi3, quality, renderer, scaling, uhd upscaling, upsampling |
|
|