Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > Capturing and Editing Video > Avisynth Development

Reply
 
Thread Tools Search this Thread Display Modes
Old 8th January 2011, 00:15   #41  |  Link
Malcolm
Registered User
 
Join Date: Sep 2002
Location: Germany
Posts: 352
@Wilbert
I will do that. But i would like to work some things out first!
I don't think your 6800LE will be capable of executing OpenCL. it's from around 2004 if i'm right. That's long, long ago... And even if, the performance would probably be not much better that TNLMeans.
Malcolm is offline   Reply With Quote
Old 8th January 2011, 00:51   #42  |  Link
TheRyuu
warpsharpened
 
Join Date: Feb 2007
Posts: 787
Quote:
Originally Posted by Wilbert View Post
Could you release the source code so that the community can enjoy your magic?

I have a GeFore 6800LE, is that good enough? Nvidia driver 61.76.
Only 8 series or higher supports OpenCL.

Edit: on a side note v 0.1.1 actually works for me. I did get a green bar on the bottom of the screen with non-mod16 resolution. I also get a green bar on the right side of the screen with plane=4 at 1280x720.

With plane=0 about 4-8 pixels on the right side turn black.

Edit2: padding like 16 pixels (addborders) on each side then cropping later seems to work.

Last edited by TheRyuu; 8th January 2011 at 01:00.
TheRyuu is offline   Reply With Quote
Old 8th January 2011, 18:14   #43  |  Link
naoan
Registered User
 
Join Date: Oct 2009
Posts: 151
Well I played around with smf parameters but it still won't work.

Here's the full XML export of my GPU, radeon HD4850 http://www.mediafire.com/?f4s4ktmocfk2hot

Hope that could help!
naoan is offline   Reply With Quote
Old 8th January 2011, 20:36   #44  |  Link
tormento
Acid fr0g
 
tormento's Avatar
 
Join Date: May 2002
Location: Italy
Posts: 2,580
Major issues when trying to load avs in VirtualDub. Same errors as some seen here.

P.S: x64 version please =)
__________________
@turment on Telegram

Last edited by tormento; 8th January 2011 at 20:57.
tormento is offline   Reply With Quote
Old 8th January 2011, 22:14   #45  |  Link
Malcolm
Registered User
 
Join Date: Sep 2002
Location: Germany
Posts: 352
Quote:
Originally Posted by naoan View Post
Here's the full XML export of my GPU, radeon HD4850 http://www.mediafire.com/?f4s4ktmocfk2hot
Hope that could help!
Yes, that helped.
Seems like all RV770 users are out of business at the moment.
The AMD OpenCL drivers do not suppport images.
Same issue as GoodzMastaJ.
Please see below for a snippet of the full XML export from GPU Caps Viewer.
Code:
CL_DEVICE_NAME="ATI RV770"
CL_DEVICE_VENDOR="Advanced Micro Devices, Inc."
CL_DRIVER_VERSION="CAL 1.4.900"
CL_DEVICE_PROFILE="FULL_PROFILE"
CL_DEVICE_VERSION="OpenCL 1.0 ATI-Stream-v2.3 (451)"
CL_DEVICE_TYPE="GPU"
CL_DEVICE_IMAGE_SUPPORT="NO"
CL_DEVICE_IMAGE2D_MAX_WIDTH="0"
CL_DEVICE_IMAGE2D_MAX_HEIGHT="0"
@tormento
Please provide the contents of the debug file and the xml export of GPU Caps Viewer. Otherwise i can't help.

@all
I have uploaded version 0.1.2.
See the changelog in my first post for details.

@Didée
Version 0.1.2 should fix you error with the dots spread all over the screen! Can you give me feedback if this version works for you? Thanks.

Last edited by Malcolm; 8th January 2011 at 23:28. Reason: uploaded version 0.1.2
Malcolm is offline   Reply With Quote
Old 9th January 2011, 00:51   #46  |  Link
TheRyuu
warpsharpened
 
Join Date: Feb 2007
Posts: 787
Getting a weird issue with regards to setting h = 1.0

Setting it to 1.0 will cause it to not work (gives error) but 0.9, 1.1 both work (any other value works I do believe).

Some tests someone I know did (credits to mirkosp):
Quote:
nlmeanscl(h=1.0000001) <<< works
nlmeanscl(h=1.00000001) <<< gives error
nlmeanscl(h=1.00000006) <<< works
nlmeanscl(h=1.00000005) <<< gives error
Edit: more random stuff
Quote:
nlmeanscl(h=1.0000000596046449899999990) <<< will work
nlmeanscl(h=1.000000059604644989999999) <<< gives error
19:14 < mirkosp> that's the limit, btw

Last edited by TheRyuu; 9th January 2011 at 01:16.
TheRyuu is offline   Reply With Quote
Old 9th January 2011, 01:30   #47  |  Link
Malcolm
Registered User
 
Join Date: Sep 2002
Location: Germany
Posts: 352
Quote:
Originally Posted by TheRyuu View Post
Getting a weird issue with regards to setting h = 1.0
Confirmed. It's a simple bug. Already fixed and will be gone with the next version.
Malcolm is offline   Reply With Quote
Old 9th January 2011, 11:35   #48  |  Link
tormento
Acid fr0g
 
tormento's Avatar
 
Join Date: May 2002
Location: Italy
Posts: 2,580
Here you are. I noticed even GPU Caps Viewer crashes in CL test. An encoding is going now, I'll try to change drivers version ASAP and report you.

PHP Code:
===================================================
GPU Caps Viewer v1.9.4
http
://www.ozone3d.net/gpu_caps_viewer/
===================================================


===================================[ 
System CPU ]
CPU NameIntel(RCore(TMi7 CPU         920  2.67GHz
CPU Core Speed3600 MHz
CPU Num Cores8
FamilyModel10 Stepping4
Physical Memory Size4087 MB
Operating SystemWindows 7 64-bit build 7600 [No Service Pack]
DirectX Version10.0
PhysX Version9100514


===================================[ Graphics Adapter GPU ]
SLIdisabled
GPUs1
Logical GPUs1
OpenGL RendererGeForce GTX 460/PCI/SSE2
Drivers RendererNVIDIA GeForce GTX 460
DB RendererNVIDIA GeForce GTX 460
Device Description
Adapter StringGeForce GTX 460
VendorNVIDIA Corporation
Vendor ID0x10DE
Device ID0x0E22
Sub device ID0x2381
Sub vendor ID0x1462
Drivers Version8.17.12.6590 (11-25-2010) - nvoglv64
GPU CodenameGF104
GPU Unified Shader Processors336
GPU Vertex Shader Processors0
GPU Pixel Shader Processors0
TPD (Watts): 160
Video Memory Size1024 MB
Video Memory TypeGDDR5
Clocks level #0: Core: 50MHz - Memory: 135MHz - Shader: 101MHz
Clocks level #1: Core: 405MHz - Memory: 324MHz - Shader: 810MHz
Clocks level #2: Core: 405MHz - Memory: 1950MHz - Shader: 810MHz
Clocks level #3: Core: 811MHz - Memory: 1950MHz - Shader: 1622MHz
BIOS String:  70.4.1b.0.2
Current Display Mode1920x1080 60 Hz 32 bpp


===================================[ OpenGL GPU Capabilities ]
OpenGL Version4.1.0
GLSL (OpenGL Shading LanguageVersion4.10 NVIDIA via Cg compiler
ARB Texture Units4
Vertex Shader Texture Units32
Pixel Shader Texture Units32
Geometry Shader Texture Units32
Max Texture Size16384x16384
Max Anisotropic Filtering ValueX16.0
Max Point Sprite Size63.4
Max Dynamic Lights8
Max Viewport Size16384x16384
Max Vertex Uniform Components4096
Max Fragment Uniform Components2048
Max Geometry Uniform Components2048
Max Varying Float60
Max Vertex Bindable Uniforms12
Max Fragment Bindable Uniforms12
Max Geometry Bindable Uniforms12
Frame Buffer Objects (FBOSupport:[yes]
Multiple Render Targets Max draw buffers8
Pixel Buffer Objects (PBOSupport:[yes]
S3TC Texture Compression Support:[yes]
ATI 3Dc Texture Compression Support:[no]
Texture Rectangle Support:[yes]
Floating Point Textures Support:[yes]
MSAA2X
MSAA4X
MSAA8X
MSAA16X
MSAA32X
OpenGL Extensions245 extensions (GL=222 and WGL=23)
    <
li>GL_ARB_blend_func_extended</li>
    <
li>GL_ARB_color_buffer_float</li>
    <
li>GL_ARB_compatibility</li>
    <
li>GL_ARB_copy_buffer</li>
    <
li>GL_ARB_debug_output</li>
    <
li>GL_ARB_depth_buffer_float</li>
    <
li>GL_ARB_depth_clamp</li>
    <
li>GL_ARB_depth_texture</li>
    <
li>GL_ARB_draw_buffers</li>
    <
li>GL_ARB_draw_buffers_blend</li>
    <
li>GL_ARB_draw_indirect</li>
    <
li>GL_ARB_draw_elements_base_vertex</li>
    <
li>GL_ARB_draw_instanced</li>
    <
li>GL_ARB_ES2_compatibility</li>
    <
li>GL_ARB_explicit_attrib_location</li>
    <
li>GL_ARB_fragment_coord_conventions</li>
    <
li>GL_ARB_fragment_program</li>
    <
li>GL_ARB_fragment_program_shadow</li>
    <
li>GL_ARB_fragment_shader</li>
    <
li>GL_ARB_framebuffer_object</li>
    <
li>GL_ARB_framebuffer_sRGB</li>
    <
li>GL_ARB_geometry_shader4</li>
    <
li>GL_ARB_get_program_binary</li>
    <
li>GL_ARB_gpu_shader5</li>
    <
li>GL_ARB_gpu_shader_fp64</li>
    <
li>GL_ARB_half_float_pixel</li>
    <
li>GL_ARB_half_float_vertex</li>
    <
li>GL_ARB_imaging</li>
    <
li>GL_ARB_instanced_arrays</li>
    <
li>GL_ARB_map_buffer_range</li>
    <
li>GL_ARB_multisample</li>
    <
li>GL_ARB_multitexture</li>
    <
li>GL_ARB_occlusion_query</li>
    <
li>GL_ARB_occlusion_query2</li>
    <
li>GL_ARB_pixel_buffer_object</li>
    <
li>GL_ARB_point_parameters</li>
    <
li>GL_ARB_point_sprite</li>
    <
li>GL_ARB_provoking_vertex</li>
    <
li>GL_ARB_robustness</li>
    <
li>GL_ARB_sample_shading</li>
    <
li>GL_ARB_sampler_objects</li>
    <
li>GL_ARB_seamless_cube_map</li>
    <
li>GL_ARB_separate_shader_objects</li>
    <
li>GL_ARB_shader_bit_encoding</li>
    <
li>GL_ARB_shader_objects</li>
    <
li>GL_ARB_shader_precision</li>
    <
li>GL_ARB_shader_subroutine</li>
    <
li>GL_ARB_shading_language_100</li>
    <
li>GL_ARB_shading_language_include</li>
    <
li>GL_ARB_shadow</li>
    <
li>GL_ARB_sync</li>
    <
li>GL_ARB_tessellation_shader</li>
    <
li>GL_ARB_texture_border_clamp</li>
    <
li>GL_ARB_texture_buffer_object</li>
    <
li>GL_ARB_texture_buffer_object_rgb32</li>
    <
li>GL_ARB_texture_compression</li>
    <
li>GL_ARB_texture_compression_bptc</li>
    <
li>GL_ARB_texture_compression_rgtc</li>
    <
li>GL_ARB_texture_cube_map</li>
    <
li>GL_ARB_texture_cube_map_array</li>
    <
li>GL_ARB_texture_env_add</li>
    <
li>GL_ARB_texture_env_combine</li>
    <
li>GL_ARB_texture_env_crossbar</li>
    <
li>GL_ARB_texture_env_dot3</li>
    <
li>GL_ARB_texture_float</li>
    <
li>GL_ARB_texture_gather</li>
    <
li>GL_ARB_texture_mirrored_repeat</li>
    <
li>GL_ARB_texture_multisample</li>
    <
li>GL_ARB_texture_non_power_of_two</li>
    <
li>GL_ARB_texture_query_lod</li>
    <
li>GL_ARB_texture_rectangle</li>
    <
li>GL_ARB_texture_rg</li>
    <
li>GL_ARB_texture_rgb10_a2ui</li>
    <
li>GL_ARB_texture_swizzle</li>
    <
li>GL_ARB_timer_query</li>
    <
li>GL_ARB_transform_feedback2</li>
    <
li>GL_ARB_transform_feedback3</li>
    <
li>GL_ARB_transpose_matrix</li>
    <
li>GL_ARB_uniform_buffer_object</li>
    <
li>GL_ARB_vertex_array_bgra</li>
    <
li>GL_ARB_vertex_array_object</li>
    <
li>GL_ARB_vertex_attrib_64bit</li>
    <
li>GL_ARB_vertex_buffer_object</li>
    <
li>GL_ARB_vertex_program</li>
    <
li>GL_ARB_vertex_shader</li>
    <
li>GL_ARB_vertex_type_2_10_10_10_rev</li>
    <
li>GL_ARB_viewport_array</li>
    <
li>GL_ARB_window_pos</li>
    <
li>GL_ATI_draw_buffers</li>
    <
li>GL_ATI_texture_float</li>
    <
li>GL_ATI_texture_mirror_once</li>
    <
li>GL_S3_s3tc</li>
    <
li>GL_EXT_texture_env_add</li>
    <
li>GL_EXT_abgr</li>
    <
li>GL_EXT_bgra</li>
    <
li>GL_EXT_bindable_uniform</li>
    <
li>GL_EXT_blend_color</li>
    <
li>GL_EXT_blend_equation_separate</li>
    <
li>GL_EXT_blend_func_separate</li>
    <
li>GL_EXT_blend_minmax</li>
    <
li>GL_EXT_blend_subtract</li>
    <
li>GL_EXT_compiled_vertex_array</li>
    <
li>GL_EXT_Cg_shader</li>
    <
li>GL_EXT_depth_bounds_test</li>
    <
li>GL_EXT_direct_state_access</li>
    <
li>GL_EXT_draw_buffers2</li>
    <
li>GL_EXT_draw_instanced</li>
    <
li>GL_EXT_draw_range_elements</li>
    <
li>GL_EXT_fog_coord</li>
    <
li>GL_EXT_framebuffer_blit</li>
    <
li>GL_EXT_framebuffer_multisample</li>
    <
li>GL_EXTX_framebuffer_mixed_formats</li>
    <
li>GL_EXT_framebuffer_object</li>
    <
li>GL_EXT_framebuffer_sRGB</li>
    <
li>GL_EXT_geometry_shader4</li>
    <
li>GL_EXT_gpu_program_parameters</li>
    <
li>GL_EXT_gpu_shader4</li>
    <
li>GL_EXT_multi_draw_arrays</li>
    <
li>GL_EXT_packed_depth_stencil</li>
    <
li>GL_EXT_packed_float</li>
    <
li>GL_EXT_packed_pixels</li>
    <
li>GL_EXT_pixel_buffer_object</li>
    <
li>GL_EXT_point_parameters</li>
    <
li>GL_EXT_provoking_vertex</li>
    <
li>GL_EXT_rescale_normal</li>
    <
li>GL_EXT_secondary_color</li>
    <
li>GL_EXT_separate_shader_objects</li>
    <
li>GL_EXT_separate_specular_color</li>
    <
li>GL_EXT_shader_image_load_store</li>
    <
li>GL_EXT_shadow_funcs</li>
    <
li>GL_EXT_stencil_two_side</li>
    <
li>GL_EXT_stencil_wrap</li>
    <
li>GL_EXT_texture3D</li>
    <
li>GL_EXT_texture_array</li>
    <
li>GL_EXT_texture_buffer_object</li>
    <
li>GL_EXT_texture_compression_latc</li>
    <
li>GL_EXT_texture_compression_rgtc</li>
    <
li>GL_EXT_texture_compression_s3tc</li>
    <
li>GL_EXT_texture_cube_map</li>
    <
li>GL_EXT_texture_edge_clamp</li>
    <
li>GL_EXT_texture_env_combine</li>
    <
li>GL_EXT_texture_env_dot3</li>
    <
li>GL_EXT_texture_filter_anisotropic</li>
    <
li>GL_EXT_texture_integer</li>
    <
li>GL_EXT_texture_lod</li>
    <
li>GL_EXT_texture_lod_bias</li>
    <
li>GL_EXT_texture_mirror_clamp</li>
    <
li>GL_EXT_texture_object</li>
    <
li>GL_EXT_texture_shared_exponent</li>
    <
li>GL_EXT_texture_sRGB</li>
    <
li>GL_EXT_texture_swizzle</li>
    <
li>GL_EXT_timer_query</li>
    <
li>GL_EXT_transform_feedback2</li>
    <
li>GL_EXT_vertex_array</li>
    <
li>GL_EXT_vertex_array_bgra</li>
    <
li>GL_EXT_vertex_attrib_64bit</li>
    <
li>GL_IBM_rasterpos_clip</li>
    <
li>GL_IBM_texture_mirrored_repeat</li>
    <
li>GL_KTX_buffer_region</li>
    <
li>GL_NV_blend_square</li>
    <
li>GL_NV_conditional_render</li>
    <
li>GL_NV_copy_depth_to_color</li>
    <
li>GL_NV_copy_image</li>
    <
li>GL_NV_depth_buffer_float</li>
    <
li>GL_NV_depth_clamp</li>
    <
li>GL_NV_explicit_multisample</li>
    <
li>GL_NV_fence</li>
    <
li>GL_NV_float_buffer</li>
    <
li>GL_NV_fog_distance</li>
    <
li>GL_NV_fragment_program</li>
    <
li>GL_NV_fragment_program_option</li>
    <
li>GL_NV_fragment_program2</li>
    <
li>GL_NV_framebuffer_multisample_coverage</li>
    <
li>GL_NV_geometry_shader4</li>
    <
li>GL_NV_gpu_program4</li>
    <
li>GL_NV_gpu_program4_1</li>
    <
li>GL_NV_gpu_program5</li>
    <
li>GL_NV_gpu_program_fp64</li>
    <
li>GL_NV_gpu_shader5</li>
    <
li>GL_NV_half_float</li>
    <
li>GL_NV_light_max_exponent</li>
    <
li>GL_NV_multisample_coverage</li>
    <
li>GL_NV_multisample_filter_hint</li>
    <
li>GL_NV_occlusion_query</li>
    <
li>GL_NV_packed_depth_stencil</li>
    <
li>GL_NV_parameter_buffer_object</li>
    <
li>GL_NV_parameter_buffer_object2</li>
    <
li>GL_NV_pixel_data_range</li>
    <
li>GL_NV_point_sprite</li>
    <
li>GL_NV_primitive_restart</li>
    <
li>GL_NV_register_combiners</li>
    <
li>GL_NV_register_combiners2</li>
    <
li>GL_NV_shader_buffer_load</li>
    <
li>GL_NV_texgen_reflection</li>
    <
li>GL_NV_texture_barrier</li>
    <
li>GL_NV_texture_compression_vtc</li>
    <
li>GL_NV_texture_env_combine4</li>
    <
li>GL_NV_texture_expand_normal</li>
    <
li>GL_NV_texture_multisample</li>
    <
li>GL_NV_texture_rectangle</li>
    <
li>GL_NV_texture_shader</li>
    <
li>GL_NV_texture_shader2</li>
    <
li>GL_NV_texture_shader3</li>
    <
li>GL_NV_transform_feedback</li>
    <
li>GL_NV_transform_feedback2</li>
    <
li>GL_NV_vertex_array_range</li>
    <
li>GL_NV_vertex_array_range2</li>
    <
li>GL_NV_vertex_attrib_integer_64bit</li>
    <
li>GL_NV_vertex_buffer_unified_memory</li>
    <
li>GL_NV_vertex_program</li>
    <
li>GL_NV_vertex_program1_1</li>
    <
li>GL_NV_vertex_program2</li>
    <
li>GL_NV_vertex_program2_option</li>
    <
li>GL_NV_vertex_program3</li>
    <
li>GL_NVX_conditional_render</li>
    <
li>GL_NVX_gpu_memory_info</li>
    <
li>GL_SGIS_generate_mipmap</li>
    <
li>GL_SGIS_texture_lod</li>
    <
li>GL_SGIX_depth_texture</li>
    <
li>GL_SGIX_shadow</li>
    <
li>GL_SUN_slice_accum</li>
    <
li>GL_WIN_swap_hint</li>
    <
li>WGL_EXT_swap_control</li>
    <
li>WGL_ARB_buffer_region</li>
    <
li>WGL_ARB_create_context</li>
    <
li>WGL_ARB_create_context_profile</li>
    <
li>WGL_ARB_create_context_robustness</li>
    <
li>WGL_ARB_extensions_string</li>
    <
li>WGL_ARB_make_current_read</li>
    <
li>WGL_ARB_multisample</li>
    <
li>WGL_ARB_pbuffer</li>
    <
li>WGL_ARB_pixel_format</li>
    <
li>WGL_ARB_pixel_format_float</li>
    <
li>WGL_ARB_render_texture</li>
    <
li>WGL_ATI_pixel_format_float</li>
    <
li>WGL_EXT_create_context_es2_profile</li>
    <
li>WGL_EXT_extensions_string</li>
    <
li>WGL_EXT_framebuffer_sRGB</li>
    <
li>WGL_EXT_pixel_format_packed_float</li>
    <
li>WGL_NVX_DX_interop</li>
    <
li>WGL_NV_DX_interop</li>
    <
li>WGL_NV_float_buffer</li>
    <
li>WGL_NV_multisample_coverage</li>
    <
li>WGL_NV_render_depth_texture</li>
    <
li>WGL_NV_render_texture_rectangle</li>


===================================[ 
NVIDIA CUDA Capabilities ]
CUDA Device 0
    
Device nameGeForce GTX 460
    
Compute Capability2.1
    
Total Memory993 MB
    
Shader Clock Rate1622 MHz
    
Multiprocessors7
    
Warp Size32
    
Max Threads Per Block1024
    
Threads Per Block1024 x 1024 x 64
    
Grid Size65535 x 65535 x 1
    
Registers Per Block32768
    
Texture Alignment512 byte
    
Total Constant Memory64 Kb


===================================[ OpenCL Capabilities ]
Num OpenCL platforms1
NameNVIDIA CUDA
VersionOpenCL 1.0 CUDA 3.2.1
ProfileFULL_PROFILE
VendorNVIDIA Corporation
Num devices1

    
CL_DEVICE_NAMEGeForce GTX 460
    
CL_DEVICE_VENDORNVIDIA Corporation
    
CL_DRIVER_VERSION265.90
    
CL_DEVICE_PROFILEFULL_PROFILE
    
CL_DEVICE_VERSIONOpenCL 1.0 CUDA
    
CL_DEVICE_TYPEGPU
    
CL_DEVICE_VENDOR_ID0x10DE
    
CL_DEVICE_MAX_COMPUTE_UNITS7
    
CL_DEVICE_MAX_CLOCK_FREQUENCY1622MHz
    
CL_NV_DEVICE_COMPUTE_CAPABILITY_MAJOR2
    
CL_NV_DEVICE_COMPUTE_CAPABILITY_MINOR1
    
CL_NV_DEVICE_REGISTERS_PER_BLOCK32768
    
CL_NV_DEVICE_WARP_SIZE32
    
CL_NV_DEVICE_GPU_OVERLAP1
    
CL_NV_DEVICE_KERNEL_EXEC_TIMEOUT1
    
CL_NV_DEVICE_INTEGRATED_MEMORY0
    
CL_DEVICE_ADDRESS_BITS32
    
CL_DEVICE_MAX_MEM_ALLOC_SIZE254320KB
    
CL_DEVICE_GLOBAL_MEM_SIZE993MB
    
CL_DEVICE_MAX_PARAMETER_SIZE4352
    
CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE128 Bytes
    
CL_DEVICE_GLOBAL_MEM_CACHE_SIZE112KB
    
CL_DEVICE_ERROR_CORRECTION_SUPPORTNO
    
CL_DEVICE_LOCAL_MEM_TYPELocal (scratchpad)
    - 
CL_DEVICE_LOCAL_MEM_SIZE48KB
    
CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE64KB
    
CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS3
    
CL_DEVICE_MAX_WORK_ITEM_SIZES: [1024 1024 64]
    - 
CL_DEVICE_MAX_WORK_GROUP_SIZE1024
    
CL_EXEC_NATIVE_KERNEL4746436
    
CL_DEVICE_IMAGE_SUPPORTYES
    
CL_DEVICE_MAX_READ_IMAGE_ARGS128
    
CL_DEVICE_MAX_WRITE_IMAGE_ARGS8
    
CL_DEVICE_IMAGE2D_MAX_WIDTH4096
    
CL_DEVICE_IMAGE2D_MAX_HEIGHT32768
    
CL_DEVICE_IMAGE3D_MAX_WIDTH2048
    
CL_DEVICE_IMAGE3D_MAX_HEIGHT2048
    
CL_DEVICE_IMAGE3D_MAX_DEPTH16
    
CL_DEVICE_MAX_SAMPLERS16
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE1
    
CL_DEVICE_EXTENSIONS16
    
Extensions:
        - 
cl_khr_byte_addressable_store
        
cl_khr_icd
        
cl_khr_gl_sharing
        
cl_nv_d3d9_sharing
        
cl_nv_d3d10_sharing
        
cl_khr_d3d10_sharing
        
cl_nv_d3d11_sharing
        
cl_nv_compiler_options
        
cl_nv_device_attribute_query
        
cl_nv_pragma_unroll
        

        - 
cl_khr_global_int32_base_atomics
        
cl_khr_global_int32_extended_atomics
        
cl_khr_local_int32_base_atomics
        
cl_khr_local_int32_extended_atomics
        
cl_khr_fp64


===================================[ Misc. ]


===================================[ 
Related Graphics Drivers ]
http://www.geeks3d.com/?page_id=752
http://downloads.guru3d.com/download.php?id=10
http://www.tweakguides.com/NVFORCE_1.html
http://www.nvidia.com/object/winxp-2k_archive.html
http://www.geeks3d.com/?p=65


===================================[ Related Graphics Cards Reviews ]
http://www.geeks3d.com/?tag=geforce-gtx-460
http://www.google.us/search?q=NVIDIA+GeForce+GTX+460+review 
__________________
@turment on Telegram
tormento is offline   Reply With Quote
Old 9th January 2011, 12:51   #49  |  Link
Dogway
Registered User
 
Join Date: Nov 2009
Posts: 2,361
-Remember my post?
It was caused by source being non mod16. I made work a few images with mod4, but below mod16 performance. I havent checked video sources, but its something to look up.
-Also there are some aliasing issues on b>0 (less than tnlmeans but still)
-crashed on a 1280x688 (mod16) video source with this message. (mkv through ffmpegsource) (Edit: problem was setting h<1.1 on v. 0.1.2)
__________________
i7-4790K@Stock::GTX 1070] AviSynth+ filters and mods on GitHub + Discussion thread

Last edited by Dogway; 9th January 2011 at 17:01.
Dogway is offline   Reply With Quote
Old 9th January 2011, 18:51   #50  |  Link
tormento
Acid fr0g
 
tormento's Avatar
 
Join Date: May 2002
Location: Italy
Posts: 2,580
Ok, updated to 266.44 drivers and works.

If only I could figure how to get a good value for parameters... They are a bit obscure to me.

For example, in a 1080p x264 encoding, what should I put to mimic SMDegrain (tr=3,thSAD=300,blksize=16,overlap=8) ? I suppose not... The temporal part is missing..
__________________
@turment on Telegram

Last edited by tormento; 9th January 2011 at 19:22.
tormento is offline   Reply With Quote
Old 10th January 2011, 20:42   #51  |  Link
frustum
Registered User
 
Join Date: Sep 2010
Location: Austin, TX
Posts: 40
My understanding is that OpenCL programs can be compiled to run either on a GPU or on the host CPU. If this is true, I'm curious, Malcom, if you have tried this, and how fast the filter runs on the host CPU vs the original TNLmeans filter. How much penalty is there (if any) for writing it as OpenCL vs C if there is no GPU to help out.
frustum is offline   Reply With Quote
Old 11th January 2011, 10:21   #52  |  Link
yup
Registered User
 
Join Date: Feb 2003
Location: Russia, Moscow
Posts: 854
Malcolm for Your time.
NLMeansCL work on my 8800 GTS 512 with last Nvidia driver (260.99). Speed for SD content little faster than real time 26-27 fps.
If You find time add also Az support for Az=1 and Az=2 it may be useful for motion compensating filtering.
yup.
yup is offline   Reply With Quote
Old 12th January 2011, 00:04   #53  |  Link
Malcolm
Registered User
 
Join Date: Sep 2002
Location: Germany
Posts: 352
Quote:
Originally Posted by frustum View Post
My understanding is that OpenCL programs can be compiled to run either on a GPU or on the host CPU. If this is true, I'm curious, Malcom, if you have tried this, and how fast the filter runs on the host CPU vs the original TNLmeans filter. How much penalty is there (if any) for writing it as OpenCL vs C if there is no GPU to help out.
I have created a version that runs on the CPU. Please see my first post for explanations as well as some performance figures.

Let me explain some aspects regarding CPU vs. GPU:
- CPUs are not generally slower than CPUs. The performance does heavily depend on the algorithm that is implemented. For NLMeans, the results are clear. But that's just one example. There are algorithms that will always be slower on a GPU than on a CPU.
- You can see from my performance measurements, that using buffers instead of images is much slower. But that's only half the truth. I have a version of NLMeansCL on my harddisk, that is faster on my Geforce when using buffers instead of images! So the execution speed does also heavily depend on how memory is used and accessed inside the filter. But on the other hand this version is actually even slower on the CPU that version 0.2 when using buffers! So though OpenCL executes on GPUs as well as CPUs, you typically have to code to the exact hardware to achieve peak performance!
- One benefit of OpenCL is that it's naturally multithreaded. So if you throw in another CPU with double the number of cores (or a new gfx card), it will typically scale well. That's not true for TNLMeans.
- My current implementation is tuned for execution on GPUs. There are some micro optimizations for individual math functions and so on. For a serious CPU version of the filter, i would go over the implementation and play around with different expressions to see if they execute faster than the existing ones. But this is - as i said -a micro optimization.

@tormento
Sorry, i can't help you out with suggestions to mimic SMDegrain.

@yup
Yeah, i will see what i can do.
Malcolm is offline   Reply With Quote
Old 12th January 2011, 03:55   #54  |  Link
naoan
Registered User
 
Join Date: Oct 2009
Posts: 151
I dunno what I did wrong, but I still get the same error message as the one in earlier version with 0.2

Have tried setting buffer and cpu to true (either and both).
naoan is offline   Reply With Quote
Old 12th January 2011, 11:39   #55  |  Link
ChaosKing
Registered User
 
Join Date: Dec 2005
Location: Germany
Posts: 1,795
yep, i have problems with cpu and buffer too.
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth
VapourSynth Portable FATPACK || VapourSynth Database
ChaosKing is offline   Reply With Quote
Old 12th January 2011, 11:51   #56  |  Link
Malcolm
Registered User
 
Join Date: Sep 2002
Location: Germany
Posts: 352
@naoan, ChaosKing
If anyone of you would send me the logfile....
Malcolm is offline   Reply With Quote
Old 12th January 2011, 12:20   #57  |  Link
naoan
Registered User
 
Join Date: Oct 2009
Posts: 151
Oh, sorry. Here it is (with buffer=true) :

Code:
NLMeansCL Version 0.2
ScriptEnvironment present.
Number of OpenCL Compute Platforms = 1.
Trying OpenCL Compute Platform
  Advanced Micro Devices, Inc..
  OpenCL 1.1 ATI-Stream-v2.3 (451).
OpenCL Compute Context successfully created.
Number of OpenCL Devices in Platform = 2.
Using first OpenCL Device.
  ATI RV770.
  Device available.
OpenCL Command Queue successfully created.
OpenCL error code detected: BuildProgramFailure.
The log is exactly the same with buffer=false and stop at "Device available." with cpu and buffer set at true.

Thanks again.
naoan is offline   Reply With Quote
Old 12th January 2011, 13:49   #58  |  Link
Malcolm
Registered User
 
Join Date: Sep 2002
Location: Germany
Posts: 352
Ok, i see. There are 2 problems.
The behaviour of AMDs OpenCL platform is different than i thought. So if you have an AMD gfx card, as well as the CPU as OpenCL device, then the selection of the CPU fails and it is using the GPU. I can change the selection strategy, to overcome this. (Selecting the CPU as computation device actually works if you have an Nvidia GPU). I will provide this ASAP.

The second thing is, that the compilation of the kernel fails. Since i don't own an AMD gfx card, it's nearly impossible for me to tell what the reason could be!
So far i can only tell that my implementation is correct, since it runs on Nvidia GPUs as well as the CPU.
The cause can only be that i'm using some functions that are not implemented by AMD and thus the AMD OpenCL driver refuses to compile!
Sorry for any inconvenience i have caused, by providing something which obviously doesn't run on Radeons. But i'm just trying to give you someting..
Malcolm is offline   Reply With Quote
Old 12th January 2011, 14:50   #59  |  Link
naoan
Registered User
 
Join Date: Oct 2009
Posts: 151
Hey, no worries there, in fact I'm very thankful you even try to do something about it and for making this plugin available to community for free.

I've been meaning to upgrade my GPU, and now my choice is clear, thanks to this plugin.
naoan is offline   Reply With Quote
Old 12th January 2011, 18:13   #60  |  Link
ChaosKing
Registered User
 
Join Date: Dec 2005
Location: Germany
Posts: 1,795
NLMeansCL(cpu=true) ->
My CPU: Intel Q9550

I can't find a debug log, c:\temp is empty

PHP Code:
===================================================
GPU Caps Viewer v1.9.5
http
://www.ozone3d.net/gpu_caps_viewer/
===================================================


===================================[ 
System CPU ]
CPU NameIntel(RCore(TM)2 Quad CPU    Q9550  2.83GHz
CPU Core Speed2833 MHz
CPU Num Cores4
FamilyModelStepping10
Physical Memory Size4095 MB
Operating SystemWindows 7 64-bit build 7600 [No Service Pack]
DirectX Version10.0
PhysX Version9100514


===================================[ Graphics Adapter GPU ]
SLIdisabled
GPUs1
Logical GPUs1
OpenGL RendererGeForce GTX 260/PCI/SSE2
Drivers RendererNVIDIA GeForce GTX 260
DB RendererNVIDIA GeForce GTX 260
Device DescriptionNVIDIA GeForce GTX 260
Adapter StringGeForce GTX 260
VendorNVIDIA Corporation
Vendor ID0x10DE
Device ID0x05E2
Sub device ID0x1109
Sub vendor ID0x19DA
Drivers Version8.17.12.6099 (10-16-2010) - nvoglv64
GPU CodenameGT200
GPU Unified Shader Processors192
GPU Vertex Shader Processors0
GPU Pixel Shader Processors0
SM SIMD24
TPC8
TPD (Watts): 182
Video Memory Size896 MB
Video Memory TypeGDDR3
Clocks level #0: Core: 300MHz - Memory: 100MHz - Shader: 600MHz
Clocks level #1: Core: 400MHz - Memory: 300MHz - Shader: 800MHz
Clocks level #2: Core: 576MHz - Memory: 999MHz - Shader: 1242MHz
BIOS String:  62.0.61.0.0
Current Display Mode1280x1024 60 Hz 32 bpp


===================================[ OpenGL GPU Capabilities ]
OpenGL Version3.3.0
GLSL (OpenGL Shading LanguageVersion3.30 NVIDIA via Cg compiler
ARB Texture Units4
Vertex Shader Texture Units32
Pixel Shader Texture Units32
Geometry Shader Texture Units32
Max Texture Size8192x8192
Max Anisotropic Filtering ValueX16.0
Max Point Sprite Size63.4
Max Dynamic Lights8
Max Viewport Size8192x8192
Max Vertex Uniform Components4096
Max Fragment Uniform Components2048
Max Geometry Uniform Components2048
Max Varying Float60
Max Vertex Bindable Uniforms12
Max Fragment Bindable Uniforms12
Max Geometry Bindable Uniforms12
Frame Buffer Objects (FBOSupport:[yes]
Multiple Render Targets Max draw buffers8
Pixel Buffer Objects (PBOSupport:[yes]
S3TC Texture Compression Support:[yes]
ATI 3Dc Texture Compression Support:[no]
Texture Rectangle Support:[yes]
Floating Point Textures Support:[yes]
MSAA2X
MSAA4X
MSAA8X
MSAA16X
OpenGL Extensions221 extensions (GL=199 and WGL=22)
    <
li>GL_ARB_blend_func_extended</li>
    <
li>GL_ARB_color_buffer_float</li>
    <
li>GL_ARB_compatibility</li>
    <
li>GL_ARB_copy_buffer</li>
    <
li>GL_ARB_debug_output</li>
    <
li>GL_ARB_depth_buffer_float</li>
    <
li>GL_ARB_depth_clamp</li>
    <
li>GL_ARB_depth_texture</li>
    <
li>GL_ARB_draw_buffers</li>
    <
li>GL_ARB_draw_elements_base_vertex</li>
    <
li>GL_ARB_draw_instanced</li>
    <
li>GL_ARB_ES2_compatibility</li>
    <
li>GL_ARB_explicit_attrib_location</li>
    <
li>GL_ARB_fragment_coord_conventions</li>
    <
li>GL_ARB_fragment_program</li>
    <
li>GL_ARB_fragment_program_shadow</li>
    <
li>GL_ARB_fragment_shader</li>
    <
li>GL_ARB_framebuffer_object</li>
    <
li>GL_ARB_framebuffer_sRGB</li>
    <
li>GL_ARB_geometry_shader4</li>
    <
li>GL_ARB_get_program_binary</li>
    <
li>GL_ARB_half_float_pixel</li>
    <
li>GL_ARB_half_float_vertex</li>
    <
li>GL_ARB_imaging</li>
    <
li>GL_ARB_instanced_arrays</li>
    <
li>GL_ARB_map_buffer_range</li>
    <
li>GL_ARB_multisample</li>
    <
li>GL_ARB_multitexture</li>
    <
li>GL_ARB_occlusion_query</li>
    <
li>GL_ARB_occlusion_query2</li>
    <
li>GL_ARB_pixel_buffer_object</li>
    <
li>GL_ARB_point_parameters</li>
    <
li>GL_ARB_point_sprite</li>
    <
li>GL_ARB_provoking_vertex</li>
    <
li>GL_ARB_robustness</li>
    <
li>GL_ARB_sampler_objects</li>
    <
li>GL_ARB_seamless_cube_map</li>
    <
li>GL_ARB_separate_shader_objects</li>
    <
li>GL_ARB_shader_bit_encoding</li>
    <
li>GL_ARB_shader_objects</li>
    <
li>GL_ARB_shading_language_100</li>
    <
li>GL_ARB_shadow</li>
    <
li>GL_ARB_sync</li>
    <
li>GL_ARB_texture_border_clamp</li>
    <
li>GL_ARB_texture_buffer_object</li>
    <
li>GL_ARB_texture_compression</li>
    <
li>GL_ARB_texture_compression_rgtc</li>
    <
li>GL_ARB_texture_cube_map</li>
    <
li>GL_ARB_texture_env_add</li>
    <
li>GL_ARB_texture_env_combine</li>
    <
li>GL_ARB_texture_env_crossbar</li>
    <
li>GL_ARB_texture_env_dot3</li>
    <
li>GL_ARB_texture_float</li>
    <
li>GL_ARB_texture_mirrored_repeat</li>
    <
li>GL_ARB_texture_multisample</li>
    <
li>GL_ARB_texture_non_power_of_two</li>
    <
li>GL_ARB_texture_rectangle</li>
    <
li>GL_ARB_texture_rg</li>
    <
li>GL_ARB_texture_rgb10_a2ui</li>
    <
li>GL_ARB_texture_swizzle</li>
    <
li>GL_ARB_timer_query</li>
    <
li>GL_ARB_transform_feedback2</li>
    <
li>GL_ARB_transpose_matrix</li>
    <
li>GL_ARB_uniform_buffer_object</li>
    <
li>GL_ARB_vertex_array_bgra</li>
    <
li>GL_ARB_vertex_array_object</li>
    <
li>GL_ARB_vertex_buffer_object</li>
    <
li>GL_ARB_vertex_program</li>
    <
li>GL_ARB_vertex_shader</li>
    <
li>GL_ARB_vertex_type_2_10_10_10_rev</li>
    <
li>GL_ARB_viewport_array</li>
    <
li>GL_ARB_window_pos</li>
    <
li>GL_ATI_draw_buffers</li>
    <
li>GL_ATI_texture_float</li>
    <
li>GL_ATI_texture_mirror_once</li>
    <
li>GL_S3_s3tc</li>
    <
li>GL_EXT_texture_env_add</li>
    <
li>GL_EXT_abgr</li>
    <
li>GL_EXT_bgra</li>
    <
li>GL_EXT_bindable_uniform</li>
    <
li>GL_EXT_blend_color</li>
    <
li>GL_EXT_blend_equation_separate</li>
    <
li>GL_EXT_blend_func_separate</li>
    <
li>GL_EXT_blend_minmax</li>
    <
li>GL_EXT_blend_subtract</li>
    <
li>GL_EXT_compiled_vertex_array</li>
    <
li>GL_EXT_Cg_shader</li>
    <
li>GL_EXT_depth_bounds_test</li>
    <
li>GL_EXT_direct_state_access</li>
    <
li>GL_EXT_draw_buffers2</li>
    <
li>GL_EXT_draw_instanced</li>
    <
li>GL_EXT_draw_range_elements</li>
    <
li>GL_EXT_fog_coord</li>
    <
li>GL_EXT_framebuffer_blit</li>
    <
li>GL_EXT_framebuffer_multisample</li>
    <
li>GL_EXTX_framebuffer_mixed_formats</li>
    <
li>GL_EXT_framebuffer_object</li>
    <
li>GL_EXT_framebuffer_sRGB</li>
    <
li>GL_EXT_geometry_shader4</li>
    <
li>GL_EXT_gpu_program_parameters</li>
    <
li>GL_EXT_gpu_shader4</li>
    <
li>GL_EXT_multi_draw_arrays</li>
    <
li>GL_EXT_packed_depth_stencil</li>
    <
li>GL_EXT_packed_float</li>
    <
li>GL_EXT_packed_pixels</li>
    <
li>GL_EXT_pixel_buffer_object</li>
    <
li>GL_EXT_point_parameters</li>
    <
li>GL_EXT_provoking_vertex</li>
    <
li>GL_EXT_rescale_normal</li>
    <
li>GL_EXT_secondary_color</li>
    <
li>GL_EXT_separate_shader_objects</li>
    <
li>GL_EXT_separate_specular_color</li>
    <
li>GL_EXT_shadow_funcs</li>
    <
li>GL_EXT_stencil_two_side</li>
    <
li>GL_EXT_stencil_wrap</li>
    <
li>GL_EXT_texture3D</li>
    <
li>GL_EXT_texture_array</li>
    <
li>GL_EXT_texture_buffer_object</li>
    <
li>GL_EXT_texture_compression_latc</li>
    <
li>GL_EXT_texture_compression_rgtc</li>
    <
li>GL_EXT_texture_compression_s3tc</li>
    <
li>GL_EXT_texture_cube_map</li>
    <
li>GL_EXT_texture_edge_clamp</li>
    <
li>GL_EXT_texture_env_combine</li>
    <
li>GL_EXT_texture_env_dot3</li>
    <
li>GL_EXT_texture_filter_anisotropic</li>
    <
li>GL_EXT_texture_integer</li>
    <
li>GL_EXT_texture_lod</li>
    <
li>GL_EXT_texture_lod_bias</li>
    <
li>GL_EXT_texture_mirror_clamp</li>
    <
li>GL_EXT_texture_object</li>
    <
li>GL_EXT_texture_shared_exponent</li>
    <
li>GL_EXT_texture_sRGB</li>
    <
li>GL_EXT_texture_swizzle</li>
    <
li>GL_EXT_timer_query</li>
    <
li>GL_EXT_transform_feedback2</li>
    <
li>GL_EXT_vertex_array</li>
    <
li>GL_EXT_vertex_array_bgra</li>
    <
li>GL_IBM_rasterpos_clip</li>
    <
li>GL_IBM_texture_mirrored_repeat</li>
    <
li>GL_KTX_buffer_region</li>
    <
li>GL_NV_blend_square</li>
    <
li>GL_NV_conditional_render</li>
    <
li>GL_NV_copy_depth_to_color</li>
    <
li>GL_NV_copy_image</li>
    <
li>GL_NV_depth_buffer_float</li>
    <
li>GL_NV_depth_clamp</li>
    <
li>GL_NV_explicit_multisample</li>
    <
li>GL_NV_fence</li>
    <
li>GL_NV_float_buffer</li>
    <
li>GL_NV_fog_distance</li>
    <
li>GL_NV_fragment_program</li>
    <
li>GL_NV_fragment_program_option</li>
    <
li>GL_NV_fragment_program2</li>
    <
li>GL_NV_framebuffer_multisample_coverage</li>
    <
li>GL_NV_geometry_shader4</li>
    <
li>GL_NV_gpu_program4</li>
    <
li>GL_NV_half_float</li>
    <
li>GL_NV_light_max_exponent</li>
    <
li>GL_NV_multisample_coverage</li>
    <
li>GL_NV_multisample_filter_hint</li>
    <
li>GL_NV_occlusion_query</li>
    <
li>GL_NV_packed_depth_stencil</li>
    <
li>GL_NV_parameter_buffer_object</li>
    <
li>GL_NV_parameter_buffer_object2</li>
    <
li>GL_NV_pixel_data_range</li>
    <
li>GL_NV_point_sprite</li>
    <
li>GL_NV_primitive_restart</li>
    <
li>GL_NV_register_combiners</li>
    <
li>GL_NV_register_combiners2</li>
    <
li>GL_NV_shader_buffer_load</li>
    <
li>GL_NV_texgen_reflection</li>
    <
li>GL_NV_texture_barrier</li>
    <
li>GL_NV_texture_compression_vtc</li>
    <
li>GL_NV_texture_env_combine4</li>
    <
li>GL_NV_texture_expand_normal</li>
    <
li>GL_NV_texture_multisample</li>
    <
li>GL_NV_texture_rectangle</li>
    <
li>GL_NV_texture_shader</li>
    <
li>GL_NV_texture_shader2</li>
    <
li>GL_NV_texture_shader3</li>
    <
li>GL_NV_transform_feedback</li>
    <
li>GL_NV_transform_feedback2</li>
    <
li>GL_NV_vertex_array_range</li>
    <
li>GL_NV_vertex_array_range2</li>
    <
li>GL_NV_vertex_buffer_unified_memory</li>
    <
li>GL_NV_vertex_program</li>
    <
li>GL_NV_vertex_program1_1</li>
    <
li>GL_NV_vertex_program2</li>
    <
li>GL_NV_vertex_program2_option</li>
    <
li>GL_NV_vertex_program3</li>
    <
li>GL_NVX_conditional_render</li>
    <
li>GL_NVX_gpu_memory_info</li>
    <
li>GL_SGIS_generate_mipmap</li>
    <
li>GL_SGIS_texture_lod</li>
    <
li>GL_SGIX_depth_texture</li>
    <
li>GL_SGIX_shadow</li>
    <
li>GL_SUN_slice_accum</li>
    <
li>GL_WIN_swap_hint</li>
    <
li>WGL_EXT_swap_control</li>
    <
li>WGL_ARB_buffer_region</li>
    <
li>WGL_ARB_create_context</li>
    <
li>WGL_ARB_create_context_profile</li>
    <
li>WGL_ARB_create_context_robustness</li>
    <
li>WGL_ARB_extensions_string</li>
    <
li>WGL_ARB_make_current_read</li>
    <
li>WGL_ARB_multisample</li>
    <
li>WGL_ARB_pbuffer</li>
    <
li>WGL_ARB_pixel_format</li>
    <
li>WGL_ARB_pixel_format_float</li>
    <
li>WGL_ARB_render_texture</li>
    <
li>WGL_ATI_pixel_format_float</li>
    <
li>WGL_EXT_create_context_es2_profile</li>
    <
li>WGL_EXT_extensions_string</li>
    <
li>WGL_EXT_framebuffer_sRGB</li>
    <
li>WGL_EXT_pixel_format_packed_float</li>
    <
li>WGL_NVX_DX_interop</li>
    <
li>WGL_NV_float_buffer</li>
    <
li>WGL_NV_multisample_coverage</li>
    <
li>WGL_NV_render_depth_texture</li>
    <
li>WGL_NV_render_texture_rectangle</li>


===================================[ 
NVIDIA CUDA Capabilities ]
CUDA Device 0
    
Device nameGeForce GTX 260
    
Compute Capability1.3
    
Total Memory877 MB
    
Shader Clock Rate1242 MHz
    
Multiprocessors24
    
Warp Size32
    
Max Threads Per Block512
    
Threads Per Block512 x 512 x 64
    
Grid Size65535 x 65535 x 1
    
Registers Per Block16384
    
Texture Alignment256 byte
    
Total Constant Memory64 Kb


===================================[ OpenCL Capabilities ]
Num OpenCL platforms1
NameNVIDIA CUDA
VersionOpenCL 1.0 CUDA 3.2.1
ProfileFULL_PROFILE
VendorNVIDIA Corporation
Num devices1

    
CL_DEVICE_NAMEGeForce GTX 260
    
CL_DEVICE_VENDORNVIDIA Corporation
    
CL_DRIVER_VERSION260.99
    
CL_DEVICE_PROFILEFULL_PROFILE
    
CL_DEVICE_VERSIONOpenCL 1.0 CUDA
    
CL_DEVICE_TYPEGPU
    
CL_DEVICE_VENDOR_ID0x10DE
    
CL_DEVICE_MAX_COMPUTE_UNITS24
    
CL_DEVICE_MAX_CLOCK_FREQUENCY1242MHz
    
CL_NV_DEVICE_COMPUTE_CAPABILITY_MAJOR1
    
CL_NV_DEVICE_COMPUTE_CAPABILITY_MINOR3
    
CL_NV_DEVICE_REGISTERS_PER_BLOCK16384
    
CL_NV_DEVICE_WARP_SIZE32
    
CL_NV_DEVICE_GPU_OVERLAP1
    
CL_NV_DEVICE_KERNEL_EXEC_TIMEOUT1
    
CL_NV_DEVICE_INTEGRATED_MEMORY0
    
CL_DEVICE_ADDRESS_BITS32
    
CL_DEVICE_MAX_MEM_ALLOC_SIZE224608KB
    
CL_DEVICE_GLOBAL_MEM_SIZE877MB
    
CL_DEVICE_MAX_PARAMETER_SIZE4352
    
CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE0 Bytes
    
CL_DEVICE_GLOBAL_MEM_CACHE_SIZE0KB
    
CL_DEVICE_ERROR_CORRECTION_SUPPORTNO
    
CL_DEVICE_LOCAL_MEM_TYPELocal (scratchpad)
    - 
CL_DEVICE_LOCAL_MEM_SIZE16KB
    
CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE64KB
    
CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS3
    
CL_DEVICE_MAX_WORK_ITEM_SIZES: [512 512 64]
    - 
CL_DEVICE_MAX_WORK_GROUP_SIZE512
    
CL_EXEC_NATIVE_KERNEL4751356
    
CL_DEVICE_IMAGE_SUPPORTYES
    
CL_DEVICE_MAX_READ_IMAGE_ARGS128
    
CL_DEVICE_MAX_WRITE_IMAGE_ARGS8
    
CL_DEVICE_IMAGE2D_MAX_WIDTH4096
    
CL_DEVICE_IMAGE2D_MAX_HEIGHT32768
    
CL_DEVICE_IMAGE3D_MAX_WIDTH2048
    
CL_DEVICE_IMAGE3D_MAX_HEIGHT2048
    
CL_DEVICE_IMAGE3D_MAX_DEPTH16
    
CL_DEVICE_MAX_SAMPLERS16
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT1
    
CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE1
    
CL_DEVICE_EXTENSIONS16
    
Extensions:
        - 
cl_khr_byte_addressable_store
        
cl_khr_icd
        
cl_khr_gl_sharing
        
cl_nv_d3d9_sharing
        
cl_nv_d3d10_sharing
        
cl_khr_d3d10_sharing
        
cl_nv_d3d11_sharing
        
cl_nv_compiler_options
        
cl_nv_device_attribute_query
        
cl_nv_pragma_unroll
        

        - 
cl_khr_global_int32_base_atomics
        
cl_khr_global_int32_extended_atomics
        
cl_khr_local_int32_base_atomics
        
cl_khr_local_int32_extended_atomics
        
cl_khr_fp64


===================================[ Misc. ]


===================================[ 
Related Graphics Drivers ]
http://www.geeks3d.com/?page_id=752
http://downloads.guru3d.com/download.php?id=10
http://www.tweakguides.com/NVFORCE_1.html
http://www.nvidia.com/object/winxp-2k_archive.html
http://www.geeks3d.com/?p=65


===================================[ Related Graphics Cards Reviews ]
http://www.geeks3d.com/?tag=geforce-gtx-260
http://www.google.us/search?q=NVIDIA+GeForce+GTX+260+review 
__________________
AVSRepoGUI // VSRepoGUI - Package Manager for AviSynth // VapourSynth
VapourSynth Portable FATPACK || VapourSynth Database

Last edited by ChaosKing; 12th January 2011 at 18:30.
ChaosKing is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 19:01.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.