Forum > Windows
optimized sources
_heinz:
excerpt from v0.39 installer Readme:
The ATI MB application will not work on ATI cards with workgroup size 128
(e.g. HD43xx).
HD4670 has:
CL_DEVICE_MAX_WORK_GROUP_SIZE: 128
:'( :'( :'(
why ?
I'm disappointed....
GPUZ shows: gpuz_hd4670
heinz
_heinz:
I installed now:
for Astropulse
ap_5.06_win_x86_SSE2_OpenCL_ATI_r521.exe
MultiBeam
AK_v8b2_win_SSE2.exe
BOINC shows:
06.12.2011 20:58:39 ATI GPU 0: ATI Radeon HD 4600 series (R730) (CAL version 1.4.1607, 1024MB, 480 GFLOPS peak)
hopefully I will get some work....when seti is up again.
heinz
_heinz:
HD4670 AGP, here is what clinfo shows:
~~~~~~~~~~~~~~~~~~~~~~~~~
C:\A\clinfo>echo off
clinfo
Number of platforms: 1
Platform Profile: FULL_PROFILE
Platform Version: OpenCL 1.1 AMD-APP-SDK-v2.5 (79
3.1)
Platform Name: AMD Accelerated Parallel Proces
sing
Platform Vendor: Advanced Micro Devices, Inc.
Platform Extensions: cl_khr_icd cl_amd_event_callbac
k cl_amd_offline_devices
Platform Name: AMD Accelerated Parallel Proces
sing
Number of devices: 2
Device Type: CL_DEVICE_TYPE_GPU
Device ID: 4098
Max compute units: 8
Max work items dimensions: 3
Max work items[0]: 128
Max work items[1]: 128
Max work items[2]: 128
Max work group size: 128
Preferred vector width char: 16
Preferred vector width short: 8
Preferred vector width int: 4
Preferred vector width long: 2
Preferred vector width float: 4
Preferred vector width double: 0
Max clock frequency: 750Mhz
Address bits: 32
Max memory allocation: 134217728
Image support: No
Max size of kernel argument: 1024
Alignment (bits) of base address: 32768
Minimum alignment (bytes) for any datatype: 128
Single precision floating point capability
Denorms: No
Quiet NaNs: Yes
Round to nearest even: Yes
Round to zero: Yes
Round to +ve and infinity: Yes
IEEE754-2008 fused multiply-add: Yes
Cache type: None
Cache line size: 0
Cache size: 0
Global memory size: 536870912
Constant buffer size: 65536
Max number of constant args: 8
Local memory type: Global
Local memory size: 16384
Error correction support: 0
Profiling timer resolution: 1
Device endianess: Little
Available: Yes
Compiler available: Yes
Execution capabilities:
Execute OpenCL kernels: Yes
Execute native function: No
Queue properties:
Out-of-Order: No
Profiling : Yes
Platform ID: 011BA4F4
Name: ATI RV730
Vendor: Advanced Micro Devices, Inc.
Driver version: CAL 1.4.1607
Profile: FULL_PROFILE
Version: OpenCL 1.0 AMD-APP-SDK-v2.5 (79
3.1)
Extensions: cl_khr_gl_sharing cl_amd_device
_attribute_query
Device Type: CL_DEVICE_TYPE_CPU
Device ID: 4098
Max compute units: 1
Max work items dimensions: 3
Max work items[0]: 1024
Max work items[1]: 1024
Max work items[2]: 1024
Max work group size: 1024
Preferred vector width char: 16
Preferred vector width short: 8
Preferred vector width int: 4
Preferred vector width long: 2
Preferred vector width float: 4
Preferred vector width double: 0
Max clock frequency: 2672Mhz
Address bits: 32
Max memory allocation: 1073201152
Image support: Yes
Max number of images read arguments: 128
Max number of images write arguments: 8
Max image 2D width: 8192
Max image 2D height: 8192
Max image 3D width: 2048
Max image 3D height: 2048
Max image 3D depth: 2048
Max samplers within kernel: 16
Max size of kernel argument: 4096
Alignment (bits) of base address: 1024
Minimum alignment (bytes) for any datatype: 128
Single precision floating point capability
Denorms: Yes
Quiet NaNs: Yes
Round to nearest even: Yes
Round to zero: Yes
Round to +ve and infinity: Yes
IEEE754-2008 fused multiply-add: No
Cache type: Read/Write
Cache line size: 0
Cache size: 0
Global memory size: 1073201152
Constant buffer size: 65536
Max number of constant args: 8
Local memory type: Global
Local memory size: 32768
Error correction support: 0
Profiling timer resolution: 279
Device endianess: Little
Available: Yes
Compiler available: Yes
Execution capabilities:
Execute OpenCL kernels: Yes
Execute native function: Yes
Queue properties:
Out-of-Order: No
Profiling : Yes
Platform ID: 011BA4F4
Name: Intel(R) Pentium(
R) 4 CPU 2.66GHz
Vendor: GenuineIntel
Driver version: 2.0
Profile: FULL_PROFILE
Version: OpenCL 1.1 AMD-APP-SDK-v2.5 (79
3.1)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_
global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int3
2_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store
cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_ve
c3 cl_amd_printf cl_amd_media_ops cl_amd_popcnt
Drücken Sie eine beliebige Taste . . .
heinz
Claggy:
--- Quote from: _heinz on 06 Dec 2011, 12:35:05 pm ---excerpt from v0.39 installer Readme:
The ATI MB application will not work on ATI cards with workgroup size 128
(e.g. HD43xx).
HD4670 has:
CL_DEVICE_MAX_WORK_GROUP_SIZE: 128
:'( :'( :'(
why ?
I'm disappointed....
GPUZ shows: gpuz_hd4670
heinz
--- End quote ---
heinz, you'll want to try the MB7_win_x86_SSE3_OpenCL_ATi_LHD4K_r390.exe app from the MB7 r390 sanity check thread, which is especially for GPUs with Max Workgroup size 128
Claggy
_heinz:
I run the testcase with MB7_win_x86_SSE3_OpenCL_ATi_LHD4K_r390,
but mine P4 2.66 has still SSE2
I need a SSE2 version of LHD4K
~~~~~~~~~~~~~~~~~~~~
Informationsliste Wert
CPU-Eigenschaften
CPU Typ Intel Pentium 4, 2666 MHz (20 x 133)
CPU Bezeichnung Northwood
CPU stepping C1
Befehlssatz x86, MMX, SSE, SSE2
Vorgesehene Taktung 2667 MHz
Min / Max CPU Multiplikator 20x / 20x
Engineering Sample Nein
L1 Trace Cache 12K Instructions
L1 Datencache 8 KB
L2 Cache 512 KB (On-Die, ECC, ATC, Full-Speed)
CPU Technische Informationen
Gehäusetyp 478 Pin FC-PGA2
Gehäusegröße 35 mm x 35 mm
Transistoren 55 Mio.
Fertigungstechnologie 6M, 0.13 um, CMOS, Cu, Low-K
Gehäusefläche 131 mm2
Kern Spannung 1.475 - 1.55 V
I/O Spannung 1.475 - 1.55 V
Typische Leistung 38.7 - 89.0 W (Abhängig von der Taktung)
Maximale Leistung 49 - 109 W (Abhängig von der Taktung)
CPU Hersteller
Firmenname Intel Corporation
Produktinformation http://ark.intel.com/search.aspx?q=Intel Pentium 4
Treiberupdate http://www.aida64.com/driver-updates
CPU Auslastung
CPU #1 0 %
heinz
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version