Forum > GPU crunching

GPU client

<< < (37/47) > >>

Devaster:
u may use  knabench system for speed comparision ...

TheMule:
Ok, not what I expected. Using KNAbench and work unit 1, I got:

226 sec - setiathome_6.01_windows_intelx86
203 sec - setiathome_5.27_windows_intelx86

About 23 sec slower.  Is it due to the FFT messages on the screen? Data follows:

setiathome_5.27_windows_intelx86.exe -nographics / testWU-1.wu :
Started at  : 13:53:37
Ended at    : 13:57:00
Elapsed time: 203 seconds
 
[ stderr ]
Can't set up shared mem: -1
Will run in standalone mode.
setiathome_enhanced 5.27 DevC++/MinGW

Work Unit Info:
...............
WU true angle range is :  0.604884
Optimal function choices:
-----------------------------------------------------
name               
-----------------------------------------------------
              v_BaseLineSmooth (no other)
   v_vGetPowerSpectrumUnrolled 0.00006 0.00000
             sse3_ChirpData_ak 0.00899 0.00000
                 v_vTranspose4 0.00143 0.00000
                AK SSE folding 0.00076 0.00000

Flopcounter: 637401180238.359500

Spike count:    0
Pulse count:    0
Triplet count:  0
Gaussian count: 0
[ /stderr ]



setiathome_6.01_windows_intelx86.exe -nographics / testWU-1.wu :
Started at  : 13:44:56
Ended at    : 13:48:42
Elapsed time: 226 seconds
 
[ stderr ]
Device name: GeForce 8800 GTS 512
Device version: 1.1
Total global memory (MB): 512
Number of multiprocessors : 16
Number of cores :128
Shared memory per block (kB): 16
Registers per block: 8192
Warp size: 32
Max threads per block: 512
Shaders clock rate (MHz): 1674
Concurrent copy and execution: No
Can't set up shared mem: -1
Will run in standalone mode.
setiathome_enhanced 6.01 Visual Studio/Microsoft C++
libboinc: 6.3.4

Work Unit Info:
...............
WU true angle range is :  0.604884

Flopcounter: 627299330081.366820

Spike count:    0
Pulse count:    0
Triplet count:  0
Gaussian count: 0
called boinc_finish
[ /stderr ]
------------



Devaster:
okay :
new code - now 64-bit ...

as previous 32-bit build ....

compiled with VS2008+VS2005 under Windows Server 2008 x64

small test :

--- Code: ---============
setiathome_6.00S08_windows_intelx86.exe -verb -nog / testWU-4.wu :
Started at  : 20:57:18.970
Ended at    : 21:00:46.190
    207.126 secs Elapsed
    199.109 secs CPU time
 
[ stderr ]
Can't set up shared mem: -1
Will run in standalone mode.
setiathome_enhanced 6.00S08 DevC++/MinGW
libboinc: 6.1.6

DataIn=0x32b00c0, ChirpedData=0x2aa0040

Work Unit Info:
...............
WU true angle range is :  1.279649
Optimal function choices:
-----------------------------------------------------
                          name  timing   error
-----------------------------------------------------
              v_BaseLineSmooth (no other)

            v_GetPowerSpectrum 0.00079 0.00000  test
           v_vGetPowerSpectrum 0.00073 0.00000  test
          v_vGetPowerSpectrum2 0.00075 0.00000  test
   v_vGetPowerSpectrumUnrolled 0.00076 0.00000  test
  v_vGetPowerSpectrumUnrolled2 0.00075 0.00000  test
           v_vGetPowerSpectrum 0.00073 0.00000  choice

                   v_ChirpData 0.03327 0.00000  test
                 fpu_ChirpData 0.04556 0.00000  test
           v_vChirpData_x86_64 0.24693 0.00002  test
             sse1_ChirpData_ak 0.03216 0.00000  test
             sse2_ChirpData_ak 0.03455 0.00000  test
             sse3_ChirpData_ak 0.02924 0.00000  test
             sse3_ChirpData_ak 0.02924 0.00000  choice

                   v_Transpose 0.04322 0.00000  test
                  v_Transpose2 0.02599 0.00000  test
                  v_Transpose4 0.01550 0.00000  test
                  v_Transpose8 0.02781 0.00000  test
                v_pfTranspose2 0.02539 0.00000  test
                v_pfTranspose4 0.01571 0.00000  test
                v_pfTranspose8 0.02681 0.00000  test
                 v_vTranspose4 0.01173 0.00000  test
               v_vTranspose4np 0.01197 0.00000  test
              v_vTranspose4ntw 0.01090 0.00000  test
            v_vTranspose4x8ntw 0.00758 0.00000  test
           v_vTranspose4x16ntw 0.00580 0.00000  test
          v_vpfTranspose8x4ntw 0.01072 0.00000  test
           v_vTranspose4x16ntw 0.00580 0.00000  choice

               FPU opt folding 0.00423 0.00000  test
                AK SSE folding 0.00220 0.00000  test
                BH SSE folding 0.00201 0.00000  test
                BH SSE folding 0.00201 0.00000  choice


Flopcounter: 243285924139.522000

Spike count:    0
Pulse count:    0
Triplet count:  0
Gaussian count: 0
called boinc_finish
[ /stderr ]
------------
setiathome_6.01_windows_intelx64.exe -verb -st / testWU-4.wu :
Started at  : 21:00:46.346
Ended at    : 21:03:02.643
    136.219 secs Elapsed
    128.750 secs CPU time
Speedup     : 35.34%
Ratio       : 1.55 x
 
Result      : Strongly similar,  Q= 99.99%
[ stderr ]
Device name: GeForce 9600 GT
Device version: 1.1
Total global memory (MB): 512
Number of multiprocessors : 8
Number of cores :64
Shared memory per block (kB): 16
Registers per block: 8192
Warp size: 32
Max threads per block: 512
Shaders clock rate (MHz): 1625
Concurrent copy and execution: No
Can't set up shared mem: -1
Will run in standalone mode.
setiathome_enhanced 6.01 Visual Studio/Microsoft C++
libboinc: 6.3.5

Work Unit Info:
...............
WU true angle range is :  1.279649

Flopcounter: 238022320153.522060

Spike count:    0
Pulse count:    0
Triplet count:  0
Gaussian count: 0
called boinc_finish
[ /stderr ]
 
 
------------
 
Quick timetable
 
WU : testWU-4.wu
setiathome_6.00S08_windows_intelx86.exe : 199.109 secs CPU
setiathome_6.01_windows_intelx64.exe : 128.750 secs CPU
Speedup     : 35.34%
Ratio       : 1.55 x
 
------------
CPU:
Number of processors 1
Number of cores 1 (max 1)
Specification AMD Athlon(tm) 64 Processor 3000+
Codename Venice
Core Speed 1005.3 MHz (5.0 x 201.1 MHz)
Core Stepping DH-E6
Technology 90 nm
Stock frequency 1800 MHz
------------
Chipset:
Northbridge NVIDIA nForce4 rev. A3
Southbridge NVIDIA nForce4 MCP rev. A3
------------
RAM:
Memory Type DDR
Memory Size 2048 MBytes
Memory Frequency 201.1 MHz (CPU/5)
Max bandwidth PC3200 (200 MHz)
CAS# 3.0
RAS# to CAS# 3
RAS# Precharge 3
Cycle Time (tRAS) 8
DRAM Idle Timer 16
------------
OS:
Windows Version Microsoft Windows Vista (6.0) Enterprise Edition (Full)  Service Pack 1 (Build 6001)
============

--- End code ---

apps was runnig almost all the time at 100 percent - MS has made very good job with 2008 server in performance ....

[attachment deleted by admin]

Morten:
Hi,

Tested x64-version and got this:

==================
Device name: Device Emulation (CPU)
Device version: 9999.9999
Total global memory (MB): 4095
Number of multiprocessors : 16
Number of cores :128
Shared memory per block (kB): 16
Registers per block: 8192
Warp size: 1
Max threads per block: 512
Shaders clock rate (MHz): 1350
Concurrent copy and execution: No
Can't set up shared mem: -1
Will run in standalone mode.
GPU memory allocation error (source buffer) ...

==================

I'm running Cuda display driver NVIDIADisplayWinVista64(177_35)Int.exe on Geforce 8800 GT

Morten

Devaster:
has someone same problem ?

try use latest drivers ....

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version