Forum > GPU crunching

[Split] PowerSpectrum Unit Test

<< < (4/62) > >>

PatrickV2:
Not sure if you're looking for this, but below my results on my 8800GTX, 260.99 drivers:

Device: GeForce 8800 GTX, 1350 MHz clock, 731 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
     64 threads:       17.8 GFlops    7.1 GB/s 1183.3ulps


GetPowerSpectrum() mod 1:
     32 threads:       14.2 GFlops    5.7 GB/s 1183.3ulps
     64 threads:       17.8 GFlops    7.1 GB/s 1183.3ulps
    128 threads:       17.8 GFlops    7.1 GB/s 1183.3ulps
    256 threads:       17.6 GFlops    7.0 GB/s 1183.3ulps


GetPowerSpectrum() mod 2:
     32 threads:        6.8 GFlops    2.7 GB/s 1183.3ulps
     64 threads:        6.2 GFlops    2.5 GB/s 1183.3ulps
    128 threads:        9.1 GFlops    3.7 GB/s 1183.3ulps
    256 threads:        8.0 GFlops    3.2 GB/s 1183.3ulps

Regards, Patrick.

_heinz:
starting PowerSpectrum2
.
-device 0
Device: GeForce GTX 470, 810 MHz clock, 1248 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
     64 threads:       20.6 GFlops    8.2 GB/s   0.0ulps


GetPowerSpectrum() mod 1:
     32 threads:       12.5 GFlops    5.0 GB/s   0.0ulps
     64 threads:       20.5 GFlops    8.2 GB/s   0.0ulps
    128 threads:       27.6 GFlops   11.0 GB/s   0.0ulps
    256 threads:       29.9 GFlops   12.0 GB/s   0.0ulps


GetPowerSpectrum() mod 2:
     32 threads:       14.4 GFlops    5.8 GB/s   0.0ulps
     64 threads:       28.3 GFlops   11.3 GB/s   0.0ulps
    128 threads:       42.4 GFlops   16.9 GB/s   0.0ulps
    256 threads:       42.5 GFlops   17.0 GB/s   0.0ulps


-device 1
Device: GeForce GTX 470, 810 MHz clock, 1249 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
     64 threads:       20.6 GFlops    8.3 GB/s   0.0ulps


GetPowerSpectrum() mod 1:
     32 threads:       12.6 GFlops    5.0 GB/s   0.0ulps
     64 threads:       20.5 GFlops    8.2 GB/s   0.0ulps
    128 threads:       27.5 GFlops   11.0 GB/s   0.0ulps
    256 threads:       30.1 GFlops   12.0 GB/s   0.0ulps


GetPowerSpectrum() mod 2:
     32 threads:       14.4 GFlops    5.8 GB/s   0.0ulps
     64 threads:       28.4 GFlops   11.4 GB/s   0.0ulps
    128 threads:       42.2 GFlops   16.9 GB/s   0.0ulps
    256 threads:       41.1 GFlops   16.4 GB/s   0.0ulps


.
Done
modify:
@Jason, woundering about you get 20 GFlops more with 256 threads than mine GTX470
have you source for me to compile with 2011XE Compiler ?

arkayn:
I tried running it on my 460 but the program always crashes on the end of 128/beginning of 256 threads in mod 2.

Never see any results.

Claggy:
Here's my 9800GTX+ result, like Richard's 9800GTX+ it's a factory overclocked example, but by XFX:

Device: GeForce 9800 GTX/9800 GTX+, 1900 MHz clock, 496 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
     64 threads:       16.1 GFlops    6.5 GB/s 1183.3ulps


GetPowerSpectrum() mod 1:
     32 threads:       15.1 GFlops    6.1 GB/s 1183.3ulps
     64 threads:       16.1 GFlops    6.5 GB/s 1183.3ulps
    128 threads:       16.0 GFlops    6.4 GB/s 1183.3ulps
    256 threads:       15.9 GFlops    6.3 GB/s 1183.3ulps


GetPowerSpectrum() mod 2:
     32 threads:        6.2 GFlops    2.5 GB/s 1183.3ulps
     64 threads:        8.2 GFlops    3.3 GB/s 1183.3ulps
    128 threads:        8.3 GFlops    3.3 GB/s 1183.3ulps
    256 threads:        8.1 GFlops    3.2 GB/s 1183.3ulps

Claggy

Claggy:
Here's my 128Mb 8400M GS's result, while it's not got enough RAM for Seti, it at least gives you some figures for very slow GPU's:

Device: GeForce 8400M GS, 800 MHz clock, 114 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
     64 threads:        1.2 GFlops    0.5 GB/s 1183.3ulps


GetPowerSpectrum() mod 1:
     32 threads:        1.2 GFlops    0.5 GB/s 1183.3ulps
     64 threads:        1.2 GFlops    0.5 GB/s 1183.3ulps
    128 threads:        1.2 GFlops    0.5 GB/s 1183.3ulps
    256 threads:        1.2 GFlops    0.5 GB/s 1183.3ulps


GetPowerSpectrum() mod 2:
     32 threads:        0.7 GFlops    0.3 GB/s 1183.3ulps
     64 threads:        0.7 GFlops    0.3 GB/s 1183.3ulps
    128 threads:        0.7 GFlops    0.3 GB/s 1183.3ulps
    256 threads:        0.6 GFlops    0.2 GB/s 1183.3ulps

Claggy

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version