Forum > GPU crunching
[Split] PowerSpectrum Unit Test
PatrickV2:
Not sure if you're looking for this, but below my results on my 8800GTX, 260.99 drivers:
Device: GeForce 8800 GTX, 1350 MHz clock, 731 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
64 threads: 17.8 GFlops 7.1 GB/s 1183.3ulps
GetPowerSpectrum() mod 1:
32 threads: 14.2 GFlops 5.7 GB/s 1183.3ulps
64 threads: 17.8 GFlops 7.1 GB/s 1183.3ulps
128 threads: 17.8 GFlops 7.1 GB/s 1183.3ulps
256 threads: 17.6 GFlops 7.0 GB/s 1183.3ulps
GetPowerSpectrum() mod 2:
32 threads: 6.8 GFlops 2.7 GB/s 1183.3ulps
64 threads: 6.2 GFlops 2.5 GB/s 1183.3ulps
128 threads: 9.1 GFlops 3.7 GB/s 1183.3ulps
256 threads: 8.0 GFlops 3.2 GB/s 1183.3ulps
Regards, Patrick.
_heinz:
starting PowerSpectrum2
.
-device 0
Device: GeForce GTX 470, 810 MHz clock, 1248 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
64 threads: 20.6 GFlops 8.2 GB/s 0.0ulps
GetPowerSpectrum() mod 1:
32 threads: 12.5 GFlops 5.0 GB/s 0.0ulps
64 threads: 20.5 GFlops 8.2 GB/s 0.0ulps
128 threads: 27.6 GFlops 11.0 GB/s 0.0ulps
256 threads: 29.9 GFlops 12.0 GB/s 0.0ulps
GetPowerSpectrum() mod 2:
32 threads: 14.4 GFlops 5.8 GB/s 0.0ulps
64 threads: 28.3 GFlops 11.3 GB/s 0.0ulps
128 threads: 42.4 GFlops 16.9 GB/s 0.0ulps
256 threads: 42.5 GFlops 17.0 GB/s 0.0ulps
-device 1
Device: GeForce GTX 470, 810 MHz clock, 1249 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
64 threads: 20.6 GFlops 8.3 GB/s 0.0ulps
GetPowerSpectrum() mod 1:
32 threads: 12.6 GFlops 5.0 GB/s 0.0ulps
64 threads: 20.5 GFlops 8.2 GB/s 0.0ulps
128 threads: 27.5 GFlops 11.0 GB/s 0.0ulps
256 threads: 30.1 GFlops 12.0 GB/s 0.0ulps
GetPowerSpectrum() mod 2:
32 threads: 14.4 GFlops 5.8 GB/s 0.0ulps
64 threads: 28.4 GFlops 11.4 GB/s 0.0ulps
128 threads: 42.2 GFlops 16.9 GB/s 0.0ulps
256 threads: 41.1 GFlops 16.4 GB/s 0.0ulps
.
Done
modify:
@Jason, woundering about you get 20 GFlops more with 256 threads than mine GTX470
have you source for me to compile with 2011XE Compiler ?
arkayn:
I tried running it on my 460 but the program always crashes on the end of 128/beginning of 256 threads in mod 2.
Never see any results.
Claggy:
Here's my 9800GTX+ result, like Richard's 9800GTX+ it's a factory overclocked example, but by XFX:
Device: GeForce 9800 GTX/9800 GTX+, 1900 MHz clock, 496 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
64 threads: 16.1 GFlops 6.5 GB/s 1183.3ulps
GetPowerSpectrum() mod 1:
32 threads: 15.1 GFlops 6.1 GB/s 1183.3ulps
64 threads: 16.1 GFlops 6.5 GB/s 1183.3ulps
128 threads: 16.0 GFlops 6.4 GB/s 1183.3ulps
256 threads: 15.9 GFlops 6.3 GB/s 1183.3ulps
GetPowerSpectrum() mod 2:
32 threads: 6.2 GFlops 2.5 GB/s 1183.3ulps
64 threads: 8.2 GFlops 3.3 GB/s 1183.3ulps
128 threads: 8.3 GFlops 3.3 GB/s 1183.3ulps
256 threads: 8.1 GFlops 3.2 GB/s 1183.3ulps
Claggy
Claggy:
Here's my 128Mb 8400M GS's result, while it's not got enough RAM for Seti, it at least gives you some figures for very slow GPU's:
Device: GeForce 8400M GS, 800 MHz clock, 114 MB memory.
Compiled with CUDA 3020.
Stock GetPowerSpectrum():
64 threads: 1.2 GFlops 0.5 GB/s 1183.3ulps
GetPowerSpectrum() mod 1:
32 threads: 1.2 GFlops 0.5 GB/s 1183.3ulps
64 threads: 1.2 GFlops 0.5 GB/s 1183.3ulps
128 threads: 1.2 GFlops 0.5 GB/s 1183.3ulps
256 threads: 1.2 GFlops 0.5 GB/s 1183.3ulps
GetPowerSpectrum() mod 2:
32 threads: 0.7 GFlops 0.3 GB/s 1183.3ulps
64 threads: 0.7 GFlops 0.3 GB/s 1183.3ulps
128 threads: 0.7 GFlops 0.3 GB/s 1183.3ulps
256 threads: 0.6 GFlops 0.2 GB/s 1183.3ulps
Claggy
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version