Forum > GPU crunching
[Split] PowerSpectrum Unit Test
Jason G:
Cheers,
BTW: average roughly represents overall improvement, Peak represents speed change in the fastest Kernels, and Min is the speed change in the slowest Kernels ... So I regard 'Avg' & 'Min' as most important, with Peak being mostly just a possible indicator of remaining headroom.
[Edit:] Similarish looking deal with the 480
--- Code: ---Device: GeForce GTX 480, 810 MHz clock, 1503 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 104.34) Peak( 157.97) Min( 12.79) [OK]
Memory thoughput GB/s Avg( 57.34) Peak( 82.25) Min( 22.55)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
revert to single stream from size 512
Processing... Done!
Compute thoughput [GFlops] -
Avg( 162.15, 1.55x) Peak( 232.02, 1.47x) Min( 26.47, 2.07x) [OK]
Memory thoughput [GB/s] -
Avg( 95.38, 1.66x) Peak( 127.32, 1.55x) Min( 46.67, 2.07x)
--- End code ---
_heinz:
Hi Jason,
new results from Test10
~~~~~~~~~~~~~~~
PowerSpectrumTest10.exe -device 0
Device: GeForce GTX 470, 810 MHz clock, 1248 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 82.93) Peak( 130.76) Min( 12.00) [OK]
Memory thoughput GB/s Avg( 46.20) Peak( 64.10) Min( 21.16)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
revert to single stream from size 512
Processing... Done!
Compute thoughput [GFlops] -
Avg( 125.13, 1.51x) Peak( 178.98, 1.37x) Min( 37.50, 3.12x) [OK]
Memory thoughput [GB/s] -
Avg( 75.48, 1.63x) Peak( 95.64, 1.49x) Min( 52.23, 2.47x)
PowerSpectrumTest10.exe -device 1
Device: GeForce GTX 470, 810 MHz clock, 1249 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 80.74) Peak( 126.77) Min( 11.69) [OK]
Memory thoughput GB/s Avg( 44.99) Peak( 59.75) Min( 20.61)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
revert to single stream from size 512
Processing... Done!
Compute thoughput [GFlops] -
Avg( 125.57, 1.56x) Peak( 179.89, 1.42x) Min( 37.72, 3.23x) [OK]
Memory thoughput [GB/s] -
Avg( 75.75, 1.68x) Peak( 95.76, 1.60x) Min( 52.48, 2.55x)
.
Done
PowerSpectrumTest10.exe -device 0
Device: ION, 1161 MHz clock, 242 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 4.38) Peak( 6.24) Min( 1.31) [OK]
Memory thoughput GB/s Avg( 2.66) Peak( 3.97) Min( 1.80)
Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
revert to single stream from size 128
Processing... Done!
Compute thoughput [GFlops] -
Avg( 4.86, 1.11x) Peak( 6.64, 1.06x) Min( 1.86, 1.41x) [OK]
Memory thoughput [GB/s] -
Avg( 3.08, 1.16x) Peak( 4.29, 1.08x) Min( 2.10, 1.17x)
.
Done
Jason G:
Works on ION, YaY! :)
Claggy:
On my 128Mb 8400M GS:
Device: GeForce 8400M GS, 800 MHz clock, 114 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 4.07) Peak( 5.64) Min( 1.19) [OK]
Memory thoughput GB/s Avg( 2.44) Peak( 3.69) Min( 1.51)
Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
revert to single stream from size 128
Processing... Done!
Compute thoughput [GFlops] -
Avg( 4.30, 1.06x) Peak( 5.78, 1.03x) Min( 1.68, 1.41x) [OK]
Memory thoughput [GB/s] -
Avg( 2.70, 1.11x) Peak( 3.78, 1.03x) Min( 1.90, 1.26x)
Claggy
Jason G:
--- Quote from: Claggy on 26 Dec 2010, 01:20:11 pm ---On my 128Mb 8400M GS:
--- End quote ---
Work's on that too :D, looks like we've managed to max that one out ;)
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version