Forum > GPU crunching

[Split] PowerSpectrum Unit Test

<< < (58/62) > >>

Jason G:
Cheers,
  BTW: average roughly represents overall improvement, Peak represents speed change in the fastest Kernels, and Min is the speed change in the slowest Kernels ... So I regard 'Avg' & 'Min' as most important, with Peak being mostly just a possible indicator of remaining headroom.

[Edit:] Similarish looking deal with the 480

--- Code: ---Device: GeForce GTX 480, 810 MHz clock, 1503 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(  104.34) Peak(  157.97) Min(   12.79) [OK]
   Memory thoughput GB/s   Avg(   57.34) Peak(   82.25) Min(   22.55)


Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
  revert to single stream from size 512
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(  162.15, 1.55x) Peak(  232.02, 1.47x) Min(   26.47, 2.07x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   95.38, 1.66x) Peak(  127.32, 1.55x) Min(   46.67, 2.07x)
--- End code ---

_heinz:
Hi Jason,
new results from Test10
~~~~~~~~~~~~~~~
PowerSpectrumTest10.exe -device 0

Device: GeForce GTX 470, 810 MHz clock, 1248 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   82.93) Peak(  130.76) Min(   12.00) [OK]
   Memory thoughput GB/s   Avg(   46.20) Peak(   64.10) Min(   21.16)


Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
  revert to single stream from size 512
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(  125.13, 1.51x) Peak(  178.98, 1.37x) Min(   37.50, 3.12x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   75.48, 1.63x) Peak(   95.64, 1.49x) Min(   52.23, 2.47x)


PowerSpectrumTest10.exe -device 1

Device: GeForce GTX 470, 810 MHz clock, 1249 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   80.74) Peak(  126.77) Min(   11.69) [OK]
   Memory thoughput GB/s   Avg(   44.99) Peak(   59.75) Min(   20.61)


Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
  revert to single stream from size 512
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(  125.57, 1.56x) Peak(  179.89, 1.42x) Min(   37.72, 3.23x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   75.75, 1.68x) Peak(   95.76, 1.60x) Min(   52.48, 2.55x)


.
Done
PowerSpectrumTest10.exe -device 0

Device: ION, 1161 MHz clock, 242 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(    4.38) Peak(    6.24) Min(    1.31) [OK]
   Memory thoughput GB/s   Avg(    2.66) Peak(    3.97) Min(    1.80)


Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
  revert to single stream from size 128
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(    4.86, 1.11x) Peak(    6.64, 1.06x) Min(    1.86, 1.41x) [OK]
   Memory thoughput [GB/s]   -
      Avg(    3.08, 1.16x) Peak(    4.29, 1.08x) Min(    2.10, 1.17x)


.
Done

Jason G:
Works on ION, YaY!  :)

Claggy:
On my 128Mb 8400M GS:

Device: GeForce 8400M GS, 800 MHz clock, 114 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(    4.07) Peak(    5.64) Min(    1.19) [OK]
   Memory thoughput GB/s   Avg(    2.44) Peak(    3.69) Min(    1.51)


Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
  revert to single stream from size 128
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(    4.30, 1.06x) Peak(    5.78, 1.03x) Min(    1.68, 1.41x) [OK]
   Memory thoughput [GB/s]   -
      Avg(    2.70, 1.11x) Peak(    3.78, 1.03x) Min(    1.90, 1.26x)

Claggy

Jason G:

--- Quote from: Claggy on 26 Dec 2010, 01:20:11 pm ---On my 128Mb 8400M GS:
--- End quote ---

Work's on that too  :D,  looks like we've managed to max that one out  ;)

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version