Forum > GPU crunching
how do i run the stock GPU app and Lunatic CPU apps
Jason G:
--- Quote from: perryjay on 28 Jan 2011, 11:05:48 am ---Here it is...
--- End quote ---
Thanks! It spins me out a little when I see the ~3+ times 'Min' figures. It means that the slowest kernels liked the attention I gave. 1.42x average processing rate should translate to a decent percentage improvement on whole tasks (probably around 20% or so without further refinement). The next few weeks levering in those to the pipeline is going to be interesting.
Jason
Josef W. Segur:
--- Quote from: RottenMutt on 27 Jan 2011, 10:06:58 pm ---i tried installing using lunatic unified installer and not checking the gpu app and ended dumping my cache :P
...
--- End quote ---
Richard Haselgrove wrote a very good FAQ, Run Seti Enhanced on Fermi class GPUs (4xx) (Advanced users). Although it's specifically for the 6.10 cuda_fermi stock application, the pattern is the same for any stock application.
Because the project has resend_lost_results on, when you get app_info.xml fixed up the "dumped" tasks may be sent again.
Joe
Claggy:
Since i've also got a new GPU, a GTX 460 GLH, here's my Test #10 results (Win 7 x64, 266.58):
--- Code: ---Device: GeForce GTX 460, 1600 MHz clock, 993 MB memory.
Compute capability 2.1
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 69.92) Peak( 114.81) Min( 8.91) [OK]
Memory thoughput GB/s Avg( 37.67) Peak( 54.34) Min( 15.05)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
revert to single stream from size 512
Processing... Done!
Compute thoughput [GFlops] -
Avg( 102.27, 1.46x) Peak( 151.62, 1.32x) Min( 21.28, 2.39x) [OK]
Memory thoughput [GB/s] -
Avg( 60.11, 1.60x) Peak( 77.00, 1.42x) Min( 37.53, 2.49x)
--- End code ---
Claggy
RottenMutt:
all 266.58 drivers and x64 vista or 7
8800gts: interesting, loaded the video engine alittle...
--- Code: ---Device: GeForce 8800 GTS 512, 1674 MHz clock, 492 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 41.89) Peak( 58.05) Min( 11.00) [OK]
Memory thoughput GB/s Avg( 25.23) Peak( 36.89) Min( 18.43)
Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
revert to single stream from size 128
Processing... Done!
Compute thoughput [GFlops] -
Avg( 48.37, 1.15x) Peak( 68.83, 1.19x) Min( 17.61, 1.60x) [OK]
Memory thoughput [GB/s] -
Avg( 30.32, 1.20x) Peak( 40.81, 1.11x) Min( 20.93, 1.14x)
--- End code ---
GTX475 (460 flashed to make it think it is a 475; i think i can, i think i can, I can)
--- Code: ---Device: GeForce GTX 470, 1250 MHz clock, 1248 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 75.32) Peak( 119.15) Min( 8.83) [OK]
Memory thoughput GB/s Avg( 41.63) Peak( 60.16) Min( 15.57)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
revert to single stream from size 512
Processing... Done!
Compute thoughput [GFlops] -
Avg( 115.05, 1.53x) Peak( 164.29, 1.38x) Min( 38.70, 4.38x) [OK]
Memory thoughput [GB/s] -
Avg( 69.71, 1.67x) Peak( 88.85, 1.48x) Min( 49.02, 3.15x)
--- End code ---
GTX295:
--- Code: ---Device: GeForce GTX 295, 1369 MHz clock, 896 MB memory.
Compute capability 1.3
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 81.66) Peak( 113.97) Min( 16.45) [OK]
Memory thoughput GB/s Avg( 47.72) Peak( 67.99) Min( 29.01)
Opt1 (worst case): 128 thrds/block, 2 x 524288 element streams
revert to single stream from size 256
Processing... Done!
Compute thoughput [GFlops] -
Avg( 78.32, 0.96x) Peak( 114.21, 1.00x) Min( 23.06, 1.40x) [OK]
Memory thoughput [GB/s] -
Avg( 47.65, 1.00x) Peak( 65.35, 0.96x) Min( 21.84, 0.75x)
--- End code ---
GTX480:
--- Code: ---Device: GeForce GTX 480, 1440 MHz clock, 1504 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 80.18) Peak( 121.88) Min( 9.81) [OK]
Memory thoughput GB/s Avg( 44.31) Peak( 64.58) Min( 17.29)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
revert to single stream from size 512
Processing... Done!
Compute thoughput [GFlops] -
Avg( 145.60, 1.82x) Peak( 208.34, 1.71x) Min( 34.86, 3.55x) [OK]
Memory thoughput [GB/s] -
Avg( 86.97, 1.96x) Peak( 113.33, 1.75x) Min( 61.46, 3.55x)
--- End code ---
another gtx480
--- Code: ---Device: GeForce GTX 480, 1440 MHz clock, 1503 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
Processing... Done!
Compute Thoughput GFlops Avg( 84.40) Peak( 130.24) Min( 9.68) [OK]
Memory thoughput GB/s Avg( 46.41) Peak( 68.03) Min( 17.06)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
revert to single stream from size 512
Processing... Done!
Compute thoughput [GFlops] -
Avg( 144.95, 1.72x) Peak( 208.23, 1.60x) Min( 34.33, 3.55x) [OK]
Memory thoughput [GB/s] -
Avg( 86.54, 1.86x) Peak( 113.26, 1.66x) Min( 60.53, 3.55x)
--- End code ---
perryjay:
Okay, you guys are just showing off now!! ;D
Navigation
[0] Message Index
[*] Previous page
Go to full version