Forum > GPU crunching
how do i run the stock GPU app and Lunatic CPU apps
			Jason G:
			
			
--- Quote from: perryjay on 28 Jan 2011, 11:05:48 am ---Here it is...
--- End quote ---
Thanks!  It spins me out a little when I see the ~3+ times 'Min' figures.  It means that the slowest kernels liked the attention I gave.  1.42x average processing rate should translate to a decent percentage improvement on whole tasks (probably around 20% or so without further refinement).  The next few weeks levering in those to the pipeline is going to be interesting.
Jason
		
			Josef W. Segur:
			
			
--- Quote from: RottenMutt on 27 Jan 2011, 10:06:58 pm ---i tried installing using lunatic unified installer and not checking the gpu app and ended dumping my cache :P
...
--- End quote ---
Richard Haselgrove wrote a very good FAQ, Run Seti Enhanced on Fermi class GPUs (4xx) (Advanced users). Although it's specifically for the 6.10 cuda_fermi stock application, the pattern is the same for any stock application.
Because the project has resend_lost_results on, when you get app_info.xml fixed up the "dumped" tasks may be sent again.
                                                                                            Joe
		
			Claggy:
			
			Since i've also got a new GPU, a GTX 460 GLH, here's my Test #10 results (Win 7 x64, 266.58):
--- Code:  ---Device: GeForce GTX 460, 1600 MHz clock, 993 MB memory.
Compute capability 2.1
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   69.92) Peak(  114.81) Min(    8.91) [OK]
   Memory thoughput GB/s   Avg(   37.67) Peak(   54.34) Min(   15.05)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
  revert to single stream from size 512
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(  102.27, 1.46x) Peak(  151.62, 1.32x) Min(   21.28, 2.39x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   60.11, 1.60x) Peak(   77.00, 1.42x) Min(   37.53, 2.49x)
--- End code ---
Claggy
		
			RottenMutt:
			
			all 266.58 drivers and x64 vista or 7
8800gts:  interesting, loaded the video engine alittle...
--- Code:  ---Device: GeForce 8800 GTS 512, 1674 MHz clock, 492 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
		PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   41.89) Peak(   58.05) Min(   11.00) [OK]
   Memory thoughput GB/s   Avg(   25.23) Peak(   36.89) Min(   18.43)
Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
  revert to single stream from size 128
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(   48.37, 1.15x) Peak(   68.83, 1.19x) Min(   17.61, 1.60x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   30.32, 1.20x) Peak(   40.81, 1.11x) Min(   20.93, 1.14x)
--- End code ---
GTX475 (460 flashed to make it think it is a 475; i think i can, i think i can, I can) 
--- Code:  ---Device: GeForce GTX 470, 1250 MHz clock, 1248 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
		PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   75.32) Peak(  119.15) Min(    8.83) [OK]
   Memory thoughput GB/s   Avg(   41.63) Peak(   60.16) Min(   15.57)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
  revert to single stream from size 512
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(  115.05, 1.53x) Peak(  164.29, 1.38x) Min(   38.70, 4.38x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   69.71, 1.67x) Peak(   88.85, 1.48x) Min(   49.02, 3.15x)
--- End code ---
GTX295:
--- Code:  ---Device: GeForce GTX 295, 1369 MHz clock, 896 MB memory.
Compute capability 1.3
Compiled with CUDA 3020.
		PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   81.66) Peak(  113.97) Min(   16.45) [OK]
   Memory thoughput GB/s   Avg(   47.72) Peak(   67.99) Min(   29.01)
Opt1 (worst case): 128 thrds/block, 2 x 524288 element streams
  revert to single stream from size 256
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(   78.32, 0.96x) Peak(  114.21, 1.00x) Min(   23.06, 1.40x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   47.65, 1.00x) Peak(   65.35, 0.96x) Min(   21.84, 0.75x)
--- End code ---
GTX480:
--- Code:  ---Device: GeForce GTX 480, 1440 MHz clock, 1504 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
		PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   80.18) Peak(  121.88) Min(    9.81) [OK]
   Memory thoughput GB/s   Avg(   44.31) Peak(   64.58) Min(   17.29)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
  revert to single stream from size 512
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(  145.60, 1.82x) Peak(  208.34, 1.71x) Min(   34.86, 3.55x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   86.97, 1.96x) Peak(  113.33, 1.75x) Min(   61.46, 3.55x)
--- End code ---
another gtx480
--- Code:  ---Device: GeForce GTX 480, 1440 MHz clock, 1503 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
		PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   84.40) Peak(  130.24) Min(    9.68) [OK]
   Memory thoughput GB/s   Avg(   46.41) Peak(   68.03) Min(   17.06)
Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams
  revert to single stream from size 512
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(  144.95, 1.72x) Peak(  208.23, 1.60x) Min(   34.33, 3.55x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   86.54, 1.86x) Peak(  113.26, 1.66x) Min(   60.53, 3.55x)
--- End code ---
		
			perryjay:
			
			Okay, you guys are just showing off now!!   ;D
		
Navigation
[0] Message Index
[*] Previous page
Go to full version