Forum > GPU crunching

[Split] PowerSpectrum Unit Test

<< < (38/62) > >>

PatrickV2:

--- Quote from: Jason G on 06 Dec 2010, 08:41:10 pm ---Thanks,

    It's what you (the test #6 anyway) didn't do  :D

This line's missing:

--- Quote ---Opt1 (PSmod3+SM): 64 thrds/block
PowerSpectrumSumMax array pinned in host memory.
   64 threads, fftlen 64: (worst case: full summax copy)
         1.5 GFlops    5.9 GB/s 121.7ulps
Every ifft average & peak OK
   64 threads, fftlen 64: (best case, nothing to update)
         1.6 GFlops    6.7 GB/s 121.7ulps
--- End quote ---

When operational, that feature seems to add a touch of throughput to both XP & Vista/Win7, and seems to close the performance difference. (we've been so worried about).  You should get a boost when I fix that.

Jason

--- End quote ---

Ah, ok, thanks for the elaboration. Looking forward to test #7 then!

Regards, Patrick.

Frizz:
Windows XP32. GTX 570. Nvidia Driver 263.09.


Device: GeForce GTX 570, 1464 MHz clock, 1280 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
      PowerSpectrum+summax Unit test #6 (pinned mem)
Stock:
 PwrSpec<    64>   25.6 GFlops  102.5 GB/s   0.0ulps

 SumMax (    64)    1.9 GFlops    7.9 GB/s
Every ifft average & peak OK

 PS+SuMx(    64)    6.2 GFlops   25.1 GB/s


GetPowerSpectrum() choice for Opt1: 256 thrds/block
    256 threads:       33.3 GFlops  133.3 GB/s 121.7ulps


Opt1 (PSmod3+SM): 256 thrds/block
PowerSpectrumSumMax array pinned in host memory.
  256 threads, fftlen 64: (worst case: full summax copy)
        10.9 GFlops   44.0 GB/s 121.7ulps
Every ifft average & peak OK
  256 threads, fftlen 64: (best case, nothing to update)
        13.5 GFlops   54.7 GB/s 121.7ulps

Jason G:
570 wooot!  ;D

Frizz:

--- Quote from: Jason G on 08 Dec 2010, 08:47:31 am ---570 wooot!  ;D

--- End quote ---

Borrowed it from a friend. It's hot, almost non-overclockable, and slightly slower than 480.

I am really looking forward to AMD HD6950/6970 !

[EDIT] Seems I got a bad sample. I've seen reports where the 570 has been overclocked to 840@4250 (stock: 732@3800) with air cooling.

Jason G:

--- Quote from: Frizz on 08 Dec 2010, 08:50:00 am --- It's hot, almost non-overclockable
--- End quote ---

Why bother ?  harvesting faulty parts you think ?

[Edit:] that worst case is slightly better than my 480 worst case, but the best cases are inferioir.   From the powerspectrum I see the constraint is memory ( again  ::) ) ... So indeed these may not be be a good choice for seti in the short term ... probably do Batman really well though  ::)

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version