Forum > GPU crunching
[Split] PowerSpectrum Unit Test
PatrickV2:
--- Quote from: Jason G on 06 Dec 2010, 08:41:10 pm ---Thanks,
It's what you (the test #6 anyway) didn't do :D
This line's missing:
--- Quote ---Opt1 (PSmod3+SM): 64 thrds/block
PowerSpectrumSumMax array pinned in host memory.
64 threads, fftlen 64: (worst case: full summax copy)
1.5 GFlops 5.9 GB/s 121.7ulps
Every ifft average & peak OK
64 threads, fftlen 64: (best case, nothing to update)
1.6 GFlops 6.7 GB/s 121.7ulps
--- End quote ---
When operational, that feature seems to add a touch of throughput to both XP & Vista/Win7, and seems to close the performance difference. (we've been so worried about). You should get a boost when I fix that.
Jason
--- End quote ---
Ah, ok, thanks for the elaboration. Looking forward to test #7 then!
Regards, Patrick.
Frizz:
Windows XP32. GTX 570. Nvidia Driver 263.09.
Device: GeForce GTX 570, 1464 MHz clock, 1280 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.
PowerSpectrum+summax Unit test #6 (pinned mem)
Stock:
PwrSpec< 64> 25.6 GFlops 102.5 GB/s 0.0ulps
SumMax ( 64) 1.9 GFlops 7.9 GB/s
Every ifft average & peak OK
PS+SuMx( 64) 6.2 GFlops 25.1 GB/s
GetPowerSpectrum() choice for Opt1: 256 thrds/block
256 threads: 33.3 GFlops 133.3 GB/s 121.7ulps
Opt1 (PSmod3+SM): 256 thrds/block
PowerSpectrumSumMax array pinned in host memory.
256 threads, fftlen 64: (worst case: full summax copy)
10.9 GFlops 44.0 GB/s 121.7ulps
Every ifft average & peak OK
256 threads, fftlen 64: (best case, nothing to update)
13.5 GFlops 54.7 GB/s 121.7ulps
Jason G:
570 wooot! ;D
Frizz:
--- Quote from: Jason G on 08 Dec 2010, 08:47:31 am ---570 wooot! ;D
--- End quote ---
Borrowed it from a friend. It's hot, almost non-overclockable, and slightly slower than 480.
I am really looking forward to AMD HD6950/6970 !
[EDIT] Seems I got a bad sample. I've seen reports where the 570 has been overclocked to 840@4250 (stock: 732@3800) with air cooling.
Jason G:
--- Quote from: Frizz on 08 Dec 2010, 08:50:00 am --- It's hot, almost non-overclockable
--- End quote ---
Why bother ? harvesting faulty parts you think ?
[Edit:] that worst case is slightly better than my 480 worst case, but the best cases are inferioir. From the powerspectrum I see the constraint is memory ( again ::) ) ... So indeed these may not be be a good choice for seti in the short term ... probably do Batman really well though ::)
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version