+- +-
Say hello if visiting :) by Gecko
11 Jan 2023, 07:43:05 pm

Seti is down again by Mike
09 Aug 2017, 10:02:44 am

Some considerations regarding OpenCL MultiBeam app tuning from algorithm view by Raistmer
11 Dec 2016, 06:30:56 am

Loading APU to the limit: performance considerations by Mike
05 Nov 2016, 06:49:26 am

Better sleep on Windows - new round by Raistmer
26 Aug 2016, 02:02:31 pm

Author Topic: [Split] PowerSpectrum Unit Test  (Read 137809 times)

Offline PatrickV2

  • Knight o' The Round Table
  • ***
  • Posts: 139
Re: [Split] PowerSpectrum Unit Test
« Reply #300 on: 27 Dec 2010, 08:04:57 am »
Cheers & No worries Patrick,
     Just wasn't sure extending the test was going to be needed.  Naked eye judgement is plenty for the purposes of testing scientific repeatability here, and running multiple times in the same exe would make it one large test rather than several small ones for comparison (if that makes any sense).  I'm happy that the 8800 seems to have some headroom left, and the 'Min' numbers indicate the sloest kernels have received a niice boost. 

Win7(WDDM) & XP(XPDM) driver model performance difference is 'gone'  ;D

Thanks for the extended explanation; my remark was merely given in by curiosity (and probably a large lack in understanding the underlying higher goals), but I feel more enlightened now. ;)

Regards, Patrick.

Offline Miep

  • Global Moderator
  • Knight who says 'Ni!'
  • *****
  • Posts: 964
Re: [Split] PowerSpectrum Unit Test
« Reply #301 on: 27 Dec 2010, 08:25:28 am »
Device: Quadro FX 570M, 950 MHz clock, 242 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(    9.58) Peak(   13.91) Min(    2.48) [OK]
   Memory thoughput GB/s   Avg(    5.70) Peak(    9.09) Min(    3.53)


Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
  revert to single stream from size 128
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(   11.23, 1.17x) Peak(   15.13, 1.09x) Min(    4.27, 1.72x) [OK]
   Memory thoughput [GB/s]   -
      Avg(    6.99, 1.23x) Peak(    9.88, 1.09x) Min(    5.01, 1.42x)


values roughly +- .3 on stock and +- .1 on opt1

[edit]compute speedup 1.56x - 1.76x memory speedup 1.22x -1.47x
« Last Edit: 27 Dec 2010, 08:44:54 am by Miep »
The road to hell is paved with good intentions

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #302 on: 27 Dec 2010, 08:27:47 am »
Thanks for the extended explanation; my remark was merely given in by curiosity (and probably a large lack in understanding the underlying higher goals), but I feel more enlightened now. ;)

Yeah, a bit more info along those lines, the actual kernels under test run in timing loops set to roughly half a second, which is enough for ~thousands to millions of runs, so I was expecting 'fair' stability in the Avg, Peak & Min values, so we are alright for discrete kernel performance measurements. 

I have however picked up an interesting thing on a friends i7-860 w/GTX480 in comparing against mine ( 45nM core2 w/GTX480)
-  His Peaks & Averages are ~same as mine for the same clockrate ... BUT ... the 'Min (slowest kernels) are several times faster ... Better CPU & RAM does have significant impact on the running of the toughest parts of code, it seems

Jason

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #303 on: 27 Dec 2010, 08:29:00 am »
values roughly +- .3 on stock and +- .1 on opt1

Hey that's decent! ... and there you were going to start a riot when initial mods yielded about 5% slowdown on yours ... tsk tsk tsk  ;D

Offline Miep

  • Global Moderator
  • Knight who says 'Ni!'
  • *****
  • Posts: 964
Re: [Split] PowerSpectrum Unit Test
« Reply #304 on: 27 Dec 2010, 08:51:17 am »
Hey that's decent! ... and there you were going to start a riot when initial mods yielded about 5% slowdown on yours ... tsk tsk tsk  ;D

Oh I just learned how to complain when not suffering ;D did the trick didn't it? ;)
The road to hell is paved with good intentions

Offline Claggy

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 3111
    • My computers at Seti Beta
Re: [Split] PowerSpectrum Unit Test
« Reply #305 on: 28 Dec 2010, 08:52:57 am »
My 9800GTX+ on Win 7 x64:

Device: GeForce 9800 GTX/9800 GTX+, 1900 MHz clock, 496 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   49.81) Peak(   71.73) Min(    8.11) [OK]
   Memory thoughput GB/s   Avg(   29.08) Peak(   44.80) Min(   14.31)


Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams
  revert to single stream from size 128
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(   57.66, 1.16x) Peak(   80.19, 1.12x) Min(   18.07, 2.23x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   35.80, 1.23x) Peak(   50.46, 1.13x) Min(   24.47, 1.71x)


Claggy

Offline glennaxl

  • Knight o' The Realm
  • **
  • Posts: 86
Re: [Split] PowerSpectrum Unit Test
« Reply #306 on: 04 Jan 2011, 01:53:38 am »
Device: GeForce GTX 260, 1441 MHz clock, 869 MB memory.
Compute capability 1.3
Compiled with CUDA 3020.
                PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)
Stock:
  Processing... Done!
  Compute Thoughput GFlops Avg(   47.55) Peak(   65.20) Min(   10.16) [OK]
   Memory thoughput GB/s   Avg(   28.09) Peak(   37.12) Min(   17.92)


Opt1 (worst case): 128 thrds/block, 2 x 524288 element streams
  revert to single stream from size 256
  Processing... Done!
  Compute thoughput [GFlops] -
      Avg(   84.83, 1.78x) Peak(  111.50, 1.71x) Min(   31.57, 3.11x) [OK]
   Memory thoughput [GB/s]   -
      Avg(   52.63, 1.87x) Peak(   67.26, 1.81x) Min(   36.24, 2.02x)

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #307 on: 04 Jan 2011, 03:14:52 pm »
Thanks both!

@glenaxl: that's some impressive speedup on GTX 260, I'll have to look at that here carefully on mine when I get a chance to do so.

@Claggy, average at 3/4 of peak seems pretty good, but I think we can get some more maybe.

@ALL, Thanks! I'm closing this test for now.  It's been an extremely valuable contribution from you all that has had a huge impact on the pace & quality of our progress (mine in particular). 

FYI: Some urgent issues may have come to light from Raistmer's OpenCL development when combined with the refinements here.  Those will need some fairly close attention for a short while, to get some information back to Berkeley, but stay tuned as there are more tests to come  :)

[Locking thread, Please stay tuned for further Unit Tests!]
Jason
« Last Edit: 04 Jan 2011, 03:43:57 pm by Jason G »

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #308 on: 10 Jan 2011, 06:28:14 am »
@All:
   Just a note that the concerns that arose, and distracted me from testing & development along this line, have now been at least partially resolved, and don't require any immediate action on our part.   I'm back to ruggedising &  integrating what we've accomplished here into the X-builds, and plan to start devising tests for PoT (Power over Time) processing refinement soon, in similar fashion to this thread.  PoT processing covers Gaussian searches, Triplet & Pulse finding, for which all Cuda releases have known issues to address, so there'll be plenty of tests to devise & collect data for yet.

Cheers once again!  :)
Jason
« Last Edit: 10 Jan 2011, 06:40:55 am by Jason G »

 

Welcome, Guest.
Please login or register.
 
 
 
Forgot your password?
Members
Total Members: 97
Latest: ToeBee
New This Month: 0
New This Week: 0
New Today: 0
Stats
Total Posts: 59559
Total Topics: 1672
Most Online Today: 48
Most Online Ever: 983
(20 Jan 2020, 03:17:55 pm)
Users Online
Members: 0
Guests: 29
Total: 29
Powered by EzPortal