+- +-
Say hello if visiting :) by Gecko
11 Jan 2023, 07:43:05 pm

Seti is down again by Mike
09 Aug 2017, 10:02:44 am

Some considerations regarding OpenCL MultiBeam app tuning from algorithm view by Raistmer
11 Dec 2016, 06:30:56 am

Loading APU to the limit: performance considerations by Mike
05 Nov 2016, 06:49:26 am

Better sleep on Windows - new round by Raistmer
26 Aug 2016, 02:02:31 pm

Author Topic: [Split] PowerSpectrum Unit Test  (Read 138511 times)

Offline PatrickV2

  • Knight o' The Round Table
  • ***
  • Posts: 139
Re: [Split] PowerSpectrum Unit Test
« Reply #60 on: 20 Nov 2010, 11:32:41 am »
EDIT: I still have WinXP32 installed on another HD of this machine; are you interested in a run of your tool under that OS?

Yes please.  The difference picked up earlier (Thanks Frizz)  between XP32 & XP64 was interesting ( with stock, around 10% advantage to XP32, reduced to ~5% with Mod3 ) .    I've little doubt XP32 has a similar advantage over Win7x64, due to the simpler driver model, but it'd be nice to confirm if the mods close that gap a bit too.

Sure, no problem. The results:

WinXP32-SP3/8GB/8800GTX/260.99 drivers:

Device: GeForce 8800 GTX, 1350 MHz clock, 768 MB memory.
Compute capability 1.0
Compiled with CUDA 3020.
                PowerSpectrum Unit Test #4
Stock GetPowerSpectrum():
     64 threads:       18.3 GFlops    7.3 GB/s 1183.3ulps


GetPowerSpectrum() mod 1: (made Fermi & Pre-Fermi match in accuracy.)
     32 threads:       14.1 GFlops    5.6 GB/s 121.7ulps
     64 threads:       18.2 GFlops    7.3 GB/s 121.7ulps
    128 threads:       18.2 GFlops    7.3 GB/s 121.7ulps
    256 threads:       17.9 GFlops    7.2 GB/s 121.7ulps


GetPowerSpectrum() mod 2 (fixed, but slow):
     32 threads:        2.9 GFlops    1.2 GB/s 1183.3ulps
     64 threads:        2.9 GFlops    1.2 GB/s 1183.3ulps
    128 threads:        2.9 GFlops    1.2 GB/s 1183.3ulps
    256 threads:        2.9 GFlops    1.2 GB/s 1183.3ulps


GetPowerSpectrum() mod 3: (As with mod1, +threads & split loads)
     32 threads:       14.7 GFlops    5.9 GB/s 121.7ulps
     64 threads:       18.3 GFlops    7.3 GB/s 121.7ulps
    128 threads:       18.2 GFlops    7.3 GB/s 121.7ulps
    256 threads:       18.0 GFlops    7.2 GB/s 121.7ulps
    512 threads:       16.4 GFlops    6.6 GB/s 121.7ulps
   1024 threads: N/A

Regards, Patrick.

Offline SciManStev

  • Alpha Tester
  • Knight Templar
  • ***
  • Posts: 263
Re: [Split] PowerSpectrum Unit Test
« Reply #61 on: 20 Nov 2010, 11:38:02 am »
What I did was to copy the folder with the dll and executable into the root (C:) directory. Stop crunching. Go to the command line, and get yourself into the directory where the two files are. Run the PowerSpectrum4 file, and then hand type the results shown in the command window into notepad. From there they can be coppied and posted here. There is probally an easier way to do it, but you will get results. I think you need to be using at least the 260.89 GPU driver.

Steve

Offline PatrickV2

  • Knight o' The Round Table
  • ***
  • Posts: 139
Re: [Split] PowerSpectrum Unit Test
« Reply #62 on: 20 Nov 2010, 11:45:11 am »
What I did was to copy the folder with the dll and executable into the root (C:) directory. Stop crunching. Go to the command line, and get yourself into the directory where the two files are. Run the PowerSpectrum4 file, and then hand type the results shown in the command window into notepad. From there they can be coppied and posted here. There is probally an easier way to do it, but you will get results. I think you need to be using at least the 260.89 GPU driver.

Steve

Whoa, you can copy and paste from a CMD window.  :o Right click in the title bar and you will be enlightened. Saves you a LOT of typing!

Regards,

Patrick.

Offline Richard Haselgrove

  • Messenger Pigeon
  • Knight who says 'Ni!'
  • *****
  • Posts: 2819
Re: [Split] PowerSpectrum Unit Test
« Reply #63 on: 20 Nov 2010, 11:48:57 am »
What I did was to copy the folder with the dll and executable into the root (C:) directory. Stop crunching. Go to the command line, and get yourself into the directory where the two files are. Run the PowerSpectrum4 file, and then hand type the results shown in the command window into notepad. From there they can be coppied and posted here. There is probally an easier way to do it, but you will get results. I think you need to be using at least the 260.89 GPU driver.

Steve

Even easier: use a redirect.

PowerSpectrum4 > results.txt

Always avoid rekeying as much as you possibly can. Apart from the time wasted, it's a prolific source of errors.

Offline SciManStev

  • Alpha Tester
  • Knight Templar
  • ***
  • Posts: 263
Re: [Split] PowerSpectrum Unit Test
« Reply #64 on: 20 Nov 2010, 11:51:36 am »
What I did was to copy the folder with the dll and executable into the root (C:) directory. Stop crunching. Go to the command line, and get yourself into the directory where the two files are. Run the PowerSpectrum4 file, and then hand type the results shown in the command window into notepad. From there they can be coppied and posted here. There is probally an easier way to do it, but you will get results. I think you need to be using at least the 260.89 GPU driver.

Steve

Whoa, you can copy and paste from a CMD window.  :o Right click in the title bar and you will be enlightened. Saves you a LOT of typing!

Regards,

Patrick.

Thank you! I was stumbling myself trying to figure out how to do it. I actually had to toss the cat out because he kept climbing on me while I was trying to type. My DOS is not as good as it once was. It was even trial and error just to get to the right directory.

Steve

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #65 on: 20 Nov 2010, 12:02:39 pm »
BTW: Steve, I think your card has downclocked or something.

Here's mine, 480 @ 820MHz (Win7x64):
Quote
...GetPowerSpectrum() mod 3: (As with mod1, +threads & split loads)
     32 threads:       17.7 GFlops    7.1 GB/s 121.7ulps
     64 threads:       29.1 GFlops   11.6 GB/s 121.7ulps
    128 threads:       40.3 GFlops   16.1 GB/s 121.7ulps
    256 threads:       44.2 GFlops   17.7 GB/s 121.7ulps
    512 threads:       43.4 GFlops   17.4 GB/s 121.7ulps
   1024 threads:       36.8 GFlops   14.7 GB/s 121.7ulps...


Offline SciManStev

  • Alpha Tester
  • Knight Templar
  • ***
  • Posts: 263
Re: [Split] PowerSpectrum Unit Test
« Reply #66 on: 20 Nov 2010, 12:08:24 pm »
BTW: Steve, I think your card has downclocked or something.

Here's mine, 480 @ 820MHz (Win7x64):
Quote
...GetPowerSpectrum() mod 3: (As with mod1, +threads & split loads)
     32 threads:       17.7 GFlops    7.1 GB/s 121.7ulps
     64 threads:       29.1 GFlops   11.6 GB/s 121.7ulps
    128 threads:       40.3 GFlops   16.1 GB/s 121.7ulps
    256 threads:       44.2 GFlops   17.7 GB/s 121.7ulps
    512 threads:       43.4 GFlops   17.4 GB/s 121.7ulps
   1024 threads:       36.8 GFlops   14.7 GB/s 121.7ulps...



That's interesting. Normally I am running at 860 MHz, with the voltage at 1.05 VDC.

Steve

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #67 on: 20 Nov 2010, 12:14:23 pm »
Me and my little 9500GT reporting for duty sir but it's time for a little hand holding.I downloaded the package from the first post. I got a DLL and the executable. Where do I put the DLL before I open the EXE?

9500GT would be a great double check of the theories so far (Mod3 64 thread should be the right choice & extremely close to stock for that one)

Just- 
   - chuck the exe & dll into a new folder somewhere easy to get to, such as C:\TEST
   - Open a command window (Start->Run->CMD.EXE),
   - change directory to that location ( cd \TEST )
   - run the test  ( powerspectrum4.exe > results.txt )
   - wait for it to finish & look at results.txt

Jason

Offline SciManStev

  • Alpha Tester
  • Knight Templar
  • ***
  • Posts: 263
Re: [Split] PowerSpectrum Unit Test
« Reply #68 on: 20 Nov 2010, 12:18:25 pm »


At least they crunch fast.

Steve

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #69 on: 20 Nov 2010, 12:19:50 pm »
Ah Huh!  Memory clock

820/1640/2088 1.138V, that's about as hard as I can reasonably push it without going to water.

This particular test kernel is memory bound, so that'll be the difference.

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #70 on: 20 Nov 2010, 12:33:33 pm »
Yes please.  The difference picked up earlier (Thanks Frizz)  between XP32 & XP64 was interesting ( with stock, around 10% advantage to XP32, reduced to ~5% with Mod3 ) .    I've little doubt XP32 has a similar advantage over Win7x64, due to the simpler driver model, but it'd be nice to confirm if the mods close that gap a bit too.

Sure, no problem. The results:
...
     64 threads:       18.3 GFlops    7.3 GB/s 121.7ulps
  Thanks!, Not enough in it (~2-3%) for me to consider switching back to Xp32  :).
« Last Edit: 20 Nov 2010, 12:37:11 pm by Jason G »

Offline perryjay

  • Knight Templar
  • ****
  • Posts: 427
Re: [Split] PowerSpectrum Unit Test
« Reply #71 on: 20 Nov 2010, 12:44:37 pm »


Microsoft Windows [Version 6.0.6002]
Copyright (c) 2006 Microsoft Corporation.  All rights reserved.

C:\Users\perry>cd\test

C:\test>powerspectrum4.exe

Device: GeForce 9500 GT, 1840 MHz clock, 1008 MB memory.
Compute capability 1.1
Compiled with CUDA 3020.
                PowerSpectrum Unit Test #4
Stock GetPowerSpectrum():
     64 threads:        2.8 GFlops    1.1 GB/s 1183.3ulps


GetPowerSpectrum() mod 1: (made Fermi & Pre-Fermi match in accuracy.)
     32 threads:        2.7 GFlops    1.1 GB/s 121.7ulps
     64 threads:        2.9 GFlops    1.1 GB/s 121.7ulps
    128 threads:        2.9 GFlops    1.1 GB/s 121.7ulps
    256 threads:        2.9 GFlops    1.2 GB/s 121.7ulps


GetPowerSpectrum() mod 2 (fixed, but slow):
     32 threads:        0.5 GFlops    0.2 GB/s 1183.3ulps
     64 threads:        0.5 GFlops    0.2 GB/s 1183.3ulps
    128 threads:        0.5 GFlops    0.2 GB/s 1183.3ulps
    256 threads:        0.5 GFlops    0.2 GB/s 1183.3ulps


GetPowerSpectrum() mod 3: (As with mod1, +threads & split loads)
     32 threads:        2.8 GFlops    1.1 GB/s 121.7ulps
     64 threads:        2.9 GFlops    1.1 GB/s 121.7ulps
    128 threads:        2.9 GFlops    1.2 GB/s 121.7ulps
    256 threads:        2.9 GFlops    1.2 GB/s 121.7ulps
    512 threads:        2.9 GFlops    1.1 GB/s 121.7ulps
   1024 threads: N/A



C:\test>

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: [Split] PowerSpectrum Unit Test
« Reply #72 on: 20 Nov 2010, 12:57:05 pm »
Woohoo, I like the ones that say 1.2GB/s , might have to shift compute cap 1.1 cards into the Mod 3, 128 thread category  ( or add more digits next time,  to find out where within 0-9% that difference is.  9% would be good )
« Last Edit: 20 Nov 2010, 01:06:57 pm by Jason G »

Offline arkayn

  • Janitor o' the Board
  • Knight who says 'Ni!'
  • *****
  • Posts: 1230
  • Aaaarrrrgggghhhh
    • My Little Place On The Internet
Re: [Split] PowerSpectrum Unit Test
« Reply #73 on: 20 Nov 2010, 06:49:29 pm »
I guess I was doing it wrong before as well, I was just running it straight

Device: GeForce GTX 460, 1600 MHz clock, 768 MB memory.
Compute capability 2.1
Compiled with CUDA 3020.
      PowerSpectrum Unit Test #4
Stock GetPowerSpectrum():
     64 threads:       12.8 GFlops    5.1 GB/s   0.0ulps


GetPowerSpectrum() mod 1: (made Fermi & Pre-Fermi match in accuracy.)
     32 threads:        7.7 GFlops    3.1 GB/s 121.7ulps
     64 threads:       12.8 GFlops    5.1 GB/s 121.7ulps
    128 threads:       17.6 GFlops    7.0 GB/s 121.7ulps
    256 threads:       19.3 GFlops    7.7 GB/s 121.7ulps


GetPowerSpectrum() mod 2 (fixed, but slow):
     32 threads:        8.7 GFlops    3.5 GB/s   0.0ulps
     64 threads:       11.2 GFlops    4.5 GB/s   0.0ulps
    128 threads:       13.2 GFlops    5.3 GB/s   0.0ulps
    256 threads:       12.8 GFlops    5.1 GB/s   0.0ulps


GetPowerSpectrum() mod 3: (As with mod1, +threads & split loads)
     32 threads:        7.8 GFlops    3.1 GB/s 121.7ulps
     64 threads:       12.9 GFlops    5.1 GB/s 121.7ulps
    128 threads:       17.6 GFlops    7.0 GB/s 121.7ulps
    256 threads:       19.3 GFlops    7.7 GB/s 121.7ulps
    512 threads:       19.1 GFlops    7.6 GB/s 121.7ulps
   1024 threads:       15.2 GFlops    6.1 GB/s 121.7ulps

Offline perryjay

  • Knight Templar
  • ****
  • Posts: 427
Re: [Split] PowerSpectrum Unit Test
« Reply #74 on: 20 Nov 2010, 06:55:14 pm »
Just to add a little bit... I'm running Vista 32 on a E5400 dual 2.7GHz. My 9500GT has driver 260.99 and is slightly overclocked at core 723/ shader 1840 and memory at 400 to give me 118GFLOP)S Peak.

 

Welcome, Guest.
Please login or register.
 
 
 
Forgot your password?
Members
Total Members: 97
Latest: ToeBee
New This Month: 0
New This Week: 0
New Today: 0
Stats
Total Posts: 59559
Total Topics: 1672
Most Online Today: 228
Most Online Ever: 983
(20 Jan 2020, 03:17:55 pm)
Users Online
Members: 0
Guests: 166
Total: 166
Powered by EzPortal