Device: GeForce GTX 480, 810 MHz clock, 1503 MB memory.Compute capability 2.0Compiled with CUDA 3020. PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)Stock: Processing... Done! Compute Thoughput GFlops Avg( 104.34) Peak( 157.97) Min( 12.79) [OK] Memory thoughput GB/s Avg( 57.34) Peak( 82.25) Min( 22.55)Opt1 (worst case): 256 thrds/block, 2 x 524288 element streams revert to single stream from size 512 Processing... Done! Compute thoughput [GFlops] - Avg( 162.15, 1.55x) Peak( 232.02, 1.47x) Min( 26.47, 2.07x) [OK] Memory thoughput [GB/s] - Avg( 95.38, 1.66x) Peak( 127.32, 1.55x) Min( 46.67, 2.07x)
Device: GeForce 8400M GS, 800 MHz clock, 114 MB memory.Compute capability 1.1Compiled with CUDA 3020. PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)Stock: Processing... Done! Compute Thoughput GFlops Avg( 4.07) Peak( 5.64) Min( 1.19) [OK] Memory thoughput GB/s Avg( 2.44) Peak( 3.69) Min( 1.51)Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams revert to single stream from size 128 Processing... Done! Compute thoughput [GFlops] - Avg( 4.30, 1.06x) Peak( 5.78, 1.03x) Min( 1.68, 1.41x) [OK] Memory thoughput [GB/s] - Avg( 2.70, 1.11x) Peak( 3.78, 1.03x) Min( 1.90, 1.26x)
On my 128Mb 8400M GS:
And My 465:
Okay, I remembered to stop BOINC this time.......Device: GeForce 9500 GT, 1848 MHz clock, 1006 MB memory....
Device: GeForce GTX 480, 810 MHz clock, 1503 MB memory.... Compute thoughput [GFlops] - Avg( 165.56, 1.45x) Peak( 234.17, 1.38x) Min( 61.06, 2.86x) [OK]
Quote from: SciManStev on 26 Dec 2010, 02:11:17 pmDevice: GeForce GTX 480, 810 MHz clock, 1503 MB memory.... Compute thoughput [GFlops] - Avg( 165.56, 1.45x) Peak( 234.17, 1.38x) Min( 61.06, 2.86x) [OK]Winning! (just ) Glad you're on water cooling with those, My fan cranks up with that and creates a vortex in my room .It made me think '1.21 GigaWatts!'. I'll be checking out & researching on water cooling the 480 here, sometime in the new year. Starting with the basics with guides like This one, & doing my homework.
Device: GeForce 8800 GTX, 1350 MHz clock, 768 MB memory.Compute capability 1.0Compiled with CUDA 3020. PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)Stock: Processing... Done! Compute Thoughput GFlops Avg( 51.45) Peak( 72.63) Min( 9.33) [OK] Memory thoughput GB/s Avg( 30.07) Peak( 47.47) Min( 16.45)Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams revert to single stream from size 128 Processing... Done! Compute thoughput [GFlops] - Avg( 55.01, 1.07x) Peak( 75.98, 1.05x) Min( 13.89, 1.49x) [OK] Memory thoughput [GB/s] - Avg( 33.46, 1.11x) Peak( 49.65, 1.05x) Min( 24.23, 1.47x)
Device: GeForce 8800 GTX, 1350 MHz clock, 731 MB memory.Compute capability 1.0Compiled with CUDA 3020. PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)Stock: Processing... Done! Compute Thoughput GFlops Avg( 45.04) Peak( 62.72) Min( 8.62) [OK] Memory thoughput GB/s Avg( 26.39) Peak( 40.07) Min( 15.21)Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams revert to single stream from size 128 Processing... Done! Compute thoughput [GFlops] - Avg( 54.49, 1.21x) Peak( 75.17, 1.20x) Min( 13.75, 1.59x) [OK] Memory thoughput [GB/s] - Avg( 33.12, 1.26x) Peak( 49.13, 1.23x) Min( 24.07, 1.58x)
Q6600/8GB/8800GTX.One remark though: if you want to run a test multiple times, why not do that in the download-able executable? I don't mind if a benchmark of yours runs several minutes on my rig, so just do a few test-runs, determine the max/min and standard-deviation or something and output that?I have in any case run the benchmark 3 times on both OS versions, before running a 4th one redirected to a text-file (and compared that one too). Results and speed-ups looked stable to my 'naked' eye.
Device: GeForce 8800 GTS 512, 1625 MHz clock, 512 MB memory.Compute capability 1.1Compiled with CUDA 3020. PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)Stock: Processing... Done! Compute Thoughput GFlops Avg( 44.40) Peak( 66.68) Min( 7.85) [OK] Memory thoughput GB/s Avg( 26.26) Peak( 41.19) Min( 13.83)Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams revert to single stream from size 128 Processing... Done! Compute thoughput [GFlops] - Avg( 47.57, 1.07x) Peak( 67.80, 1.02x) Min( 17.37, 2.21x) [OK] Memory thoughput [GB/s] - Avg( 30.04, 1.14x) Peak( 41.89, 1.02x) Min( 19.00, 1.37x)
Device: GeForce 8800 GTS 512, 1625 MHz clock, 500 MB memory.Compute capability 1.1Compiled with CUDA 3020. PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)Stock: Processing... Done! Compute Thoughput GFlops Avg( 40.57) Peak( 57.91) Min( 7.32) [OK] Memory thoughput GB/s Avg( 23.86) Peak( 35.82) Min( 12.91)Opt1 (worst case): 64 thrds/block, 2 x 524288 element streams revert to single stream from size 128 Processing... Done! Compute thoughput [GFlops] - Avg( 48.43, 1.19x) Peak( 66.67, 1.15x) Min( 15.87, 2.17x) [OK] Memory thoughput [GB/s] - Avg( 30.30, 1.27x) Peak( 41.94, 1.17x) Min( 20.41, 1.58x)
Did a few runs for test #10 on different cards/machines...Cheers,MarkJ
Device: GeForce GTX 260, 1242 MHz clock, 896 MB memory.Compute capability 1.3Compiled with CUDA 3020. PowerSpectrum+summax Unit test #10 (FFT pipeline throughput)Stock: Processing... Done! Compute Thoughput GFlops Avg( 62.64) Peak( 93.36) Min( 4.48) [OK] Memory thoughput GB/s Avg( 34.47) Peak( 52.71) Min( 7.89)Opt1 (worst case): 128 thrds/block, 2 x 524288 element streams revert to single stream from size 256 Processing... Done! Compute thoughput [GFlops] - Avg( 67.78, 1.08x) Peak( 95.96, 1.03x) Min( 5.69, 1.27x) [OK] Memory thoughput [GB/s] - Avg( 38.80, 1.13x) Peak( 55.48, 1.05x) Min( 10.03, 1.27x)