Forum > GPU crunching

Latest drivers (NVidia and ATI)

<< < (80/167) > >>

Raistmer:
Aha, and for nVidia...:


--- Quote ---I also have a NVIDIA Fermi card, and there is no way to get concurrent kernels on that using their OpenCL (driver v280.19 w/ OpenCL 1.1), with either single out-of-order queue or multiple queues.  You can get concurrent kernels using CUDA Runtime API, but then it's not in OpenCL.

Ken

--- End quote ---

Jason G:
280.26 WHQL is out.  If 280.19 is anything to go by it might me worth a look for many.  Will test it myself here soon.

[Later:] Similar performance with experimental optimised code to 280.19 beta driver here, Temps reached 107 degrees C.

Device: GeForce GTX 480, 1640 MHz clock, 1503 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.

                Chirp Unit test #3d (double Chirp)

Stock:
  Processing...
 Done!
  GFlops Avg(   68.94) Peak(   68.95) Min(   68.90)
  GB/s   Avg(   91.91) Peak(   91.93) Min(   91.87)


Opt: Elems/thrd 1, thrds/blk 64 - pk thru 156.4984 GB/s, 195.6230 GFlops
Opt: Elems/thrd 2, thrds/blk 64 - pk thru 149.9762 GB/s, 187.4702 GFlops
Opt: Elems/thrd 4, thrds/blk 64 - pk thru 159.4488 GB/s, 199.3110 GFlops
Opt: Elems/thrd 8, thrds/blk 64 - pk thru 164.2478 GB/s, 205.3098 GFlops

Opt: Elems/thrd 1, thrds/blk 128 - pk thru 154.4557 GB/s, 193.0696 GFlops
Opt: Elems/thrd 2, thrds/blk 128 - pk thru 170.7901 GB/s, 213.4876 GFlops
Opt: Elems/thrd 4, thrds/blk 128 - pk thru 163.2162 GB/s, 204.0203 GFlops
Opt: Elems/thrd 8, thrds/blk 128 - pk thru 162.6486 GB/s, 203.3107 GFlops

Opt: Elems/thrd 1, thrds/blk 192 - pk thru 161.0965 GB/s, 201.3706 GFlops
Opt: Elems/thrd 2, thrds/blk 192 - pk thru 160.4949 GB/s, 200.6186 GFlops
Opt: Elems/thrd 4, thrds/blk 192 - pk thru 164.0530 GB/s, 205.0662 GFlops
Opt: Elems/thrd 8, thrds/blk 192 - pk thru 160.4640 GB/s, 200.5800 GFlops

Opt: Elems/thrd 1, thrds/blk 256 - pk thru 162.5058 GB/s, 203.1322 GFlops
Opt: Elems/thrd 2, thrds/blk 256 - pk thru 148.2003 GB/s, 185.2503 GFlops
Opt: Elems/thrd 4, thrds/blk 256 - pk thru 161.9992 GB/s, 202.4989 GFlops
Opt: Elems/thrd 8, thrds/blk 256 -  N/A

Too hot.  I'll invent some way to throttle optimisations back....  ::)

Richard Haselgrove:
Let me eat lunch first, then I'll download it and run chirp test on the 420M (we have temps for that, too). Just 3d, or would any of the others be useful?

Jason G:

--- Quote from: Richard Haselgrove on 09 Aug 2011, 09:06:51 am ---Let me eat lunch first, then I'll download it and run chirp test on the 420M (we have temps for that, too). Just 3d, or would any of the others be useful?

--- End quote ---

Anything you can throw into the mix might turn out useful.  The current suspicion (of mine)  is that some of my kernels are simply 'too hot', so that throws some factory OC'd models over the edge.  Probably treading new ground then, so some effective way to handle things might be in order (like advanced user flags + failsafes).  We'll See.

Ghost0210:
From my 465 with the 280.26 drivers:

Device: GeForce GTX 465, 1450 MHz clock, 993 MB memory.
Compute capability 2.0
Compiled with CUDA 3020.

                Chirp Unit test #3d (double Chirp)

Stock:
  Processing...
 Done!
  GFlops Avg(   43.42) Peak(   43.49) Min(   43.07)
  GB/s   Avg(   57.90) Peak(   57.99) Min(   57.42)


Opt: Elems/thrd 1, thrds/blk 64 - pk thru  98.4503 GB/s, 123.0628 GFlops
Opt: Elems/thrd 2, thrds/blk 64 - pk thru  95.3779 GB/s, 119.2224 GFlops
Opt: Elems/thrd 4, thrds/blk 64 - pk thru  97.0317 GB/s, 121.2896 GFlops
Opt: Elems/thrd 8, thrds/blk 64 - pk thru  94.1059 GB/s, 117.6324 GFlops

Opt: Elems/thrd 1, thrds/blk 128 - pk thru  99.5902 GB/s, 124.4878 GFlops
Opt: Elems/thrd 2, thrds/blk 128 - pk thru  96.2985 GB/s, 120.3731 GFlops
Opt: Elems/thrd 4, thrds/blk 128 - pk thru  94.9835 GB/s, 118.7293 GFlops
Opt: Elems/thrd 8, thrds/blk 128 - pk thru  96.1054 GB/s, 120.1318 GFlops

Opt: Elems/thrd 1, thrds/blk 192 - pk thru  98.2473 GB/s, 122.8092 GFlops
Opt: Elems/thrd 2, thrds/blk 192 - pk thru  95.6510 GB/s, 119.5637 GFlops
Opt: Elems/thrd 4, thrds/blk 192 - pk thru  96.5704 GB/s, 120.7129 GFlops
Opt: Elems/thrd 8, thrds/blk 192 - pk thru  95.4054 GB/s, 119.2568 GFlops

Opt: Elems/thrd 1, thrds/blk 256 - pk thru  98.5406 GB/s, 123.1758 GFlops
Opt: Elems/thrd 2, thrds/blk 256 - pk thru  95.8551 GB/s, 119.8189 GFlops
Opt: Elems/thrd 4, thrds/blk 256 - pk thru  96.5229 GB/s, 120.6536 GFlops
Opt: Elems/thrd 8, thrds/blk 256 -  N/A

Card peaked @ 68c whilst running the last block
I'll run the rest and see if I get any different results to last time

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version