+- +-
Say hello if visiting :) by Gecko
11 Jan 2023, 07:43:05 pm

Seti is down again by Mike
09 Aug 2017, 10:02:44 am

Some considerations regarding OpenCL MultiBeam app tuning from algorithm view by Raistmer
11 Dec 2016, 06:30:56 am

Loading APU to the limit: performance considerations by Mike
05 Nov 2016, 06:49:26 am

Better sleep on Windows - new round by Raistmer
26 Aug 2016, 02:02:31 pm

Author Topic: GPU client  (Read 196566 times)

Offline Devaster

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 653
  • I like Duke !!!
Re: GPU client
« Reply #90 on: 16 Jan 2008, 08:48:08 am »
i see that in some cases is GPU application better than stock app .... :o

interesting ....

Radiohead

  • Guest
Re: GPU client
« Reply #91 on: 16 Jan 2008, 11:30:11 am »
i see that in some cases is GPU application better than stock app .... :o

interesting ....

Hm...
This is 8800GTX  :)

guido.man

  • Guest
Re: GPU client
« Reply #92 on: 16 Jan 2008, 12:21:10 pm »
There are too many variables to make an accurate assessment of the true speedup of GPU client.
When I use default-515.exe as the reference client I also get Speedup Ratio  greater than 1.00 for some WU's.
Devaster what source files are you using to base your GPU client on, and what optimizations are done?
If you compiled a reference client to be used against your GPU client then most of the variables would
be known.

« Last Edit: 16 Jan 2008, 12:25:24 pm by guido.man »

Offline Devaster

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 653
  • I like Duke !!!
Re: GPU client
« Reply #93 on: 16 Jan 2008, 12:57:32 pm »
at home by tests i am using latest optimized app from  cruncher page and i have 8500GT only ....

remember , for now aren't in GPU code optimizations (shared mem usage ,memory coalescent access by read/write, optimal thread/block scheduling against core, ). used is only partial loop unroll ...

Offline Devaster

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 653
  • I like Duke !!!
Re: GPU client
« Reply #94 on: 21 Jan 2008, 01:08:11 am »
last weekend i have downloaded GUI profiler for CUDA. It has showed  many interesting things .... i will write more later - it would bigger ...

Gecko_R7

  • Guest
Re: GPU client
« Reply #95 on: 21 Jan 2008, 01:02:54 pm »
Hi Mimo,

Just saw this recent update on the Cuda developer forum.

http://forums.nvidia.com/index.php?showtopic=34241

Any benefit to your efforts?

Offline Devaster

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 653
  • I like Duke !!!
Re: GPU client
« Reply #96 on: 21 Jan 2008, 03:09:45 pm »
SETI@HOME uses 1d FFT and this is batched naturally in CUFFT library ....

Gecko_R7

  • Guest
Re: GPU client
« Reply #97 on: 21 Jan 2008, 05:30:15 pm »
SETI@HOME uses 1d FFT and this is batched naturally in CUFFT library ....
SETI@HOME uses 1d FFT and this is batched naturally in CUFFT library ....

Sorry  :-[.  My inquiry/interest was related to the batching, not the 2d fft itself since as you pointed-out, Seti uses 1d fft single precision complex ^2.
Didn't realize the batching was already done in the CUFFT library however.
Just ignore "the little man from behind the curtain"..... :P


« Last Edit: 21 Jan 2008, 08:09:44 pm by Gecko_R7 »

Offline Devaster

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 653
  • I like Duke !!!
Re: GPU client
« Reply #98 on: 22 Jan 2008, 05:09:41 pm »
try

[attachment deleted by admin]

Radiohead

  • Guest
Re: GPU client
« Reply #99 on: 22 Jan 2008, 05:22:42 pm »
What has changed?

Radiohead

  • Guest
Re: GPU client
« Reply #100 on: 22 Jan 2008, 06:43:03 pm »
New data...

[attachment deleted by admin]

popandbob

  • Guest
Re: GPU client
« Reply #101 on: 22 Jan 2008, 08:03:31 pm »
my first run...

8600GTS

Cpu and vid card not OC'ed.

~BoB

[attachment deleted by admin]

Offline Devaster

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 653
  • I like Duke !!!
Re: GPU client
« Reply #102 on: 23 Jan 2008, 12:19:10 am »
- changed data alignment to multiples of two (float2,float4) -some speedup
 

Offline Devaster

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 653
  • I like Duke !!!
Re: GPU client
« Reply #103 on: 23 Jan 2008, 12:25:03 am »
my first run...

8600GTS

Cpu and vid card not OC'ed.

~BoB

more multiprocessors and more MHz as me and better results ...

Batstat

  • Guest
Re: GPU client
« Reply #104 on: 26 Jan 2008, 06:10:53 am »
My result, open attachment

Result      : weakly similar.   
Result      : weakly similar.   
Result      : DIFFERENT. 
Result      : weakly similar.   
Result      : DIFFERENT. 
Result      : weakly similar.   
Result      : weakly similar.   
Speedup: 5.88%, Ratio: 1.06 x
Speedup: 9.83%, Ratio: 1.11 x
Speedup: 11.48%, Ratio: 1.13 x
Speedup: -31.30%, Ratio: 0.76 x
Speedup: 10.21%, Ratio: 1.11 x
Speedup: 24.52%, Ratio: 1.32 x
Speedup: 28.01%, Ratio: 1.39 x

[attachment deleted by admin]

 

Welcome, Guest.
Please login or register.
 
 
 
Forgot your password?
Members
Total Members: 97
Latest: ToeBee
New This Month: 0
New This Week: 0
New Today: 0
Stats
Total Posts: 59559
Total Topics: 1672
Most Online Today: 81
Most Online Ever: 1025
(17 Oct 2025, 10:50:36 am)
Users Online
Members: 0
Guests: 27
Total: 27
Powered by EzPortal