+- +-
Say hello if visiting :) by Gecko
11 Jan 2023, 07:43:05 pm

Seti is down again by Mike
09 Aug 2017, 10:02:44 am

Some considerations regarding OpenCL MultiBeam app tuning from algorithm view by Raistmer
11 Dec 2016, 06:30:56 am

Loading APU to the limit: performance considerations by Mike
05 Nov 2016, 06:49:26 am

Better sleep on Windows - new round by Raistmer
26 Aug 2016, 02:02:31 pm

Author Topic: MB/AP Bench Test Instruction  (Read 28566 times)

Offline Dirk

  • Squire
  • *
  • Posts: 44
  • BETA TESTER
    • SETI@home Account
MB/AP Bench Test Instruction
« on: 24 May 2016, 05:41:10 am »
Because of my Message in the SETI-Beta Forum which I wrote for 15 days...

I made something wrong because of the .wisdom file creation?

It's possible to make a MB/AP Bench Test Instruction?

Thanks.
BR :)


Offline Raistmer

  • Working Code Wizard
  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 14349
Re: MB/AP Bench Test Instruction
« Reply #1 on: 26 May 2016, 12:33:26 pm »
Did you read existing instruction before asking?

Offline Dirk

  • Squire
  • *
  • Posts: 44
  • BETA TESTER
    • SETI@home Account
Re: MB/AP Bench Test Instruction
« Reply #2 on: 28 May 2016, 04:54:59 am »
I'm a semi advanced user.
I know a bit how to make bench test runs.

AstroPulse worked well, MultiBeam not (slightly deeper explanation).

Of course, I read the readme.txt files.
But, there I can't read this which I asked in the above linked message.

E.g. (after my experiences)...

AP bench test runs (on NV GT730):

APbench211_minimal.7z (which have the WU #ap_genwis.dat incl. This kind of file isn't in the MultiBeam tool, because of this always two times superfluous runs)
ap_Zblank_2LC67.wu of ap_Zblank_2LC67and6LC67.7z

In short (something like this, for an entry level, low end VGA card)...
cmdline tests:
-unroll 2 up to 10 (+1)
-ffa_block 256 -ffa_block_fetch 128 up to -ffa_block 2048 -ffa_block_fetch 1024 (+128, rough) (+/-64, fine)
-tune 1 8 4 1 up to -tune 1 64 4 1 (1st N *2)
-tune 2 8 1 1 up to -tune 2 256 1 1 (1st N *2)
-oclFFT_plan 16 8 32 up to -oclFFT_plan 256 32 256 (all possible N's specially written)
-sbs 32 up to -sbs 512 (N *2)
-skip_ffa_precompute (without N's)


Like this above for AP, I would like to have this also for MB.
Until now I have no ideas which N's are possible and if the cmdline settings are alone (e.g. -unroll N) or connected together (e.g. -ffa_block N and -ffa_block_fetch N).

The currently available bench test MB WUs are too small/fast for very fast VGA cards.

For MB is possible (I have no idea which N's are possible and if the cmdline settings are alone or connected together):

-period_iterations_num N
-spike_fft_thresh N
-sbs N
-use_sleep_ex N
-no_caching
-tune N Mx My Mz
-oclfft_tune_gr N
-oclfft_tune_lr N
-oclfft_tune_wg N
-oclfft_tune_ls N
-oclfft_tune_bn N
-oclfft_tune_cw N


Of course -use_sleep is needed for AP and MB NV OpenCL apps.

Also the download possibilities of all available apps (stock are newer than Installer ones).

It's a pity, sad and annoying that I can't let run SETI@home like other projects (Plug&Play or "just out of the box and fine") for to get max. performance.


Until I finshed this AP and MB bench test runs (for to find the best/fastest cmdline settings) I don't let run SETI@home in my BOINCs.


Thanks.
BR :)


Offline Mike

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 2427
Re: MB/AP Bench Test Instruction
« Reply #3 on: 28 May 2016, 10:02:44 am »
I`m not sure what you are talking about.

The NV read me`s have examples of best params to use, same as AMD.

Most users are quite happy with them.

Maybe you can get a few seconds more out of it but thats host dependent.

Offline Raistmer

  • Working Code Wizard
  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 14349
Re: MB/AP Bench Test Instruction
« Reply #4 on: 29 May 2016, 03:41:37 am »
It's a pity, sad and annoying that I can't let run SETI@home like other projects (Plug&Play or "just out of the box and fine") for to get max. performance.


What hell make you think you can do that for other projects ??? Or do you think that if there is no options then max possible performance already achieved?
What a perverted way of thinking started to spread recently... Ok, forget about options. Just run stock. It's the best you can do. Just as with other projects.

Offline Dirk

  • Squire
  • *
  • Posts: 44
  • BETA TESTER
    • SETI@home Account
Re: MB/AP Bench Test Instruction
« Reply #5 on: 29 May 2016, 11:55:58 am »
I have asked legitimate questions...
Why is offended someone here and responds aggressively?


The people who know me, know me - I'm a perfectionist.
I would like to squeeze out the max performance.

So it would be nice to have equipment (tools) for to make bench test runs on own hardware for to get max. possible performance.


In the meantime...
For the following GPUs (1 or 2 WUs/GPU?), which GPU apps (in Lunatics Installer or d/l URLs of stock apps?) and which cmdline settings I should use?


Intel Celeron J1900 CPU with iGPU 'Intel HD Graphics':
AP:
app:
cmdline settings:

MB:
app:
cmdline settings:

...with NV GT730:
AP:
app:
cmdline settings:

MB:
app:
cmdline settings:


AMD FuryX VGA cards:
AP:
app:
cmdline settings:

MB:
app:
cmdline settings:


Thanks.
« Last Edit: 29 May 2016, 12:03:55 pm by Dirk S. »
BR :)


Offline Raistmer

  • Working Code Wizard
  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 14349
Re: MB/AP Bench Test Instruction
« Reply #6 on: 29 May 2016, 05:53:03 pm »
I have asked legitimate questions...
Why is offended someone here and responds aggressively?
Cause while asking you made illegitimate comparison with other projects. If I provide some options for "perfectionists" it doesn't mean other soft can't be improved similarly - just nobody there cared about that. Try to go there and ask to implement tuning options....


Quote
The people who know me, know me - I'm a perfectionist.
I would like to squeeze out the max performance.

So do that, you have "the buttons" to play with.

Quote
For the following GPUs (1 or 2 WUs/GPU?), which GPU apps (in Lunatics Installer or d/l URLs of stock apps?) and which cmdline settings I should use?
You tell us? How is it possible to know what is better for your setup if you don't carry benches and just asking others for best tuning line?

All that looks strange to me. You have tunable app, you have benchmark package, you have readmes... And then all questions are "what better tuning line".
What exactly your problems with KWSN bench?

EDIT:

Quote
-tune N Mx My Mz
-oclfft_tune_gr N
-oclfft_tune_lr N
-oclfft_tune_wg N
-oclfft_tune_ls N
-oclfft_tune_bn N
-oclfft_tune_cw N

Try to find table I posted with experimental findings what is worked for my own C-60.
Also, maybe someone made smth similar but I not aware. With fast GPU you could make your own table quite fast. As rule of thumb ( should I teach "perfectionist" how to make tuning?? ) one should take best recommended ones (RTFM!) and vary them from those values.
If you "full perfectionist" - then do full parameter space scan. WG for AMD limited to 256. Most of options are fool-proof. Other will show improper input as invalid result (and benchmark will catch that).
« Last Edit: 29 May 2016, 06:04:48 pm by Raistmer »

Offline Raistmer

  • Working Code Wizard
  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 14349

Offline Dirk

  • Squire
  • *
  • Posts: 44
  • BETA TESTER
    • SETI@home Account
Re: MB/AP Bench Test Instruction
« Reply #8 on: 16 Jun 2016, 02:07:32 pm »
It looks like so more I write with my poor english, the less I were understood.

I was disappointed that guppi.vlar's are send to GPUs.
From my experiences at SETI-Beta (default settings), they last x3 than a mid-AR task (on my FuryX's). On CPU they are shorter than mid-AR tasks.

I optimized (just played around) the cmdline settings and now the guppi.vlar's last like mid-AR tasks (on my FuryX's).
Not so much performance loss like I thought.

I did this on my own, noone told it to me how to do this.


Because of this I would like to know, how to optimize the cmdline settings for to get max performance, specially on my own PC system.


For some time,  Josef W. Segur guided me through the AstroPulse bench test run (on my Intel J1900 iGPU).
After this I understood the procedure and I did the same on my NV GT730 alone.
I can do this also on my FuryX's PC.


It would be very nice if someone could guide me through the MultiBeam bench test run, e.g. on my FuryX's.
Then I can do this also on my other GPUs.

Other could also follow it and optimize their GPUs.

Maybe after I could make an instruction how to make bench test runs.
From the sight of an user, maybe it could be helpful to do this.


Thanks.
« Last Edit: 16 Jun 2016, 02:13:39 pm by Dirk S. »
BR :)


Offline William

  • Global Moderator
  • Knight Templar
  • *****
  • Posts: 342
Re: MB/AP Bench Test Instruction
« Reply #9 on: 17 Jun 2016, 04:13:45 am »
the principle behind AP and MB bench is the same - you should be able to directly transfer what you learned for AP.

 

Welcome, Guest.
Please login or register.
 
 
 
Forgot your password?
Members
Total Members: 97
Latest: ToeBee
New This Month: 0
New This Week: 0
New Today: 0
Stats
Total Posts: 59559
Total Topics: 1672
Most Online Today: 23
Most Online Ever: 983
(20 Jan 2020, 03:17:55 pm)
Users Online
Members: 0
Guests: 15
Total: 15
Powered by EzPortal