Seti@Home optimized science apps and information

Optimized Seti@Home apps => Discussion Forum => Topic started by: Dirk on 24 May 2016, 05:41:10 am

Title: MB/AP Bench Test Instruction
Post by: Dirk on 24 May 2016, 05:41:10 am
Because of my Message (http://setiweb.ssl.berkeley.edu/beta/forum_thread.php?id=2266&postid=58266#58266) in the SETI-Beta Forum which I wrote for 15 days...

I made something wrong because of the .wisdom file creation?

It's possible to make a MB/AP Bench Test Instruction?

Thanks.
Title: Re: MB/AP Bench Test Instruction
Post by: Raistmer on 26 May 2016, 12:33:26 pm
Did you read existing instruction before asking?
Title: Re: MB/AP Bench Test Instruction
Post by: Dirk on 28 May 2016, 04:54:59 am
I'm a semi advanced user.
I know a bit how to make bench test runs.

AstroPulse worked well, MultiBeam not (slightly deeper explanation).

Of course, I read the readme.txt files.
But, there I can't read this which I asked in the above linked message.

E.g. (after my experiences)...

AP bench test runs (on NV GT730):

APbench211_minimal.7z (which have the WU #ap_genwis.dat incl. This kind of file isn't in the MultiBeam tool, because of this always two times superfluous runs)
ap_Zblank_2LC67.wu of ap_Zblank_2LC67and6LC67.7z

In short (something like this, for an entry level, low end VGA card)...
cmdline tests:
-unroll 2 up to 10 (+1)
-ffa_block 256 -ffa_block_fetch 128 up to -ffa_block 2048 -ffa_block_fetch 1024 (+128, rough) (+/-64, fine)
-tune 1 8 4 1 up to -tune 1 64 4 1 (1st N *2)
-tune 2 8 1 1 up to -tune 2 256 1 1 (1st N *2)
-oclFFT_plan 16 8 32 up to -oclFFT_plan 256 32 256 (all possible N's specially written)
-sbs 32 up to -sbs 512 (N *2)
-skip_ffa_precompute (without N's)


Like this above for AP, I would like to have this also for MB.
Until now I have no ideas which N's are possible and if the cmdline settings are alone (e.g. -unroll N) or connected together (e.g. -ffa_block N and -ffa_block_fetch N).

The currently available bench test MB WUs are too small/fast for very fast VGA cards.

For MB is possible (I have no idea which N's are possible and if the cmdline settings are alone or connected together):

-period_iterations_num N
-spike_fft_thresh N
-sbs N
-use_sleep_ex N
-no_caching
-tune N Mx My Mz
-oclfft_tune_gr N
-oclfft_tune_lr N
-oclfft_tune_wg N
-oclfft_tune_ls N
-oclfft_tune_bn N
-oclfft_tune_cw N


Of course -use_sleep is needed for AP and MB NV OpenCL apps.

Also the download possibilities of all available apps (stock are newer than Installer ones).

It's a pity, sad and annoying that I can't let run SETI@home like other projects (Plug&Play or "just out of the box and fine") for to get max. performance.


Until I finshed this AP and MB bench test runs (for to find the best/fastest cmdline settings) I don't let run SETI@home in my BOINCs.


Thanks.
Title: Re: MB/AP Bench Test Instruction
Post by: Mike on 28 May 2016, 10:02:44 am
I`m not sure what you are talking about.

The NV read me`s have examples of best params to use, same as AMD.

Most users are quite happy with them.

Maybe you can get a few seconds more out of it but thats host dependent.
Title: Re: MB/AP Bench Test Instruction
Post by: Raistmer on 29 May 2016, 03:41:37 am
It's a pity, sad and annoying that I can't let run SETI@home like other projects (Plug&Play or "just out of the box and fine") for to get max. performance.


What hell make you think you can do that for other projects ??? Or do you think that if there is no options then max possible performance already achieved?
What a perverted way of thinking started to spread recently... Ok, forget about options. Just run stock. It's the best you can do. Just as with other projects.
Title: Re: MB/AP Bench Test Instruction
Post by: Dirk on 29 May 2016, 11:55:58 am
I have asked legitimate questions...
Why is offended someone here and responds aggressively?


The people who know me, know me - I'm a perfectionist.
I would like to squeeze out the max performance.

So it would be nice to have equipment (tools) for to make bench test runs on own hardware for to get max. possible performance.


In the meantime...
For the following GPUs (1 or 2 WUs/GPU?), which GPU apps (in Lunatics Installer or d/l URLs of stock apps?) and which cmdline settings I should use?


Intel Celeron J1900 CPU with iGPU 'Intel HD Graphics':
AP:
app:
cmdline settings:

MB:
app:
cmdline settings:

...with NV GT730:
AP:
app:
cmdline settings:

MB:
app:
cmdline settings:


AMD FuryX VGA cards:
AP:
app:
cmdline settings:

MB:
app:
cmdline settings:


Thanks.
Title: Re: MB/AP Bench Test Instruction
Post by: Raistmer on 29 May 2016, 05:53:03 pm
I have asked legitimate questions...
Why is offended someone here and responds aggressively?
Cause while asking you made illegitimate comparison with other projects. If I provide some options for "perfectionists" it doesn't mean other soft can't be improved similarly - just nobody there cared about that. Try to go there and ask to implement tuning options....


Quote
The people who know me, know me - I'm a perfectionist.
I would like to squeeze out the max performance.

So do that, you have "the buttons" to play with.

Quote
For the following GPUs (1 or 2 WUs/GPU?), which GPU apps (in Lunatics Installer or d/l URLs of stock apps?) and which cmdline settings I should use?
You tell us? How is it possible to know what is better for your setup if you don't carry benches and just asking others for best tuning line?

All that looks strange to me. You have tunable app, you have benchmark package, you have readmes... And then all questions are "what better tuning line".
What exactly your problems with KWSN bench?

EDIT:

Quote
-tune N Mx My Mz
-oclfft_tune_gr N
-oclfft_tune_lr N
-oclfft_tune_wg N
-oclfft_tune_ls N
-oclfft_tune_bn N
-oclfft_tune_cw N

Try to find table I posted with experimental findings what is worked for my own C-60.
Also, maybe someone made smth similar but I not aware. With fast GPU you could make your own table quite fast. As rule of thumb ( should I teach "perfectionist" how to make tuning?? ) one should take best recommended ones (RTFM!) and vary them from those values.
If you "full perfectionist" - then do full parameter space scan. WG for AMD limited to 256. Most of options are fool-proof. Other will show improper input as invalid result (and benchmark will catch that).
Title: Re: MB/AP Bench Test Instruction
Post by: Raistmer on 30 May 2016, 04:56:10 am
Link to table:
https://drive.google.com/file/d/0BwjTLNvsJmLBcEtaNG5xTUc4TDA/view?usp=sharing
Title: Re: MB/AP Bench Test Instruction
Post by: Dirk on 16 Jun 2016, 02:07:32 pm
It looks like so more I write with my poor english, the less I were understood.

I was disappointed that guppi.vlar's are send to GPUs.
From my experiences at SETI-Beta (default settings), they last x3 than a mid-AR task (on my FuryX's). On CPU they are shorter than mid-AR tasks.

I optimized (just played around) the cmdline settings and now the guppi.vlar's last like mid-AR tasks (on my FuryX's).
Not so much performance loss like I thought.

I did this on my own, noone told it to me how to do this.


Because of this I would like to know, how to optimize the cmdline settings for to get max performance, specially on my own PC system.


For some time,  Josef W. Segur (http://setiathome.berkeley.edu/forum_thread.php?id=76814&postid=1646777#1646777) guided me through the AstroPulse bench test run (on my Intel J1900 iGPU).
After this I understood the procedure and I did the same on my NV GT730 alone.
I can do this also on my FuryX's PC.


It would be very nice if someone could guide me through the MultiBeam bench test run, e.g. on my FuryX's.
Then I can do this also on my other GPUs.

Other could also follow it and optimize their GPUs.

Maybe after I could make an instruction how to make bench test runs.
From the sight of an user, maybe it could be helpful to do this.


Thanks.
Title: Re: MB/AP Bench Test Instruction
Post by: William on 17 Jun 2016, 04:13:45 am
the principle behind AP and MB bench is the same - you should be able to directly transfer what you learned for AP.