Forum > GPU crunching

AK V8 + CUDA MB team work mod

(1/51) > >>

Raistmer:
This version intended to allow simultaneous SETI MB processing on CPU and GPU on the same host.

It's just workaround of BOINC's inability to maintain such config. So, as any workaround it has its own advantages, disadvantages and limitations. Peruse known issues and use this package wisely. Please, report all unknows issues in this thread.

Lets begin:
1) This first "proof of concept" version will work only on SSSE3-capable hosts (sorry AMD fans, if this will work SSE3  will be supported too of course). Only Intel Core and up for now.
1a) Now SSSE3,SSE3 and SSE2 supported.  AMD CPUs should use SSE3-variant with _AMD in name (if CPU supports SSE3 of course ;) otherwise - SSE2 ).
2) This package can work ONLY on SETI main. Even don't try to use it on SETI beta - you can lose tasks for nothing.
3) This CPU-GPU team will not play nicely with other GPU-related projects like GPU-grid. It's because no BOINC GPU-management mechanism is used in this version.
This fact has positive side too - you do not need GPU-aware BOINC at all. You just need host with CUDA-supported GPU. It should work even with BOINC 5.xx
4) <ncpus>NUMBER_OF_CORES+1</ncpus> is REQUIRED for productive work. If you let BOINC manage CPU cores number you will end up with one idle core, trust me ;)
5) This AK V8 build was not PGOed so it will show worse performance than current CPU-only AK V8 SSSE3x app (will be fixed if this approach will be useful)(fixed already)
6) Probably will not use second GPU on dual-GPU hosts. (confirmed)

How it works:
for BOINC it looks as usual CPU opt app installed. BOINC will call CPU app (AK_v8b_win_SSSE3x_GPU_CPU_team.exe in our case) ans assign one of SETI MB tasks for it. But this app aware of possibility to use GPU for computations. It will check if another instance (it knows only itself and its clones, so - no other GPU-related projects please) already use GPU and if not - will start GPU-related app (MB_6.08_mod_CPU_team_CUDA.exe in our case) and suspend itself until GPU app finish. This CUDA app will do all work as usually but will do it on GPU leave CPU almost free.
That's why you should increase number of cores. BOINC should run NUMBER_OF_CORES+1 app thinking they all are CPU-related (some cheating of poor old BOINC here ;) )

After installation try to keep eye on first few results - this is pretty new approach and I can't give any guaranties if it will work for your config. If something will go wrong, please, revert to old variant you used before and describe your issue  in this thread.

P.S. Now you can easily see how fast CUDA is indeed (on non-VLAR tasks). CPU apps completed <20% of their tasks when CUDA app finished its first task on my Q9450+9600GSO host ;)
Enjoy!

[attachment deleted by admin]

Richard Haselgrove:
I was in the middle of flushing my Beta queue on the Q9300 anyway, since we have no new versions to test just now. I'll give this a run when I'm ready to switch, but it may be six hours or so till I get there - not quite sure how many of the remaining tasks are VLAR.

Have you any idea how compatible this package is with AP, if I transfer my r103 block into the app_info?

Raistmer:
AP r103 is included as usually ! IT should work with AP too. But if you will have only AP tasks it will do NUMBER_OF_CORES+1 AP r103 and no CUDA MB. => GPU will be idle.
(AP r103 provided "as is" w/o any modifications, it will use CPU just as will do any another app from another CPU-only project as Einstein for example. I expect some issues only with another GPU-enabled projects. And some inefficiencies because of BOINC inability to pair projects. This VERY needed feature still missing! We need option to pair one project with another....
I will post beta-site variant in pre-release area. This one will not work on beta.

Grey Shadow:
Hi again.

Great mod, thanks a lot. The first one allowing me to crunch MB units with CUDA and at the same time use my PC for other tasks. However one bug still persists (it is present also in stock 6.08, according to my experience). Certain MB workunits start as usually (about 30 seconds with CPU, then switch to GPU), but are processed extremely slow (almost as AP ones). Also when they are processed PC starts working much slower so usually I have to cancel them manually.

The sample of such workunit is http://setiathome.berkeley.edu/result.php?resultid=1134111853

It looke like this is the same VLAR issue?

Is it possible to do something to prevent this? Do you mind adding VLAR-killing ability to this mod?

Raistmer:

--- Quote from: Grey Shadow on 25 Jan 2009, 12:47:06 pm ---
Is it possible to do something to prevent this? Do you mind adding VLAR-killing ability to this mod?

--- End quote ---

Yes. This mod could be extended in two ways.
1) More advanced CPU/GPU choosing at app start. That is, if task is VLAR - stay with CPU, else - try pass task to GPU.
2) VLAR autokill mod.

Navigation

[0] Message Index

[#] Next page

Go to full version