Forum > GPU crunching
AK V8 + CUDA MB team work mod
Richard Haselgrove:
--- Quote from: The Naja on 28 Jan 2009, 09:22:44 am ---Thanks for your answer and for the clue,
Would yit be possible to have a link to such a template, and also in which folder to put it ?
--- End quote ---
There's plenty of discussion on the main board, but here goes:
--- Code: ---<cc_config>
<options>
<ncpus>5</ncpus>
</options>
</cc_config>
--- End code ---
(from BOINC Client configuration)
As stated on that page, it goes at the root level of your BOINC data folder.
Once in place, you can load and activate it from the advanced menu in BOINC Manager, 'Read config file'. When you change the number of CPUs this way, expect BOINC to re-run benchmarks.
Raistmer:
And now test on SSE3-capable Athlon 64 (Venice)
WU : PG0009.wu
AK_v8b_win_SSE3.exe : 1082.844 secs CPU
AK_v8b_win_SSE2_GPU_CPU_team.exe : 1035.328 secs CPU
Speedup : 4.39%
Ratio : 1.05 x
WU : PG0395.wu
AK_v8b_win_SSE3.exe : 1012.813 secs CPU
AK_v8b_win_SSE2_GPU_CPU_team.exe : 986.641 secs CPU
Speedup : 2.58%
Ratio : 1.03 x
WU : PG0444.wu
AK_v8b_win_SSE3.exe : 871.516 secs CPU
AK_v8b_win_SSE2_GPU_CPU_team.exe : 824.000 secs CPU
Speedup : 5.45%
Ratio : 1.06 x
WU : PG1327.wu
AK_v8b_win_SSE3.exe : 946.563 secs CPU
AK_v8b_win_SSE2_GPU_CPU_team.exe : 875.781 secs CPU
Speedup : 7.48%
Ratio : 1.08 x
As you can see SSE2 build performs better than SSE3 one on SSE3-capable (early) AMD. Maybe on latest Phenom SSE3-situation was improved ?
Could someone of our pre-testers or regular users try to run KWSN bench for AK_v8b_SSE3 and AK_v8b_SSE2 on new Phenom CPU to shed light on current situation with AMD SSE3 support quality ?
Jason G:
It is quite possible the SSE3 build, being a generic P4 switch (QxO or QxP?) , is assuming the presence of hardware prefetchers that those generations of AMD don't have, and of course long Pipelines (Along with the IPP libraries too). It's difficult that the two architectures are so different at that period. I'm leaning more and more towards a preference for isolating the core functions, maybe even into DLL's one day, but starting simply with a delay loaded choice of FFTs might be a start.
The Naja:
--- Quote from: Haselgrove on 28 Jan 2009, 09:57:25 am ---
As stated on that page, it goes at the root level of your BOINC data folder.
Once in place, you can load and activate it from the advanced menu in BOINC Manager, 'Read config file'. When you change the number of CPUs this way, expect BOINC to re-run benchmarks.
--- End quote ---
Haselgrove: thanks a lot for your explanations.
I ran a few tests yesterday based on your message.
It went fine on configuration point of view, but my computer was nut usable anymore: screen refresh was horrible: an alt-tab between 2 applications was taking at least 8 seconds.
Graphic drivers: latest from Nvidia website from last Saturday.
I had to roll back to AK_V8 + AP r103 package I was using before, no GPU used.
Hope this can give some clues for you guys, thanks again...
Raistmer:
New combo for AMD CPUs with SSE3 support is available.
At least for early AMD CPUs with SSE3 support (x86 mode) AK_v8 SSE2 version works faster than SSE3 one.
So this build aimed for such CPUs (SSE3 AP and SSE2 AK_v8).
If new Phenoms don'r show such speed degradation on SSE3 instruction set I would like to know it ( with benchmark results posted of course).
Package name: Raistmer's_opt_package_V8a_CPU_GPU_team_SSE3_AMD.rar
Attached to first thread post.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version