Forum > GPU crunching

C-60 APU and Radeon HD6920

<< < (5/7) > >>

Urs Echternacht:
For Bulldozer arch try CompilerOptQuickRef-62004200.pdf

Jason G:

--- Quote from: Urs Echternacht on 04 Jan 2012, 09:59:48 pm ---For Bulldozer arch try CompilerOptQuickRef-62004200.pdf
--- End quote ---

Thanks! Interesting they recommend aggressive unrolling & prefetch, which suggests long pipelines. That's opposite to Core2 onward, which use loop stream detectors, often preferring to remain rolled up.   Probably when optimising for those it'll be worthwhile cross-checking what Agner Fog says for extra insight. 

I'm open in planning to try other compilers as well, so that's some good starting info.

Jason

skildude:
I did the benchmarks for the SSE3 X64 non AMD.  All WU's failed to start.
No real data to report at all on that test.
 The Following are the results from the AMD SSE3 on Win7 64 bit OC to 3.9Ghz.  The app doesn't state if it is 64 bit but it is from the 64 bit lunatics installer.
I think there is a dramatic speed difference from Mikes 32 bit testing.  I don't think the minimal OC can account for the speed difference.  In fact these times are substantially faster than Mikes!!!

WU : PG0009.wu
AK_v8b2_win_SSE3_AMD.exe : 326.697 secs CPU
AK_v8b2_win_SSE3_AMD.exe : 328.554 secs CPU
Speedup     : -0.57%
Ratio       : 0.99 x

WU : PG0395.wu
AK_v8b2_win_SSE3_AMD.exe : 307.104 secs CPU
AK_v8b2_win_SSE3_AMD.exe : 306.776 secs CPU
Speedup     : 0.11%
Ratio       : 1.00 x

WU : PG0444.wu
AK_v8b2_win_SSE3_AMD.exe : 249.430 secs CPU
AK_v8b2_win_SSE3_AMD.exe : 250.740 secs CPU
Speedup     : -0.53%
Ratio       : 0.99 x

WU : PG1327.wu
AK_v8b2_win_SSE3_AMD.exe : 201.584 secs CPU
AK_v8b2_win_SSE3_AMD.exe : 200.134 secs CPU
Speedup     : 0.72%
Ratio       : 1.01 x

Raistmer:
Currently I preparing new build environment on netbook. It will be x64 one cause it came with x64 Win7 onboard.
Wanna take opportunity and do great upgrade of buiuld environment too.
Ultimately will use VS2010 (unfortunately, I have access only to x86 prof version so will sit with VS2008 little more cause have full x64 pro suite).
Looks like Intel's part should be upgraded too. Perhaps, new Intel's composer? Should it support AVX? Should VS2010 support AVX? VS 2008 apparently should not ?Or some patches/service packs available?
I put new MB7 OCL NV onlyne, still CUDA 3.2 but will try CUDA 4.1RC2 on netbook so some more speed comparisons will be needed.
Testers stay tuned ;)

Raistmer:
http://software.intel.com/en-us/articles/intel-ipp-70-library-release-notes/

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version