Forum > Windows
New apps based on code revision 2.2 'Noo? No, Ni!' have been released!
Richard Haselgrove:
Thanks to Simon and the Coop, as always. Another splendid effort.
One tiny wee bugette in CPU-ID - it thinks my Xeon 53xx 'Clovertown' is a Xeon 51xx 'Woodcrest' - but that's not important: it can be tidied up later, or not, as you see fit.
KarVi:
I've also found a tiny error in the output of WU's.
This is the features output for my A64 on the Intel SSE2 client.
Features: MMX, 3DNow!, 3DNow!+, SSE, SSE3, SSE3,
Notice that SSE3 seems to be supported twice, and SSE2 is not :)
Ronon Dex:
Thanx for the GOOD work! :)
I have the Intel QX6700.
I use the SSSE3 Core2 app.
The <stderr_txt> isn't correct or?
--------------------------------------------------------
Optimized SETI@Home Enhanced application
Optimizers: Ben Herndon, Josef Segur, Alex Kan, Simon Zadra
Version: Windows SSE3 32-bit based on seti V5.15 'Noo? No - Ni!'
Revision: R-2.2|xT|FFT:IPP_SSE3|Ben-Joe
CPUID: Intel Xeon 51xx 'Woodcrest'
CPUs: 1, cores: 4, threads: 1 cache: L1=32K, L2=4096K, L3=0K
Features: MMX, SSE, SSE3, SSE3, SSSE3
speed: 2666 MHz -- read MB/s: L1=9951, L2=8617, RAM=5579
--------------------------------------------------------
Only for information! :)
EDIT:
I downloaded (I have running (Task Manager)) the KWSN_2.2_SSSE3-C2_Ben-Joe.exe... (with the other files)
Or is the link not right for the SSSE3 app.? And I have the SSE3 app.?
Or the name of the/my app. is not rigtht? SSSE3 but is SSE3?
Urs Echternacht:
--- Quote from: Furex on 16 Feb 2007, 10:40:46 am ---What are the improvements of the newest release? I've read something about C2D, so isn't there anything new for older machines ?
--- End quote ---
credit for wu: 57.17
Pentium3 1.4GHz@1.63GHz:
v1.3_SSE 21700secs
v2.2_SSE 17775secs
improvement appr. -18%
credit for wu: 60.86
Pentium M 2.0GHz@2.4GHz:
v2.0_SSE2_PM 7821secs
v2.2_SSE2_PM 6348secs
improvement appr. -19% (but first wu was validated INVALID)
Thanks to all the optimizers over here.
Simon:
Eek!
Seems I made a typo when changing the stats output; the first "SSE3" should be SSE2, instead.
Guess I'll have to recompile the apps again :) Thanks for noticing. Since it's a cosmetic thing only (it doesn't affect any function choices), it won't be a required upgrade.
--- Quote from: Furex on 16 Feb 2007, 10:40:46 am ---What are the improvements of the newest release? I've read something about C2D, so isn't there anything new for older machines ?
--- End quote ---
As for changes vs. 2.0 -
Improved pulse folding
Improved accuracy (especially on Core 2 systems vs. 1.41)
Benchmarking for the various folding versions
Some extra chirp functions adapted from Alex Kan's code (SSE and SSE2, was only SSE3 before)
Benchmark improvements as far as correct function choices go (the app tests each available function for sub-tasks like chirping, pulse folding, etc. up to the supported SSE level and uses the quickest, but did choose incorrectly sometimes, fixed)
Major efficiency improvement by Joe Segur - Not doing transpose when it's not needed
Doing transpose on 4 FFT chunks at a time rather than 1
and some others I probably forgot. Ben and Joe can complete the list or correct it.
How the apps will perform depends a lot on your host. On my Xeon 3.0 HT system, I saw an unbelievable jump of 60%. On my A64, around 15%, on my PD 805, around 20%, same on PD 9xx hosts. Around 10% quicker than 1.41 on Core 2 systems for most WUs, and around 35% quicker for the dreaded 58.7s, if I remember correctly. These are values I've seen in my benchmarks, so they may not reflect your results.
That said, you have my word you'll be pleasantly surprised because it is quicker than 2.0B and 1.41 on ALL hosts I've tested it on.
HTH,
Simon.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version