Forum > Windows

New apps based on code revision 2.2 'Noo? No, Ni!' have been released!

<< < (4/22) > >>

KarVi:
You're welcome Simon.


You're probably right about the pipeline, it could be significant, but then it should have shown in the previous versions also. I know I tested the SSE2-PM with rev. 2.0, and it wasn't faster at that time (but not much slower either).

But many factors have changed, and even the new code-changes, (who as I understand it, put less pressure on the L2 cache and memory), could make the PM-version perform better on A64.

It could also be a fluke, but a 7 seconds difference on 212 seconds run-times is a big variation. I'm going to run the long test tomorrow, to see if the results are the same.

While I'm here I will just mention, that on my old AthlonXP Thoroughbread at 1936Mhz, with 256Kb cache, I'm seeing a 25+ % improvement on 62 points WU's. An extremely impressive result!

Simon:
Thanks to Ben, Joe and Alex, it is indeed impressive :)

The next app revision will probably deal with the new 5.18 code, and will offer different challenges.

Regards,
Simon.

Furex:

--- Quote from: Simon on 16 Feb 2007, 04:20:25 pm ---As for changes vs. 2.0 - (list follows)
--- End quote ---

Thank you Simon! :)

I'm testing GenSSE2 and it seems faster than patched iSSE3. I'm doing my tests on a batch of long WUs (~62.4) I've retrieved some time ago. After the interesting findings by KarVi I'll probably end up doing more tests on patched R2.2 apps to see whether the gains on the short benchmark units  show up in real world crunching, too.

Hope this release does something also for some classes of shorter units which turned out to be much slower (up to 40-50%) than the longer ones.

KarVi:
New results.

Running KWSN Test & Benchmark tool, with patched and renamed Rev. 2.2 applications, in long test mode, on my Athlon64.

Patched Intel "only" SSE3-P4 Rev. 2.2:   397 seconds.
Patched Intel "only" SSE2-P4 Rev. 2.2:   387 seconds.
Patched Intel "only" SSE2-PM Rev. 2.2:   381 seconds.
Generic SSE2 Rev. 2.2:                         395 seconds.

The results seem to be conclusive:

For my processor, the patched SSE2-PM is the fastest client.

msattler:

--- Quote from: KarVi on 17 Feb 2007, 08:33:30 am ---New results.

Running KWSN Test & Benchmark tool, with patched and renamed Rev. 2.2 applications, in long test mode, on my Athlon64.

Patched Intel "only" SSE3-P4 Rev. 2.2:   397 seconds.
Patched Intel "only" SSE2-P4 Rev. 2.2:   387 seconds.
Patched Intel "only" SSE2-PM Rev. 2.2:   381 seconds.
Generic SSE2 Rev. 2.2:                         395 seconds.

The results seem to be conclusive:

For my processor, the patched SSE2-PM is the fastest client.

--- End quote ---

KarVi,
Could you possibly PM me the patched clients to test on my FX60 rig?

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version