Forum > Linux
Computation Error With Optimized Apps
Lysia:
Done. Results of the prerelease apps are exactly identical. The results are very similar to the ones of the released SSE3-app (peak power differs slightly, and one of the time_series differs in one digit). I have attached all results (including those from the released apps) and a diff between the nearly identical results.
My result is identical to Urs' for the prerelease.
All "good" apps have the same indices.txt that is produced right at the beginning of the calculation and it differs significantly from the broken app. Although it is of no use after the calculation, it might be the cause of the miscalculation (or the difference has the same origin), so i would start there when searching for the bug.
EDIT: For the comparison of the performance
prerelease SSE2:
--- Code: ---real 1847m42.197s
user 1703m37.728s
sys 93m4.481s
--- End code ---
prerelease SSE3:
--- Code: ---real 1777m46.620s
user 1631m52.455s
sys 93m21.558s
--- End code ---
release SSE2:
--- Code: ---real 1429m15.220s
user 1317m58.278s
sys 86m18.336s
--- End code ---
release SSE3:
--- Code: ---real 1475m25.372s
user 1363m40.233s
sys 86m3.655s
--- End code ---
[attachment deleted by admin]
sunu:
I got all the results from Lysius, Urs and mine and compared each other and I agree with what you've all said above. So to recap:
SSE2 app from Crunch3r's page is not good.
SSE3 app from Crunch3r's page and pre-release apps I've posted above are good.
So for the time being, till Crunc3r makes a new one, everyone who wants a SSE2 app you can use the one attached.
Urs Echternacht:
Checking results :
at first the ones that are supposed to be ok :
--- Code: ---.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ls
pulse_503pr_sse2.out pulse.out_pre pulse.out_rel_SSE3 pulse.out_SSE3 rescmpAP2
.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ./rescmpAP2 pulse_503pr_sse2.out pulse.out_pre
\: <ap_signal>14,<pulses>4,<best_pulses>10
: <ap_signal>14,<pulses>4,<best_pulses>10
All Signals: Checked 14, 14 , Strongly Similar
Pulses: Checked 4, 4 , Strongly Similar
Best Pulses: Checked 10, 10 , Strongly Similar
.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ./rescmpAP2 pulse_503pr_sse2.out pulse.out_rel_SSE3
\: <ap_signal>14,<pulses>4,<best_pulses>10
: <ap_signal>14,<pulses>4,<best_pulses>10
All Signals: Checked 14, 14 , Strongly Similar
Pulses: Checked 4, 4 , Strongly Similar
Best Pulses: Checked 10, 10 , Strongly Similar
.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ./rescmpAP2 pulse_503pr_sse2.out pulse.out_SSE3
\: <ap_signal>14,<pulses>4,<best_pulses>10
: <ap_signal>14,<pulses>4,<best_pulses>10
All Signals: Checked 14, 14 , Strongly Similar
Pulses: Checked 4, 4 , Strongly Similar
Best Pulses: Checked 10, 10 , Strongly Similar
--- End code ---
They 'are strongly similar.
Now against the ones from that bad release version :
--- Code: ---.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ls
pulse_503pr_sse2.out pulse_503rel_sse2.out pulse.out_pre pulse.out_rel_SSE2 pulse.out_rel_SSE3 pulse.out_SSE2 pulse.out_SSE3 rescmpAP2
.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ./rescmpAP2 pulse_503pr_sse2.out pulse_503rel_sse2.out
\: <ap_signal>14,<pulses>4,<best_pulses>10
: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: Weakly similar or Different.
Best Pulses: Weakly similar or Different.
.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ./rescmpAP2 pulse_503pr_sse2.out pulse.out_rel_SSE2
\: <ap_signal>14,<pulses>4,<best_pulses>10
: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: Weakly similar or Different.
Best Pulses: Weakly similar or Different.
.../ap_5.03/linux/ap_5.03_sse2_lx32_r03/compare> ./rescmpAP2 pulse_503pr_sse2.out pulse.out_SSE2
\: <ap_signal>14,<pulses>4,<best_pulses>10
: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: Weakly similar or Different.
Best Pulses: Weakly similar or Different.
--- End code ---
They are obviously (no. of signals) different.
Lysia:
--- Quote from: sunu on 01 Apr 2009, 06:07:40 pm ---So for the time being, till Crunc3r makes a new one, everyone who wants a SSE2 app you can use the one attached.
--- End quote ---
I think I will do some performance comparison tomorrow and run the SSE-version. Might be that the SSE-version of the released app is faster than the SSE2-version of the prerelease. Can you also attach the SSE-version of the prerelease?
sunu:
The sse app in crunch3r's page is the same we had for testing, it didn't change. And it is slower than the sse2 app.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version