Forum > Discussion Forum
AVX Optimized App Development
Josef W. Segur:
--- Quote from: KarVi on 08 May 2012, 12:27:17 pm ---...
Seems my 8150 is more consistent about choosing f6 than arkayn's FX-4100... I wonder why?
--- End quote ---
Maybe a measure of how well the hardware prefetching is matched to the memory system. In the attached J48 I've added an fn with no software prefetching, perhaps your system will prefer that over f6.
I've also modified the way the test time is calculated slightly. Each test consists of ten runs and the average of all was used, now I'm dropping the slowest of the ten runs to reduce the effect of transient conditions. I expect it to still vary more with BOINC running than without, though.
Edit: Attachment removed, see later post for current test.
Joe
arkayn:
I don't know if it makes all that much difference but I am running BOINC on the GTX460 while the test is running, the CPU's are idle though.
=========================================================
Ftst_v7_J48_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.009465 0.00000 test
fpu_ChirpData 0.018363 0.00000 test
fpu_opt_ChirpData 0.009566 0.00000 test
sse1_ChirpData_ak8e 0.007725 0.00000 test
sse2_ChirpData_ak8 0.004682 0.00000 test
sse3_ChirpData_ak8 0.004917 0.00000 test
avx_ChirpData_a 0.004643 0.00000 test
avx_ChirpData_b 0.004889 0.00000 test
avx_ChirpData_c 0.004597 0.00000 test
avx_ChirpData_d 0.004607 0.00000 test
avx_ChirpData_e 0.004117 0.00000 test
avx_ChirpData_f2 0.003935 0.00000 test
avx_ChirpData_f3 0.004306 0.00000 test
avx_ChirpData_f4 0.003974 0.00000 test
avx_ChirpData_f5 0.003947 0.00000 test
avx_ChirpData_f6 0.004002 0.00000 test
avx_ChirpData_fn 0.004189 0.00000 test
avx_ChirpData_f2 0.003935 0.00000 choice
Second run
v_ChirpData 0.009229 0.00000 test
fpu_ChirpData 0.017868 0.00000 test
fpu_opt_ChirpData 0.009018 0.00000 test
sse1_ChirpData_ak8e 0.007519 0.00000 test
sse2_ChirpData_ak8 0.004623 0.00000 test
sse3_ChirpData_ak8 0.004755 0.00000 test
avx_ChirpData_a 0.003940 0.00000 test
avx_ChirpData_b 0.003992 0.00000 test
avx_ChirpData_c 0.004212 0.00000 test
avx_ChirpData_d 0.004157 0.00000 test
avx_ChirpData_e 0.004050 0.00000 test
avx_ChirpData_f2 0.004179 0.00000 test
avx_ChirpData_f3 0.004142 0.00000 test
avx_ChirpData_f4 0.003875 0.00000 test
avx_ChirpData_f5 0.004078 0.00000 test
avx_ChirpData_f6 0.004131 0.00000 test
avx_ChirpData_fn 0.004142 0.00000 test
avx_ChirpData_f4 0.003875 0.00000 choice
Third run
v_ChirpData 0.009246 0.00000 test
fpu_ChirpData 0.018076 0.00000 test
fpu_opt_ChirpData 0.009144 0.00000 test
sse1_ChirpData_ak8e 0.007887 0.00000 test
sse2_ChirpData_ak8 0.004614 0.00000 test
sse3_ChirpData_ak8 0.004732 0.00000 test
avx_ChirpData_a 0.004012 0.00000 test
avx_ChirpData_b 0.003975 0.00000 test
avx_ChirpData_c 0.004200 0.00000 test
avx_ChirpData_d 0.004111 0.00000 test
avx_ChirpData_e 0.003970 0.00000 test
avx_ChirpData_f2 0.003894 0.00000 test
avx_ChirpData_f3 0.003784 0.00000 test
avx_ChirpData_f4 0.003861 0.00000 test
avx_ChirpData_f5 0.003912 0.00000 test
avx_ChirpData_f6 0.003801 0.00000 test
avx_ChirpData_fn 0.003966 0.00000 test
avx_ChirpData_f3 0.003784 0.00000 choice
Test duration 10.19 seconds
Ftst_v7 completed successfully.
arkayn:
Here is one with everything idle on the system.
=========================================================
Ftst_v7_J48_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.009247 0.00000 test
fpu_ChirpData 0.017567 0.00000 test
fpu_opt_ChirpData 0.008871 0.00000 test
sse1_ChirpData_ak8e 0.007218 0.00000 test
sse2_ChirpData_ak8 0.004589 0.00000 test
sse3_ChirpData_ak8 0.004675 0.00000 test
avx_ChirpData_a 0.003838 0.00000 test
avx_ChirpData_b 0.003844 0.00000 test
avx_ChirpData_c 0.004144 0.00000 test
avx_ChirpData_d 0.004042 0.00000 test
avx_ChirpData_e 0.003897 0.00000 test
avx_ChirpData_f2 0.003701 0.00000 test
avx_ChirpData_f3 0.003696 0.00000 test
avx_ChirpData_f4 0.003701 0.00000 test
avx_ChirpData_f5 0.003678 0.00000 test
avx_ChirpData_f6 0.003678 0.00000 test
avx_ChirpData_fn 0.004046 0.00000 test
avx_ChirpData_f6 0.003678 0.00000 choice
Second run
v_ChirpData 0.009537 0.00000 test
fpu_ChirpData 0.017549 0.00000 test
fpu_opt_ChirpData 0.008801 0.00000 test
sse1_ChirpData_ak8e 0.007091 0.00000 test
sse2_ChirpData_ak8 0.004581 0.00000 test
sse3_ChirpData_ak8 0.004691 0.00000 test
avx_ChirpData_a 0.003840 0.00000 test
avx_ChirpData_b 0.003835 0.00000 test
avx_ChirpData_c 0.004116 0.00000 test
avx_ChirpData_d 0.004036 0.00000 test
avx_ChirpData_e 0.003922 0.00000 test
avx_ChirpData_f2 0.003738 0.00000 test
avx_ChirpData_f3 0.003700 0.00000 test
avx_ChirpData_f4 0.003722 0.00000 test
avx_ChirpData_f5 0.003709 0.00000 test
avx_ChirpData_f6 0.003696 0.00000 test
avx_ChirpData_fn 0.003885 0.00000 test
avx_ChirpData_f6 0.003696 0.00000 choice
Third run
v_ChirpData 0.008990 0.00000 test
fpu_ChirpData 0.017410 0.00000 test
fpu_opt_ChirpData 0.008669 0.00000 test
sse1_ChirpData_ak8e 0.007166 0.00000 test
sse2_ChirpData_ak8 0.004573 0.00000 test
sse3_ChirpData_ak8 0.004653 0.00000 test
avx_ChirpData_a 0.003878 0.00000 test
avx_ChirpData_b 0.003845 0.00000 test
avx_ChirpData_c 0.004118 0.00000 test
avx_ChirpData_d 0.004046 0.00000 test
avx_ChirpData_e 0.003878 0.00000 test
avx_ChirpData_f2 0.003707 0.00000 test
avx_ChirpData_f3 0.003746 0.00000 test
avx_ChirpData_f4 0.003841 0.00000 test
avx_ChirpData_f5 0.003697 0.00000 test
avx_ChirpData_f6 0.003668 0.00000 test
avx_ChirpData_fn 0.003951 0.00000 test
avx_ChirpData_f6 0.003668 0.00000 choice
Test duration 9.64 seconds
Ftst_v7 completed successfully.
arkayn:
i3-2120, GTX560 crunching and CPU idle
=========================================================
Ftst_v7_J47_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.005192 0.00000 test
fpu_ChirpData 0.016616 0.00000 test
fpu_opt_ChirpData 0.005312 0.00000 test
sse1_ChirpData_ak8e 0.007554 0.00000 test
sse2_ChirpData_ak8 0.004366 0.00000 test
sse3_ChirpData_ak8 0.004680 0.00000 test
avx_ChirpData_a 0.002474 0.00000 test
avx_ChirpData_b 0.002890 0.00000 test
avx_ChirpData_c 0.002728 0.00000 test
avx_ChirpData_d 0.002087 0.00000 test
avx_ChirpData_e 0.002756 0.00000 test
avx_ChirpData_f2 0.002708 0.00000 test
avx_ChirpData_f3 0.002074 0.00000 test
avx_ChirpData_f4 0.002303 0.00000 test
avx_ChirpData_f5 0.002295 0.00000 test
avx_ChirpData_f6 0.002532 0.00000 test
avx_ChirpData_f3 0.002074 0.00000 choice
Second run
v_ChirpData 0.007313 0.00000 test
fpu_ChirpData 0.015500 0.00000 test
fpu_opt_ChirpData 0.005429 0.00000 test
sse1_ChirpData_ak8e 0.006784 0.00000 test
sse2_ChirpData_ak8 0.004199 0.00000 test
sse3_ChirpData_ak8 0.004349 0.00000 test
avx_ChirpData_a 0.002453 0.00000 test
avx_ChirpData_b 0.002080 0.00000 test
avx_ChirpData_c 0.002121 0.00000 test
avx_ChirpData_d 0.002363 0.00000 test
avx_ChirpData_e 0.002128 0.00000 test
avx_ChirpData_f2 0.002339 0.00000 test
avx_ChirpData_f3 0.002652 0.00000 test
avx_ChirpData_f4 0.002237 0.00000 test
avx_ChirpData_f5 0.003082 0.00000 test
avx_ChirpData_f6 0.002211 0.00000 test
avx_ChirpData_b 0.002080 0.00000 choice
Third run
v_ChirpData 0.005128 0.00000 test
fpu_ChirpData 0.013508 0.00000 test
fpu_opt_ChirpData 0.005062 0.00000 test
sse1_ChirpData_ak8e 0.005944 0.00000 test
sse2_ChirpData_ak8 0.004307 0.00000 test
sse3_ChirpData_ak8 0.004244 0.00000 test
avx_ChirpData_a 0.002100 0.00000 test
avx_ChirpData_b 0.002253 0.00000 test
avx_ChirpData_c 0.002125 0.00000 test
avx_ChirpData_d 0.001955 0.00000 test
avx_ChirpData_e 0.002250 0.00000 test
avx_ChirpData_f2 0.002177 0.00000 test
avx_ChirpData_f3 0.002137 0.00000 test
avx_ChirpData_f4 0.002128 0.00000 test
avx_ChirpData_f5 0.002131 0.00000 test
avx_ChirpData_f6 0.002574 0.00000 test
avx_ChirpData_d 0.001955 0.00000 choice
Test duration 8.86 seconds
Ftst_v7 completed successfully.
arkayn:
i3-2120, idle
=========================================================
Ftst_v7_J48_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.004554 0.00000 test
fpu_ChirpData 0.012295 0.00000 test
fpu_opt_ChirpData 0.004339 0.00000 test
sse1_ChirpData_ak8e 0.006798 0.00000 test
sse2_ChirpData_ak8 0.004179 0.00000 test
sse3_ChirpData_ak8 0.004091 0.00000 test
avx_ChirpData_a 0.002076 0.00000 test
avx_ChirpData_b 0.002049 0.00000 test
avx_ChirpData_c 0.002097 0.00000 test
avx_ChirpData_d 0.002275 0.00000 test
avx_ChirpData_e 0.002107 0.00000 test
avx_ChirpData_f2 0.002080 0.00000 test
avx_ChirpData_f3 0.002063 0.00000 test
avx_ChirpData_f4 0.002049 0.00000 test
avx_ChirpData_f5 0.002050 0.00000 test
avx_ChirpData_f6 0.002047 0.00000 test
avx_ChirpData_fn 0.002191 0.00000 test
avx_ChirpData_f6 0.002047 0.00000 choice
Second run
v_ChirpData 0.004551 0.00000 test
fpu_ChirpData 0.012311 0.00000 test
fpu_opt_ChirpData 0.004347 0.00000 test
sse1_ChirpData_ak8e 0.005693 0.00000 test
sse2_ChirpData_ak8 0.004187 0.00000 test
sse3_ChirpData_ak8 0.004125 0.00000 test
avx_ChirpData_a 0.002071 0.00000 test
avx_ChirpData_b 0.002055 0.00000 test
avx_ChirpData_c 0.002099 0.00000 test
avx_ChirpData_d 0.001932 0.00000 test
avx_ChirpData_e 0.001932 0.00000 test
avx_ChirpData_f2 0.002088 0.00000 test
avx_ChirpData_f3 0.002060 0.00000 test
avx_ChirpData_f4 0.002052 0.00000 test
avx_ChirpData_f5 0.002050 0.00000 test
avx_ChirpData_f6 0.002052 0.00000 test
avx_ChirpData_fn 0.002187 0.00000 test
avx_ChirpData_d 0.001932 0.00000 choice
Third run
v_ChirpData 0.004542 0.00000 test
fpu_ChirpData 0.012313 0.00000 test
fpu_opt_ChirpData 0.004336 0.00000 test
sse1_ChirpData_ak8e 0.005710 0.00000 test
sse2_ChirpData_ak8 0.004181 0.00000 test
sse3_ChirpData_ak8 0.004121 0.00000 test
avx_ChirpData_a 0.002073 0.00000 test
avx_ChirpData_b 0.002051 0.00000 test
avx_ChirpData_c 0.002099 0.00000 test
avx_ChirpData_d 0.001932 0.00000 test
avx_ChirpData_e 0.001932 0.00000 test
avx_ChirpData_f2 0.002078 0.00000 test
avx_ChirpData_f3 0.002061 0.00000 test
avx_ChirpData_f4 0.002052 0.00000 test
avx_ChirpData_f5 0.002053 0.00000 test
avx_ChirpData_f6 0.002048 0.00000 test
avx_ChirpData_fn 0.002186 0.00000 test
avx_ChirpData_e 0.001932 0.00000 choice
Test duration 8.94 seconds
Ftst_v7 completed successfully.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version