Forum > Discussion Forum
AVX Optimized App Development
Claggy:
i7-2600K @4.7GHz (Boinc running):
=========================================================
Ftst_v7_J49_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.007968 0.00000 test
fpu_ChirpData 0.015811 0.00000 test
fpu_opt_ChirpData 0.007296 0.00000 test
sse1_ChirpData_ak8e 0.006359 0.00000 test
sse2_ChirpData_ak8 0.004849 0.00000 test
sse3_ChirpData_ak8 0.004496 0.00000 test
avx_ChirpData_a 0.002711 0.00000 test
avx_ChirpData_b 0.002826 0.00000 test
avx_ChirpData_c 0.002719 0.00000 test
avx_ChirpData_d 0.002606 0.00000 test
avx_ChirpData_e 0.002469 0.00000 test
avx_ChirpData_f 0.002662 0.00000 test
avx_ChirpData_g 0.002796 0.00000 test
avx_ChirpData_h 0.003208 0.00000 test
avx_ChirpData_e 0.002469 0.00000 choice
Second run
v_ChirpData 0.007533 0.00000 test
fpu_ChirpData 0.015415 0.00000 test
fpu_opt_ChirpData 0.006911 0.00000 test
sse1_ChirpData_ak8e 0.006322 0.00000 test
sse2_ChirpData_ak8 0.004869 0.00000 test
sse3_ChirpData_ak8 0.004669 0.00000 test
avx_ChirpData_a 0.002702 0.00000 test
avx_ChirpData_b 0.002879 0.00000 test
avx_ChirpData_c 0.002848 0.00000 test
avx_ChirpData_d 0.002208 0.00000 test
avx_ChirpData_e 0.002697 0.00000 test
avx_ChirpData_f 0.002966 0.00000 test
avx_ChirpData_g 0.002573 0.00000 test
avx_ChirpData_h 0.003257 0.00000 test
avx_ChirpData_d 0.002208 0.00000 choice
Third run
v_ChirpData 0.007272 0.00000 test
fpu_ChirpData 0.016058 0.00000 test
fpu_opt_ChirpData 0.007339 0.00000 test
sse1_ChirpData_ak8e 0.006160 0.00000 test
sse2_ChirpData_ak8 0.004753 0.00000 test
sse3_ChirpData_ak8 0.004710 0.00000 test
avx_ChirpData_a 0.002605 0.00000 test
avx_ChirpData_b 0.002734 0.00000 test
avx_ChirpData_c 0.002658 0.00000 test
avx_ChirpData_d 0.002396 0.00000 test
avx_ChirpData_e 0.002445 0.00000 test
avx_ChirpData_f 0.002504 0.00000 test
avx_ChirpData_g 0.002488 0.00000 test
avx_ChirpData_h 0.003208 0.00000 test
avx_ChirpData_d 0.002396 0.00000 choice
Test duration 7.16 seconds
Ftst_v7 completed successfully.
=========================================================
i7-2600K @4.7GHz (Boinc computing suspended):
=========================================================
Ftst_v7_J49_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.003652 0.00000 test
fpu_ChirpData 0.008646 0.00000 test
fpu_opt_ChirpData 0.003595 0.00000 test
sse1_ChirpData_ak8e 0.004217 0.00000 test
sse2_ChirpData_ak8 0.003150 0.00000 test
sse3_ChirpData_ak8 0.002981 0.00000 test
avx_ChirpData_a 0.001477 0.00000 test
avx_ChirpData_b 0.001635 0.00000 test
avx_ChirpData_c 0.001485 0.00000 test
avx_ChirpData_d 0.001377 0.00000 test
avx_ChirpData_e 0.001524 0.00000 test
avx_ChirpData_f 0.001588 0.00000 test
avx_ChirpData_g 0.001619 0.00000 test
avx_ChirpData_h 0.002023 0.00000 test
avx_ChirpData_d 0.001377 0.00000 choice
Second run
v_ChirpData 0.003666 0.00000 test
fpu_ChirpData 0.008667 0.00000 test
fpu_opt_ChirpData 0.003789 0.00000 test
sse1_ChirpData_ak8e 0.004202 0.00000 test
sse2_ChirpData_ak8 0.003154 0.00000 test
sse3_ChirpData_ak8 0.002986 0.00000 test
avx_ChirpData_a 0.001475 0.00000 test
avx_ChirpData_b 0.001635 0.00000 test
avx_ChirpData_c 0.001483 0.00000 test
avx_ChirpData_d 0.001376 0.00000 test
avx_ChirpData_e 0.001531 0.00000 test
avx_ChirpData_f 0.001571 0.00000 test
avx_ChirpData_g 0.001618 0.00000 test
avx_ChirpData_h 0.002019 0.00000 test
avx_ChirpData_d 0.001376 0.00000 choice
Third run
v_ChirpData 0.003673 0.00000 test
fpu_ChirpData 0.008651 0.00000 test
fpu_opt_ChirpData 0.003576 0.00000 test
sse1_ChirpData_ak8e 0.004199 0.00000 test
sse2_ChirpData_ak8 0.003150 0.00000 test
sse3_ChirpData_ak8 0.002981 0.00000 test
avx_ChirpData_a 0.001474 0.00000 test
avx_ChirpData_b 0.001644 0.00000 test
avx_ChirpData_c 0.001483 0.00000 test
avx_ChirpData_d 0.001379 0.00000 test
avx_ChirpData_e 0.001525 0.00000 test
avx_ChirpData_f 0.001572 0.00000 test
avx_ChirpData_g 0.001617 0.00000 test
avx_ChirpData_h 0.002022 0.00000 test
avx_ChirpData_d 0.001379 0.00000 choice
Test duration 5.28 seconds
Ftst_v7 completed successfully.
Claggy
Josef W. Segur:
--- Quote from: KarVi on 10 May 2012, 04:19:07 pm ---...
Mine seems to like f, where arkayns prefers g. Not very conclusive :)
--- End quote ---
Indeed not, and these ~2% differences simply prove I haven't yet found what I'm looking for. Having the h test go the wrong way may be more revealing if I can correlate it with BD features.
Joe
Josef W. Segur:
Trying to find the strengths of Bulldozer, I've added a chirp variant using both AVX and FMA4 in J50. It does reduce the number of instructions in the loop by 8 or more so should have some measurable effect, though it still has to load and save just as much data. Other than the FMA4 changes, it's like the g AVX version.
If I have everything right, it ought to show as unsupported on Sandy Bridge and run on Bulldozer. If not, anything might happen. :P
Joe
arkayn:
FX-4100
BOINC running on GTX460
=========================================================
Ftst_v7_J50_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.009627 0.00000 test
fpu_ChirpData 0.019200 0.00000 test
fpu_opt_ChirpData 0.008951 0.00000 test
sse1_ChirpData_ak8e 0.007910 0.00000 test
sse2_ChirpData_ak8 0.005040 0.00000 test
sse3_ChirpData_ak8 0.004927 0.00000 test
avx_ChirpData_a 0.004119 0.00000 test
avx_ChirpData_b 0.004149 0.00000 test
avx_ChirpData_c 0.004650 0.00000 test
avx_ChirpData_d 0.004221 0.00000 test
avx_ChirpData_e 0.004187 0.00000 test
avx_ChirpData_f 0.004013 0.00000 test
avx_ChirpData_g 0.004171 0.00000 test
avx_ChirpData_h 0.005179 0.00000 test
avx_fma4_ChirpData_a 0.003669 0.00000 test
avx_fma4_ChirpData_a 0.003669 0.00000 choice
Second run
v_ChirpData 0.009635 0.00000 test
fpu_ChirpData 0.018249 0.00000 test
fpu_opt_ChirpData 0.009154 0.00000 test
sse1_ChirpData_ak8e 0.007586 0.00000 test
sse2_ChirpData_ak8 0.004708 0.00000 test
sse3_ChirpData_ak8 0.004546 0.00000 test
avx_ChirpData_a 0.004097 0.00000 test
avx_ChirpData_b 0.004024 0.00000 test
avx_ChirpData_c 0.004339 0.00000 test
avx_ChirpData_d 0.004329 0.00000 test
avx_ChirpData_e 0.004205 0.00000 test
avx_ChirpData_f 0.003973 0.00000 test
avx_ChirpData_g 0.003893 0.00000 test
avx_ChirpData_h 0.004708 0.00000 test
avx_fma4_ChirpData_a 0.003704 0.00000 test
avx_fma4_ChirpData_a 0.003704 0.00000 choice
Third run
v_ChirpData 0.009304 0.00000 test
fpu_ChirpData 0.019267 0.00000 test
fpu_opt_ChirpData 0.008838 0.00000 test
sse1_ChirpData_ak8e 0.007273 0.00000 test
sse2_ChirpData_ak8 0.004618 0.00000 test
sse3_ChirpData_ak8 0.004530 0.00000 test
avx_ChirpData_a 0.004216 0.00000 test
avx_ChirpData_b 0.004080 0.00000 test
avx_ChirpData_c 0.004223 0.00000 test
avx_ChirpData_d 0.004374 0.00000 test
avx_ChirpData_e 0.004065 0.00000 test
avx_ChirpData_f 0.003829 0.00000 test
avx_ChirpData_g 0.004143 0.00000 test
avx_ChirpData_h 0.004819 0.00000 test
avx_fma4_ChirpData_a 0.003452 0.00000 test
avx_fma4_ChirpData_a 0.003452 0.00000 choice
Test duration 9.33 seconds
Ftst_v7 completed successfully.
i3-2120
BOINC running on GTX560
=========================================================
Ftst_v7_J50_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.004834 0.00000 test
fpu_ChirpData 0.012480 0.00000 test
fpu_opt_ChirpData 0.004621 0.00000 test
sse1_ChirpData_ak8e 0.005919 0.00000 test
sse2_ChirpData_ak8 0.004306 0.00000 test
sse3_ChirpData_ak8 0.004118 0.00000 test
avx_ChirpData_a 0.002149 0.00000 test
avx_ChirpData_b 0.002130 0.00000 test
avx_ChirpData_c 0.002282 0.00000 test
avx_ChirpData_d 0.002143 0.00000 test
avx_ChirpData_e 0.002010 0.00000 test
avx_ChirpData_f 0.002116 0.00000 test
avx_ChirpData_g 0.002156 0.00000 test
avx_ChirpData_h 0.002744 0.00000 test
avx_fma4_ChirpData_a not supported by system
avx_ChirpData_e 0.002010 0.00000 choice
Second run
v_ChirpData 0.004538 0.00000 test
fpu_ChirpData 0.012324 0.00000 test
fpu_opt_ChirpData 0.004330 0.00000 test
sse1_ChirpData_ak8e 0.005758 0.00000 test
sse2_ChirpData_ak8 0.004179 0.00000 test
sse3_ChirpData_ak8 0.004003 0.00000 test
avx_ChirpData_a 0.002143 0.00000 test
avx_ChirpData_b 0.002585 0.00000 test
avx_ChirpData_c 0.002312 0.00000 test
avx_ChirpData_d 0.001930 0.00000 test
avx_ChirpData_e 0.002107 0.00000 test
avx_ChirpData_f 0.002309 0.00000 test
avx_ChirpData_g 0.002067 0.00000 test
avx_ChirpData_h 0.002657 0.00000 test
avx_fma4_ChirpData_a not supported by system
avx_ChirpData_d 0.001930 0.00000 choice
Third run
v_ChirpData 0.005770 0.00000 test
fpu_ChirpData 0.012279 0.00000 test
fpu_opt_ChirpData 0.004469 0.00000 test
sse1_ChirpData_ak8e 0.006027 0.00000 test
sse2_ChirpData_ak8 0.004288 0.00000 test
sse3_ChirpData_ak8 0.004026 0.00000 test
avx_ChirpData_a 0.002081 0.00000 test
avx_ChirpData_b 0.002052 0.00000 test
avx_ChirpData_c 0.002536 0.00000 test
avx_ChirpData_d 0.001946 0.00000 test
avx_ChirpData_e 0.001987 0.00000 test
avx_ChirpData_f 0.002057 0.00000 test
avx_ChirpData_g 0.002298 0.00000 test
avx_ChirpData_h 0.002790 0.00000 test
avx_fma4_ChirpData_a not supported by system
avx_ChirpData_d 0.001946 0.00000 choice
Test duration 7.68 seconds
Ftst_v7 completed successfully.
Mike:
FX 8150 @4.4 GHZ Win 8 CP
Ftst_v7_J50_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_ChirpData 0.008360 0.00000 test
fpu_ChirpData 0.013990 0.00000 test
fpu_opt_ChirpData 0.009692 0.00000 test
sse1_ChirpData_ak8e 0.005746 0.00000 test
sse2_ChirpData_ak8 0.003786 0.00000 test
sse3_ChirpData_ak8 0.003776 0.00000 test
avx_ChirpData_a 0.003227 0.00000 test
avx_ChirpData_b 0.003268 0.00000 test
avx_ChirpData_c 0.003384 0.00000 test
avx_ChirpData_d 0.003312 0.00000 test
avx_ChirpData_e 0.003169 0.00000 test
avx_ChirpData_f 0.003059 0.00000 test
avx_ChirpData_g 0.003046 0.00000 test
avx_ChirpData_h 0.003847 0.00000 test
avx_fma4_ChirpData_a 0.002927 0.00000 test
avx_fma4_ChirpData_a 0.002927 0.00000 choice
Second run
v_ChirpData 0.009693 0.00000 test
fpu_ChirpData 0.013975 0.00000 test
fpu_opt_ChirpData 0.009653 0.00000 test
sse1_ChirpData_ak8e 0.005742 0.00000 test
sse2_ChirpData_ak8 0.003836 0.00000 test
sse3_ChirpData_ak8 0.003730 0.00000 test
avx_ChirpData_a 0.003346 0.00000 test
avx_ChirpData_b 0.003312 0.00000 test
avx_ChirpData_c 0.003432 0.00000 test
avx_ChirpData_d 0.003368 0.00000 test
avx_ChirpData_e 0.003222 0.00000 test
avx_ChirpData_f 0.003203 0.00000 test
avx_ChirpData_g 0.003240 0.00000 test
avx_ChirpData_h 0.003889 0.00000 test
avx_fma4_ChirpData_a 0.003078 0.00000 test
avx_fma4_ChirpData_a 0.003078 0.00000 choice
Third run
v_ChirpData 0.009655 0.00000 test
fpu_ChirpData 0.013997 0.00000 test
fpu_opt_ChirpData 0.009633 0.00000 test
sse1_ChirpData_ak8e 0.005748 0.00000 test
sse2_ChirpData_ak8 0.003857 0.00000 test
sse3_ChirpData_ak8 0.003740 0.00000 test
avx_ChirpData_a 0.003364 0.00000 test
avx_ChirpData_b 0.003325 0.00000 test
avx_ChirpData_c 0.003434 0.00000 test
avx_ChirpData_d 0.003386 0.00000 test
avx_ChirpData_e 0.003235 0.00000 test
avx_ChirpData_f 0.003228 0.00000 test
avx_ChirpData_g 0.003249 0.00000 test
avx_ChirpData_h 0.003882 0.00000 test
avx_fma4_ChirpData_a 0.003090 0.00000 test
avx_fma4_ChirpData_a 0.003090 0.00000 choice
Test duration 7.26 seconds
Ftst_v7 completed successfully.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version