Forum > Discussion Forum
AVX Optimized App Development
arkayn:
FX-4100@3.6
BOINC idle
=========================================================
Ftst_v7_J55_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
First run, 1048576 sample testing
v_ChirpData 0.051682 0.00000 test mintime= 0.051531
fpu_ChirpData 0.017529 0.00000 test mintime= 0.017479
sse1_ChirpData_ak8e 0.007230 0.00000 test mintime= 0.007164
sse2_ChirpData_ak8 0.004583 0.00000 test mintime= 0.004521
sse3_ChirpData_ak8 0.004468 0.00000 test mintime= 0.004435
avx_ChirpData_a 0.003825 0.00000 test mintime= 0.003762
avx_ChirpData_b 0.003839 0.00000 test mintime= 0.003779
avx_ChirpData_c 0.004100 0.00000 test mintime= 0.004079
avx_ChirpData_d 0.003990 0.00000 test mintime= 0.003967
avx_ChirpData_e 0.003914 0.00000 test mintime= 0.003844
avx_ChirpData_f 0.003695 0.00000 test mintime= 0.003664
avx_ChirpData_g 0.003653 0.00000 test mintime= 0.003586
avx_ChirpData_h 0.004360 0.00000 test mintime= 0.004313
avx_ChirpData_i 0.003781 0.00000 test mintime= 0.003734
avx_fma4_ChirpData_a 0.003349 0.00000 test mintime= 0.003328
avx_fma4_ChirpData_d4 0.003376 0.00000 test mintime= 0.003356
avx_fma4_ChirpData_d6 0.003417 0.00000 test mintime= 0.003329
avx_fma4_ChirpData_d8 0.003378 0.00000 test mintime= 0.003339
avx_fma4_ChirpData_e 0.003745 0.00000 test mintime= 0.003706
avx_fma4_ChirpData_a 0.003349 0.00000 choice
Second run, 131072 sample testing
v_ChirpData 0.006409 0.00000 test mintime= 0.006369
fpu_ChirpData 0.002194 0.00000 test mintime= 0.002161
sse1_ChirpData_ak8e 0.000900 0.00000 test mintime= 0.000887
sse2_ChirpData_ak8 0.000573 0.00000 test mintime= 0.000564
sse3_ChirpData_ak8 0.000561 0.00000 test mintime= 0.000549
avx_ChirpData_a 0.000477 0.00000 test mintime= 0.000470
avx_ChirpData_b 0.000486 0.00000 test mintime= 0.000478
avx_ChirpData_c 0.000513 0.00000 test mintime= 0.000505
avx_ChirpData_d 0.000502 0.00000 test mintime= 0.000492
avx_ChirpData_e 0.000483 0.00000 test mintime= 0.000456
avx_ChirpData_f 0.000460 0.00000 test mintime= 0.000453
avx_ChirpData_g 0.000450 0.00000 test mintime= 0.000440
avx_ChirpData_h 0.000543 0.00000 test mintime= 0.000531
avx_ChirpData_i 0.000459 0.00000 test mintime= 0.000446
avx_fma4_ChirpData_a 0.000417 0.00000 test mintime= 0.000410
avx_fma4_ChirpData_d4 0.000429 0.00000 test mintime= 0.000415
avx_fma4_ChirpData_d6 0.000419 0.00000 test mintime= 0.000414
avx_fma4_ChirpData_d8 0.000423 0.00000 test mintime= 0.000414
avx_fma4_ChirpData_e 0.000465 0.00000 test mintime= 0.000456
avx_fma4_ChirpData_a 0.000417 0.00000 choice
Third run, 32768 sample testing
v_ChirpData 0.001609 0.00000 test mintime= 0.001590
fpu_ChirpData 0.000548 0.00000 test mintime= 0.000537
sse1_ChirpData_ak8e 0.000225 0.00000 test mintime= 0.000221
sse2_ChirpData_ak8 0.000144 0.00000 test mintime= 0.000140
sse3_ChirpData_ak8 0.000140 0.00000 test mintime= 0.000137
avx_ChirpData_a 0.000120 0.00000 test mintime= 0.000117
avx_ChirpData_b 0.000122 0.00000 test mintime= 0.000120
avx_ChirpData_c 0.000129 0.00000 test mintime= 0.000126
avx_ChirpData_d 0.000125 0.00000 test mintime= 0.000123
avx_ChirpData_e 0.000119 0.00000 test mintime= 0.000114
avx_ChirpData_f 0.000115 0.00000 test mintime= 0.000113
avx_ChirpData_g 0.000112 0.00000 test mintime= 0.000110
avx_ChirpData_h 0.000135 0.00000 test mintime= 0.000132
avx_ChirpData_i 0.000113 0.00000 test mintime= 0.000111
avx_fma4_ChirpData_a 0.000104 0.00000 test mintime= 0.000103
avx_fma4_ChirpData_d4 0.000106 0.00000 test mintime= 0.000104
avx_fma4_ChirpData_d6 0.000105 0.00000 test mintime= 0.000104
avx_fma4_ChirpData_d8 0.000105 0.00000 test mintime= 0.000104
avx_fma4_ChirpData_e 0.000117 0.00000 test mintime= 0.000114
avx_fma4_ChirpData_a 0.000104 0.00000 choice
Test duration 7.34 seconds
Ftst_v7 completed successfully.
i3-2120@3.3
BOINC idle
=========================================================
Ftst_v7_J55_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
First run, 1048576 sample testing
v_ChirpData 0.058924 0.00000 test mintime= 0.058849
fpu_ChirpData 0.012426 0.00000 test mintime= 0.012339
sse1_ChirpData_ak8e 0.005945 0.00000 test mintime= 0.005699
sse2_ChirpData_ak8 0.004193 0.00000 test mintime= 0.004164
sse3_ChirpData_ak8 0.004016 0.00000 test mintime= 0.003993
avx_ChirpData_a 0.002082 0.00000 test mintime= 0.002074
avx_ChirpData_b 0.002039 0.00000 test mintime= 0.002034
avx_ChirpData_c 0.002107 0.00000 test mintime= 0.002098
avx_ChirpData_d 0.001936 0.00000 test mintime= 0.001932
avx_ChirpData_e 0.001928 0.00000 test mintime= 0.001918
avx_ChirpData_f 0.002054 0.00000 test mintime= 0.002044
avx_ChirpData_g 0.002078 0.00000 test mintime= 0.002070
avx_ChirpData_h 0.002735 0.00000 test mintime= 0.002641
avx_ChirpData_i 0.002223 0.00000 test mintime= 0.002212
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_e 0.001928 0.00000 choice
Second run, 131072 sample testing
v_ChirpData 0.007376 0.00000 test mintime= 0.007337
fpu_ChirpData 0.001547 0.00000 test mintime= 0.001540
sse1_ChirpData_ak8e 0.000714 0.00000 test mintime= 0.000712
sse2_ChirpData_ak8 0.000522 0.00000 test mintime= 0.000520
sse3_ChirpData_ak8 0.000500 0.00000 test mintime= 0.000498
avx_ChirpData_a 0.000260 0.00000 test mintime= 0.000258
avx_ChirpData_b 0.000255 0.00000 test mintime= 0.000254
avx_ChirpData_c 0.000264 0.00000 test mintime= 0.000261
avx_ChirpData_d 0.000242 0.00000 test mintime= 0.000241
avx_ChirpData_e 0.000242 0.00000 test mintime= 0.000239
avx_ChirpData_f 0.000257 0.00000 test mintime= 0.000255
avx_ChirpData_g 0.000260 0.00000 test mintime= 0.000257
avx_ChirpData_h 0.000329 0.00000 test mintime= 0.000322
avx_ChirpData_i 0.000271 0.00000 test mintime= 0.000267
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_e 0.000242 0.00000 choice
Third run, 32768 sample testing
v_ChirpData 0.001841 0.00000 test mintime= 0.001834
fpu_ChirpData 0.000387 0.00000 test mintime= 0.000385
sse1_ChirpData_ak8e 0.000179 0.00000 test mintime= 0.000178
sse2_ChirpData_ak8 0.000131 0.00000 test mintime= 0.000130
sse3_ChirpData_ak8 0.000125 0.00000 test mintime= 0.000124
avx_ChirpData_a 0.000065 0.00000 test mintime= 0.000064
avx_ChirpData_b 0.000064 0.00000 test mintime= 0.000063
avx_ChirpData_c 0.000066 0.00000 test mintime= 0.000065
avx_ChirpData_d 0.000064 0.00000 test mintime= 0.000060
avx_ChirpData_e 0.000060 0.00000 test mintime= 0.000059
avx_ChirpData_f 0.000065 0.00000 test mintime= 0.000063
avx_ChirpData_g 0.000065 0.00000 test mintime= 0.000064
avx_ChirpData_h 0.000081 0.00000 test mintime= 0.000079
avx_ChirpData_i 0.000069 0.00000 test mintime= 0.000064
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_e 0.000060 0.00000 choice
Test duration 5.55 seconds
Ftst_v7 completed successfully.
PatrickV2:
--- Quote from: Josef W. Segur on 28 May 2012, 11:17:57 am ---Although there are still puzzles from the tests so far, with the attached J55 I've added another dimension to the tests. J54 and earlier have been doing full Mebisample chirping as needed before doing Gaussian, Pulse, and Triplet finding. For cases where that's not needed, AK_v8 becomes more cache friendly by subdividing. So I modified all the chirp functions to support that, and J55 does testing at 128K and 32K in addition. The timings ought to be about 1/8 and 1/32 of the full length tests.
I do appreciate the testing, and am glad the Ivy Bridge system reacted like other Intel CPUs. Whatever form of dispatch is eventually used, keeping the number of code paths low will be more efficient.
Joe
--- End quote ---
I have handed over the Ivy Bridge system to my (happy) brother (who is ~150kms away from me), so I will not be able to provide more feedback in that respect.
Regards, Patrick.
Claggy:
i7-2600K @4.7GHz (Boinc running):
=========================================================
Ftst_v7_J55_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
First run, 1048576 sample testing
v_ChirpData 0.047452 0.00000 test mintime= 0.046795
fpu_ChirpData 0.015573 0.00000 test mintime= 0.014707
sse1_ChirpData_ak8e 0.006271 0.00000 test mintime= 0.006084
sse2_ChirpData_ak8 0.004890 0.00000 test mintime= 0.004704
sse3_ChirpData_ak8 0.004621 0.00000 test mintime= 0.004309
avx_ChirpData_a 0.003329 0.00000 test mintime= 0.002764
avx_ChirpData_b 0.003110 0.00000 test mintime= 0.002663
avx_ChirpData_c 0.002965 0.00000 test mintime= 0.002510
avx_ChirpData_d 0.002596 0.00000 test mintime= 0.002251
avx_ChirpData_e 0.002902 0.00000 test mintime= 0.002174
avx_ChirpData_f 0.003209 0.00000 test mintime= 0.002657
avx_ChirpData_g 0.002810 0.00000 test mintime= 0.002484
avx_ChirpData_h 0.003235 0.00000 test mintime= 0.003037
avx_ChirpData_i 0.002924 0.00000 test mintime= 0.002605
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_d 0.002596 0.00000 choice
Second run, 131072 sample testing
v_ChirpData 0.005896 0.00000 test mintime= 0.005775
fpu_ChirpData 0.001855 0.00000 test mintime= 0.001815
sse1_ChirpData_ak8e 0.000724 0.00000 test mintime= 0.000704
sse2_ChirpData_ak8 0.000571 0.00000 test mintime= 0.000543
sse3_ChirpData_ak8 0.000554 0.00000 test mintime= 0.000531
avx_ChirpData_a 0.000306 0.00000 test mintime= 0.000278
avx_ChirpData_b 0.000340 0.00000 test mintime= 0.000290
avx_ChirpData_c 0.000356 0.00000 test mintime= 0.000288
avx_ChirpData_d 0.000298 0.00000 test mintime= 0.000264
avx_ChirpData_e 0.000323 0.00000 test mintime= 0.000264
avx_ChirpData_f 0.000318 0.00000 test mintime= 0.000271
avx_ChirpData_g 0.000340 0.00000 test mintime= 0.000283
avx_ChirpData_h 0.000396 0.00000 test mintime= 0.000339
avx_ChirpData_i 0.000351 0.00000 test mintime= 0.000297
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_d 0.000298 0.00000 choice
Third run, 32768 sample testing
v_ChirpData 0.001490 0.00000 test mintime= 0.001426
fpu_ChirpData 0.000490 0.00000 test mintime= 0.000440
sse1_ChirpData_ak8e 0.000201 0.00000 test mintime= 0.000175
sse2_ChirpData_ak8 0.000158 0.00000 test mintime= 0.000137
sse3_ChirpData_ak8 0.000160 0.00000 test mintime= 0.000132
avx_ChirpData_a 0.000099 0.00000 test mintime= 0.000064
avx_ChirpData_b 0.000087 0.00000 test mintime= 0.000068
avx_ChirpData_c 0.000092 0.00000 test mintime= 0.000067
avx_ChirpData_d 0.000096 0.00000 test mintime= 0.000063
avx_ChirpData_e 0.000103 0.00000 test mintime= 0.000071
avx_ChirpData_f 0.000097 0.00000 test mintime= 0.000070
avx_ChirpData_g 0.000093 0.00000 test mintime= 0.000068
avx_ChirpData_h 0.000097 0.00000 test mintime= 0.000081
avx_ChirpData_i 0.000088 0.00000 test mintime= 0.000068
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_b 0.000087 0.00000 choice
Test duration 5.56 seconds
Ftst_v7 completed successfully.
=========================================================
i7-2600K @4.7GHz (Boinc suspended):
=========================================================
Ftst_v7_J55_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
First run, 1048576 sample testing
v_ChirpData 0.041361 0.00000 test mintime= 0.041250
fpu_ChirpData 0.008720 0.00000 test mintime= 0.008688
sse1_ChirpData_ak8e 0.004220 0.00000 test mintime= 0.004196
sse2_ChirpData_ak8 0.003154 0.00000 test mintime= 0.003142
sse3_ChirpData_ak8 0.002983 0.00000 test mintime= 0.002965
avx_ChirpData_a 0.001472 0.00000 test mintime= 0.001470
avx_ChirpData_b 0.001646 0.00000 test mintime= 0.001642
avx_ChirpData_c 0.001486 0.00000 test mintime= 0.001484
avx_ChirpData_d 0.001385 0.00000 test mintime= 0.001377
avx_ChirpData_e 0.001538 0.00000 test mintime= 0.001532
avx_ChirpData_f 0.001587 0.00000 test mintime= 0.001578
avx_ChirpData_g 0.001638 0.00000 test mintime= 0.001632
avx_ChirpData_h 0.002020 0.00000 test mintime= 0.002002
avx_ChirpData_i 0.001712 0.00000 test mintime= 0.001708
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_d 0.001385 0.00000 choice
Second run, 131072 sample testing
v_ChirpData 0.005160 0.00000 test mintime= 0.005146
fpu_ChirpData 0.001093 0.00000 test mintime= 0.001084
sse1_ChirpData_ak8e 0.000528 0.00000 test mintime= 0.000524
sse2_ChirpData_ak8 0.000394 0.00000 test mintime= 0.000393
sse3_ChirpData_ak8 0.000373 0.00000 test mintime= 0.000370
avx_ChirpData_a 0.000184 0.00000 test mintime= 0.000183
avx_ChirpData_b 0.000206 0.00000 test mintime= 0.000204
avx_ChirpData_c 0.000186 0.00000 test mintime= 0.000185
avx_ChirpData_d 0.000173 0.00000 test mintime= 0.000171
avx_ChirpData_e 0.000192 0.00000 test mintime= 0.000191
avx_ChirpData_f 0.000198 0.00000 test mintime= 0.000196
avx_ChirpData_g 0.000205 0.00000 test mintime= 0.000203
avx_ChirpData_h 0.000250 0.00000 test mintime= 0.000245
avx_ChirpData_i 0.000204 0.00000 test mintime= 0.000201
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_d 0.000173 0.00000 choice
Third run, 32768 sample testing
v_ChirpData 0.001291 0.00000 test mintime= 0.001286
fpu_ChirpData 0.000272 0.00000 test mintime= 0.000271
sse1_ChirpData_ak8e 0.000132 0.00000 test mintime= 0.000131
sse2_ChirpData_ak8 0.000099 0.00000 test mintime= 0.000098
sse3_ChirpData_ak8 0.000093 0.00000 test mintime= 0.000092
avx_ChirpData_a 0.000047 0.00000 test mintime= 0.000046
avx_ChirpData_b 0.000052 0.00000 test mintime= 0.000051
avx_ChirpData_c 0.000047 0.00000 test mintime= 0.000046
avx_ChirpData_d 0.000043 0.00000 test mintime= 0.000043
avx_ChirpData_e 0.000048 0.00000 test mintime= 0.000047
avx_ChirpData_f 0.000050 0.00000 test mintime= 0.000049
avx_ChirpData_g 0.000051 0.00000 test mintime= 0.000051
avx_ChirpData_h 0.000062 0.00000 test mintime= 0.000061
avx_ChirpData_i 0.000051 0.00000 test mintime= 0.000050
avx_fma4_ChirpData_a not supported by system
avx_fma4_ChirpData_d4 not supported by system
avx_fma4_ChirpData_d6 not supported by system
avx_fma4_ChirpData_d8 not supported by system
avx_fma4_ChirpData_e not supported by system
avx_ChirpData_d 0.000043 0.00000 choice
Test duration 3.92 seconds
Ftst_v7 completed successfully.
Claggy
Josef W. Segur:
The J55 test was built with GCC 4.5.1, I'm attaching J55b built with GCC 4.6.1 to see if there's any significant difference. If you have time to run both in order for the environment to be as similar as possible, that would be best. All functions may be affected, not just those targeting Bulldozer or Sandy Bridge.
One of the particular puzzles is why avx_fma4_ChirpData_a seems to be faster than avx_fma4_ChirpData_d4, the 4.5.1 build seemed not to optimize the instruction ordering of the d subvariants as well, but the CPU out of order execution capabilities ought to have been sufficient to handle it. These routines necessarily have serious dependency chain problems so there isn't a lot of room for reordering anyhow.
Joe
arkayn:
FX-4100
BOINC Idle
=========================================================
Ftst_v7_J55_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
First run, 1048576 sample testing
v_ChirpData 0.051829 0.00000 test mintime= 0.051639
fpu_ChirpData 0.017697 0.00000 test mintime= 0.017562
sse1_ChirpData_ak8e 0.007256 0.00000 test mintime= 0.007218
sse2_ChirpData_ak8 0.004547 0.00000 test mintime= 0.004528
sse3_ChirpData_ak8 0.004486 0.00000 test mintime= 0.004450
avx_ChirpData_a 0.003830 0.00000 test mintime= 0.003812
avx_ChirpData_b 0.003889 0.00000 test mintime= 0.003792
avx_ChirpData_c 0.004161 0.00000 test mintime= 0.004119
avx_ChirpData_d 0.004023 0.00000 test mintime= 0.003978
avx_ChirpData_e 0.003911 0.00000 test mintime= 0.003853
avx_ChirpData_f 0.003730 0.00000 test mintime= 0.003684
avx_ChirpData_g 0.003687 0.00000 test mintime= 0.003626
avx_ChirpData_h 0.004389 0.00000 test mintime= 0.004343
avx_ChirpData_i 0.003824 0.00000 test mintime= 0.003775
avx_fma4_ChirpData_a 0.003376 0.00000 test mintime= 0.003330
avx_fma4_ChirpData_d4 0.003397 0.00000 test mintime= 0.003355
avx_fma4_ChirpData_d6 0.003379 0.00000 test mintime= 0.003348
avx_fma4_ChirpData_d8 0.003397 0.00000 test mintime= 0.003363
avx_fma4_ChirpData_e 0.003773 0.00000 test mintime= 0.003720
avx_fma4_ChirpData_a 0.003376 0.00000 choice
Second run, 131072 sample testing
v_ChirpData 0.006456 0.00000 test mintime= 0.006373
fpu_ChirpData 0.002205 0.00000 test mintime= 0.002167
sse1_ChirpData_ak8e 0.000905 0.00000 test mintime= 0.000891
sse2_ChirpData_ak8 0.000577 0.00000 test mintime= 0.000564
sse3_ChirpData_ak8 0.000561 0.00000 test mintime= 0.000550
avx_ChirpData_a 0.000482 0.00000 test mintime= 0.000470
avx_ChirpData_b 0.000488 0.00000 test mintime= 0.000478
avx_ChirpData_c 0.000515 0.00000 test mintime= 0.000505
avx_ChirpData_d 0.000502 0.00000 test mintime= 0.000493
avx_ChirpData_e 0.000480 0.00000 test mintime= 0.000456
avx_ChirpData_f 0.000461 0.00000 test mintime= 0.000453
avx_ChirpData_g 0.000455 0.00000 test mintime= 0.000441
avx_ChirpData_h 0.000545 0.00000 test mintime= 0.000531
avx_ChirpData_i 0.000462 0.00000 test mintime= 0.000446
avx_fma4_ChirpData_a 0.000419 0.00000 test mintime= 0.000411
avx_fma4_ChirpData_d4 0.000423 0.00000 test mintime= 0.000415
avx_fma4_ChirpData_d6 0.000421 0.00000 test mintime= 0.000415
avx_fma4_ChirpData_d8 0.000422 0.00000 test mintime= 0.000414
avx_fma4_ChirpData_e 0.000468 0.00000 test mintime= 0.000457
avx_fma4_ChirpData_a 0.000419 0.00000 choice
Third run, 32768 sample testing
v_ChirpData 0.001632 0.00000 test mintime= 0.001590
fpu_ChirpData 0.000555 0.00000 test mintime= 0.000539
sse1_ChirpData_ak8e 0.000227 0.00000 test mintime= 0.000222
sse2_ChirpData_ak8 0.000145 0.00000 test mintime= 0.000140
sse3_ChirpData_ak8 0.000141 0.00000 test mintime= 0.000137
avx_ChirpData_a 0.000120 0.00000 test mintime= 0.000117
avx_ChirpData_b 0.000119 0.00000 test mintime= 0.000116
avx_ChirpData_c 0.000129 0.00000 test mintime= 0.000126
avx_ChirpData_d 0.000127 0.00000 test mintime= 0.000123
avx_ChirpData_e 0.000120 0.00000 test mintime= 0.000114
avx_ChirpData_f 0.000117 0.00000 test mintime= 0.000113
avx_ChirpData_g 0.000114 0.00000 test mintime= 0.000110
avx_ChirpData_h 0.000137 0.00000 test mintime= 0.000133
avx_ChirpData_i 0.000115 0.00000 test mintime= 0.000111
avx_fma4_ChirpData_a 0.000105 0.00000 test mintime= 0.000103
avx_fma4_ChirpData_d4 0.000107 0.00000 test mintime= 0.000104
avx_fma4_ChirpData_d6 0.000106 0.00000 test mintime= 0.000104
avx_fma4_ChirpData_d8 0.000106 0.00000 test mintime= 0.000104
avx_fma4_ChirpData_e 0.000117 0.00000 test mintime= 0.000114
avx_fma4_ChirpData_a 0.000105 0.00000 choice
Test duration 7.39 seconds
Ftst_v7 completed successfully.
=========================================================
Ftst_v7_J55b_Chirponly started.
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
First run, 1048576 sample testing
v_ChirpData 0.052452 0.00000 test mintime= 0.052096
fpu_ChirpData 0.018704 0.00000 test mintime= 0.017854
sse1_ChirpData_ak8e 0.007731 0.00000 test mintime= 0.007256
sse2_ChirpData_ak8 0.004579 0.00000 test mintime= 0.004497
sse3_ChirpData_ak8 0.004591 0.00000 test mintime= 0.004549
avx_ChirpData_a 0.004131 0.00000 test mintime= 0.003764
avx_ChirpData_b 0.004169 0.00000 test mintime= 0.003948
avx_ChirpData_c 0.004434 0.00000 test mintime= 0.003979
avx_ChirpData_d 0.004127 0.00000 test mintime= 0.003956
avx_ChirpData_e 0.004005 0.00000 test mintime= 0.003870
avx_ChirpData_f 0.003865 0.00000 test mintime= 0.003655
avx_ChirpData_g 0.004126 0.00000 test mintime= 0.003680
avx_ChirpData_h 0.004696 0.00000 test mintime= 0.004399
avx_ChirpData_i 0.004318 0.00000 test mintime= 0.003751
avx_fma4_ChirpData_a 0.003619 0.00000 test mintime= 0.003408
avx_fma4_ChirpData_d4 0.003713 0.00000 test mintime= 0.003264
avx_fma4_ChirpData_d6 0.004176 0.00000 test mintime= 0.003271
avx_fma4_ChirpData_d8 0.003497 0.00000 test mintime= 0.003206
avx_fma4_ChirpData_e 0.003928 0.00000 test mintime= 0.003882
avx_fma4_ChirpData_d8 0.003497 0.00000 choice
Second run, 131072 sample testing
v_ChirpData 0.006478 0.00000 test mintime= 0.006380
fpu_ChirpData 0.002202 0.00000 test mintime= 0.002172
sse1_ChirpData_ak8e 0.000925 0.00000 test mintime= 0.000902
sse2_ChirpData_ak8 0.000579 0.00000 test mintime= 0.000565
sse3_ChirpData_ak8 0.000575 0.00000 test mintime= 0.000565
avx_ChirpData_a 0.000478 0.00000 test mintime= 0.000466
avx_ChirpData_b 0.000499 0.00000 test mintime= 0.000487
avx_ChirpData_c 0.000498 0.00000 test mintime= 0.000482
avx_ChirpData_d 0.000501 0.00000 test mintime= 0.000490
avx_ChirpData_e 0.000482 0.00000 test mintime= 0.000458
avx_ChirpData_f 0.000464 0.00000 test mintime= 0.000453
avx_ChirpData_g 0.000452 0.00000 test mintime= 0.000442
avx_ChirpData_h 0.000554 0.00000 test mintime= 0.000542
avx_ChirpData_i 0.000459 0.00000 test mintime= 0.000446
avx_fma4_ChirpData_a 0.000431 0.00000 test mintime= 0.000423
avx_fma4_ChirpData_d4 0.000408 0.00000 test mintime= 0.000399
avx_fma4_ChirpData_d6 0.000406 0.00000 test mintime= 0.000398
avx_fma4_ChirpData_d8 0.000417 0.00000 test mintime= 0.000398
avx_fma4_ChirpData_e 0.000493 0.00000 test mintime= 0.000478
avx_fma4_ChirpData_d6 0.000406 0.00000 choice
Third run, 32768 sample testing
v_ChirpData 0.001623 0.00000 test mintime= 0.001589
fpu_ChirpData 0.000556 0.00000 test mintime= 0.000544
sse1_ChirpData_ak8e 0.000228 0.00000 test mintime= 0.000222
sse2_ChirpData_ak8 0.000146 0.00000 test mintime= 0.000139
sse3_ChirpData_ak8 0.000144 0.00000 test mintime= 0.000141
avx_ChirpData_a 0.000118 0.00000 test mintime= 0.000116
avx_ChirpData_b 0.000126 0.00000 test mintime= 0.000122
avx_ChirpData_c 0.000123 0.00000 test mintime= 0.000121
avx_ChirpData_d 0.000124 0.00000 test mintime= 0.000122
avx_ChirpData_e 0.000117 0.00000 test mintime= 0.000114
avx_ChirpData_f 0.000115 0.00000 test mintime= 0.000113
avx_ChirpData_g 0.000118 0.00000 test mintime= 0.000110
avx_ChirpData_h 0.000136 0.00000 test mintime= 0.000133
avx_ChirpData_i 0.000114 0.00000 test mintime= 0.000111
avx_fma4_ChirpData_a 0.000108 0.00000 test mintime= 0.000106
avx_fma4_ChirpData_d4 0.000102 0.00000 test mintime= 0.000099
avx_fma4_ChirpData_d6 0.000101 0.00000 test mintime= 0.000099
avx_fma4_ChirpData_d8 0.000101 0.00000 test mintime= 0.000099
avx_fma4_ChirpData_e 0.000122 0.00000 test mintime= 0.000119
avx_fma4_ChirpData_d6 0.000101 0.00000 choice
Test duration 7.50 seconds
Ftst_v7 completed successfully.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version