Surprise Surprise, a QxN build is faster on my Northwood LOL
4- Chirp function Block Prefetch, memcpy++ zerocase & 3phase chirp Generic x86 Untested ~?.?%
measure its the best to try code and find optimal variants. the loop construct in pulsefind.cpp is ready now, but not measured. Today I will squeeze the case-construct code.have still some good ideas to eleminate code else and there...we will see...
Quote from: seti_britta on 07 Nov 2007, 11:47:04 am I am running vtune on the chirp one now to look for any p4 specific slowdowns, wickedly fast code though
have a strong modified chirpfft.cpp which we can try too