C:\Release-vc8>fft.exemin_n = 4max_n = 4RapidMind FFT Benchmark-----------------------------------------------Length: 16 = 2^4Warming up...Run timings, to and from host (in us): 11561.3 10482.5 8229.39 12829.6 8740.71 9539.26 9745.74 10875.1 11149.2 9760.27 12356 8845.49 11541.2 8558.26 9808.89 9916.74 9238.06 9773.12 8477.23 7909.47 11607.7 10333.6 7918.13 11377.5 7920.09 10473.6 8454.32 9801.9 10972.9 10767 9267.11 11145.3 9876.5 9839.62 13427.2 8664.71 10973.7 11119.3 9176.86 9062.31 9811.68 8923.72 7202.85 9036.6 9994.13 8747.42 10002.8 10443.1 9761.39 9866.44 10177.1 10808.3 8371.89 10052 9621.96 10266 11904.4 9640.12 9375.24 8899.69 9294.78 10726.2 6828.72 12483.1 9911.99 12466.6 8385.58 7925.68 10416.3 9766.97 9917.02 11196.4 9642.64 10324.1 11035.8 9518.3 8512.15 10829 9727.86 12404.3 10707.5 10192.5 10868.4 7899.13 9340.32 8048.62 7750.77 11226.9 8889.35 9273.54 7777.87 7842.69 7471.92 8830.4 10697.4 11466.3 8701.59 8419.39 7942.44 9761.11Average execution time: 9788.45usNormalized execution time (T/N): 611.778us/sampleNormalized by complexity (T/N lg N): 152.945Mflops (5 N lg N/T): 0.0326916Average execution time: 9788.45usMinimum execution time: 6828.72usNormalized average execution time (T/N): 611.778us/sampleNormalized minimum execution time (T/N): 426.795us/sampleAverage time normalized by complexity (T/N lg N): 152.945Minimum time normalized by complexity (T/N lg N): 106.699Average Mflops (5 N lg N/T): 0.0326916Peak Mflops (5 N lg N/T): 0.0468609---Warming up...Run timings, GPU-local (in us): 10815.9 11730.4 7816.99 7627.83 9804.42 9321.6 9801.34 9725.06 7585.92 9003.07 9982.68 6766.42 10917.9 8505.45 7894.38 10349.5 8926.79 11731.8 7668.62 8905.56 11206.2 9771.44 11598.2 8679.8 9933.78 9116.51 8855.83 9696 9815.87 8695.17 12109.5 9716.4 8787.65 8662.48 8444.54 7717.24 8718.36 9792.96 10747.7 9169.6 11555.5 8955.85 9709.7 6659.12 10377.2 9286.95 10160.9 11761.7 8587.87 12249.8 8761.67 10833.5 9495.95 7892.71 9270.47 9678.68 10709.1 9684.55 7819.5 10225.5 8822.58 12600.2 8660.8 8996.09 11010.3 6783.74 10320.5 10069.9 9703.83 10450.1 7650.74 10810.8 10639.8 9755.24 11815.3 8054.21 7740.15 10277.5 10128.5 10209.3 6895.78 7671.42 9653.26 9822.86 12298.4 10547.4 7820.62 7712.77 6761.39 8859.18 7419.95 8623.08 7702.71 8842.41 9383.91 9820.06 7636.21 8563.29 9718.36 8473.6Average execution time: 9385.19usMinimum execution time: 6659.12usNormalized average execution time (T/N): 586.574us/sampleNormalized minimum execution time (T/N): 416.195us/sampleAverage time normalized by complexity (T/N lg N): 146.644Minimum time normalized by complexity (T/N lg N): 104.049BenchFFT average Mflops (5 N lg N/T): 0.0340963BenchFFT peak Mflops (5 N lg N/T): 0.0480544Residuals (compare with inverse): Average absolute: 1.26059e-008 Maximum absolute: 5.96046e-008 Average relative: -1.#IND Maximum relative: 1.#INF-----------------------------------------------
C:\Release-vc8>fft2d.exeRapidMind 2D FFT Benchmark===============================================Size: 256 x 256 = 2^8 x 2^8Radix: 4 = 2^2Total number of floating point operations: 5.24288e+006Run timings, to and from host (in ms):Average execution time: 15.6239msOverall average execution time: 15.6285msMinimum execution time: 13.4389msAverage Mflops: 335.568Peak Mflops: 390.126Run timings, GPU-local (in ms):Average execution time: 13.8474msOverall average execution time: 13.851msMinimum execution time: 10.7656msAverage Mflops: 378.619Peak Mflops: 487.004
C:\Release-vc8>fft2d.exeRapidMind 2D FFT Benchmark===============================================Size: 256 x 256 = 2^8 x 2^8Radix: 4 = 2^2Total number of floating point operations: 5.24288e+006Run timings, to and from host (in ms):Average execution time: 14.0743msOverall average execution time: 14.0783msMinimum execution time: 13.1137msAverage Mflops: 372.515Peak Mflops: 399.801Run timings, GPU-local (in ms):Average execution time: 12.3266msOverall average execution time: 12.3304msMinimum execution time: 10.2948msAverage Mflops: 425.332Peak Mflops: 509.276
for G80 is better a CUDA version , i may search on my home computer some apps by Hans Dorn - he had builded some test apps based on CUDA ...