Forum > GPU crunching

WUs that CUDA MB can't do correctly

<< < (3/7) > >>

Raistmer:
VLAR AR~0.15
Errors at CUDA mem copy, invalid results.

AK_v8_win_SSSE3x.exe -verb -st / 23no08ad.15915.22976.9.8.127.wu :
Started at  : 03:07:17.808
Ended at    : 04:05:01.401
   3463.562 secs Elapsed
   3458.355 secs CPU time
 
[ stderr ]
Can't set up shared mem: -1
Will run in standalone mode.
Windows optimized S@H Enhanced application by Alex Kan
Version info: SSSE3x (Intel, Core 2-optimized v8-nographics) V5.13 by Alex Kan
SSSE3x Win32 Build 41 , Ported by : Jason G, Raistmer, JDWhale

     CPUID: Intel(R) Core(TM)2 Quad  CPU   Q9450  @ 2.66GHz
     Speed: 4 x 2655 MHz
     Cache: L1=64K L2=6144K
  Features: MMX SSE SSE2 SSE3 SSSE3
 
Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.154919

Flopcounter: 27945224690130.227000

Spike count:    2
Pulse count:    5
Triplet count:  0
Gaussian count: 0
called boinc_finish
[ /stderr ]

------------
MB_6.06r380mod_CUDA.exe -verb -st / 23no08ad.15915.22976.9.8.127.wu :
Started at  : 19:48:55.425
Ended at    : 20:17:32.767
   1717.310 secs Elapsed
     70.294 secs CPU time
Speedup     : -57.06%
Ratio       : 0.64 x
 
                ----- R1:R2 ------     ----- R2:R1 ------
                Good    Bad   Ugly     Good    Bad   Ugly
        Spike      2      0      0        2      0      4
     Gaussian      0      0      0        0      0      0
        Pulse      5      0      0        5      0      2
      Triplet      0      0      0        0      0      0
   Best Spike      0      0      1        0      0      1
Best Gaussian      1      0      0        1      0      0
   Best Pulse      0      0      1        0      0      1
 Best Triplet      0      0      0        0      0      0
                ----   ----   ----     ----   ----   ----
                   8      0      2        8      0      8

Result      : Weakly similar.

Cuda error 'cudaMemcpy(best_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1265 : unknown error.
Cuda error 'cudaMemcpy(best_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1265 : unknown error.
Cuda error 'cudaMemcpy(best_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1265 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.

Online result:  http://setiathome.berkeley.edu/result.php?resultid=1108406664


[attachment deleted by admin]

Raistmer:
AR ~0.4
Invalid result:
MB_6.06r380mod_CUDA.exe -verb -st / 03dc08ab.11550.18882.10.8.130.wu :
Started at  : 21:22:41.537
Ended at    : 21:41:03.287
   1101.656 secs Elapsed
    120.339 secs CPU time
Speedup     : 95.83%
Ratio       : 24.00 x
 
                ----- R1:R2 ------     ----- R2:R1 ------
                Good    Bad   Ugly     Good    Bad   Ugly
        Spike      0      0      0        0      0      0
     Gaussian      0      0      0        0      0      0
        Pulse      0      0      1        0      0      0
      Triplet      0      0      7        0      0      0
   Best Spike      0      0      1        0      0      0
Best Gaussian      0      0      1        0      0      0
   Best Pulse      0      0      1        0      0      0
 Best Triplet      0      0      1        0      0      0
                ----   ----   ----     ----   ----   ----
                   0      0     12        0      0      0

Result      : Different.

 with CUDA error:

SETI@home error -12 Unknown error
cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel
File: d:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu
Line: 235

[ /stderr ]



[attachment deleted by admin]

Josef W. Segur:

--- Quote from: Raistmer on 30 Dec 2008, 12:32:06 pm ---VLAR AR~0.15
Errors at CUDA mem copy, invalid results.
Wall clock execution time greater than for CPU app.

============
AK_v8_win_SSSE3x.exe -verb -st / 23no08ad.15915.22976.9.8.127.wu :
Started at  : 19:48:10.591
Ended at    : 19:48:55.394
     44.772 secs Elapsed
     44.757 secs CPU time
...
No heartbeat from core client for 30 sec - exiting
...
--- End quote ---

I don't think the timing comparison is meaningful, though the CUDA mem copy errors obviously show a problem with that.
                                                                           Joe

Raistmer:
Oops, will retest...
(thanx for spotting this early exit)

ADDON:report edited, now correct CPU run there.
(and reference on online result added)

Maik:
-
AR: 5.324874
-
MB_6.06r380mod_CUDA AK_v8_win_SSE41 setiathome_6.06_windows_intelx86__cudaSpike count: 0 Spike count: 9 Spike count: 0  Pulse count: 0 Pulse count: 0 Pulse count: 0Triplet count: 31 Triplet count: 2 Triplet count: 31  Gaussian count: 0 Gaussian count: 0 Gaussian count: 0
[edit] found 2 more of them, same AR, nearly same results ... stock cuda on both with triplet count: 31[/edit]

[attachment deleted by admin]

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version