Forum > GPU crunching
WUs that CUDA MB can't do correctly
Raistmer:
VLAR AR~0.15
Errors at CUDA mem copy, invalid results.
AK_v8_win_SSSE3x.exe -verb -st / 23no08ad.15915.22976.9.8.127.wu :
Started at : 03:07:17.808
Ended at : 04:05:01.401
3463.562 secs Elapsed
3458.355 secs CPU time
[ stderr ]
Can't set up shared mem: -1
Will run in standalone mode.
Windows optimized S@H Enhanced application by Alex Kan
Version info: SSSE3x (Intel, Core 2-optimized v8-nographics) V5.13 by Alex Kan
SSSE3x Win32 Build 41 , Ported by : Jason G, Raistmer, JDWhale
CPUID: Intel(R) Core(TM)2 Quad CPU Q9450 @ 2.66GHz
Speed: 4 x 2655 MHz
Cache: L1=64K L2=6144K
Features: MMX SSE SSE2 SSE3 SSSE3
Work Unit Info:
...............
Credit multiplier is : 2.85
WU true angle range is : 0.154919
Flopcounter: 27945224690130.227000
Spike count: 2
Pulse count: 5
Triplet count: 0
Gaussian count: 0
called boinc_finish
[ /stderr ]
------------
MB_6.06r380mod_CUDA.exe -verb -st / 23no08ad.15915.22976.9.8.127.wu :
Started at : 19:48:55.425
Ended at : 20:17:32.767
1717.310 secs Elapsed
70.294 secs CPU time
Speedup : -57.06%
Ratio : 0.64 x
----- R1:R2 ------ ----- R2:R1 ------
Good Bad Ugly Good Bad Ugly
Spike 2 0 0 2 0 4
Gaussian 0 0 0 0 0 0
Pulse 5 0 0 5 0 2
Triplet 0 0 0 0 0 0
Best Spike 0 0 1 0 0 1
Best Gaussian 1 0 0 1 0 0
Best Pulse 0 0 1 0 0 1
Best Triplet 0 0 0 0 0 0
---- ---- ---- ---- ---- ----
8 0 2 8 0 8
Result : Weakly similar.
Cuda error 'cudaMemcpy(best_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1265 : unknown error.
Cuda error 'cudaMemcpy(best_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1265 : unknown error.
Cuda error 'cudaMemcpy(best_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1265 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Cuda error 'cudaMemcpy(tmp_PoT, dev_tmp_pot, max_nb_of_elems * sizeof(float), cudaMemcpyDeviceToHost)' in file 'd:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu' in line 1269 : unknown error.
Online result: http://setiathome.berkeley.edu/result.php?resultid=1108406664
[attachment deleted by admin]
Raistmer:
AR ~0.4
Invalid result:
MB_6.06r380mod_CUDA.exe -verb -st / 03dc08ab.11550.18882.10.8.130.wu :
Started at : 21:22:41.537
Ended at : 21:41:03.287
1101.656 secs Elapsed
120.339 secs CPU time
Speedup : 95.83%
Ratio : 24.00 x
----- R1:R2 ------ ----- R2:R1 ------
Good Bad Ugly Good Bad Ugly
Spike 0 0 0 0 0 0
Gaussian 0 0 0 0 0 0
Pulse 0 0 1 0 0 0
Triplet 0 0 7 0 0 0
Best Spike 0 0 1 0 0 0
Best Gaussian 0 0 1 0 0 0
Best Pulse 0 0 1 0 0 0
Best Triplet 0 0 1 0 0 0
---- ---- ---- ---- ---- ----
0 0 12 0 0 0
Result : Different.
with CUDA error:
SETI@home error -12 Unknown error
cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel
File: d:/BTR/seticuda/Berkeley_rep/client/cuda/cudaAcc_pulsefind.cu
Line: 235
[ /stderr ]
[attachment deleted by admin]
Josef W. Segur:
--- Quote from: Raistmer on 30 Dec 2008, 12:32:06 pm ---VLAR AR~0.15
Errors at CUDA mem copy, invalid results.
Wall clock execution time greater than for CPU app.
============
AK_v8_win_SSSE3x.exe -verb -st / 23no08ad.15915.22976.9.8.127.wu :
Started at : 19:48:10.591
Ended at : 19:48:55.394
44.772 secs Elapsed
44.757 secs CPU time
...
No heartbeat from core client for 30 sec - exiting
...
--- End quote ---
I don't think the timing comparison is meaningful, though the CUDA mem copy errors obviously show a problem with that.
Joe
Raistmer:
Oops, will retest...
(thanx for spotting this early exit)
ADDON:report edited, now correct CPU run there.
(and reference on online result added)
Maik:
-
AR: 5.324874
-
MB_6.06r380mod_CUDA AK_v8_win_SSE41 setiathome_6.06_windows_intelx86__cudaSpike count: 0 Spike count: 9 Spike count: 0 Pulse count: 0 Pulse count: 0 Pulse count: 0Triplet count: 31 Triplet count: 2 Triplet count: 31 Gaussian count: 0 Gaussian count: 0 Gaussian count: 0
[edit] found 2 more of them, same AR, nearly same results ... stock cuda on both with triplet count: 31[/edit]
[attachment deleted by admin]
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version