Forum > Linux
SETI MB CUDA for Linux
IanJ:
Guys,
Forgive if this is not the place for posting questions, but from what I read here I think it is.
I have installed the Crunchr CUDA app on my FedoraCore10 64bit machine. After a fair amount of grief with segfaults, today I finally managed to get my first result in. However two of my results this morning have a strange error. Could anyone elaborate on what the problem is and what I should do to fix it. The card is a 9600GT. Here is the output:-
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
SETI@home MB CUDA_2.2 608 Linux 64bit SM 1.0 - r12 by Crunch3r :p
VLAR autokill mod
setiathome_CUDA: Found 1 CUDA device(s):
Device 1 : GeForce 9600 GT
totalGlobalMem = 536608768
sharedMemPerBlock = 16384
regsPerBlock = 8192
warpSize = 32
memPitch = 262144
maxThreadsPerBlock = 512
clockRate = 1625000
totalConstMem = 65536
major = 1
minor = 1
textureAlignment = 256
deviceOverlap = 1
multiProcessorCount = 8
setiathome_CUDA: CUDA Device 1 specified, checking...
Device 1: GeForce 9600 GT is okay
SETI@home using CUDA accelerated device GeForce 9600 GT
setiathome_enhanced 6.01 Revision: 402 g++ (GCC) 4.2.1 (SUSE Linux)
libboinc: BOINC 6.7.0
Work Unit Info:
...............
WU true angle range is : 0.388520
Cuda error 'cudaAcc_CalcChirpData_kernel2' in file './cudaAcc_CalcChirpData.cu' in line 106 : unspecified launch failure.
cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070
cufft: ERROR: CUFFT_EXEC_FAILED
cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070
cufft: ERROR: CUFFT_EXEC_FAILED
cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/cufft.cu, line 151
cufft: ERROR: CUFFT_EXEC_FAILED
CUFFT error in file './cudaAcc_fft.cu' in line 62.
</stderr_txt>
]]>
Thanks
Ian
sunu:
Thanks!
Just yesterday I put the new ap 5.06 in, but I haven't got any astropulse workunits yet. So currently only MB.
My load averages are above 4 and usually below 5.5
A GTX285 should be able to do 10000-14000 RAC alone.
With all those stuff running in your desktop I don't know if it would be a good idea to buy a low non-CUDA capable card for X and have your other 2 cards dedicated to CUDA. Of course your motherboard would need 3 PCI-E slots.
EDIT:
--- Quote from: IanJ on 04 Sep 2009, 08:55:47 am ---...
SETI@home MB CUDA_2.2 608 Linux 64bit SM 1.0 - r12 by Crunch3r :p
VLAR autokill mod
...
cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070
cufft: ERROR: CUFFT_EXEC_FAILED
cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070
cufft: ERROR: CUFFT_EXEC_FAILED
cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/cufft.cu, line 151
cufft: ERROR: CUFFT_EXEC_FAILED
...
--- End quote ---
You're probably running the 2.2 cuda app with 2.1 libraries. Get the newer 2.2 or even better the 2.3 cuda libraries. Also you'll have to upgrade your NVIDIA driver to a 2.2 (185.18.xx) or 2.3 (190.xx) compatible one.
IanJ:
Sunu,
I'll try with the 2.2. I've installed Cuda Driver 185.18.14 (2.2? from the nvidia website). Previously installed I had 185.18.36. I have now installed the Cuda Toolkit 2.2, set my PATH and amended ldconfig.
I now await tasks from SETI, at the moment it's out of work.
Ian
riofl:
well problem is i am now spoiled.. i had an 8600gt 256mb card i used for my desktop and ran the tesla for cuda before i got the 285. the 285 is several orders of magnitude better in desktop performance. i think i would rather just replace the tesla with a 2nd 285 and let that one crunch full speed and let this one do as it can. would still be a large improvement over the tesla in the 2nd slot. either that or maybe buy a motherboard with 3 slots that can take 3 of these cards leaving room for them to breathe and get a gtx 260 to use for my desktop and minor cuda crunching and let both 285 have at it full steam. i expect the 260 should be up to the task for my desktops.
sunu:
--- Quote from: IanJ on 04 Sep 2009, 10:15:44 am ---I've installed Cuda Driver 185.18.14 (2.2? from the nvidia website). Previously installed I had 185.18.36.
--- End quote ---
185.18.14 is older than 185.18.36, why rollback? Also try cuda 2.3 with 190.xx driver, it's faster than 2.2.
--- Quote from: riofl on 04 Sep 2009, 10:34:44 am ---well problem is i am now spoiled.. i had an 8600gt 256mb card i used for my desktop and ran the tesla for cuda before i got the 285. the 285 is several orders of magnitude better in desktop performance. i think i would rather just replace the tesla with a 2nd 285 and let that one crunch full speed and let this one do as it can. would still be a large improvement over the tesla in the 2nd slot. either that or maybe buy a motherboard with 3 slots that can take 3 of these cards leaving room for them to breathe and get a gtx 260 to use for my desktop and minor cuda crunching and let both 285 have at it full steam. i expect the 260 should be up to the task for my desktops.
--- End quote ---
Or maybe get a GTX295 in place of tesla and no need for a new motherboard.
TO ALL
Please see this thread and take proper action (abort those workunits): I've lost quite a few credits because of this.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version