Are you still running CUDA 2.1? The 100% CPU was apparently a bug in those libraries. Upgrade CUDA to 2.3, nvidia-drivers to 190.xx and replace your setiathome executable with the 2.2 version and optionally renice that process if you think it's too slow.
Macros, what pp says. Make sure you're using cuda 2.2 or later together with a compatible nvidia driver.
i think you will find best resonse setting your preferences to use 6 or 7 cpus instead of 8 leaving 1 for cuda and your desktop to use. i played around a bit with max_ncpus but did not find a huge difference. mine is set at 0.35.absolutely if you do nothing else change your cuda tookit and sdk to 2.2 and get the 2.2 application. make sure your driver is at the minimum 185.14 or 185.29. i am using 185.29. ver 2.1 had huge flaws in it . i have heard 2.3 is even better, however i have not had good luck with 2.3 so i went back to 2.2 until i can figure out what went wrong.
Small correction to riofl: The driver versions are 185.18.14 and 185.18.29. Latest is 185.18.31. Macros, if you go to cuda 2.3 you'll need 190.18.Macros, what card are you using? Maybe that 99% is because your card goes out of memory?
<core_client_version>6.6.37</core_client_version><![CDATA[<message>process exited with code 193 (0xc1, -63)</message><stderr_txt>SETI@home MB CUDA 608 Linux 64bit SM 1.0 - r06 by Crunch3r :p setiathome_CUDA: Found 1 CUDA device(s): Device 1 : Quadro FX 4600 totalGlobalMem = 804585472 sharedMemPerBlock = 16384 regsPerBlock = 8192 warpSize = 32 memPitch = 262144 maxThreadsPerBlock = 512 clockRate = 1188000 totalConstMem = 65536 major = 1 minor = 0 textureAlignment = 256 deviceOverlap = 0 multiProcessorCount = 12 setiathome_CUDA: CUDA Device 1 specified, checking... Device 1: Quadro FX 4600 is okaySIGSEGV: segmentation violationStack trace (16 frames):setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x47cba9]/lib/libpthread.so.0[0x7f96066ac080]/usr/lib/libcuda.so.1[0x7f9607123020]/usr/lib/libcuda.so.1[0x7f9607128d84]/usr/lib/libcuda.so.1[0x7f96070f210f]/usr/lib/libcuda.so.1[0x7f9606e7db3b]/usr/lib/libcuda.so.1[0x7f9606e8e46b]/usr/lib/libcuda.so.1[0x7f9606e76211]/usr/lib/libcuda.so.1(cuCtxCreate+0xaa)[0x7f9606e6ffaa]setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x5ace4b]setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x40d4ca]setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x419f23]setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x424c7d]setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x407f60]/lib/libc.so.6(__libc_start_main+0xe6)[0x7f96063495a6]setiathome-CUDA-6.08.x86_64-pc-linux-gnu(__gxx_personality_v0+0x241)[0x407be9]Exiting...</stderr_txt>]]>
The crash dump is still referencing the old executable.
Did you update your app_info.xml? Also make sure you copy the new libcudart.so.2 and libcufft.so.2 to your projects/setiathome.berkeley.edu directory.And finally, as stated in another thread, also copy the new executable to /usr/local/bin or whatever directory you have in your PATH. I have had no problems since following these advices (well, apart from having to renice the executable to level 0 to give it enough CPU time).
Quote from: pp on 19 Aug 2009, 11:04:18 amThe crash dump is still referencing the old executable.True, but I got the same for the newer, just picked one from the error list, didn't notice its the old one...QuoteDid you update your app_info.xml? Also make sure you copy the new libcudart.so.2 and libcufft.so.2 to your projects/setiathome.berkeley.edu directory.And finally, as stated in another thread, also copy the new executable to /usr/local/bin or whatever directory you have in your PATH. I have had no problems since following these advices (well, apart from having to renice the executable to level 0 to give it enough CPU time).Yes, I did all that. Anyway, it seems to be running now, due to not making one change at the time, I don't know what was exactly the cause. Besides, its just first WU, hopefully there will be no more errors.edit: It works. Finally
one thing you need to make sure of is that the project directory where the cuda libs are is listed in the ld.so.conf file and that you have run ldconfig. without that it is very likely it would crash possibly a few times and then find its libraries by accident.
riofl, what is happening?I've checked again your host today and I've seen this: http://setiathome.berkeley.edu/results.php?hostid=4166601&offset=40&show_names=0&state=2All 2 hundred and 3 hundred sec tasks were done by your 285. All two-digit sec tasks were done by your tesla. This is completely abnormal.
Hi.I was just wondering if there are any known issues in using the CUDA client with the 2.6.30 kernel ? I recently built a 2.6.30 kernel (to see if the AP units will fail), and noticed that my CUDA units were appreciably slower (taking over an hour).I just switched back the latest ubuntu kernel (2.6.28-15-generic), which seems to work fine.Any suggestions, any particular info you need ? I'm using the same nvidia driver in both cases (185.18.31, on an x86_64 platform)
I'm gonna try a vanilla built 2.6.28.10 kernel, see if I get the same performance issues (and hopefully successfull AP units ...).This is fun ! Damn I missed this stuff !