Forum > Linux

SETI MB CUDA for Linux

<< < (31/162) > >>

sunu:
Your app_info xml seem ok. If I'm not mistaken 6.4.5 doesn't support running two different versions of the same app simultaneously (e.g. 6.03 on the cpu and 6.08 on the gpu). If you already have 6.03s it will run them, but all new wus will be 6.08 (gpu).

s52d:
Thanks!


--- Quote from: sunu on 15 Jun 2009, 08:58:05 pm ---Your app_info xml seem ok. If I'm not mistaken 6.4.5 doesn't support running two different versions of the same app simultaneously (e.g. 6.03 on the cpu and 6.08 on the gpu). If you already have 6.03s it will run them, but all new wus will be 6.08 (gpu).

--- End quote ---

Uh, ugly.... Maybe perl CPU_GPU_rebrand_V5.pl should be run daily?
One day of 6.4.5: seems to handle X and CPUs better.
- X is more responsive (still stops a bit once in a while, price for using GPU)
- one day with no errors
I guess overwriting libcuda.so helped a bit.

ls -lrt boinc/lib*
-rwxr-xr-x 1 iztok users 1150912 2009-01-16 23:30 libcufft.so.2*
-rwxr-xr-x 1 iztok users  252680 2009-01-16 23:30 libcudart.so.2*
-rwxr-xr-x 1 iztok users  254912 2009-06-15 03:10 libcudart.so*

a bit ugly to have two libcudart at the same time (and outside /usr/lib).



6.3.20 is then next to try, after SETI starts sending new job again.

BR
Iztok





Sligltly later: boinc 6.3.20, library copied to ~/boinc directory, smells ok.
validated WU:
http://setiweb.ssl.berkeley.edu/result.php?resultid=1263122254

X works, CPU load OK (shared among X/CPU-seti/GPU-seti). Now testing for "error ephidemia".

BR
Iztok

a bit later: happened.
two instances of seti-CUDA running, kill -TERM to boinc, 2 minutes to restart from crontab.
A stream of WUs gone with:


<message>
process exited with code 193 (0xc1, -63)
</message> 
<stderr_txt>

SETI@home MB CUDA 608 Linux 64bit SM 1.0 - r06 by Crunch3r :p

setiathome_CUDA: Found 1 CUDA device(s):
   Device 1 : GeForce GTX 260
           totalGlobalMem = 938803200
           sharedMemPerBlock = 16384 
           regsPerBlock = 16384     
           warpSize = 32
           memPitch = 262144
           maxThreadsPerBlock = 512
           clockRate = 1350000     
           totalConstMem = 65536
           major = 1             
           minor = 3
           textureAlignment = 256
           deviceOverlap = 1
           multiProcessorCount = 27
setiathome_CUDA: CUDA Device 1 specified, checking...
   Device 1: GeForce GTX 260 is okay
SETI@home using CUDA accelerated device GeForce GTX 260
setiathome_enhanced 6.01 Revision: 402 g++ (GCC) 4.1.2 20070925 (Red Hat 4.1.2-33)
libboinc: BOINC 6.5.0

Work Unit Info:
...............
WU true angle range is :  0.410807
Optimal function choices:
-----------------------------------------------------
name               
-----------------------------------------------------
              v_BaseLineSmooth (no other)
            v_GetPowerSpectrum 0.00019 0.00000
                   v_ChirpData 0.01607 0.00000
                  v_Transpose4 0.00651 0.00000
               FPU opt folding 0.00152 0.00000

SETI@home MB CUDA 608 Linux 64bit SM 1.0 - r06 by Crunch3r :p

setiathome_CUDA: Found 1 CUDA device(s):
   Device 1 : GeForce GTX 260
           totalGlobalMem = 938803200
           sharedMemPerBlock = 16384 
           regsPerBlock = 16384
           warpSize = 32
           memPitch = 262144
           maxThreadsPerBlock = 512
           clockRate = 1350000
           totalConstMem = 65536
           major = 1
           minor = 3
           textureAlignment = 256
           deviceOverlap = 1
           multiProcessorCount = 27
setiathome_CUDA: CUDA Device 1 specified, checking...
   Device 1: GeForce GTX 260 is okay
SIGSEGV: segmentation violation
Stack trace (16 frames):

setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x47cba9]
/lib64/libpthread.so.0[0x7f3f289e2f30]
/usr/lib64/libcuda.so.1[0x7f3f29454020]
/usr/lib64/libcuda.so.1[0x7f3f29459d84]
/usr/lib64/libcuda.so.1[0x7f3f2942310f]
/usr/lib64/libcuda.so.1[0x7f3f291aeb3b]
/usr/lib64/libcuda.so.1[0x7f3f291bf46b]
/usr/lib64/libcuda.so.1[0x7f3f291a7211]
/usr/lib64/libcuda.so.1(cuCtxCreate+0xaa)[0x7f3f291a0faa]
setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x5ace4b]
setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x40d4ca]
setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x419f23]
setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x424c7d]
setiathome-CUDA-6.08.x86_64-pc-linux-gnu[0x407f60]
/lib64/libc.so.6(__libc_start_main+0xe6)[0x7f3f28682526]
setiathome-CUDA-6.08.x86_64-pc-linux-gnu(__gxx_personality_v0+0x241)[0x407be9]

Exiting...

</stderr_txt>




sunu:

--- Quote from: s52d on 16 Jun 2009, 01:34:20 am ---6.3.20 is then next to try, after SETI starts sending new job again.

--- End quote ---

Do you mean 6.6.20? 6.3.20 is pretty old now.

Try something else. Put in your app_info.xml

   <avg_ncpus>1.0000</avg_ncpus>
   <max_ncpus>1.0000</max_ncpus>

in all AKv8 entries and also make all cuda entries 1.0000.  Then put in your cc_config.xml

<ncpus>5</ncpus>

Test it and see how it goes.

s52d:
Done.

let me wait a day or two if it stikes again.

Thanks,

Iztok

Of course: <core_client_version>6.6.20</core_client_version>

Sp@r0:
Hello,

I have some problem with "SETI@Home MB NVIDIA CUDA V6.08", I run my PC under Fedora 11 x86_64, i have a GTS 250. I have setup the CUDA 2.2 driver with the toolkit and the SDK.
The SDK sample apps run without problems.

I use without problem the CPU optimized applications  AK V8 Linux 64 SSSE3  and it run very well, but with the cuda apps i have 2 big problems
* I have got calculation error on all my WU's within a second
* Boinc did not detect Cuda card if i launch it as service but it works if i launch it form a terminal

I have try lots of things with differents version of Cuda but now i'm seek, i do not know the reason of the calcultaion error because i do not know if there is an error log file

Can you give me some tips to solve my problem ???

PS : I m sorry for the quality of my english
 

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version