Forum > Linux
SETI MB CUDA for Linux
rja:
I tried the MB only version of setiathome-CUDA-6.08.i686.tar.bz2 and got the GaussFit_kernel errors that smurf mentioned earlier in « Reply #7 on: 21 Jan 2009, 05:53:48 pm » after about 14 minutes or processing.
It has an Nvidia 8800 GTS 640MB. The GPU core temp is usually 68C, but went to 80C when processing a CUDA workunit. There was a single cpu process of setiathome running at 100%.
Would it help to use a newer version of boinc than 6.4.5?
Fedora 9, nvidia-graphics180.22-kmdl-2.6.27.12-78.2.8.fc9-180.22-106.fc9.i686 from atrpms, boinc-client-6.4.5-2.20081217svn.fc9.i386 from Fedora, with the latest Nvidia cudatoolkit_2.1_linux32_fedora9.run, for this computer - http://setiathome.berkeley.edu/show_host_detail.php?hostid=4131779
Here is an example taskid http://setiathome.berkeley.edu/result.php?resultid=1154715237
I tried removing the cuda 2.1 libs and using the cudalibs (2.0?) that were in setiathome-CUDA-6.08.i686.tar.bz2, but got the same GaussFit_kernel errors.
This was in a different workunit's slots/0/stderr.txt before it was uploaded:
SETI@home MB CUDA 608 Linux 32bit SM 1.0 - r06 by Crunch3r :p
setiathome_CUDA: Found 1 CUDA device(s):
Device 1 : GeForce 8800 GTS
totalGlobalMem = 670760960
sharedMemPerBlock = 16384
regsPerBlock = 8192
warpSize = 32
memPitch = 262144
maxThreadsPerBlock = 512
clockRate = 1188000
totalConstMem = 65536
major = 1
minor = 0
textureAlignment = 256
deviceOverlap = 0
multiProcessorCount = 12
setiathome_CUDA: CUDA Device 1 specified, checking...
Device 1: GeForce 8800 GTS is okay
SETI@home using CUDA accelerated device GeForce 8800 GTS
setiathome_enhanced 6.01 Revision: 402 g++ (GCC) 4.1.2 20070925 (Red Hat 4.1.2-33)
libboinc: BOINC 6.5.0
Work Unit Info:
...............
WU true angle range is : 0.447901
Optimal function choices:
-----------------------------------------------------
name
-----------------------------------------------------
v_BaseLineSmooth (no other)
v_vGetPowerSpectrumUnrolled2 0.00010 0.00000
sse1_ChirpData_ak 0.00814 0.00098
v_vTranspose4 0.00381 0.00000
BH SSE folding 0.00144 0.00000
Cuda error 'GaussFit_kernel' in file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
Cuda error 'GaussFit_kernel' in file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
and lots more of these Cuda error 'GaussFit_kernel' lines
zjones:
I have been trying to get the SETI MB CUDA client running and ever work unit so far immediately hits a computation error as soon as it trys to crunch.
The error log looks like this:
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
process exited with code 127 (0x7f, -129)
</message>
<stderr_txt>
setiathome-CUDA-6.08.x86_64-pc-linux-gnu: error while loading shared libraries: libcufft.so.2: invalid ELF header
</stderr_txt>
]]>
The machine has a AMD Opteron 248, with 2 Quadro 5600 FX. These are equivalent to GeForce 8800 GTXs and G80 chipsets. I am using CentOS 5.2 (x86_64) with Linux Kernel 2.6.18-8.el5. I am using NVIDIA drivers x86_64-180.22 and CUDA toolkit 2.1 64-bit. I have tried BOINC clients 6.4.5 and 6.6.2. I have tried using the CUDA toolkit libs and the ones provided in Crunch3r's package.
Have any of you run into this problem and/or have any suggestions?
Thanks.
ML1:
--- Quote from: bigzachattack on 08 Feb 2009, 03:12:41 pm ---I have been trying to get the SETI MB CUDA client running and ever work unit so far immediately hits a computation error as soon as it trys to crunch.
The error log looks like this:
[...]
setiathome-CUDA-6.08.x86_64-pc-linux-gnu: error while loading shared libraries: libcufft.so.2: invalid ELF header
[...]
The machine has a AMD Opteron 248, with 2 Quadro 5600 FX. These are equivalent to GeForce 8800 GTXs and G80 chipsets. I am using CentOS 5.2 (x86_64) with Linux Kernel 2.6.18-8.el5. I am using NVIDIA drivers x86_64-180.22 and CUDA toolkit 2.1 64-bit. I have tried BOINC clients 6.4.5 and 6.6.2. I have tried using the CUDA toolkit libs and the ones provided in Crunch3r's package.
Have any of you run into this problem and/or have any suggestions?
--- End quote ---
'Tis working fine here and for the same revision for the nVidia drivers.
Kernel 2.6.18 is from a while ago now... It could well be that Crunch3r has used a much more recent kernel and a more recent version of gcc for his compiles.
Can you try a more recent kernel/distro?
I'm using Mandriva 2.6.27.7-server-1mnb.
Good luck,
Martin
CorranHorn:
It's the same problem on my computer.
http://setiathome.berkeley.edu/show_host_detail.php?hostid=4023395
Cuda is ok
--- Quote ---chess@chess-desktop:~/Documents/setiathome-CUDA-6.08.x86_64$ ldd setiathome-CUDA-6.08.x86_64-pc-linux-gnu
linux-vdso.so.1 => (0x00007ffff01ff000)
libcufft.so.2 => /usr/lib/libcufft.so.2 (0x00007f80e7cbc000)
libcudart.so.2 => /usr/lib/libcudart.so.2 (0x00007f80e7a7e000)
libcuda.so.1 => /usr/lib/libcuda.so.1 (0x00007f80e7607000)
libstdc++.so.6 => /usr/lib/libstdc++.so.6 (0x00007f80e72fa000)
libm.so.6 => /lib/libm.so.6 (0x00007f80e7075000)
libpthread.so.0 => /lib/libpthread.so.0 (0x00007f80e6e59000)
libc.so.6 => /lib/libc.so.6 (0x00007f80e6ae7000)
libdl.so.2 => /lib/libdl.so.2 (0x00007f80e68e3000)
libgcc_s.so.1 => /lib/libgcc_s.so.1 (0x00007f80e66cb000)
librt.so.1 => /lib/librt.so.1 (0x00007f80e64c2000)
libz.so.1 => /usr/lib/libz.so.1 (0x00007f80e62aa000)
/lib64/ld-linux-x86-64.so.2 (0x00007f80e7fd6000)
--- End quote ---
OS
--- Quote ---Operating System Linux
2.6.27-11-generic
--- End quote ---
dtiger:
--- Quote from: smurf on 21 Jan 2009, 05:53:48 pm ---I crunshed several units. One was already correctly validated. The others have error messages:
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<stderr_txt>
file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
Cuda error 'GaussFit_kernel' in file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
Cuda error 'GaussFit_kernel' in file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
Cuda error 'GaussFit_kernel' in file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
Cuda error 'GaussFit_kernel' in file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
Cuda error 'GaussFit_kernel' in file './cudaAcc_gaussfit.cu' in line 506 : invalid configuration argument.
....
I have a GeForce 8800 GT. Whats wrong here ? :(
--- End quote ---
--- Quote from: smurf on 24 Jan 2009, 12:57:28 pm ---
Now I updated to 180.22. But same situation. The card is not overclocked and temperature is 62 C. There are two types of cases:
- small WUs with a time to completion of ~7 min are running fine without any errors and granted credit is the same as claimed credit (14-15)
- big WUs with a time to completion of ~25 min have the gausfit error message and the granted credit is smaller than the claimed credit (claimed ~50, granted ~40)
--- End quote ---
I have exactly the same situation.
Video card is 8600 GT 256 MB, NVIDIA 180.29 driver.
Exactly the same uncountable error messages in stderr out, the same clamed/granted credits.
http://setiathome.berkeley.edu/result.php?resultid=1170131986
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version