Forum > Linux

SETI MB CUDA for Linux

<< < (80/162) > >>

letni:
Hey folks, I'm trying to set up 2 separate CUDA devices on a dedicated 64 bit Linux system with the setiathome-6.08.CUDA_2.2_x86_64-pc-linux-gnu.  I have the system running with 1 CUDA (9600GSO) no problem, but I decided to stick in the 8800GTS from my desktop machine and now I'm getting this message:

CUDA device: GeForce 8800GTS (driver version 0, compute capability 1.0, 320MB, est. 41GFLOPS)
CUDA device (not used): GeForce 8800GTS (driver version 0, compute capability 1.0, 320MB, est. 41GFLOPS)

For some reason It doesn't detect my 9600 GSO as an 9600 anymore, but still uses it (the card is physically hot).  The 8800 I added in as a secondary card is cool to the tough and I don't see a thread running for it.   Here is the LSPCI output:

02:00.0 VGA compatible controller: nVidia Corporation GeForce 9600 GSO (rev a2)
03:00.0 VGA compatible controller: nVidia Corporation G80 [GeForce 8800 GTS] (rev a2)

Is it even possible to use CUDA with two different devices?  If so, what am I missing?

Thanks,

Letni

sunu:
After a small hiatus I'm back  :)


--- Quote from: riofl on 21 Aug 2009, 08:22:04 pm --- now i dont know about desktop settings much but there is one change i made in the past few weeks with nvidia-settings. i unchecked Sync to VBlank in xvideo settings and also unchecked sync to vblank and allow flipping in the opengl settings. wasnt sure what they did but there seemed to be no difference.   should they be checked?

--- End quote ---
Sync to vblank and flipping don't have anything to do with cuda computations. They are about tearing.


--- Quote from: riofl on 21 Aug 2009, 08:22:04 pm ---power mizer which seems to not have settings says adaptive clocking enabled performance level2 perforamce mode desktop. level2 is the 3d settings above however i remember when i first got the card, performance mode said maximum performance and somewhere along the line it changed to desktop. since the other settings are the same i can only assume it is a function of which driver is being used for which text shows up.

--- End quote ---
I don't like powermizer at all, but it doesn't seem you have a problem there.


--- Quote from: riofl on 21 Aug 2009, 08:27:17 pm ---since 6.9.0 reports 2 teslas, could it be possible it is mixing up which device is 0 and which is 1? because it is completely odd since the tesla is running gpu 500mhz and memory 900mhz so it should be considerably slower. it rates both devices it thinks are teslas at 74gflops yet the 285 is rated by 6.6.11 as 127gflops

--- End quote ---
Again, what boinc reports is irrelevant, just cosmetic. What cuda client sees is important and in your case it reports your cards right.


--- Quote from: riofl on 21 Aug 2009, 08:27:17 pm ---i am going to reboot this tomorrow so when i do i am going to go over the settings in cmos. presently it is set to auto on pci-e bus frequency. maybe i will fix it at 100mhz .. it could be doing God knows what in auto.

--- End quote ---
I don't think pci-e bus frequency has any noticeable effect in cuda speed and even if it did, it should affect both your cards, not only one.


--- Quote from: riofl on 22 Aug 2009, 10:45:36 pm ---also the 3 digit time workunits are still the 285 and 2 digit the tesla.  i wonder if it has something to do with how busy my desktops are? i have quite a lot going on 24/7 with 18 gkrellm server monitors running in one desktop, usually 4 or 5 browser windows in different desktops with maybe 28 or so tabs open, average 8 or 10 ssh konqueror tabs open into our servers, email, virtualbox running xp which also runs boinc, kopete, 8 or 9 postit notes in the various desktops, a few kedit windows open plus momentary things like adobe reader, smplayer or whatever.. im in totally new territory here. my experience in graphics cards is plug it in and make sure it works with a stable and peppy screen :)

however the 'busyness' of the desktops is not new and was basically the same when i had 10-13min workunits out of both cards.


--- End quote ---
This is a very busy desktop. Have you tried running a few workunits with absolutely nothing of the above running? I think your times will return to "normal".


--- Quote from: vejpuste on 25 Aug 2009, 07:55:46 am ---Hello,
I try Crunch3rs CUDA seti application and search google and this forum too but result is not OK.
I tried to CUDA 2.1 and aplication setiathome-CUDA-6.08.x86_64-pc-linux-gnu. This compute OK but take 100% of CPU.
http://setiathome.berkeley.edu/result.php?resultid=1340190227

Now I test CUDA 2.3 with same apliaction and result is SIGSEGV: segmentation violation

I try add setiathome-CUDA-6.08.x86_64-pc-linux-gnu to /usr/local/bin but still not working.
CUDA 2.2 take segmentation violation too.
Thanks for any ideas
Libor

--- End quote ---
Cuda 2.1 libraries have a bug, that's why you have 100% CPU utilisation. You need 2.2 or later. Also please follow my post in http://lunatics.kwsn.net/linux/seti-mb-cuda-for-linux.msg19014.html#msg19014 very carefully.


--- Quote from: vejpuste on 25 Aug 2009, 08:59:23 am ---I try this application too but here is another problem
ldd setiathome-6.08.CUDA_2.2_x86_64-pc-linux-gnu
./setiathome-6.08.CUDA_2.2_x86_64-pc-linux-gnu: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.9' not found (required by ./setiathome-6.08.CUDA_2.2_x86_64-pc-linux-gnu)
CentOS5 have no GLIBCXX_3.4.9 in updates now.
Libor

--- End quote ---
Centos, since it's more enterprise oriented, uses old versions of ...well everything. Have you had any success?

@riofl and lordvader about kernel versions
Have you compiled these kernels yourselves or have you got them from elsewhere? Maybe some performance/optimization options you left out? Do you have any nvidia related errors in your syslog when running cuda? Any other observations with these newer kernels?


--- Quote from: letni on 31 Aug 2009, 06:59:36 pm ---Is it even possible to use CUDA with two different devices?  If so, what am I missing?

--- End quote ---
Of course it is. Any link to your host? Also try putting

<use_all_gpus>1</use_all_gpus>

in the options section of your cc_config.xml and tell boinc to read the config file.

riofl:
i always compile my own kernels and have been since i first ran linux 8 yrs ago. in switching back and forth there was no difference in my cuda processing times. at first i thought there was but it was just different size workunits.

no, i have not tried just running boinc without a gui.. i will try that this coming weekend when i can spare some downtime from work and monitoring the servers. will let it run for 1 hr with no X running and then will go in and see if there are any differences.   

thing is, the usage of my desktops has not changed much at all during the past year so i had the same stuff open with the 13min workunits a few months ago. will be interesting to see if the 3 digit numbers move into 2 digit though on the tasks report.

thanks for the info about video settings. i was not sure what they did but when i did not see any changes, i figured "off" may be better..  i hate powermizer myself but i cannot find any options to turn it off and leave the card in high perf mode at all times. every time i spot check it its always in hi perf mode so maybe my temps are not high enough to trigger it (assuming temp is its onlly trigger) and if idle is a trigger, my desktop is never idle even when i go to bed, all the gkrellm monitors are advancing their graphs every second.

seems so strange with all the mb servers down, my cuda cards are both idling at around 46c. really odd since i am used to them being in the low or mid 60s all the time.

sunu:

--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---no, i have not tried just running boinc without a gui.. i will try that this coming weekend when i can spare some downtime from work and monitoring the servers. will let it run for 1 hr with no X running and then will go in and see if there are any differences.   

--- End quote ---
Leave X, just close all those apps you have running. Just the desktop with boinc in the background.


--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---thing is, the usage of my desktops has not changed much at all during the past year so i had the same stuff open with the 13min workunits a few months ago. will be interesting to see if the 3 digit numbers move into 2 digit though on the tasks report.

--- End quote ---
The bigger multibeam workunits started about a month or two ago.


--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---i hate powermizer myself but i cannot find any options to turn it off and leave the card in high perf mode at all times. every time i spot check it its always in hi perf mode so maybe my temps are not high enough to trigger it (assuming temp is its onlly trigger) and if idle is a trigger, my desktop is never idle even when i go to bed, all the gkrellm monitors are advancing their graphs every second.

--- End quote ---
Many people have tried many ways to turn off powermizer usually with no success. :D   Powermizer levels are triggered by GPU usage or very high (95+°C) temperatures.


--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---seems so strange with all the mb servers down, my cuda cards are both idling at around 46c. really odd since i am used to them being in the low or mid 60s all the time.

--- End quote ---
I have some WUs cached for a few days more  ;D

riofl:

--- Quote from: sunu on 02 Sep 2009, 01:48:51 pm ---
--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---no, i have not tried just running boinc without a gui.. i will try that this coming weekend when i can spare some downtime from work and monitoring the servers. will let it run for 1 hr with no X running and then will go in and see if there are any differences.   

--- End quote ---
Leave X, just close all those apps you have running. Just the desktop with boinc in the background.

ok ill close down all my 'server' functions as well like my jabber server, bind, etc. so its just x and boinc running.


--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---thing is, the usage of my desktops has not changed much at all during the past year so i had the same stuff open with the 13min workunits a few months ago. will be interesting to see if the 3 digit numbers move into 2 digit though on the tasks report.

--- End quote ---
The bigger multibeam workunits started about a month or two ago.

hehe thats about the time i started noticing issues. maybe they're not issues afer all.



--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---i hate powermizer myself but i cannot find any options to turn it off and leave the card in high perf mode at all times. every time i spot check it its always in hi perf mode so maybe my temps are not high enough to trigger it (assuming temp is its onlly trigger) and if idle is a trigger, my desktop is never idle even when i go to bed, all the gkrellm monitors are advancing their graphs every second.

--- End quote ---
Many people have tried many ways to turn off powermizer usually with no success. :D   Powermizer levels are triggered by GPU usage or very high (95+°C) temperatures.

ok well i hardly do anything involving true graphics besides cuda running on that stuff and i have my hardware monitors set to shut the system down if the gpu gets to 80c.. once i adjusted the fans and air flow in the case they have never gone above 70c.


--- Quote from: riofl on 02 Sep 2009, 12:05:14 pm ---seems so strange with all the mb servers down, my cuda cards are both idling at around 46c. really odd since i am used to them being in the low or mid 60s all the time.

--- End quote ---
I have some WUs cached for a few days more  ;D

--- End quote ---

lucky. im set for 10 days but ran out of cuda.. still have 400+ cpu units which are all vlar/vhar. been trying to get more since saturday.

i'll just wait out the stampede and give the servers a chance to settle and then go for it again.

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version