Forum > GPU crunching
It works!
Yellow_Horror:
No one freeze with V7 until now. Think, the V9 app from the multi-GPU package is the one thing to blame.
Raistmer:
--- Quote from: Yellow_Horror on 24 Feb 2009, 03:27:17 pm ---No one freeze with V7 until now. Think, the V9 app from the multi-GPU package is the one thing to blame.
--- End quote ---
Replace DLLs to older versions.
Yellow_Horror:
--- Quote from: Raistmer on 24 Feb 2009, 03:31:35 pm ---Replace DLLs to older versions.
--- End quote ---
Got a freeze with the same symptoms using MB_6.08_mod_CUDA_V9.exe with old DLLs.
Jason G:
Yeah. I've confirmed this behaviour now on my machine also (v9 with old DLL's). At about 6.4% it decided to spontaneously pause & ran for an hour at full GPU use (by temperature) with no progess, marked as 'Waiting to Run' (Normally finishes <30mins or so). Increasing ncpus by 1 (&reread config file) started up another astropulse instead :-\ so I reset ncpus, restarted boinc and it resumed normally, but similarly stuck on the next task. No obvious complaints in stderr, and angle ranges were both ~0.44.
I've switched back to v7vlarkill to check everything else is OK, and all is running normally, (2xAPs + 1xCuda, with Maik's script monitoring the show (modified to restart boinc in case of stuck WU., instead of terminating the process)
Important note: I am using a development Boinc 6.6.9 at this time (after getting the same response from 6.6.10), which I had 'work_fetch_debug' turned on and I believe some mechanism in these new versions may be causing the waiting. I get the impression it may be some 'twiddling' of the scheduling operations going on behind the scenes interacting with the app trying to get things 'right' for the normal user, but don't know for sure, partly as I don't know what some of the more detailed log messages mean.
Jason
Raistmer:
--- Quote from: Jason G on 25 Feb 2009, 11:07:59 am ---Yeah. I've confirmed this behaviour now on my machine also (v9 with old DLL's). At about 6.4% it decided to spontaneously pause & ran for an hour at full GPU use (by temperature) with no progess, marked as 'Waiting to Run' (Normally finishes <30mins or so). Increasing ncpus by 1 (&reread config file) started up another astropulse instead :-\ so I reset ncpus, restarted boinc and it resumed normally, but similarly stuck on the next task. No obvious complaints in stderr, and angle ranges were both ~0.44.
--- End quote ---
Waiting to run - it's BOINC mark. App refused to wait and continue crunching. Pity that you didn't look at state.sah for that task - did it progress or not...
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version