Forum > GPU crunching
CUDA_V12_app
Sutaru Tsureku:
Hello opt. crew!
I 'helped(?)' Jon in the SETI@home NC subforum to install opt. apps.
Core i7 920 & 2x GTX295 (old 2x PCB), Win7 64bit.
But after installation he have CUDA probs/errors.
Please if you have time, have a small look here: 'RAC with 2 gtx295'
IIRC, nVIDIA_driver_191.x and all stock and everything was well.
The probs started here.. 'Message 962334'.
In my messages you can find some interesting infos about his system.. but now - I'm out of ideas.. :-\
Thanks! :)
Richard Haselgrove:
You mean when you told him to install the VLAR_kill application, without explaining what it does, how it works, and the requirement for a user of 'anonymous platform' applications to manage and maintain their own science app from that point forward?
Most of his errors are -6 "Bad workunit header". It's a VLAR WU. VLAR kills it. That's what it does.
Task 1478354101 is interesting. -6 error, but no VLAR_kill message. Raistmer?
There are also a number of "Incorrect function. (0x1) - exit code 1 (0x1)", after what would appear to be full-term runtimes, but with none of the standard data in stderr_txt. Anyone?
Raistmer:
Yes, most errors just VLAR rejections. But there are some that very similar to my own troubles with 9400GT in dual-GPU config on Core2 Duo host.
Same "0" available memory readings time to time, same "unknown error".
For my own host it was only one solution - to remove 9400GT from it and leave 9600GSO only. It works perfect now.
Also, 9400GT works just perfect in Q9450 host.
What the reasons for such behavior?
I see 3 possibilities:
1) overheating.
2) system underpowered
3) system PCI-E bus overloaded and brings corruption to bus transfers.
Check them.
last one could be checked by using bandwidth sample from nVidia's CUDA or OpenCL samples.
Raistmer:
--- Quote from: Richard Haselgrove on 13 Jan 2010, 10:47:53 am ---
Task 1478354101 is interesting. -6 error, but no VLAR_kill message. Raistmer?
--- End quote ---
Perhaps task was aborted in especially rough way and stderr buffer was no flushed into file.
I'm more concerned with such errors:
http://setiathome.berkeley.edu/result.php?resultid=1478354023
efmer (fred):
--- Quote from: Raistmer on 13 Jan 2010, 10:56:49 am ---Yes, most errors just VLAR rejections. But there are some that very similar to my own troubles with 9400GT in dual-GPU config on Core2 Duo host.
Same "0" available memory readings time to time, same "unknown error".
For my own host it was only one solution - to remove 9400GT from it and leave 9600GSO only. It works perfect now.
Also, 9400GT works just perfect in Q9450 host.
--- End quote ---
The system has 2 old 2 pcb 295's probably the worst cards that nVidia made. They are not suitable for CUDA work, one maybe but two together get way way too hot.
I had my experience with them, and not good. The newer 1 PCB version or the 295 are ok though, quite a different design.
And to top thing off he uses Win 7 with drivers I couldn't get to work properly with my 2 295. Too many driver crashes.
So this system is asking for trouble and he is sometimes OC them as well. So not the best testbed.
Navigation
[0] Message Index
[#] Next page
[*] Previous page
Go to full version