Forum > GPU crunching
GTX295 CUDA Issues
madmac:
OK, I decided to splash out and spent £250 getting a second hand 295 as I was fed up watching my position slide down the tables :-(
I built a new (2nd hand) rig, installed my card, latest nvidia driver, turned of sli and put the latest optimised apps on and fired her up.
(It would appear that I have a problem with one of the cores as one gpu just errors typically after 19 sec's but never much longer than that, though there has been a couple of times where it has crunched to about 80% and then errored, but this has only happened with 1 or 2 units out of too many to mention, so might be a fluke.
Reverted back to stock apps - no difference
dropped the memory clocks right down to 500Mhz - no difference, the card is now back at stock speeds with one gpu disabled
This is my new pc
http://setiathome.berkeley.edu/show_host_detail.php?hostid=5400786
And this is the task list - Could someone on here have a look at the errors and see if there is a common theme or if there is anything I can change as Im stumped on this one...
The error messages are not always the same
http://setiathome.berkeley.edu/results.php?hostid=5400786
Is it hardware related or is it software related..
Any help gratefully received
sunu:
99.99% it's a hardware problem. All invalids and error workunits are from the first GPU (device 1).
A bit puzzling is http://setiathome.berkeley.edu/result.php?resultid=1604052193 that gives an out of memory error.
Pizzadude:
Mmmm, thats weird, I've got a gtx295 and been suffering similar issues for about the last three weeks. I thought it was a heat issue and completely dismantled the GTX295 and removed all dustballs etc and replaced heatsink paste with Artic silver. Overall temperatures have reduced by about 5 to 8 degrees but the Seti problem persisted. I assumed it may be a OS or registry issue so I clean installed Win7 64bit with various Nvidia drivers but still the problem persists. The errors always occur on GPU 0. I am not convinced its a hardware issue as all other Cuda apps work flawlessly. The GTX 295 plays intensive games without a hitch.
I have performed a burnin test using furmark which took the GTX295 to 93 degrees well within its 105 degree design limit. In case it was an issue with gtx295 interfacing with my motherboard i removed all over clocks from my I7 processor and memory and put everything back to stock Intel settings.
Still GPU 0 throws an error every couple units or sometimes 10 in a row.
:-\
Pepi:
weak power supply?
madmac:
--- Quote from: Pizzadude on 11 May 2010, 03:39:20 am --- I am not convinced its a hardware issue as all other Cuda apps work flawlessly. The GTX 295 plays intensive games without a hitch.
I have performed a burnin test using furmark which took the GTX295 to 93 degrees well within its 105 degree design limit.
Still GPU 0 throws an error every couple units or sometimes 10 in a row.
:-\
--- End quote ---
l
Interesting, can you clarify the 'all other CUDA apps work flawlessly' bit?
I have read that this is an issue with the older dual pcb versions over on the seti forums
I have the problem that 99.99% of wu's error on one of the gpu's
Trying to get my money back...
Navigation
[0] Message Index
[#] Next page
Go to full version