Advanced search

Message boards : Graphics cards (GPUs) : gtx295 returning nearly constant errors

Author Message
Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10014 - Posted: 21 May 2009 | 8:46:15 UTC

I had a gtx295 running gpugrid with only the occasional w/u that errors.

Something changed on the 18th and now i get nearly constant errors.
I'm running boinc 6.6.28 with NV 185.85 drivers on Vista HP 64bit.

I cant work out what changed as all automatic updates are turned off and i update everything manually every weekend.Any suggestions after looking through my task history would be much appreciated.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10024 - Posted: 21 May 2009 | 10:24:34 UTC - in response to Message 10014.

Mhh, occasionally you still finish a WU. And it looks like you alread tried a project reset? Otherwise.. tried a reboot and to switch power off + remove the power cord for >10 mins?

MrS
____________
Scanning for our furry friends since Jan 2002

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10031 - Posted: 21 May 2009 | 11:58:05 UTC - in response to Message 10024.

I've just gone back to basics :) un-installed everything nvidia and boinc related. Restarted and cleaned out all traces of nvidia anywhere. Fresh download of 185.85 and fresh download of boinc client.
Reattached to project and now running first 2 W/U. Fingers crossed it was just a bit of corruption somewhere amongst all the files i just replaced.

2 hours into the W/u so far and all looks good.

Will update when they finish.



Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10032 - Posted: 21 May 2009 | 12:22:33 UTC - in response to Message 10031.

And there goes the first error

Profile Bender10
Avatar
Send message
Joined: 3 Dec 07
Posts: 167
Credit: 8,368,897
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 10035 - Posted: 21 May 2009 | 14:21:54 UTC - in response to Message 10032.

Try going back to the 182.50 driver....
____________


Consciousness: That annoying time between naps......

Experience is a wonderful thing: it enables you to recognize a mistake every time you repeat it.

popandbob
Send message
Joined: 18 Jul 07
Posts: 67
Credit: 40,277,822
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10043 - Posted: 21 May 2009 | 18:55:22 UTC
Last modified: 21 May 2009 | 18:56:41 UTC

Since the re-install the errors are only on GPU1 (meaning the second GPU of the 295)
Make sure you've got it configured correctly.

Bob

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10049 - Posted: 21 May 2009 | 20:46:16 UTC - in response to Message 10043.

Since the re-install the errors are only on GPU1 (meaning the second GPU of the 295)
Make sure you've got it configured correctly.

Bob

Not sure what there is to configure? Its not in sli mode or gpu 1 would not exist.
Physx is on as it always has been. Am i missing something else as i thought that was pretty much it. Is there something else?


Try going back to the 182.50 driver....


Thats what i'm now trying. Just got it ready and started 2 more w/u



popandbob
Send message
Joined: 18 Jul 07
Posts: 67
Credit: 40,277,822
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10057 - Posted: 22 May 2009 | 0:46:50 UTC

The new drivers for some people made the need for a dummy plug/monitor on the second GPU unnecessary. As long as phyisx is enabled on the second GPU it should detect it. Others have found out they still need the dummy plug/monitor on the card.
The desktop still needs to be extended onto the second GPU.

The new drivers have changed the rules for cuda and has caught a few people off guard. Of course for some the new drivers wouldn't work either.

Bob

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10064 - Posted: 22 May 2009 | 11:34:32 UTC - in response to Message 10057.

I never needed a plug and have never had to extend the desktop onto 2nd GPU.
No drivers or settings changed between it working and it not working.
In the NVIDIA control panel Physx is enabled and the SLI is set to not use multiple gpu mode as it always has been.


popandbob
Send message
Joined: 18 Jul 07
Posts: 67
Credit: 40,277,822
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10067 - Posted: 22 May 2009 | 18:50:24 UTC

The amazing thing is about the drivers is that sometimes they change their mind...

For this next batch of work please dont abort any as you keep aborting the work on card 0 (the first of the 2 in the 295) and we haven't seen if it will error out or not.

Bob

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10073 - Posted: 22 May 2009 | 22:29:33 UTC - in response to Message 10067.

For this next batch of work please dont abort any as you keep aborting the work on card 0 (the first of the 2 in the 295) and we haven't seen if it will error out or not.

Bob

OK i'm now running 2 w/u and will be leaving it overnight so both cards should be on their second W/u's by the time i get up :)

Will see if my hard work today paid off or not. Been a busy bee indeed.

Thats the reason i cancelled all jobs and dropped out for a fresh start.

Fingers crossed now or its definately the card!!

popandbob
Send message
Joined: 18 Jul 07
Posts: 67
Credit: 40,277,822
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10081 - Posted: 23 May 2009 | 5:11:10 UTC

I'm sure its not the card.
I see the new host (37017)
currently 4 errors all on the second GPU of the 295.
Nothing yet from the first GPU...
Sounding more like a config issue as there would be errors from the first GPU as well if something was wrong...

Bob

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10085 - Posted: 23 May 2009 | 10:24:10 UTC - in response to Message 10081.

I was thinking the fault might be on the 2nd gpu only hence thats why it is failing.

might try 2 seti w/u and see if both finish.

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10086 - Posted: 23 May 2009 | 10:28:11 UTC - in response to Message 10085.

I have reinstalled everything including vista so everything was clean instal with fresh driver downloads.
Seeing as both gpu's get configured with same setting how can it be a setting issue. If i plug monitor into 2nd gpu instead of first and it still fails it must be card. However if first gpu starts failing tasks then it must be settings. If thats how it works anyway.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10092 - Posted: 23 May 2009 | 14:59:38 UTC - in response to Message 10086.

Agreed. What about other software? 3DMark or FurMark in SLI mode? Any artefacts?

MrS
____________
Scanning for our furry friends since Jan 2002

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10096 - Posted: 23 May 2009 | 17:11:50 UTC - in response to Message 10092.
Last modified: 23 May 2009 | 18:00:05 UTC

ok tasks are still failing on the 2nd gpu regardless of which socket monitor is plugged into.
I'm going to get in touch with vendor of card on tuesday and see what they say as i refuse to overclock something that cost so much in the first place lmao. So at least my warranty is valid still.
EDIT 1
in the meantime im going to run seti enhanced w/u as they dont fail it seems.

Edit 2: Seti enhanced w/u for cuda fly through without a problem on both cores. I'm still going to try for a replacement as it worked then stopped working within a few weeks of use. I might even be inside the ole 28 day mark :)

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10102 - Posted: 23 May 2009 | 19:41:26 UTC - in response to Message 10096.

Did you watch the temperatures on both projects? I suppose they're rather similar at automatic fan control?

MrS
____________
Scanning for our furry friends since Jan 2002

Curt Timmerman
Send message
Joined: 24 Dec 08
Posts: 13
Credit: 17,931,283
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 10114 - Posted: 23 May 2009 | 22:45:51 UTC - in response to Message 10102.

Did you watch the temperatures on both projects? I suppose they're rather similar at automatic fan control?

MrS


I had a similar problem with my gtx295, when I set the fan to 100% duty cycle I stopped having most of the WU errors.

Curt

popandbob
Send message
Joined: 18 Jul 07
Posts: 67
Credit: 40,277,822
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10118 - Posted: 24 May 2009 | 0:21:22 UTC

So its quite clear there is something wrong somewhere with the second GPU

A list of things to try

1) plug monitors into both monitor sockets (if you have a second monitor)If not you could try and switch the monitor over while the PC is running (that sometimes tricks it into seeing 2 GPU's)
2) Ensure the desktop is extended onto the second GPU
3) Try disabling PhysX on the first card while enabling it on the second (I think thats possible)
4) Try disabling PhysX on both

5) Run Cuda memory tester (see sticky) on the second GPU

Bob

P.S. If they'll replace it that may/may not work. Its worth a try if they'll go for it.

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10120 - Posted: 24 May 2009 | 7:20:13 UTC - in response to Message 10118.

My temps for running the gtx have never been over 80 while crunching. I have the fan set to 75% which keeps it down around 72 degrees.

I've tried without physx on both cards.
My system doesn't need to be tricked into "seeing 2 gpu's" there are 2 or it couldn't be failing or starting on the second one.

Are you copy and pasting this from elsewhere cos you've said the same thing repeatedly with the same advice time and time again.
Thats not me having a go, i apreciate everyones help its just you keep telling me to do stuff so the second gpu is detected!! thats the same one that must be detected cos its failing work units.

Cuda mem test i have run a couple times but in the readme it states that "In our
testing, we have found that even "problematic" cards may only fail sporadically
(e.g., once every 50,000 test iterations). Like other stress testing tools,
to properly verify stability MemtestG80 should be run for an extended period of
time."

This test throws up no errors when i run it.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10126 - Posted: 24 May 2009 | 10:33:29 UTC - in response to Message 10120.

You're totally right, seeing the 2nd GPU is not your problem.

And CUDA memtest.. well, would have been nice if it detected something. But memory doesn't fail that often over time - either it's broken from the beginning or it's the chip. Since your card worked fine and then started throwing errors, I' rather suppose the 2nd chip got damaged / decayed excessively.

72°C sounds fine.. but still, I'd test with the fan speed set to 100% (and leave the room). If that helps you have a clear case for RMA. If not I'd run the regular 3D stability tests I suggested above. If they fail you also have a clear case for RMA.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10137 - Posted: 24 May 2009 | 19:46:47 UTC - in response to Message 10126.

As a by note to fixing this maybe it means something who knows but under windows 7 my gtx 295 shows only 1 gpu to boinc no matter how i configure it. 2 monitors plugged in desktops extended and not. Any driver or version all only let boinc see the single gpu.

Just had a fun afternoon anyway playing quickly with W7 :)



Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10146 - Posted: 24 May 2009 | 23:11:48 UTC - in response to Message 10137.

As a by note to fixing this maybe it means something who knows but under windows 7 my gtx 295 shows only 1 gpu to boinc no matter how i configure it. 2 monitors plugged in desktops extended and not. Any driver or version all only let boinc see the single gpu.

Just had a fun afternoon anyway playing quickly with W7 :)

SLI is off?

I don't know how W7 handles SLI but there is an internal SLI mode on the 295 cards IIRC, XP Does not do that ... or at least none of my installs have done that (yet) ... then again, I have not had the courage to try Vista or W7 ...

THough I seem to increasingly hit the upper bound in memory which in XP is 2.something G ...

popandbob
Send message
Joined: 18 Jul 07
Posts: 67
Credit: 40,277,822
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10150 - Posted: 25 May 2009 | 6:05:57 UTC - in response to Message 10120.

My system doesn't need to be tricked into "seeing 2 gpu's" there are 2 or it couldn't be failing or starting on the second one.


I meant working on 2 GPU's.
What I've seen elsewhere is that the second GPU being the only one to throw errors is the Drivers/CUDA are not happy with the config. This is why I've been suggesting what I have. I'm sorry if I sounded like a broken record.

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10151 - Posted: 25 May 2009 | 6:48:04 UTC - in response to Message 10150.

Well, I have 3 systems all with 2 or more GPUs running and my error rate is vanishingly small. I mean in almost all cases of error it has turned out to be a bad task or some event where I have caused an issue.

To put it another way, I don't see a problem here ...

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10152 - Posted: 25 May 2009 | 8:31:16 UTC - in response to Message 10151.

how can you not see a problem with the 2nd gpu always erroring w/u Paul. Just because your system is not erroring constantly then all systems are ok? i dont understand that statement.I had the same errors as you to start, the occasional w/u here and there. Then bang, all w/u that 2nd gpu does fail everytime.

Look through my tasks and try to find the last completed w/u on gpu 1.


Popandbob. I salute you for your suggestions i dont mean to be grumpy honest. I can only see the one way of changing sli config. There is no bios option for it and nvidia control panel just has the 2 important ones, SLI and Physx.

I'm nearly back to where i started again with no light at end of tunnel yet but am about to try fresh instal of 64 vista and 182.50 drivers first. Just doing the last updates before i instal boinc again.



Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10159 - Posted: 25 May 2009 | 13:29:30 UTC - in response to Message 10152.
Last modified: 25 May 2009 | 13:40:06 UTC

OK i'm 2hours 46 minutes into running 2 tasks on it.
723126
723205
Fingers crossed and they both finish so i can sleep easy.

May sound silly but its always failed by now...........

The difference.........

Turned Aero interface off........

I'm just running on the vista basic theme so all the fancy graphic card utilising bumf is off

Could it be that simple for my problem!

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10164 - Posted: 25 May 2009 | 15:58:59 UTC - in response to Message 10152.

how can you not see a problem with the 2nd gpu always erroring w/u Paul. Just because your system is not erroring constantly then all systems are ok? i dont understand that statement.I had the same errors as you to start, the occasional w/u here and there. Then bang, all w/u that 2nd gpu does fail everytime.

Here, as in my house ... I guess I was not clear ... obviously...

I did not say that there was not a problem THERE ...

The way I interpreted the earlier posts was that all systems with 2 or more GPUs are seeing errors on the second GPU. I was saying that I don't see that problem, nothing more or less ...

Sudden changes like what you report sound to me more like a failure of the card than anything else. Do you have other cards? I mean, can you swap things around?

Liuqyn
Send message
Joined: 16 May 08
Posts: 5
Credit: 68,721,860
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10185 - Posted: 26 May 2009 | 11:43:49 UTC - in response to Message 10146.

As a by note to fixing this maybe it means something who knows but under windows 7 my gtx 295 shows only 1 gpu to boinc no matter how i configure it. 2 monitors plugged in desktops extended and not. Any driver or version all only let boinc see the single gpu.

Just had a fun afternoon anyway playing quickly with W7 :)

SLI is off?

I don't know how W7 handles SLI but there is an internal SLI mode on the 295 cards IIRC, XP Does not do that ... or at least none of my installs have done that (yet) ... then again, I have not had the courage to try Vista or W7 ...

THough I seem to increasingly hit the upper bound in memory which in XP is 2.something G ...


I tried W7 with 2 GTX285's and could only get boinc to see one card regardless of what settings I made.
Has anyone gotten W7 to use more then one GPU?

Profile Edboard
Avatar
Send message
Joined: 24 Sep 08
Posts: 72
Credit: 12,410,275
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10192 - Posted: 26 May 2009 | 14:41:15 UTC - in response to Message 10081.
Last modified: 26 May 2009 | 14:47:00 UTC

I'm sure its not the card.
I see the new host (37017)
currently 4 errors all on the second GPU of the 295.
Nothing yet from the first GPU...
Sounding more like a config issue as there would be errors from the first GPU as well if something was wrong...

Bob


I have a PC with a gtx295 and a 8800GT. The GPUs assignation is:

GPU#0: gtx295
GPU#1: 8800GT
GPU#2: gtx295

I get many erros with the WUs that are processed in GPU#1. I thought it was dued to mixed GPU chips (g92 and gt200). I changed the 8800GT with a gtx280 with this new GPU assignation:

GPU#0: gtx280
GPU#1: gtx295
GPU#2: gtx295

The same, many WUs being processed on GPU#1 give error. Note that GPU#1 is a different graphic card in each case.

I thought it could be the mobo. I installed the gtx295 in another PC (with only one PCIe for graphics). The GPUs assignation was:

GPU#0: gtx295
GPU#1: gtx295

The same: many WUs being processed in GPU#1 give error...

Windows Vista Home Premium 32 bits and Nvidia Driver 182.50 in boht PCs.

Now I have the PC with gtx295 + 8800GT configured so It only process WUs in the gtx295 (GPU#0 and GPU#2) (the GPU#1 -> 8800GT not processing) and I have NO problem (no errors) but I have one Graphic Card wasted...

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10202 - Posted: 26 May 2009 | 19:42:45 UTC - in response to Message 10192.

Its in a parcel at the post office atm :) will update as it pans out

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10209 - Posted: 26 May 2009 | 21:18:43 UTC - in response to Message 10192.

Edboard,

oh.. what a ****** mess! What does that mean for other people? Take a close look at which GPU errors out. Otherwise.. it can't be a general problem, as some people are running fine. That makes me point to the combination of OS and driver. Would you mind checking one of the 185 series drivers?

MrS
____________
Scanning for our furry friends since Jan 2002

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10228 - Posted: 27 May 2009 | 8:26:07 UTC - in response to Message 10209.

My problem started very suddenly and nothing i did changed anything in the slightest, drivers, O/S, whole different pc which has none of the problems this card gave me and i still get the same result.

Before this started the GTX295 was running both w/u with just the occasional fail on either core so either something is wrong enough with the gpu to fail w/u which it does in 2 different systems or who knows. the other system i tried it in is running 2 w/u on its own without a problem until i put this card in then it also fails on GPU 1.

Profile Edboard
Avatar
Send message
Joined: 24 Sep 08
Posts: 72
Credit: 12,410,275
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10230 - Posted: 27 May 2009 | 13:14:56 UTC - in response to Message 10209.
Last modified: 27 May 2009 | 13:26:55 UTC

I want to state with facts that many WUs processed in any GPU assigned #1 fails, mainly with the newest units.

I begun to have this kind of problems some time ago with Folding. But NO with GpuGrid/SETI. Now I have this problem with GpuGrid too. SETI works fine.

Joe
Send message
Joined: 1 Sep 08
Posts: 37
Credit: 5,864,088
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 10365 - Posted: 1 Jun 2009 | 19:35:49 UTC - in response to Message 10230.

It's Gainward card??? I have ONE GTX 295 from Gainward - one GPU works fine GPUGrid and the other crashed after a few seconds. But its ok for short WUs like SaH. And this not a problem from XP, Vista, Linux or the driver/Boinc version or an unused monitorport, it's the card... Perhaps you can chance it to another...

Profile nutcase
Avatar
Send message
Joined: 16 Oct 08
Posts: 7
Credit: 5,348,057
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 10369 - Posted: 1 Jun 2009 | 21:34:19 UTC

ok, you are getting the same errors as I was.

I tried mine in 2 computers (with different OS's) and errored on both.

I suspected I had a Bad card but I can run benchmarks with no errors.

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10377 - Posted: 2 Jun 2009 | 14:30:44 UTC - in response to Message 10369.

I'm just waiting for the argument with customer support when they say they can find nothing wrong. I have a feeling they wont like my answer that its not fit for the purpose i bought it for, namely gpugrid.

Cant be long now till they get back in touch with me, they have had it since Thursday afternoon :(

pharrg
Send message
Joined: 12 Jan 09
Posts: 36
Credit: 1,075,543
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 10380 - Posted: 2 Jun 2009 | 15:52:12 UTC
Last modified: 2 Jun 2009 | 15:53:13 UTC

Check what kind of slots you have your cards plugged into on the motherboard. Not all PCI-E slots are the same. Some run at x16, some at X8, some at X4, etc. I searched hard when shopping for my motherboard to find one that had 3 full speed x16 slots and that was also capable for triple SLI (though you need SLI disabled when running CUDA like GPUgrid). Then, I made sure to use the x16 slots for my cards. Oh, and the x16 slots have to be spaced correctly too since the newer GTX200 series cards are double thick, or they won't fit. Then, of course, the case has to have room for that setup. Check for motherboard BIOS and chipset driver updates as well. Anyway, just a thought.

Gamekiller
Avatar
Send message
Joined: 17 Jan 09
Posts: 11
Credit: 191,851,704
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwat
Message 10385 - Posted: 2 Jun 2009 | 18:10:18 UTC - in response to Message 10380.

I'm getting a ton of errors with my gtx 295 as well. =[
I just started gpugrid, and this doesn't make me very happy.
I gave up on using the 2nd gpu and just tried using the first.... and the 1st gpu errors out everything... some within second and some after 20 minutes.
My card is not overclocked, and the temps are in a very good range.

Win7, 185.55, 6.6.31

Annoying to say the least.

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10466 - Posted: 9 Jun 2009 | 19:11:21 UTC - in response to Message 10385.

The product GX-191-AS on RMA RMA****** has been replaced, this will be sent on a new order with order number *******.

This will be despatched as soon as possible, subject to stock availability.

Well thats that sorted....they tested the card and the 2nd gpu was indeed faulty. It was ok for a bit then the fault showed under load.

Pharg my slots are both PCIe 2.0 16x so that wasn't the problem.

My system ran ok for 2 weeks till the fault showed itself. There where no system/driver changes to blame between working and not working.

Thanks to you all for keeping me sane while i waited for their decision.

:) cant wait to take my emergency EAX800 XT out and slide a 295 back in. It will be a lot quieter believe it or not :)

Profile Edboard
Avatar
Send message
Joined: 24 Sep 08
Posts: 72
Credit: 12,410,275
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10492 - Posted: 12 Jun 2009 | 18:29:41 UTC - in response to Message 10365.
Last modified: 12 Jun 2009 | 18:32:30 UTC

It's Gainward card??? I have ONE GTX 295 from Gainward - one GPU works fine GPUGrid and the other crashed after a few seconds. But its ok for short WUs like SaH. And this not a problem from XP, Vista, Linux or the driver/Boinc version or an unused monitorport, it's the card... Perhaps you can chance it to another...



No, it is an EVGA Card.
Boths GPUs work fine with Folding WUs, which last from one to 7 hours each.

Besides, as I have explained in my "messy" post, in some configurations both gtx295 GPUs work fine in GPUGRID, being the 8800GT the one that get errors.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10517 - Posted: 12 Jun 2009 | 22:12:25 UTC - in response to Message 10466.

My system ran ok for 2 weeks till the fault showed itself. There where no system/driver changes to blame between working and not working.


Or more like: until the fault developed.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10681 - Posted: 19 Jun 2009 | 14:56:13 UTC - in response to Message 10517.

Well its here and installed and running its first w/u. New Gainward card.

Faulty one was an Asus GTX295

Hopefully thats my woes sorted out :) will know tomorrow morning :)

Profile madas91
Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10723 - Posted: 20 Jun 2009 | 10:53:56 UTC

Ahhhhhhh lovely 4 tasks done nicely :)

Post to thread

Message boards : Graphics cards (GPUs) : gtx295 returning nearly constant errors

//