Message boards : Number crunching : GPU computation errors
Author | Message |
---|---|
I was a victim of the new nVidia driver that had a fan speed problem and my card ran over 95deg C for a while and now has compute errors. But it only errors when I run 2 GPUGRID tasks simultaneously on my 9800GX x2. I can run 1 GPUGRID and one anything else and not have any errors. I cannot run 2 of the other GPU task either, because those tasks error out as well. How can I force BOINC to always run 1 GPUGRID task and one of something else? I just can't figure it out... Anyone's help is much appreciated! | |
ID: 16164 | Rating: 0 | rate: / Reply Quote | |
I was a victim of the new nVidia driver that had a fan speed problem and my card ran over 95deg C for a while and now has compute errors. But it only errors when I run 2 GPUGRID tasks simultaneously on my 9800GX x2. I can run 1 GPUGRID and one anything else and not have any errors. I cannot run 2 of the other GPU task either, because those tasks error out as well. How can I force BOINC to always run 1 GPUGRID task and one of something else? I just can't figure it out... Anyone's help is much appreciated! Short answer. You can't You can tell it not to use a particular gpu, but it applies to all projects. In your cc_config file put (within the options tag) <ignore_cuda_dev>0</ignore_cuda_dev> Where 0 is the cuda device number. ____________ BOINC blog | |
ID: 16166 | Rating: 0 | rate: / Reply Quote | |
Thanks for the help... I would like to use both GPUs though. Is there a way to | |
ID: 16185 | Rating: 0 | rate: / Reply Quote | |
Anyone have an answer to this???? | |
ID: 16191 | Rating: 0 | rate: / Reply Quote | |
Anyone have an answer to this???? About all you can do is set your cache to zero, but even then I think it will pickup a new wu when its close to finishing the one thats running. If it was me i'd get the card fixed or replaced. Given nvidia have admitted their driver was faulty you'd have a pretty good chance at getting them to wear the cost or compensating you in some way. Is the thing still under warranty? A lot of them come with 2 or more years now. ____________ BOINC blog | |
ID: 16192 | Rating: 0 | rate: / Reply Quote | |
I submitted a ticket to XFX since the card is suppose to have a double lifetime warranty. We'll see what happens. | |
ID: 16196 | Rating: 0 | rate: / Reply Quote | |
You could also try upgrading to the latest BOINC client 6.10.43 | |
ID: 16198 | Rating: 0 | rate: / Reply Quote | |
I thought of something yesterday. I don't quite know how the "switch between tasks" works exactly, but I set GPUGRID to 9999 and the other project to run on my GPU to 9999 also. If I'm running one of each, it should not try to switch for that many minutes, right? Then when it does, they will both switch around the same time, and what I'm guessing will happen is that project A on core 0 and project B on core 1, will get switched to project A on core 1, and project B on core 0. I don't know, it's just a thoery. What do you think? If it works, it seems only one of each will run, unless there is overlap of the times somehow. How can I start the 9999 timers at the same time so there's no overlap? The switch between tasks is used for CPU tasks. It will (if it needs to share between projects) swap tasks based upon this time. The default is 60 (mins). Which means if it needs to swap one out it has to run for an hour before it can do it. GPU task run from beginning to end, they don't get swapped out normally. ____________ BOINC blog | |
ID: 16199 | Rating: 0 | rate: / Reply Quote | |
Sorry, but I have to say that on my system, GPU tasks don't run to completion, as I have one Collatz that ran 16 of 45 minutes, then switched to running to 2 GPUGRID tasks, from one of each. Maybe something isn't right because it does this? | |
ID: 16224 | Rating: 0 | rate: / Reply Quote | |
GPU tasks still have to use the CPU to some extent. Perhaps that explains it; the default switch is 60min for the CPU and a GPUGrid task could use more than an hour of CPU time. I think I saw the same thing in the past with MW tasks starting mid-run through a GPUGrid task (I dont much care for MW or Aqua, and the others have never got a look in). If you have a GTX275 it is likely to be able to finish a task in less that 1hour of CPU time, but lots of other cards are not so fast. | |
ID: 16225 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : GPU computation errors