Advanced search

Message boards : Graphics cards (GPUs) : Client Error / Compute error

Author Message
Profile Sabroe_SMC
Send message
Joined: 30 Aug 08
Posts: 24
Credit: 500,287,085
RAC: 31,523
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 7850 - Posted: 25 Mar 2009 | 21:21:02 UTC

Since end of February I have many calculation errors (more than 50%).
The error code is always:
core_client_version>6.6.11</core_client_version>
<![CDATA[
<message>
Unzul�ssige Funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
# Device 0: "GeForce GTX 280"
# Clock rate: 1502000 kilohertz
# Total amount of global memory: 1073741824 bytes
# Number of multiprocessors: 30
# Number of cores: 240
MDIO ERROR: cannot open file "restart.coor"
Cuda error: Kernel [shake_step_1] failed in file 'shake.cu' in line 79 : unknown error.

</stderr_txt>
]]>

I´m using:
Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q9550 @ 2.83GHz
Processor features: fpu tsc pae nx sse sse2 pni
Microsoft Windows Vista: Ultimate x64 Editon, Service Pack 1, (06.00.6001.00)
Memory: 4.00 GB physical, 8.17 GB virtual
Disk: 931.50 GB total, 652.09 GB free
CUDA device: GeForce GTX 280 (driver version 18208, CUDA version 1.3, 1024MB, est. 111GFLOPS)

Updating of nvidia-driver was without any luck.

Any Ideas??
Thanks in advance.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7885 - Posted: 26 Mar 2009 | 21:50:38 UTC - in response to Message 7850.

Your GPU shader clock is 1.50 GHz, whereas the stock setting is 1.30 GHz. Lower your shader clock to maybe 1.40 GHz and, if the core is OC'ed as well, lower that one by ~50 MHz and see if it helps.

MrS
____________
Scanning for our furry friends since Jan 2002

[B^S] HenryHunter
Send message
Joined: 27 Dec 08
Posts: 4
Credit: 6,055,773
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 7951 - Posted: 29 Mar 2009 | 4:32:22 UTC

All WUs during the last week ended with:
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
Unzul�ssige Funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
# Device 0: "GeForce 9500 GS"
# Clock rate: 1375000 kilohertz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 4
# Number of cores: 32
MDIO ERROR: cannot open file "restart.coor"
# Using CUDA device 0
# Device 0: "GeForce 9500 GS"
# Clock rate: 1375000 kilohertz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 4
# Number of cores: 32
# Using CUDA device 0
# Device 0: "GeForce 9500 GS"
# Clock rate: 1375000 kilohertz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 4
# Number of cores: 32
Cuda error: Kernel [mshake_position] failed in file 'mshake.cu' in line 101 : unknown error.



ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7965 - Posted: 29 Mar 2009 | 19:02:17 UTC - in response to Message 7951.

Did you change anything? Did you reboot the machine and/or power it off for >10 mins with removed power cord?

MrS
____________
Scanning for our furry friends since Jan 2002

[B^S] HenryHunter
Send message
Joined: 27 Dec 08
Posts: 4
Credit: 6,055,773
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 7986 - Posted: 30 Mar 2009 | 2:17:26 UTC - in response to Message 7965.

Yes i even waited a couple of hours to restart

1. new WU:
Name Ok12252-SH2_US_8-0-10-SH2_US_8680000_0
Workunit 344487
Created 29 Mar 2009 18:29:35 UTC
Sent 30 Mar 2009 0:17:44 UTC
Received 30 Mar 2009 2:09:26 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status 1 (0x1)
Computer ID 20839
Report deadline 4 Apr 2009 0:17:44 UTC
CPU time 90.57418
stderr out

<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
Unzul�ssige Funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
# Device 0: "GeForce 9500 GS"
# Clock rate: 1375000 kilohertz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 4
# Number of cores: 32
MDIO ERROR: cannot open file "restart.coor"
Cuda error: Kernel [copy_mul] failed in file 'com.cu' in line 45 : unknown error.

</stderr_txt>
]]>

Validate state Invalid
Claimed credit 3098.73148148148
Granted credit 0
application version 6.62

Will buy a new graphic card LOL

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7998 - Posted: 30 Mar 2009 | 20:23:00 UTC - in response to Message 7986.

Will buy a new graphic card LOL


Well, if that's funny for you go ahead ;)

Otherwise there are a couple of things to check:
- did you change anything, hard- or software wise?
- did the GPU fan fail? What's the GPU temperature under GPU-Grid? (Everest or GPU-Z can read that out)
- does it help if you underclock a bit? (~100 MHz shader and ~50MHz GPU core)
- can it still run 3D Mark in a loop for >1 hour?

If you want to get a faster card anyway this won't matter much. But if you want to know if you can throw the old card away or reuse it in an office computer it might be good to invest some time into checking it.

MrS
____________
Scanning for our furry friends since Jan 2002

Post to thread

Message boards : Graphics cards (GPUs) : Client Error / Compute error

//