Advanced search

Message boards : Number crunching : WU Computing errors?

Author Message
Profile dskagcommunity
Avatar
Send message
Joined: 28 Apr 11
Posts: 456
Credit: 817,865,789
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 21200 - Posted: 17 May 2011 | 6:18:01 UTC

Good morning (here in austria its 8:16 ;)) to all!

I got a problem now with gpugrid? Is something changed on standart WU´s? Got only Computing errors since some days on XP SP3 with 9800GTX. Dont know after what time cos this computer compute them without any human attention ^^

ErrorMSG from last WU Task:

<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
- exit code 98 (0x62)
</message>
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce 9800 GTX/9800 GTX+"
# Clock rate: 1.84 GHz
# Total amount of global memory: 536543232 bytes
# Number of multiprocessors: 16
# Number of cores: 128
MDIO ERROR: cannot open file "restart.coor"
ERROR: file c:\cygwin\home\speechserver\gpumd2_c\src\pme\CPME_cufft.cpp line 106: cufftExecC2C (gridCalc2.2)
called boinc_finish

</stderr_txt>
]]>


____________
DSKAG Austria Research Team: http://www.research.dskag.at



Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 21202 - Posted: 17 May 2011 | 9:33:01 UTC - in response to Message 21200.

Your GeForce 9800 GTX/9800 GTX+ (511MB) is no longer a recommended card here. That said, it was completing tasks successfully up until the last 2, so it looks like some sort of change somewhere caused the cuda fft errors (more common on CC1.1 cards). The same task types ran and then failed (KASHIF_HIVPR, TONI_KKAL), so if something change it must be universal.

An announcement of intended app changes was recently made, but I am not aware of it's implementation (might have expected a new app number). Perhaps some tasks are being tested for the new app?

There is very little we crunchers can do about such problems.
I would suggest a system shutdown and cold start (shutdown, leave the system off for a minute and then start-up).

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,861,436,851
RAC: 10,008,829
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 21206 - Posted: 17 May 2011 | 17:33:23 UTC - in response to Message 21202.
Last modified: 17 May 2011 | 17:33:44 UTC

An announcement of intended app changes was recently made, but I am not aware of it's implementation (might have expected a new app number). Perhaps some tasks are being tested for the new app?

If there are any new apps, they will appear on the applications page (linked from the very bottom of the home page) first - and there's nothing there yet.

Profile dskagcommunity
Avatar
Send message
Joined: 28 Apr 11
Posts: 456
Credit: 817,865,789
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 21210 - Posted: 18 May 2011 | 14:19:22 UTC
Last modified: 18 May 2011 | 14:20:23 UTC

Ah ok thx.

Think it was a slowly dying Mainboard. I reset project, shut the Computer down (and this was the last thing this computer did in his life) waiting the recommend minute turn it back on and nothing happens only cpu&GPU cooling running. After some component changes, diagnosis: Mainboard failure :/

PS: Einstein was running clearly..but ok its not that efficient and dont use 99% GPU Power ^^
____________
DSKAG Austria Research Team: http://www.research.dskag.at



Post to thread

Message boards : Number crunching : WU Computing errors?

//