Message boards : Number crunching : Too many errors (may have bug)
Author | Message |
---|---|
My task, g240-TONI_CAPBIND99SB-48-200-RND7652 and Other TONI WU’s have the same bug. | |
ID: 18705 | Rating: 0 | rate: / Reply Quote | |
I aborted 2 in the last week that seemed to run forever. | |
ID: 18716 | Rating: 0 | rate: / Reply Quote | |
I have been seeing a raft of computation errors on several systems - mostly running Windows XP and 9800GT cards. This seems to be a revisit of a problem which plagued the GPUGrid applications a year ago for me. Since it has affected ALL of my systems running that combination, and I have NOT encountered similar problems with three other BOINC projects which utilize the same GPU on the same systems (SETI, Dnetc, Collatz), for now, rather than simply 'shoulder shrug' and try, try again, I am backing off of GPUGrid for now. | |
ID: 18728 | Rating: 0 | rate: / Reply Quote | |
I'm getting over a 50% error rate across a several quads. The majority of the cards are GTS-250s, but there is a GTX-275 and a couple 8800GTs thrown in also. Same machines also do either DNETC or Collatz w/o issues. | |
ID: 18731 | Rating: 0 | rate: / Reply Quote | |
As I noted before -- the normal scenario here is when problems like this crop up -- there is a real limit to the amount of response we can expect... I'm getting over a 50% error rate across a several quads. The majority of the cards are GTS-250s, but there is a GTX-275 and a couple 8800GTs thrown in also. Same machines also do either DNETC or Collatz w/o issues. | |
ID: 18742 | Rating: 0 | rate: / Reply Quote | |
I just aborted another one that ran on to long. Between the probs others are having and the ones I am experiencing I am pulling out until the bad WU's clear the catch. | |
ID: 18744 | Rating: 0 | rate: / Reply Quote | |
Bill, you aborted a long task about 60% through its run. | |
ID: 18745 | Rating: 0 | rate: / Reply Quote | |
Sorry, I already moved off, be back later. Trying to run SETI since 1999 has made me gun shy (crazy). So many projects so little time. Thanks for the response, I generally get criticism on other sites so I don't post. | |
ID: 18747 | Rating: 0 | rate: / Reply Quote | |
I got an error on a TONI_CAPBIND as well, like many others. I can't see the original error in this thread any longer, as the WU is already purged, but here is my stderr: <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 98 (0x62, -158) </message> <stderr_txt> # There is 1 device supporting CUDA # Device 0: "GeForce GT 240" # Clock rate: 1.34 GHz # Total amount of global memory: 536150016 bytes # Number of multiprocessors: 12 # Number of cores: 96 MDIO ERROR: read error for file "input.coor", byte number 0: expected to read number of atoms ERROR: file mdioload.cpp line 80: Unable to read bincoordfile 14:58:03 (10049): called boinc_finish </stderr_txt> ]]> It failed after 2 seconds, so no real harm done, except that now a CASHIF_HIVPR is running, so far without problems. I hope it won't be affected, as I can't restart the puter without thrashing a checkpointless RNA-world WU after 24h, and I don't want to do that ;) If this is a problem with the work generator (some input file not generated properly) perhaps it should be looked into somehow. ____________ Gruesse vom Saenger For questions about Boinc look in the BOINC-Wiki | |
ID: 18818 | Rating: 0 | rate: / Reply Quote | |
Found some errors <core_client_version>6.10.58</core_client_version> | |
ID: 18919 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : Too many errors (may have bug)