Message boards : Number crunching : What am I doing wrong
Author | Message |
---|---|
I now have about 15 WU that spontaneously abort at 1 Hr. | |
ID: 40296 | Rating: 0 | rate: / Reply Quote | |
(I don't see an "edit" option) | |
ID: 40297 | Rating: 0 | rate: / Reply Quote | |
They all say for a status "Abandoned". That's so weird! | |
ID: 40300 | Rating: 0 | rate: / Reply Quote | |
I see that your computer's details show: | |
ID: 40301 | Rating: 0 | rate: / Reply Quote | |
Dayle Diamond wrote: They all say for a status "Abandoned". That's so weird! The root of this phenomenon should be some kind of BOINC work folder access rights problem. Are there more than one user on this PC? Is the BOINC installed as a system service (protected execution mode)? Jacob Klein wrote: I see that your computer's details show: That's true, but it still has to work under Windows 7 x64. Until recently, I've used 6.10.60 on my hosts. The only reason for the update was to have such spare projects which are using OpenCL. Jacob Klein wrote: Please try the latest release, BOINC v7.4.36. Updating to this version is still a good idea, as this will update the folder access rights. | |
ID: 40307 | Rating: 0 | rate: / Reply Quote | |
No, 'Abandoned' is a server-only phenomenon - the tasks are marked thus in the server database record, but as the OP stated in the first post, the BOINC client locally knows nothing about this, and carries on processing - there are no permission problems locally. | |
ID: 40308 | Rating: 0 | rate: / Reply Quote | |
Well ... I have no idea ... but I removed the project and reconnected ... | |
ID: 40309 | Rating: 0 | rate: / Reply Quote | |
As to why the server is marking the tasks as abandoned - nobody really knows, and I'd appreciate more help in tracking it down. It's done by the function mark_results_over(), which is called in two places in sched/handle_request.cpp (and nowhere else). It's supposed to happen "when there's evidence that the host has detached.", or "If the [RPC] seqno from the host is less than what we expect, the user must have copied the state file to a different host". But it seems to happen more than that, and the finger of suspicion seems to point at communication problems between host and server resulting in RPC requests being processed out of order on the server. It happened on one of my dual boot (WinXP/Win7) hosts, when I've tried to make the BOINC manager use the same working folder on both OSes. I've succeeded to do it on my other similar host by setting the proper access rights for the BOINC work folder (which is located on the Win7's partition on this host), but on the first host the ongoing GPUGrid workunits gets abandoned, whenever I boot to Win7 (the BOINC working folder is located on the WinXP's partition on this host). | |
ID: 40311 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : What am I doing wrong