Advanced search

Message boards : Graphics cards (GPUs) : Not gettting work, how to see LTD?

Author Message
Jeremy
Send message
Joined: 15 Feb 09
Posts: 55
Credit: 3,542,733
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 7033 - Posted: 27 Feb 2009 | 18:19:51 UTC

Upgraded to BOINC 6.6.11 this morning, seems to be working well. However, I'm getting errors when GPUgrid requests work.

GPUGRID 2/27/09 12:16:34 PM Requesting new tasks
GPUGRID 2/27/09 12:16:34 PM Message from server: No work sent
GPUGRID 2/27/09 12:16:34 PM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 2/27/09 12:16:34 PM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
GPUGRID 2/27/09 12:16:34 PM Message from server: Full-atom molecular dynamics is not available for your type of computer.

SETI@home continues to receive and process cuda tasks properly. When I wasn't getting new tasks from GPUGRID yesterday, I figured it was a LTD issue. There were no errors like these under 6.6.10. On another, semi-related note, is there a way to see the "debt" of various projects? I participate in 8 projects, so sometimes it can be difficult to determine if a project isn't requesting work due to an error or because LTD is working as it's supposed to.

Profile K1atOdessa
Send message
Joined: 25 Feb 08
Posts: 249
Credit: 370,320,941
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7034 - Posted: 27 Feb 2009 | 19:05:35 UTC - in response to Message 7033.

Try BOINC 6.5.0 -- that's what I am running and many other users have reported it works the best for cpu and gpu projects right now. I checked the Server and there are 100+ GPU WU's ready to be sent out.

Profile Dieter Matuschek
Avatar
Send message
Joined: 28 Dec 08
Posts: 58
Credit: 231,884,297
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7035 - Posted: 27 Feb 2009 | 19:27:20 UTC - in response to Message 7033.

... is there a way to see the "debt" of various projects?

Find file 'client_state.xml' and search therein the tag <long_term_debt>.

____________

[B^S] Gamma^Ray
Send message
Joined: 24 Aug 07
Posts: 7
Credit: 1,354,047
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwat
Message 7039 - Posted: 28 Feb 2009 | 2:27:47 UTC

Well Im having the same issue as of lately too. Keep getting the same message listed by Jeremy. Was running 6.6.11 64 bit so dropped down to 6.4.6 64 bit to see if that would help but it didnt. Even tried detaching and reattaching but still getting the same message. As of right now there are 233 work units ready to be sent and yet I havent run one (Or gotten one) in the past day or so.

GR
____________

Michael Doerner
Send message
Joined: 28 Feb 09
Posts: 37
Credit: 666,889
RAC: 0
Level
Gly
Scientific publications
watwatwatwat
Message 7040 - Posted: 28 Feb 2009 | 3:46:05 UTC
Last modified: 28 Feb 2009 | 3:48:08 UTC

Hello,

I'm a newbie from Enigma@Home. I'm running BOINC 6.6.11 on Linux X86_64 and I am getting the same messages. I have no issues like this with Enigma@Home. What's the workaround?

Mike Doerner

Fri 27 Feb 2009 10:36:46 PM EST|GPUGRID|Requesting new tasks
Fri 27 Feb 2009 10:36:51 PM EST|GPUGRID|Scheduler request completed: got 0 new tasks
Fri 27 Feb 2009 10:36:51 PM EST|GPUGRID|Message from server: Not sending work - last request too recent: 11 sec
Fri 27 Feb 2009 10:39:22 PM EST|Enigma@Home|Sending scheduler request: Requested by user.
Fri 27 Feb 2009 10:39:22 PM EST|Enigma@Home|Reporting 1 completed tasks, not requesting new tasks
Fri 27 Feb 2009 10:39:27 PM EST|Enigma@Home|Scheduler request completed: got 0 new tasks
Fri 27 Feb 2009 10:44:13 PM EST|GPUGRID|Sending scheduler request: Requested by user.
Fri 27 Feb 2009 10:44:13 PM EST|GPUGRID|Requesting new tasks
Fri 27 Feb 2009 10:44:18 PM EST|GPUGRID|Scheduler request completed: got 0 new tasks
Fri 27 Feb 2009 10:44:18 PM EST|GPUGRID|Message from server: No work sent
Fri 27 Feb 2009 10:44:18 PM EST|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
Fri 27 Feb 2009 10:44:18 PM EST|GPUGRID|Message from server: Full-atom molecular dynamics is not available for your type of computer.
Fri 27 Feb 2009 10:44:18 PM EST|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
Fri 27 Feb 2009 10:44:23 PM EST|GPUGRID|Sending scheduler request: To fetch work.
Fri 27 Feb 2009 10:44:23 PM EST|GPUGRID|Requesting new tasks
Fri 27 Feb 2009 10:44:28 PM EST|GPUGRID|Scheduler request completed: got 0 new tasks
Fri 27 Feb 2009 10:44:28 PM EST|GPUGRID|Message from server: Not sending work - last request too recent: 10 sec
Fri 27 Feb 2009 10:46:30 PM EST|Enigma@Home|Sending scheduler request: To fetch work.
Fri 27 Feb 2009 10:46:30 PM EST|Enigma@Home|Requesting new tasks
Fri 27 Feb 2009 10:46:40 PM EST|Enigma@Home|Scheduler request completed: got 0 new tasks

Michael Doerner
Send message
Joined: 28 Feb 09
Posts: 37
Credit: 666,889
RAC: 0
Level
Gly
Scientific publications
watwatwatwat
Message 7041 - Posted: 28 Feb 2009 | 4:42:18 UTC - in response to Message 7040.

OK, got it....Upgraded to the true 6.6.11 client (not the 6.6.11 compat version, which I pulled down somehow.) Anyhoo, I have WU's to work with now. L8r.

Mike Doerner

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 7042 - Posted: 28 Feb 2009 | 7:07:54 UTC

With 6.5.0 and later (maybe some version earlier) you can look at the internals of a project by clicking on the projects tab, then the project name, then the "Properties" button. That gives a pop-up window with most of the data.

I have suggested a cross-tab view as being potentially more usable in that if you have a fair number of projects that is a lot of clicking and looking...

MarkJ
Volunteer moderator
Volunteer tester
Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7045 - Posted: 28 Feb 2009 | 10:05:21 UTC - in response to Message 7042.
Last modified: 28 Feb 2009 | 10:08:05 UTC

With 6.5.0 and later (maybe some version earlier) you can look at the internals of a project by clicking on the projects tab, then the project name, then the "Properties" button. That gives a pop-up window with most of the data.

I have suggested a cross-tab view as being potentially more usable in that if you have a fair number of projects that is a lot of clicking and looking...


BOINCview can also display STD (short term debt) and LTD (long term debt) on the projects tab (yes even for 6.6.11 clients). They could do something similar in BOINCmgr.

In BOINCmgr 6.6.11 in Advanced view you can go to the Projects tab, click on a project and click the Properties button. It displays the scheduling priority, work fetch priority, the DCF and some other stats. STD seems to = scheduling priority. LTD seems to = work fetch priority.
____________
BOINC blog

Jeremy
Send message
Joined: 15 Feb 09
Posts: 55
Credit: 3,542,733
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 7061 - Posted: 1 Mar 2009 | 5:18:23 UTC - in response to Message 7045.

Installed 6.5.0 today. All SEEMED to be going well until I checked the message log just now.

GPUGRID 3/1/2009 12:02:59 AM Sending scheduler request: To fetch work. Requesting 4919 seconds of work, reporting 0 completed tasks.
GPUGRID 3/1/2009 12:03:04 AM Scheduler request completed: got 0 new tasks
GPUGRID 3/1/2009 12:03:04 AM Message from server: No work sent
GPUGRID 3/1/2009 12:03:04 AM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
GPUGRID 3/1/2009 12:03:04 AM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 3/1/2009 12:03:04 AM Message from server: (won't finish in time) BOINC runs 67.1% of time, computation enabled 93.8% of that

Now, I was screwing around with the clocks on my system for a lot of the day, but I've never seen that last particular message before now. It looks like BOINC is requesting work from GPUgrid about once every 5 minutes, but the server is refusing to dole it out. Any ideas why?

[B^S] Gamma^Ray
Send message
Joined: 24 Aug 07
Posts: 7
Credit: 1,354,047
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwat
Message 7062 - Posted: 1 Mar 2009 | 7:02:03 UTC

I'm at a loss whats causing the error or message, I've dropped down to a earlier version of the client, back up to the newest, detached-reattached, even deleted boinc completely and reinstalled fresh and yet just get this message...

3/1/2009 12:54:12 AM GPUGRID Message from server: No work sent
3/1/2009 12:54:12 AM GPUGRID Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
3/1/2009 12:54:12 AM GPUGRID Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.

My type of PC ?

Running Q4 intel, with windows 64bit ??

____________

Ulf Ohlsson
Send message
Joined: 1 Jan 09
Posts: 20
Credit: 616,384
RAC: 0
Level
Gly
Scientific publications
watwatwat
Message 7067 - Posted: 1 Mar 2009 | 9:44:11 UTC - in response to Message 7033.
Last modified: 1 Mar 2009 | 9:48:20 UTC

a fast look at the server status page tells theres no work avilable for nvidia gpu

But this is confusing with 1 wu running and 1 wu in queue :

03/01/09 10:44:10|GPUGRID|Sending scheduler request: Requested by user. Requesting 1371478 seconds of work, reporting 0 completed tasks
03/01/09 10:44:15|GPUGRID|Scheduler request completed: got 0 new tasks
03/01/09 10:44:15|GPUGRID|Message from server: No work sent
03/01/09 10:44:15|GPUGRID|Message from server: (reached per-CPU limit of 1 tasks)
03/01/09 10:44:15|GPUGRID|Message from server: (Project has no jobs available)

Boinc 6.5.0 windows x86_64
____________

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 7071 - Posted: 1 Mar 2009 | 10:33:50 UTC

@Jeremy

The server looked at the run times and the time it takes for you to process a task and decided that you could not make the deadlines based on the work it had available. That is what the last message is telling you.

The message says that you only have your computer on two thirds of the day and only run BOINC 93% of THAT time, leaving not enough hours in the day to do the work. Messages like that can be looked up in the UBW where just about every message (as of two years ago) was documented as best as was known. A good place to start when you have a question like that...

If you have run off all your work you could try a project reset, but I think the only "cure" would be to leave the system on longer or get a faster GPU ... or take your chances with trying to catch the shorter tasks as they are available.

@Gama^Ray

Your GPU ... GPU Grid has a "bad habit" of running to that message when it runs out of work ... as we are seeing at Milky Way, at times the numbers on the queue do not always comport with what the scheduler "sees" at the time. Because of the way the work flows, completed tasks feeding forward to create new task (shades of Milky Way again) the queue tends to run "light" with I think the highest number i have ever seen being about 300 ... The only redeeming news is that the back-off should be less and you should be able to get work ... hopefully before you run out ...

@Ulf Ohlsson

More of less the same as what I said above to Gamma^Ray ... only the scheduler knows for sure why it emits different messages for basically the same situation ...

The only good news is that there seems to be work listed again ... hopefully you an GR got some ...

Ulf Ohlsson
Send message
Joined: 1 Jan 09
Posts: 20
Credit: 616,384
RAC: 0
Level
Gly
Scientific publications
watwatwat
Message 7078 - Posted: 1 Mar 2009 | 19:03:44 UTC - in response to Message 7071.
Last modified: 1 Mar 2009 | 19:04:34 UTC

This not answering my question why I not getting more then one new wu when requesting work for 8 days buffer.

03/01/09 16:42:52|GPUGRID|Sending scheduler request: Requested by user. Requesting 1368180 seconds of work, reporting 0 completed tasks
03/01/09 16:42:57|GPUGRID|Scheduler request completed: got 0 new tasks
03/01/09 16:42:57|GPUGRID|Message from server: No work sent
03/01/09 16:42:57|GPUGRID|Message from server: (reached per-CPU limit of 1 tasks)
03/01/09 16:42:57|GPUGRID|Message from server: (Project has no jobs available)

everytime I get new WU's I only get one new


@Gama^Ray

Your GPU ... GPU Grid has a "bad habit" of running to that message when it runs out of work ... as we are seeing at Milky Way, at times the numbers on the queue do not always comport with what the scheduler "sees" at the time. Because of the way the work flows, completed tasks feeding forward to create new task (shades of Milky Way again) the queue tends to run "light" with I think the highest number i have ever seen being about 300 ... The only redeeming news is that the back-off should be less and you should be able to get work ... hopefully before you run out ...

@Ulf Ohlsson

More of less the same as what I said above to Gamma^Ray ... only the scheduler knows for sure why it emits different messages for basically the same situation ...

The only good news is that there seems to be work listed again ... hopefully you an GR got some ...[/quote]
____________

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7082 - Posted: 1 Mar 2009 | 19:49:23 UTC - in response to Message 7078.

This not answering my question why I not getting more then one new wu when requesting work for 8 days buffer.

03/01/09 16:42:57|GPUGRID|Message from server: (reached per-CPU limit of 1 tasks)


You've got a dual core CPU and the server tells you that there is a limit for the number of concurrent WUs, 1 per CPU (core). You've reached that with 2 WUs, so you've got to send one result back to get more WUs. This behaviour is not ideal, but it's the setting the project team choose.

MrS
____________
Scanning for our furry friends since Jan 2002

Ulf Ohlsson
Send message
Joined: 1 Jan 09
Posts: 20
Credit: 616,384
RAC: 0
Level
Gly
Scientific publications
watwatwat
Message 7083 - Posted: 1 Mar 2009 | 20:11:16 UTC - in response to Message 7082.

so with the ongoing errors with the boinc client i have to pause 2 other projects wich are running on pure cpu-core everytime i send request to fetch new work for gpugrid, this is not a ideal situation for me with CUDA running on at most 0,2 CPU there would be no problem to fetch work to fill for at least 5 days workbuffer

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 7085 - Posted: 1 Mar 2009 | 20:50:56 UTC - in response to Message 7083.

Not sure what you mean, but it sounds like you got something wrong.

For you there is a limit of 2 WUs, which you can have at any time, no matter how much you jump up and down or what ever you set in BOINC manager.

MrS
____________
Scanning for our furry friends since Jan 2002

Scott Brown
Send message
Joined: 21 Oct 08
Posts: 144
Credit: 2,973,555
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwat
Message 7087 - Posted: 1 Mar 2009 | 21:02:56 UTC - in response to Message 7078.

This is the key line for understanding why you are not getting work


03/01/09 16:42:57|GPUGRID|Message from server: (reached per-CPU limit of 1 tasks)


The number of tasks is tied to the number of CPU's (or more specifically number of cores), which in your case is two. Unlike CPU-based projects, increasing cache, increasing resource share, etc. will have no effect on number of workunits received at GPUGRID. This is set at the server.

Thus, if you are crunching one task with another waiting, when you complete the first task you will only be able to get a single replacement. If you were to reset the project, you would download two workunits.

Hopefully, this will change in the future to an allocation of work based on number and speed of GPU's, but my guess is that we are some ways off (at least a few months) from this being a stable reality.




Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 7091 - Posted: 1 Mar 2009 | 22:50:47 UTC - in response to Message 7083.

so with the ongoing errors with the boinc client i have to pause 2 other projects wich are running on pure cpu-core everytime i send request to fetch new work for gpugrid, this is not a ideal situation for me with CUDA running on at most 0,2 CPU there would be no problem to fetch work to fill for at least 5 days workbuffer


Now I have the feeling that there are actually two questions. The first has been answered by Scott and ETA that you will only have a limited number of tasks on hand regardless of the settings of BOINC on your client. On my 8 CPU system I have 4 in flight and 4 pending most of the time. On occasion, I will turn in a couple and be down to one or two pending until I get another request to fill my queue. Again, this is a server side setting that is not likely to change (as previously stated) anytime soon.

And yes, the project is well aware that we the people are unhappy with it ... :)

BUt, the task you complete is used to generate another task, that when complete is used to generate another task ... on and on ...

The second question it seems to me is that you are having to "fiddle" with the system to even keep this happy situation. This *IS* an artifact of the BOINC Manager and there are a couple things you can try. In your case with a dual CPU system you need to set the resource share of GPU Grid so that it is above 50% ... this will make BOINC Manager think that GPU Grid is "starved" and, with version 6.5.0 will tend to automatically fetch work for you.

Since the BOINC Manager works on a proportional basis, all you have to do is make suer that the shares for your other projects are set accordingly ... if you want 50/50 on two other projects, then set a share for them to be 25% each ... and so on ...

I use a share to set my i7 so that it is 20% of my resources ...

This unhappy situation is, or should be, fixed in the upcoming 6.7.x release of BOINC manager where the CUDA resources should be "properly" managed separately from the CPU Projects.

[B^S] Gamma^Ray
Send message
Joined: 24 Aug 07
Posts: 7
Credit: 1,354,047
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwat
Message 7094 - Posted: 2 Mar 2009 | 2:29:09 UTC

Well, I finially tried removing the Cuda installation I had and reinstalled it from scratch, once done that seem to fix my error as I picked up a work unit right after. :)

GR
____________

Post to thread

Message boards : Graphics cards (GPUs) : Not gettting work, how to see LTD?

//