Advanced search

Message boards : Graphics cards (GPUs) : A bit overeager scheduler

Author Message
Profile Saenger
Avatar
Send message
Joined: 20 Jul 08
Posts: 134
Credit: 23,657,183
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 1363 - Posted: 26 Jul 2008 | 13:59:31 UTC

I got 4 new WUs sitting on my puter, all with quite wrong duration estimates (far too short) and a very short deadline: 29 Jul 2008 18:36:19 UTC +/- an hour. They will probably take quite some time into august to complete, should I rather kill them prematurely or wait how far I get?

One more obstacle is this strange messages I receive from your server:

Sa 26 Jul 2008 14:18:32 CEST|PS3GRID|Sending scheduler request: To fetch work. Requesting 1470 seconds of work, reporting 0 completed tasks
Sa 26 Jul 2008 14:18:37 CEST|PS3GRID|Scheduler request completed: got 0 new tasks
Sa 26 Jul 2008 14:18:37 CEST|PS3GRID|Message from server: No work sent
Sa 26 Jul 2008 14:18:37 CEST|PS3GRID|Message from server: (reached per-CPU limit of 1 tasks)


Can anyone enlighten me what the number of CPUs has to do with GPU crunching? I need one per GPU, and as I only have one those four will never run in parallel (or at least I think so).

I've set the project to NNW and will manage it by hand for the time being, it's alpha, so such things are expected and manual handling is one of the fun things in alpha ;)
____________
Gruesse vom Saenger

For questions about Boinc look in the BOINC-Wiki

Profile UBT - NaRyan
Avatar
Send message
Joined: 16 Jul 08
Posts: 68
Credit: 1,242,980
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 1367 - Posted: 26 Jul 2008 | 23:07:09 UTC - in response to Message 1363.

I got 4 new WUs sitting on my puter, all with quite wrong duration estimates (far too short) and a very short deadline: 29 Jul 2008 18:36:19 UTC +/- an hour. They will probably take quite some time into august to complete, should I rather kill them prematurely or wait how far I get?

One more obstacle is this strange messages I receive from your server:
Sa 26 Jul 2008 14:18:32 CEST|PS3GRID|Sending scheduler request: To fetch work. Requesting 1470 seconds of work, reporting 0 completed tasks
Sa 26 Jul 2008 14:18:37 CEST|PS3GRID|Scheduler request completed: got 0 new tasks
Sa 26 Jul 2008 14:18:37 CEST|PS3GRID|Message from server: No work sent
Sa 26 Jul 2008 14:18:37 CEST|PS3GRID|Message from server: (reached per-CPU limit of 1 tasks)


Can anyone enlighten me what the number of CPUs has to do with GPU crunching? I need one per GPU, and as I only have one those four will never run in parallel (or at least I think so).

I've set the project to NNW and will manage it by hand for the time being, it's alpha, so such things are expected and manual handling is one of the fun things in alpha ;)


You get 4 days to do each workunit.
The EST time is based on your CPU speed not on your GPU speed, guess that's a Boinc limitation at the moment.
And the project gives you 1 task per cpu/core so on a quad core you can end up with 4 at a time :(
That's ok if you have a card that can do 4 in 4 days, but if not, best to abort the ones you won't do.

What I done when I was running it on my quad (it took about 12 hours 40 minutes per workunit), was let it 2 at most and then set it to no new work. wait till it was near finishing the 2nd of the 2 workunits, and let it get more work.

If you have any workunits with the name FASTTEST in them, you can leave them, as they run fast, for me they took about 47 seconds to complete.

____________

Down with the Kredit Kops!!!

Profile Saenger
Avatar
Send message
Joined: 20 Jul 08
Posts: 134
Credit: 23,657,183
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 1368 - Posted: 26 Jul 2008 | 23:28:44 UTC - in response to Message 1367.

You get 4 days to do each workunit.
The EST time is based on your CPU speed not on your GPU speed, guess that's a Boinc limitation at the moment.
And the project gives you 1 task per cpu/core so on a quad core you can end up with 4 at a time :(
That's ok if you have a card that can do 4 in 4 days, but if not, best to abort the ones you won't do.

What I done when I was running it on my quad (it took about 12 hours 40 minutes per workunit), was let it 2 at most and then set it to no new work. wait till it was near finishing the 2nd of the 2 workunits, and let it get more work.

If you have any workunits with the name FASTTEST in them, you can leave them, as they run fast, for me they took about 47 seconds to complete.

The one currently crunching is at 40% and 20h, running in panic mode already. It was estimated with 5h, like two of the others as well, the forth is estimated with 13h. If the estimate <-> reality factor stays it will take 50h for the "short" ones and 130h for the longer one, it won't fit in 4 days ;)

The 1/core should be dropped immediately to 1/host (or 2/host at max), as usually there is only one graphic card in a host, regardless of the number of cores.
____________
Gruesse vom Saenger

For questions about Boinc look in the BOINC-Wiki

Profile UBT - NaRyan
Avatar
Send message
Joined: 16 Jul 08
Posts: 68
Credit: 1,242,980
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 1370 - Posted: 27 Jul 2008 | 1:49:00 UTC - in response to Message 1368.

Yeah I agree the 1 task per host would be a better option, since there is no multi GPU app at the moment, and probably even fewer users who have multi GPU systems. As when I was running it on the quad if I got 4 workunits, Boinc would start to painc, with it thinking each one would take 38 hours :(

Surprised that boinc estimated it at 5 hours for the workunit.
For me it est them at 38 hours for the normal ones and 13 hours for the "shorties"

With the 2 workunits on my 8800GT the AVG time for the 2 of them are 11 hours 44 minutes 50 seconds, for the "normal" length ones, and 42 Seconds for the "shorties"
Of course depending on what type of Nvidia card you have your results are going to be different.

I don't think the shorties are of much or any importance project wise, I think they were just added to try to find out why a lot of multicore (mostly quads) were having problems with an error on a workunit just after 1 finnished.
____________

Down with the Kredit Kops!!!

Profile Saenger
Avatar
Send message
Joined: 20 Jul 08
Posts: 134
Credit: 23,657,183
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 1372 - Posted: 27 Jul 2008 | 12:24:57 UTC - in response to Message 1370.

Yeah I agree the 1 task per host would be a better option, since there is no multi GPU app at the moment, and probably even fewer users who have multi GPU systems. As when I was running it on the quad if I got 4 workunits, Boinc would start to painc, with it thinking each one would take 38 hours :(

Surprised that boinc estimated it at 5 hours for the workunit.
For me it est them at 38 hours for the normal ones and 13 hours for the "shorties"

With the 2 workunits on my 8800GT the AVG time for the 2 of them are 11 hours 44 minutes 50 seconds, for the "normal" length ones, and 42 Seconds for the "shorties"
Of course depending on what type of Nvidia card you have your results are going to be different.

I don't think the shorties are of much or any importance project wise, I think they were just added to try to find out why a lot of multicore (mostly quads) were having problems with an error on a workunit just after 1 finnished.

I've got a 8600GT, that's probably some big difference. Here's my system (it's a bit german, but I don't think it's a problem).
The first one took 50h, the current one is planning to take 50h, it's not that big sample, but it's coherent ;)
____________
Gruesse vom Saenger

For questions about Boinc look in the BOINC-Wiki

Profile UBT - NaRyan
Avatar
Send message
Joined: 16 Jul 08
Posts: 68
Credit: 1,242,980
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 1374 - Posted: 27 Jul 2008 | 19:23:40 UTC - in response to Message 1372.

The first one took 50h, the current one is planning to take 50h, it's not that big sample, but it's coherent ;)


For the 23 or so that I have done on the 8800GT the time has been about the same.
Only varied by a few minutes, and that was probably caused by what I was doing on the computer during at that time. (or normal Boinc behavior)

As you can see here, not realy much variation in workunit times, apart from those fast running workunits :)

____________

Down with the Kredit Kops!!!

Post to thread

Message boards : Graphics cards (GPUs) : A bit overeager scheduler

//