Advanced search

Message boards : Number crunching : workload not optimum

Author Message
Jorge Alberto Ramos Olive...
Send message
Joined: 13 Aug 09
Posts: 24
Credit: 156,684,745
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwat
Message 27893 - Posted: 31 Dec 2012 | 23:34:57 UTC

Hello everyone! So here's my problem:

I have noticed that I get a little more than 1 day off full workload from the servers for my 2 GTX 690's (4 tasks at a time), but then get only 2 tasks (half the full potential) for the next day. I have set network preferences as recomended:

Computer is connected to the Internet about every 0 days
Leave blank or 0 if always connected.
BOINC will try to maintain at least this much work (max 10 days).
Additional work buffer 0.05 days


How can I get 4 tasks at all times?

thanks!

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27901 - Posted: 1 Jan 2013 | 15:28:16 UTC - in response to Message 27893.
Last modified: 1 Jan 2013 | 15:32:03 UTC

Going by the descriptions I think you are using old Boinc software.
I suggest you upgrade Boinc to 7.0.28, and also set your preferences to 0.05/0.05 or slightly higher (0.10).

If you don't already have <use_all_gpus>1</use_all_gpus> set in a cc_config file, do so,
FAQ - Best configurations for GPUGRID
____________
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help

Jorge Alberto Ramos Olive...
Send message
Joined: 13 Aug 09
Posts: 24
Credit: 156,684,745
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwat
Message 27905 - Posted: 1 Jan 2013 | 19:57:31 UTC

Sorry, I should have told that info before. I am running the latest BOINC client (7.0.28) and also have use_all_gpus flag to 1 in config file. I actually have "5" GPUs: the equivalent of 4 GTX 680 (two GTX 690, non-SLI) and 1 640 GT. I have also set GPUGrid to use only the four 680's (the two 690) by setting

<exclude_gpu>
<url>http://www.gpugrid.net/</url>
[<device_num>1</device_num>]
</exclude_gpu>


in the config file. All goes well, GPUGrid never utilizes the GT 640 (device 1 in my system) and the 4 remaining GPUs are used. Nevertheless, it uses the two 690s for about a day, and then "rests" for another day, using only 2 GPU's (I mean, 1 GPU in 1 690 and 1 GPU in the other 690). When this happens, the client is "smart" enough that it uses 1 GPU in each 690 card. I never leaves 1 card totally iddle.

I never "use" the 690s (for display, like games): their only reason for existence is crunching GPUGrid workunits ;). I'm stumped :S

P.S. The two 690 cards are actually physically linked for SLI, but is not enabled in the NVIDIA control panel. I am positive that this is not the cause of the problem as the it persists in a periodic, repeatable and totally predictable fashion. If this was the cause, like some kind of software glitch in the NVIDIA control panel, it would be totally random, dont you think?

This is as if there is some kind of GPU protection from the server side, and sending only half the workload after 1 day, but have never heard of this before.

Dylan
Send message
Joined: 16 Jul 12
Posts: 98
Credit: 386,043,752
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwat
Message 27907 - Posted: 1 Jan 2013 | 21:46:47 UTC

I don't know if this helps or not, but I also have my two 670's in a SLI bridge, but also crunching/folding etc with SLI disabled and I have had no issues.

Maybe try restarting the BOINC client or going to the advanced options and clicking read config file?

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,206,655,749
RAC: 261,147
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27908 - Posted: 1 Jan 2013 | 22:35:12 UTC - in response to Message 27905.

This is as if there is some kind of GPU protection from the server side, and sending only half the workload after 1 day, but have never heard of this before.

No such protection is in operation on the server side.
Why don't you remove the GT640?

Jorge Alberto Ramos Olive...
Send message
Joined: 13 Aug 09
Posts: 24
Credit: 156,684,745
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwat
Message 27909 - Posted: 1 Jan 2013 | 22:42:07 UTC - in response to Message 27907.

I don't know if this helps or not, but I also have my two 670's in a SLI bridge, but also crunching/folding etc with SLI disabled and I have had no issues.

Maybe try restarting the BOINC client or going to the advanced options and clicking read config file?


This has happened for days and computer restarts. I started researchign in the BOINC forums as this might be tied to a client issue. Will get back here if I find something usefull...

Jorge Alberto Ramos Olive...
Send message
Joined: 13 Aug 09
Posts: 24
Credit: 156,684,745
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwat
Message 27910 - Posted: 1 Jan 2013 | 22:42:55 UTC - in response to Message 27908.

This is as if there is some kind of GPU protection from the server side, and sending only half the workload after 1 day, but have never heard of this before.

No such protection is in operation on the server side.
Why don't you remove the GT640?


I use that as the primary display device and for SETI@home too ;)

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,206,655,749
RAC: 261,147
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27911 - Posted: 1 Jan 2013 | 23:30:01 UTC - in response to Message 27910.
Last modified: 1 Jan 2013 | 23:31:43 UTC

Why don't you remove the GT640?

I use that as the primary display device and for SETI@home too ;)

A GTX 690 isn't lagging if it is used for crunching and as the primary display device at the same time (I know it from my experience).
You lose much more credit by the idle GPUs, than you gain by a GT640.

Dylan
Send message
Joined: 16 Jul 12
Posts: 98
Credit: 386,043,752
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwat
Message 27913 - Posted: 2 Jan 2013 | 1:22:08 UTC

What Retvari said is true about using the same GPU for crunching and for display at the same time. I do this with my computer and even play games while crunching, although this does slow down WU completion times, but I haven't had any issues.

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27917 - Posted: 2 Jan 2013 | 15:53:25 UTC - in response to Message 27913.

I would still disable the physical SLI link. What's the point in having it, if it's not used?
I had trouble in the distant past with 4GPUs. I had to do things like extend the desktop to all 4 cards and use an omni-port. Recent drivers might have improved this situation, for two cards but not for four/five GPU's.
____________
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help

Jorge Alberto Ramos Olive...
Send message
Joined: 13 Aug 09
Posts: 24
Credit: 156,684,745
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwat
Message 27967 - Posted: 7 Jan 2013 | 23:51:37 UTC

It's been a week since my fisrt post and I haven't found a solution. I decided then to disattach the seti project in favor of this one and thus having only one project to work on. Hopefully, this will permit my system crunch 24/7 at full workload without strange work-fetch BOINC behaviour.

I have noticed that since BOINC 7.x, the algorithms for fetching work had a major overhaull (see e.g. http://boincfaq.mundayweb.com/index.php?language=1&view=592 and http://boinc.berkeley.edu/wiki/Release_Notes#BOINC_7_incompatible_with_Domain_Controllers) and some users have reverted back to pre 7.x versions or even configuring their systems for working with 2 clients on the same host.

Shouldn't BOINC developers be noticing this? For many other users, though, this is not a problem as many are attached to only one project and/or not using GPU as coprocessor, but from my perspective, the previous fetch algorithms did not cause trouble for them either. I would be happy to hear from someone who actually knows about this and hear from him/her the reasoning behind the changes made to 7.x

BOINC's 7.x work fetch algorithm - NO LIKE

werdwerdus
Send message
Joined: 15 Apr 10
Posts: 123
Credit: 1,004,473,861
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27973 - Posted: 8 Jan 2013 | 17:19:14 UTC

attaching to 2 GPU projects with one client is not recommended. Causes problems getting correct amount of work from both.
____________
XtremeSystems.org - #1 Team in GPUGrid

Post to thread

Message boards : Number crunching : workload not optimum

//