Advanced search

Message boards : News : Probable access problems on 9th Dec

Author Message
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1893
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 42274 - Posted: 1 Dec 2015 | 23:32:19 UTC

On 9th Dec we are moving the IP of gpugrid to another network. This means changing the DNS. While the dns update is propagating around the world you might experience that the server is unreachable.

Early next year we are going also to upgrade the server, but this is another story.

GDF

John C MacAlister
Send message
Joined: 17 Feb 13
Posts: 177
Credit: 131,725,186
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 42276 - Posted: 2 Dec 2015 | 0:45:44 UTC - in response to Message 42274.

Thanks for the notice!

[CSF] Thomas H.V. Dupont
Send message
Joined: 20 Jul 14
Posts: 523
Credit: 55,288,675
RAC: 52,362
Level
Thr
Scientific publications
watwatwat
Message 42279 - Posted: 2 Dec 2015 | 15:48:57 UTC - in response to Message 42276.

Thanks for the notice!

+1 :) Thanks Gianni !
____________
[CSF] Thomas H.V. Dupont
Founder of the team CRUNCHERS SANS FRONTIERES
www.crunchersansfrontieres.org

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1893
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 42347 - Posted: 9 Dec 2015 | 21:54:46 UTC - in response to Message 42279.

the transition is over and it should be all working like before.

gdf

Profile Logan Carr
Send message
Joined: 12 Aug 15
Posts: 193
Credit: 25,979,525
RAC: 19
Level
Val
Scientific publications
wat
Message 42349 - Posted: 10 Dec 2015 | 0:56:33 UTC - in response to Message 42347.

Hi,

why is it that my gpugrid keeps saying "communication deferred"?

It was working fine before. Any way I can fix this? :(

ALAIN_13013
Avatar
Send message
Joined: 11 Sep 08
Posts: 10
Credit: 1,192,969,088
RAC: 414,413
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42353 - Posted: 10 Dec 2015 | 5:45:52 UTC - in response to Message 42349.

même chose ici

10/12/2015 06:45:22 | | Project communication failed: attempting access to reference site
10/12/2015 06:45:23 | | Internet access OK - project servers may be temporarily down.


____________

Profile [PUGLIA] kidkidkid3
Send message
Joined: 23 Feb 11
Posts: 48
Credit: 331,170,342
RAC: 195,496
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 42354 - Posted: 10 Dec 2015 | 6:55:04 UTC - in response to Message 42353.

même chose ici
10/12/2015 06:45:22 | | Project communication failed: attempting access to reference site
10/12/2015 06:45:23 | | Internet access OK - project servers may be temporarily down.



All is working now .... but zero WU !
K.
____________
Dreams do not always come true. But not because they are too big or impossible. Why did we stop believing.
(Martin Luther King)

kain
Send message
Joined: 3 Sep 14
Posts: 104
Credit: 127,669,166
RAC: 18,278
Level
Cys
Scientific publications
watwatwatwat
Message 42355 - Posted: 10 Dec 2015 | 12:15:07 UTC

Yup, we need more WU. Short runs are totally forgotten...

Profile Zarck
Send message
Joined: 16 Aug 08
Posts: 135
Credit: 235,424,643
RAC: 98,099
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42356 - Posted: 10 Dec 2015 | 17:10:03 UTC
Last modified: 10 Dec 2015 | 17:10:56 UTC

An alternative to GPUGRID, Poem, which also offers Bio GPU units.

http://boinc.fzk.de/poem/

http://boinc.fzk.de/poem/gpu_list.php

@+
*_*
____________

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1814
Credit: 9,952,045,994
RAC: 6,316,474
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42363 - Posted: 11 Dec 2015 | 8:30:21 UTC
Last modified: 11 Dec 2015 | 8:30:41 UTC

FYI: I'm still having network access problems:
1. Stalled up/downloads
2. There are 3 tasks in progress according to my host's status page, but in reality there's only two (this host has only one GPU)
These problems were severe on 9th Dec, but it seems that the reason of these problems are not related to DNS replication.

Vagelis Giannadakis
Send message
Joined: 5 May 13
Posts: 187
Credit: 349,254,454
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwat
Message 42364 - Posted: 11 Dec 2015 | 9:42:42 UTC - in response to Message 42363.

Zoltan, perhaps your host has not refreshed its DNS cache. A reboot would help in that case.
____________

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1814
Credit: 9,952,045,994
RAC: 6,316,474
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42373 - Posted: 12 Dec 2015 | 9:52:57 UTC - in response to Message 42364.

Zoltan, perhaps your host has not refreshed its DNS cache. A reboot would help in that case.
I've checked it by
ipconfig /displaydns
and it was ok. The address in the cache was 84.89.134.145, to make sure the cache refresh itself I did an
ipconfig /flushdns
to clear the cache.
However, it seems that the problem is gone in the meantime.
There's nothing I can do about the extra workunits assigned to my hosts, they will be reassigned to another host after their deadline (5 days).

Puyjalon
Send message
Joined: 12 Dec 15
Posts: 1
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 42380 - Posted: 12 Dec 2015 | 20:06:11 UTC

Cc je suis en France je parle juste français

kain
Send message
Joined: 3 Sep 14
Posts: 104
Credit: 127,669,166
RAC: 18,278
Level
Cys
Scientific publications
watwatwatwat
Message 42381 - Posted: 12 Dec 2015 | 20:16:25 UTC - in response to Message 42380.

Apprenez donc

Bedrich Hajek
Send message
Joined: 28 Mar 09
Posts: 332
Credit: 3,759,688,409
RAC: 392,906
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42384 - Posted: 13 Dec 2015 | 12:05:40 UTC - in response to Message 42363.

FYI: I'm still having network access problems:
1. Stalled up/downloads
2. There are 3 tasks in progress according to my host's status page, but in reality there's only two (this host has only one GPU)
These problems were severe on 9th Dec, but it seems that the reason of these problems are not related to DNS replication.


I am have been having the same problems since the change to the new network. But for me, it happens occasionally during downloads only (definitely more often than before), and it involves only one file in the WU downloading. To continue with the download, I can either press "renter now " when the status is "Download reentry in xx:xx:xx" or exit boinc and then run it again. Sometimes, I have to do this more than once. But most of the time, the downloads go smoothly.

I haven't lost a WU yet, due to this problem. Knock wood!



klepel
Send message
Joined: 23 Dec 09
Posts: 126
Credit: 1,699,934,037
RAC: 1,840,431
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42387 - Posted: 14 Dec 2015 | 1:17:37 UTC - in response to Message 42384.

I am have been having the same problems since the change to the new network. But for me, it happens occasionally during downloads only (definitely more often than before), and it involves only one file in the WU downloading. To continue with the download, I can either press "renter now " when the status is "Download reentry in xx:xx:xx" or exit boinc and then run it again.

Same behavior here as well!

Profile Blurf
Send message
Joined: 20 Dec 11
Posts: 9
Credit: 28,974,143
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwat
Message 42390 - Posted: 15 Dec 2015 | 6:34:00 UTC - in response to Message 42354.
Last modified: 15 Dec 2015 | 6:44:45 UTC

même chose ici
Project communication failed: attempting access to reference site
Internet access OK - project servers may be temporarily down.



Still receiving this error and I have flushed DNS/rebooted. Other projects running normally

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1893
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 42400 - Posted: 15 Dec 2015 | 21:29:51 UTC - in response to Message 42390.

which domain name are you attaching to?
Do you use an account manager?

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1893
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 42401 - Posted: 15 Dec 2015 | 21:29:56 UTC - in response to Message 42390.

which domain name are you attaching to?
Do you use an account manager?

Bedrich Hajek
Send message
Joined: 28 Mar 09
Posts: 332
Credit: 3,759,688,409
RAC: 392,906
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42414 - Posted: 16 Dec 2015 | 20:32:34 UTC - in response to Message 42384.

FYI: I'm still having network access problems:
1. Stalled up/downloads
2. There are 3 tasks in progress according to my host's status page, but in reality there's only two (this host has only one GPU)
These problems were severe on 9th Dec, but it seems that the reason of these problems are not related to DNS replication.


I am have been having the same problems since the change to the new network. But for me, it happens occasionally during downloads only (definitely more often than before), and it involves only one file in the WU downloading. To continue with the download, I can either press "renter now " when the status is "Download reentry in xx:xx:xx" or exit boinc and then run it again. Sometimes, I have to do this more than once. But most of the time, the downloads go smoothly.

I haven't lost a WU yet, due to this problem. Knock wood!






I just want to add that this is happening on my windows xp computer, only. The windows 10 machine is downloading WUs with no problems, so far.

And sometimes, more than one file gets stuck.


Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1814
Credit: 9,952,045,994
RAC: 6,316,474
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42488 - Posted: 24 Dec 2015 | 12:40:04 UTC - in response to Message 42414.

I just want to add that this is happening on my windows xp computer, only. The windows 10 machine is downloading WUs with no problems, so far.

And sometimes, more than one file gets stuck.

I have another "ghost" task on one of my hosts.

Vagelis Giannadakis
Send message
Joined: 5 May 13
Posts: 187
Credit: 349,254,454
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwat
Message 42558 - Posted: 4 Jan 2016 | 10:13:21 UTC

I too just noticed I had a task that timed out without a response on me. I went over the BOINC log and couldn't find its name. I did notice the following in the log for December 29th however (when the task was assigned to me):

29-Dec-2015 13:30:40 [GPUGRID] Sending scheduler request: To fetch work.
29-Dec-2015 13:30:40 [GPUGRID] Requesting new tasks for NVIDIA GPU
29-Dec-2015 13:35:47 [GPUGRID] Scheduler request failed: Timeout was reached
29-Dec-2015 13:35:47 [GPUGRID] Sending scheduler request: To fetch work.
29-Dec-2015 13:35:47 [GPUGRID] Requesting new tasks for NVIDIA GPU
29-Dec-2015 13:35:49 [GPUGRID] Scheduler request completed: got 0 new tasks
29-Dec-2015 13:35:49 [GPUGRID] No tasks sent
29-Dec-2015 13:35:49 [GPUGRID] No tasks are available for Long runs (8-12 hours on fastest card)
29-Dec-2015 13:35:49 [GPUGRID] Project has no tasks available
29-Dec-2015 13:35:51 [---] Project communication failed: attempting access to reference site
29-Dec-2015 13:35:52 [---] Internet access OK - project servers may be temporarily down.


So, it seems to me the request for new tasks did go through to the scheduler, but its response never reached my machine.

I am also having the download / upload problems mentioned in this thread. Files eventually do get down / up, but with several retries.

This is definitely a network problem on the GPUGRID side of the network - maybe a router close to the project servers has not had its DNS and / or routing tables refreshed?

I am wondering how this issue with phantom WU assignments is affecting WU availability and the overall computation progress, especially in this WU season of drought. Just imagine hosts requesting tasks, getting them without knowing it, and after some minutes requesting again. This issue does not need to happen many times to many users to make many tasks disappear...
____________

captainjack
Send message
Joined: 9 May 13
Posts: 109
Credit: 734,440,997
RAC: 49,537
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42571 - Posted: 7 Jan 2016 | 20:48:32 UTC

Still getting these errors on most of my downloads. Eventually they come through.

Thu 07 Jan 2016 02:42:14 PM CST | GPUGRID | Temporarily failed download of e20s36_e16s2p1f382-GERARD_CXCL12_DIMPROTO1-0-pdb_file: transient HTTP error
Thu 07 Jan 2016 02:42:14 PM CST | GPUGRID | Backing off 00:02:24 on download of e20s36_e16s2p1f382-GERARD_CXCL12_DIMPROTO1-0-pdb_file
Thu 07 Jan 2016 02:42:15 PM CST | GPUGRID | Temporarily failed download of e20s36_e16s2p1f382-GERARD_CXCL12_DIMPROTO1-0-psf_file: transient HTTP error
Thu 07 Jan 2016 02:42:15 PM CST | GPUGRID | Backing off 00:02:44 on download of e20s36_e16s2p1f382-GERARD_CXCL12_DIMPROTO1-0-psf_file
Thu 07 Jan 2016 02:42:17 PM CST | | Internet access OK - project servers may be temporarily down.
Thu 07 Jan 2016 02:42:30 PM CST | | Project communication failed: attempting access to reference site
Thu 07 Jan 2016 02:42:30 PM CST | GPUGRID | Temporarily failed download of e20s3_e16s18p1f222-GERARD_CXCL12_DIMPROTO1-0-pdb_file: transient HTTP error
Thu 07 Jan 2016 02:42:30 PM CST | GPUGRID | Backing off 00:02:54 on download of e20s3_e16s18p1f222-GERARD_CXCL12_DIMPROTO1-0-pdb_file
Thu 07 Jan 2016 02:42:31 PM CST | | Internet access OK - project servers may be temporarily down.

Jim1348
Send message
Joined: 28 Jul 12
Posts: 446
Credit: 1,102,915,752
RAC: 2,480,741
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42572 - Posted: 7 Jan 2016 | 23:59:32 UTC - in response to Message 42571.

Now that you mention it, I am too. There are several earlier entries, this is just the most recent. I never paid any attention to it before. Whether it is a big problem or not I have no idea.

i7-4790-PC

194 GPUGRID 1/7/2016 4:39:35 PM Temporarily failed download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-coor_file: transient HTTP error
195 GPUGRID 1/7/2016 4:39:35 PM Backing off 00:03:10 on download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-coor_file
196 GPUGRID 1/7/2016 4:39:35 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-par_file
197 GPUGRID 1/7/2016 4:39:39 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-par_file
198 GPUGRID 1/7/2016 4:39:39 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-conf_file_enc
199 1/7/2016 4:39:40 PM Project communication failed: attempting access to reference site
200 GPUGRID 1/7/2016 4:39:40 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-conf_file_enc
201 GPUGRID 1/7/2016 4:39:40 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-metainp_file
202 1/7/2016 4:39:41 PM Internet access OK - project servers may be temporarily down.
203 GPUGRID 1/7/2016 4:39:41 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-metainp_file
204 GPUGRID 1/7/2016 4:39:41 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-hills_file
205 GPUGRID 1/7/2016 4:39:42 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-hills_file
206 GPUGRID 1/7/2016 4:39:42 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-xsc_file
207 GPUGRID 1/7/2016 4:39:43 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-xsc_file
208 GPUGRID 1/7/2016 4:39:43 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-prmtop_file
209 GPUGRID 1/7/2016 4:39:44 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-prmtop_file
210 GPUGRID 1/7/2016 4:39:57 PM Temporarily failed download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-psf_file: transient HTTP error
211 GPUGRID 1/7/2016 4:39:57 PM Backing off 00:02:26 on download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-psf_file
212 1/7/2016 4:40:01 PM Project communication failed: attempting access to reference site
213 1/7/2016 4:40:02 PM Internet access OK - project servers may be temporarily down.
214 GPUGRID 1/7/2016 4:42:24 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-psf_file
215 GPUGRID 1/7/2016 4:42:28 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-psf_file
216 GPUGRID 1/7/2016 4:42:46 PM Started download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-coor_file
217 GPUGRID 1/7/2016 4:42:51 PM Finished download of e18s56_e12s22p1f460-GERARD_CXCL12_DIMPROTO3-0-coor_file

nanoprobe
Send message
Joined: 26 Feb 12
Posts: 181
Credit: 221,824,715
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42578 - Posted: 8 Jan 2016 | 21:16:34 UTC

Still getting these errors on most of my downloads. Eventually they come through.


Same here. After the initial download times out I hit "retry" from BoincTasks transfers tab and the download resumes and finishes. Been doing this for a couple of weeks.

Bedrich Hajek
Send message
Joined: 28 Mar 09
Posts: 332
Credit: 3,759,688,409
RAC: 392,906
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42603 - Posted: 14 Jan 2016 | 2:25:26 UTC

I have lost 2 WUs while downloading on my windows xp machine:

https://www.gpugrid.net/result.php?resultid=14844544

https://www.gpugrid.net/result.php?resultid=14843167


The WUs were both GERARD_A2AR_luf6806.


Here is the event log:

1/13/2016 9:14:52 PM | GPUGRID | Requesting new tasks for NVIDIA GPU
1/13/2016 9:14:54 PM | GPUGRID | Scheduler request completed: got 1 new tasks
1/13/2016 9:14:56 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-LICENSE
1/13/2016 9:14:56 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-COPYRIGHT
1/13/2016 9:14:58 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-LICENSE: permanent HTTP error
1/13/2016 9:14:58 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-COPYRIGHT: permanent HTTP error
1/13/2016 9:14:58 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-coor_file
1/13/2016 9:14:58 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-vel_file
1/13/2016 9:14:59 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-coor_file: permanent HTTP error
1/13/2016 9:14:59 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-vel_file: permanent HTTP error
1/13/2016 9:14:59 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-idx_file
1/13/2016 9:14:59 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-pdb_file
1/13/2016 9:15:00 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-idx_file: permanent HTTP error
1/13/2016 9:15:00 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-pdb_file: permanent HTTP error
1/13/2016 9:15:00 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-psf_file
1/13/2016 9:15:00 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-par_file
1/13/2016 9:15:01 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-psf_file: permanent HTTP error
1/13/2016 9:15:01 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-par_file: permanent HTTP error
1/13/2016 9:15:01 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-conf_file_enc
1/13/2016 9:15:01 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-metainp_file
1/13/2016 9:15:02 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-conf_file_enc: permanent HTTP error
1/13/2016 9:15:02 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-metainp_file: permanent HTTP error
1/13/2016 9:15:02 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-hills_file
1/13/2016 9:15:02 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-xsc_file
1/13/2016 9:15:03 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-hills_file: permanent HTTP error
1/13/2016 9:15:03 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-xsc_file: permanent HTTP error
1/13/2016 9:15:03 PM | GPUGRID | Started download of e1s42_4-GERARD_A2AR_luf6806_b1-0-prmtop_file
1/13/2016 9:15:04 PM | GPUGRID | Giving up on download of e1s42_4-GERARD_A2AR_luf6806_b1-0-prmtop_file: permanent HTTP error
1/13/2016 9:16:04 PM | GPUGRID | Sending scheduler request: To report completed tasks.
1/13/2016 9:16:04 PM | GPUGRID | Reporting 1 completed tasks


klepel
Send message
Joined: 23 Dec 09
Posts: 126
Credit: 1,699,934,037
RAC: 1,840,431
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42776 - Posted: 9 Feb 2016 | 16:59:00 UTC

I would like to report, that I have occasionally download problems until this date (individual files get stuck). This was not a concern, when there have not been many WUs around, but now when the pipeline is full, it is quite boring.

nanoprobe
Send message
Joined: 26 Feb 12
Posts: 181
Credit: 221,824,715
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42781 - Posted: 9 Feb 2016 | 20:04:45 UTC - in response to Message 42776.

I would like to report, that I have occasionally download problems until this date (individual files get stuck). This was not a concern, when there have not been many WUs around, but now when the pipeline is full, it is quite boring.

Same here. I've asked about it several times but never got a reply. Hours wasted that could be used crunching.

kashi
Send message
Joined: 29 Jan 15
Posts: 3
Credit: 55,526,887
RAC: 8,150
Level
Thr
Scientific publications
wat
Message 42793 - Posted: 10 Feb 2016 | 13:25:28 UTC - in response to Message 42776.

Yes me too, stuck file usually downloads after a few hours before the task running is finished however 2 times recently it has been stuck for over 4 hours and this left GPU idle for a few hours. I hate that. Crunching computer is running using electricity, belching fire into our skies and no work is being done.

A single upload file also often gets stuck. Haven't lost bonus credit yet because of it but gone close a few times. Not good.

If I had a slower GPU this server malfunction would make me consider crunching another project where the download/upload server works properly. Perhaps even F@H.

rama
Send message
Joined: 12 Feb 16
Posts: 1
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 42803 - Posted: 12 Feb 2016 | 11:58:30 UTC
Last modified: 12 Feb 2016 | 12:06:49 UTC

recently i saw one article For All Portable issue problems.But after one week the content got changed to some game content...may be be that is because of my browser issue ..pls try read this article that gives exact solutions...also inform me about the issue i am facing....the link is http://bit.do/solveportableissues

nanoprobe
Send message
Joined: 26 Feb 12
Posts: 181
Credit: 221,824,715
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42808 - Posted: 12 Feb 2016 | 22:01:59 UTC

Still having the same download issue. Recently brought my XP machine back to crunch here. It has the same issue. Downloads get stuck for hours. This is the only project of 7 that I'm currently running that does this and since it's On 2 different machines/OSs the problem is not on my end. PLEASE FIX THIS!

klepel
Send message
Joined: 23 Dec 09
Posts: 126
Credit: 1,699,934,037
RAC: 1,840,431
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42811 - Posted: 13 Feb 2016 | 0:39:22 UTC

I can confirm this; it is not on my end! The problem has aroused when the project changed the network. It seems to me that the new network cannot cope with the size of data transferred from the server to the user and vice versa.

I had up-load problems before, but assumed this is caused by the ADSL contracted. But since the network change, it happens also when downloading files from the server.

Two comments:
First, another project is now happy with the spare GPU time.
Second, although it was lengthy discussed in another forum, because of this download problem, I suggest, the maximal WUs per GPU should be increased to three as the fastest cards get a better load with parallel crunching.

Jim1348
Send message
Joined: 28 Jul 12
Posts: 446
Credit: 1,102,915,752
RAC: 2,480,741
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42822 - Posted: 15 Feb 2016 | 20:54:45 UTC

I routinely see delays of 10 to 20 minutes or so on downloads and a few uploads. I see it on both wired and wireless connections. It is annoying when I am running my GTX 750 Tis and am trying to make the 24 hour limit.

Maybe their servers are just overloaded?

Gerard
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 26 Mar 14
Posts: 99
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 42825 - Posted: 16 Feb 2016 | 12:52:32 UTC - in response to Message 42822.

I've forwarded your complaints to our IT service. Indeed delays in download/upload could be caused by the new network. I'll keep you updated!

nanoprobe
Send message
Joined: 26 Feb 12
Posts: 181
Credit: 221,824,715
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42827 - Posted: 18 Feb 2016 | 20:43:16 UTC - in response to Message 42825.

I've forwarded your complaints to our IT service. Indeed delays in download/upload could be caused by the new network. I'll keep you updated!

Thanks.

Bedrich Hajek
Send message
Joined: 28 Mar 09
Posts: 332
Credit: 3,759,688,409
RAC: 392,906
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42829 - Posted: 21 Feb 2016 | 23:52:08 UTC - in response to Message 42414.
Last modified: 21 Feb 2016 | 23:53:24 UTC

FYI: I'm still having network access problems:
1. Stalled up/downloads
2. There are 3 tasks in progress according to my host's status page, but in reality there's only two (this host has only one GPU)
These problems were severe on 9th Dec, but it seems that the reason of these problems are not related to DNS replication.


I am have been having the same problems since the change to the new network. But for me, it happens occasionally during downloads only (definitely more often than before), and it involves only one file in the WU downloading. To continue with the download, I can either press "renter now " when the status is "Download reentry in xx:xx:xx" or exit boinc and then run it again. Sometimes, I have to do this more than once. But most of the time, the downloads go smoothly.

I haven't lost a WU yet, due to this problem. Knock wood!






I just want to add that this is happening on my windows xp computer, only. The windows 10 machine is downloading WUs with no problems, so far.

And sometimes, more than one file gets stuck.






WU file downloading problem is now happening on both my windows xp and 10 computers, occasionally. See log:


2/21/2016 6:15:03 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-par_file
2/21/2016 6:15:03 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-conf_file_enc
2/21/2016 6:15:04 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-conf_file_enc
2/21/2016 6:15:04 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-metainp_file
2/21/2016 6:15:05 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-metainp_file
2/21/2016 6:15:05 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-hills_file
2/21/2016 6:15:06 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-hills_file
2/21/2016 6:15:06 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-xsc_file
2/21/2016 6:15:07 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-xsc_file
2/21/2016 6:15:07 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-prmtop_file
2/21/2016 6:15:08 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-prmtop_file
2/21/2016 6:20:02 PM | GPUGRID | Temporarily failed download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file: transient HTTP error
2/21/2016 6:20:02 PM | GPUGRID | Backing off 00:02:40 on download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file
2/21/2016 6:20:03 PM | | Project communication failed: attempting access to reference site
2/21/2016 6:20:04 PM | | Internet access OK - project servers may be temporarily down.
2/21/2016 6:22:42 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file
2/21/2016 6:27:55 PM | | Project communication failed: attempting access to reference site
2/21/2016 6:27:55 PM | GPUGRID | Temporarily failed download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file: transient HTTP error
2/21/2016 6:27:55 PM | GPUGRID | Backing off 00:04:30 on download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file
2/21/2016 6:27:56 PM | | Internet access OK - project servers may be temporarily down.
2/21/2016 6:28:51 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file
2/21/2016 6:29:06 PM | GPUGRID | Temporarily failed download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file: transient HTTP error
2/21/2016 6:29:06 PM | GPUGRID | Backing off 00:13:42 on download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file
2/21/2016 6:29:07 PM | | Project communication failed: attempting access to reference site
2/21/2016 6:29:08 PM | | BOINC can't access Internet - check network connection or proxy configuration.
2/21/2016 6:29:18 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file
2/21/2016 6:29:35 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file

Another trick to get the download to restart is to disconnect and reconnect the network internet connection, and then in the boinc manager under the transfer tab press the renter now button with the stalled file highlighted. Of course, you can wait for it to restart on its own, this merely speeds up the download.


This was not happening in such frequency before the network upgrade.

nanoprobe
Send message
Joined: 26 Feb 12
Posts: 181
Credit: 221,824,715
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42831 - Posted: 23 Feb 2016 | 19:52:59 UTC
Last modified: 23 Feb 2016 | 19:54:58 UTC

7 tries and 5 1/2 hours wasted trying to download 1 file because every time the download fails the wait period for the next try gets longer. This is ridiculous. I thinks it's time to move somewhere else until this issue is resolved. 2+ months is long enough for me.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1814
Credit: 9,952,045,994
RAC: 6,316,474
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42832 - Posted: 23 Feb 2016 | 20:47:10 UTC - in response to Message 42831.

7 tries and 5 1/2 hours wasted trying to download 1 file because every time the download fails the wait period for the next try gets longer. This is ridiculous. I thinks it's time to move somewhere else until this issue is resolved. 2+ months is long enough for me.
Our complaints were forwarded to the IT service a week ago, however this problem exists since the changes in the network. I guess it's a misconfigured routing table (or more of them), which is quite hard to spot, especially when not all traffic is affected by it. A spare project could help to reduce the idle GPU time, so when the network issues will be fixed at GPUGrid's campus, your host will automatically stop downloading from the other (spare, 0 resource share) project.

nanoprobe
Send message
Joined: 26 Feb 12
Posts: 181
Credit: 221,824,715
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42834 - Posted: 24 Feb 2016 | 12:40:44 UTC - in response to Message 42832.
Last modified: 24 Feb 2016 | 13:25:41 UTC

7 tries and 5 1/2 hours wasted trying to download 1 file because every time the download fails the wait period for the next try gets longer. This is ridiculous. I thinks it's time to move somewhere else until this issue is resolved. 2+ months is long enough for me.
Our complaints were forwarded to the IT service a week ago, however this problem exists since the changes in the network. I guess it's a misconfigured routing table (or more of them), which is quite hard to spot, especially when not all traffic is affected by it. A spare project could help to reduce the idle GPU time, so when the network issues will be fixed at GPUGrid's campus, your host will automatically stop downloading from the other (spare, 0 resource share) project.


Something else blew up last night. I awoke this morning to find 4 tasks ready to report and no new tasks running. That had to be at least 12+ dead hours of no crunching. The projects tab showed the next update would not be for 12 more hours. After doing a manual update the 4 tasks reported and new tasks were requested. Below is a partial copy of the messages:

672710 GPUGRID 2/24/2016 7:24:35 AM update requested by user
672711 GPUGRID 2/24/2016 7:24:40 AM Fetching scheduler list
672712 GPUGRID 2/24/2016 7:24:43 AM Master file download succeeded
672713 GPUGRID 2/24/2016 7:24:48 AM Sending scheduler request: Requested by user.
672714 GPUGRID 2/24/2016 7:24:48 AM Reporting 4 completed tasks
672715 GPUGRID 2/24/2016 7:24:48 AM Requesting new tasks for NVIDIA GPU
672716 GPUGRID 2/24/2016 7:24:50 AM Scheduler request completed: got 1 new tasks

What would cause the master file to be needed again? I'm assuming that was a/the reason for the 12 hour delay. New tasks were received but there are 7 files stuck again. *bangs head on desk*

Also I don't think using a 0 share standby will work because if I remember correctly BOINC will not allow new tasks from another project to download if it detects stuck downloads from the higher priority project.

FWIW if the current IT service can't get this resolved after 2 months maybe GPUGrid might consider switching to another service provider.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 775
Credit: 1,314,197,470
RAC: 1,423,197
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42835 - Posted: 24 Feb 2016 | 18:55:04 UTC - in response to Message 42834.
Last modified: 24 Feb 2016 | 18:56:52 UTC

What would cause the master file to be needed again?

Ten consecutive failures to contact the scheduler. Note that's the request work/report work contact attempt, not the file download attempts this thread has mainly been about.

Check the full log in stdoutdae.txt - see when the problem started/ended. Unless you've suppressed it, BOINC will try to contact a 'neutral' web host (google.com) after each failure: if google is OK but gpugrid fails, then the project server is the suspect. But if google fails as well, then your own network connection and ISP may be at fault.

To test a little theory of mine - what OS is having these problems? Linux, Windows, OS X? Or all three? I'm Windows, and I see the downloads stalling sometimes - but the work is usually fully downloaded by the time I need it.

[Edit - OK, we don't support OS X here. Forget that one.]

Jim1348
Send message
Joined: 28 Jul 12
Posts: 446
Credit: 1,102,915,752
RAC: 2,480,741
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42836 - Posted: 24 Feb 2016 | 19:47:10 UTC - in response to Message 42835.
Last modified: 24 Feb 2016 | 20:27:51 UTC

To test a little theory of mine - what OS is having these problems? Linux, Windows, OS X? Or all three? I'm Windows, and I see the downloads stalling sometimes - but the work is usually fully downloaded by the time I need it.

I am on Win7 64-bit. It is not an actual operational problem for me at the moment. For the past four days, even with one or two backoffs, I get the downloads in less than 20 minutes, and usually about 10 minutes. With a little overlap (buffer setting of 0.01 + 0.01 days), it is working OK for my GTX 960, though the problem to some degree is still there.

nanoprobe
Send message
Joined: 26 Feb 12
Posts: 181
Credit: 221,824,715
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42837 - Posted: 24 Feb 2016 | 23:01:07 UTC - in response to Message 42835.



To test a little theory of mine - what OS is having these problems? Linux, Windows, OS X? Or all three? I'm Windows, and I see the downloads stalling sometimes - but the work is usually fully downloaded by the time I need it.


Both my XP and Win7 boxes are having the issue. Win7 is the most problematic because I'm running dual cards with 2 tasks each. Because of the 2 task per GPU limit I have no buffer to cover the stuck downloads. A 3 tasks per card limit would probably eliminate the issue but I doubt that will happen.

Nick Name
Send message
Joined: 3 Sep 13
Posts: 15
Credit: 276,798,256
RAC: 288,491
Level
Asn
Scientific publications
watwatwatwatwat
Message 42838 - Posted: 25 Feb 2016 | 2:19:51 UTC - in response to Message 42835.

To test a little theory of mine - what OS is having these problems?


Windows 7 Pro here. It's a minor problem, usually, however I am not crunching at the scale some are.

____________
Team USA forum | Team USA page

Gerard
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 26 Mar 14
Posts: 99
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 42856 - Posted: 28 Feb 2016 | 18:06:34 UTC - in response to Message 42831.

Hi nanoprobe and the gpugrid community,

I have forwarded this issue already several times to our local IT service but apparently they also have to forward it to some other higher IT service and I'm not sure to which extent we have power to prioritize our problem. I kindly ask you to be patient. I particularly share your frustration and I will try to keep insisting them to fix the problem.

Vagelis Giannadakis
Send message
Joined: 5 May 13
Posts: 187
Credit: 349,254,454
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwat
Message 42876 - Posted: 2 Mar 2016 | 14:36:30 UTC

I just noticed another phantom task that was assigned to me, but my BOINC client never got the scheduler's response:

02-Mar-2016 14:35:22 [GPUGRID] Sending scheduler request: Requested by project.
02-Mar-2016 14:35:22 [GPUGRID] Requesting new tasks for NVIDIA GPU
02-Mar-2016 14:40:27 [GPUGRID] Scheduler request failed: Timeout was reached
02-Mar-2016 14:40:27 [GPUGRID] Sending scheduler request: Requested by project.
02-Mar-2016 14:40:27 [GPUGRID] Requesting new tasks for NVIDIA GPU
02-Mar-2016 14:40:29 [GPUGRID] Scheduler request completed: got 1 new tasks

After the timeout, my BOINC client merrily requested once more for new tasks...

I wish there was a way to cancel tasks using the project's site, e.g. a Cancel button on the task list.
____________

Vagelis Giannadakis
Send message
Joined: 5 May 13
Posts: 187
Credit: 349,254,454
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwat
Message 42883 - Posted: 4 Mar 2016 | 10:28:22 UTC

Intrigued by this post by Bjarke I decided to do some trace-routing for GPUGRID and my other projects (WCG and POEM). Here's the output from tracert, having appended the geographic location of each hop using http://www.ipligence.com/geolocation:

C:\Users\vagelis>tracert www.gpugrid.org

Tracing route to www.gpugrid.org [84.89.134.145]
over a maximum of 30 hops:

## Skipping trace of my own ISP ##

8 77 ms 75 ms 76 ms nl-sar.nordu.net [80.249.209.203] -- NETHERLANDS
9 80 ms 80 ms 98 ms uk-hex.nordu.net [109.105.102.97] -- SWEDEN
10 96 ms 105 ms 95 ms ndn-gw.mx1.lon.uk.geant.net [109.105.102.98] -- SWEDEN
11 85 ms 98 ms 86 ms ae0.mx1.par.fr.geant.net [62.40.98.77] -- UK
12 81 ms 81 ms 81 ms 83.97.88.129 -- UK
13 104 ms * 105 ms 83.97.88.130 -- UK
14 139 ms 139 ms 120 ms TELMAD.AE4.uv.rt1.val.red.rediris.es [130.206.245.89] -- SPAIN - MADRID
15 118 ms 120 ms 121 ms anella-val1-router.red.rediris.es [130.206.211.70] -- SPAIN - MADRID
16 * * * Request timed out.
17 126 ms 126 ms 126 ms grosso.upf.edu [84.89.134.145] -- SPAIN - BARCELONA
18 216 ms 126 ms 126 ms grosso.upf.edu [84.89.134.145]
19 118 ms 117 ms 120 ms grosso.upf.edu [84.89.134.145]

Trace complete.

Note that I took traces from two locations using different ISPs to determine the entry point to GPUGRID's ISP network. The trace above is the common part.

Comparing GPUGRID's route trace to my other projects, it is evident that there's a lot of hopping around across Europe: Netherlands to Sweden to the UK to finally reach Spain. In contrast, WCG's trace shows a hop in the UK and then it goes to the USA. POEM's again has a hop in the UK and then goes to Germany.

Now, I'm not saying that hopping across Europe is a bad thing, even for an IP packet :D, but more hops does mean more points that can cause network problems.

It would be interesting to have a route trace from before the GPUGRID ISP switch to compare...
____________

Jim1348
Send message
Joined: 28 Jul 12
Posts: 446
Credit: 1,102,915,752
RAC: 2,480,741
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42885 - Posted: 4 Mar 2016 | 16:01:13 UTC - in response to Message 42883.
Last modified: 4 Mar 2016 | 16:02:59 UTC

A tracert from eastern Pennsylvania seems simple enough. My guess is that it is a local problem near UPF.
Or maybe there is more than one problem? It seems to be worse for some people than others.


3 9 ms 13 ms 11 ms 207.172.196.203
4 11 ms 11 ms 11 ms xe-7-0-2.bar2.Philadelphia1.Level3.net [4.30.46.33]
5 107 ms 105 ms 107 ms ae-1-3101.bar1.Madrid2.Level3.net [4.69.210.222]
6 107 ms 111 ms 107 ms ae-1-3101.bar1.Madrid2.Level3.net [4.69.210.222]
7 106 ms 108 ms 113 ms 213.242.113.78
8 113 ms 118 ms 113 ms TELMAD.AE4.uv.rt1.val.red.rediris.es [130.206.245.89]
9 121 ms 119 ms 119 ms anella-val1-router.red.rediris.es [130.206.211.70]
10 * * * Request timed out.
11 119 ms 118 ms 119 ms grosso.upf.edu [84.89.134.145]

Jim1348
Send message
Joined: 28 Jul 12
Posts: 446
Credit: 1,102,915,752
RAC: 2,480,741
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 42890 - Posted: 5 Mar 2016 | 15:39:27 UTC

I wonder if it is possible for BOINC, or some ancillary program, to do a tracert on a file as it is being downloaded? That would be more useful in finding the sticking points than doing a tracert after the fact, when conditions have changed.

klepel
Send message
Joined: 23 Dec 09
Posts: 126
Credit: 1,699,934,037
RAC: 1,840,431
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43002 - Posted: 14 Mar 2016 | 14:49:26 UTC

Did I miss something?

I installed my old GTX970 GPU on a Linux Mint Computer a few days ago:
Operating system: Linux Mint 17.3 Cinnamon 32-bit
Cinnamon Version: 2.8.6
Linux Kernel: 3.19.0-32-generic
AMD FX™-6100 Six-Core Priocessorx3
Memory: 7.8GiB
Hard Drive: 114.7 GB
Graphics Card: Nvidia Corporation GF110 [GeForce GTX570 Rev.2]

I have not received any work until today, although the Server Status indicates that there are “Tasks ready to send: 181”. However, I do receive PRIMEGRID.net Wus for my GTX570 without any problem! And all my Windows machines are loaded with work. So I do not understand what is wrong.

Please find the relevant log below:

Mon 14 Mar 2016 09:25:38 AM PET | | Starting BOINC client version 7.2.42 for i686-pc-linux-gnu
Mon 14 Mar 2016 09:25:38 AM PET | | log flags: file_xfer, sched_ops, task
Mon 14 Mar 2016 09:25:38 AM PET | | Libraries: libcurl/7.35.0 OpenSSL/1.0.1f zlib/1.2.8 libidn/1.28 librtmp/2.3
Mon 14 Mar 2016 09:25:38 AM PET | | Data directory: /var/lib/boinc-client
Mon 14 Mar 2016 09:25:38 AM PET | | CUDA: NVIDIA GPU 0: GeForce GTX 570 (driver version unknown, CUDA version 7.5, compute capability 2.0, 1279MB, 1144MB available, 1530 GFLOPS peak)
Mon 14 Mar 2016 09:25:38 AM PET | | OpenCL: NVIDIA GPU 0: GeForce GTX 570 (driver version 352.63, device version OpenCL 1.1 CUDA, 1279MB, 1144MB available, 1530 GFLOPS peak)
Mon 14 Mar 2016 09:25:38 AM PET | | Host name: kle1boinc-GA-970A-D3
Mon 14 Mar 2016 09:25:38 AM PET | | Processor: 6 AuthenticAMD AMD FX(tm)-6100 Six-Core Processor [Family 21 Model 1 Stepping 2]
Mon 14 Mar 2016 09:25:38 AM PET | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nonstop_tsc extd_apicid aperfmperf pni pclmulqdq monitor ssse3 cx16 sse4_1 sse4_2 popcnt aes xsave avx lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt lwp fma4 nodeid_msr topoext perfctr_core perfctr_nb arat cpb hw_pstate npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold vmmcall
Mon 14 Mar 2016 09:25:38 AM PET | | OS: Linux: 3.19.0-32-generic
Mon 14 Mar 2016 09:25:38 AM PET | | Memory: 7.89 GB physical, 7.98 GB virtual
Mon 14 Mar 2016 09:25:38 AM PET | | Disk: 109.15 GB total, 98.89 GB free
Mon 14 Mar 2016 09:25:38 AM PET | | Local time is UTC -5 hours
Mon 14 Mar 2016 09:25:38 AM PET | | Config: GUI RPCs allowed from:
Mon 14 Mar 2016 09:25:38 AM PET | SETI@home | URL http://setiathome.berkeley.edu/; Computer ID 7953465; resource share 100
Mon 14 Mar 2016 09:25:38 AM PET | PrimeGrid | URL http://www.primegrid.com/; Computer ID 511403; resource share 0
Mon 14 Mar 2016 09:25:38 AM PET | | General prefs: from http://www.malariacontrol.net/ (last modified 08-May-2012 21:54:17)
Mon 14 Mar 2016 09:25:38 AM PET | | Host location: none
Mon 14 Mar 2016 09:25:38 AM PET | | General prefs: using your defaults
Mon 14 Mar 2016 09:25:38 AM PET | | Reading preferences override file
Mon 14 Mar 2016 09:25:38 AM PET | | Preferences:
Mon 14 Mar 2016 09:25:38 AM PET | | max memory usage when active: 4037.73MB
Mon 14 Mar 2016 09:25:38 AM PET | | max memory usage when idle: 7267.92MB
Mon 14 Mar 2016 09:25:38 AM PET | | max disk usage: 54.57GB
Mon 14 Mar 2016 09:25:38 AM PET | | max CPUs used: 5
Mon 14 Mar 2016 09:25:38 AM PET | | suspend work if non-BOINC CPU load exceeds 50%
Mon 14 Mar 2016 09:25:38 AM PET | | max download rate: 409600 bytes/sec
Mon 14 Mar 2016 09:25:38 AM PET | | max upload rate: 204800 bytes/sec
Mon 14 Mar 2016 09:25:38 AM PET | | (to change preferences, visit a project web site or select Preferences in the Manager)
Mon 14 Mar 2016 09:25:38 AM PET | | gui_rpc_auth.cfg is empty - no GUI RPC password protection
Mon 14 Mar 2016 09:25:38 AM PET | | Not using a proxy
Mon 14 Mar 2016 09:25:39 AM PET | SETI@home | Sending scheduler request: To fetch work.
Mon 14 Mar 2016 09:25:39 AM PET | SETI@home | Requesting new tasks for NVIDIA
Mon 14 Mar 2016 09:25:43 AM PET | SETI@home | Scheduler request completed: got 0 new tasks
Mon 14 Mar 2016 09:29:48 AM PET | | Fetching configuration file from http://www.gpugrid.net/get_project_config.php
Mon 14 Mar 2016 09:30:35 AM PET | GPUGRID | Master file download succeeded
Mon 14 Mar 2016 09:30:40 AM PET | GPUGRID | Sending scheduler request: Project initialization.
Mon 14 Mar 2016 09:30:40 AM PET | GPUGRID | Requesting new tasks for CPU and NVIDIA
Mon 14 Mar 2016 09:30:45 AM PET | GPUGRID | Scheduler request completed: got 0 new tasks
Mon 14 Mar 2016 09:30:45 AM PET | GPUGRID | No tasks sent
Mon 14 Mar 2016 09:30:45 AM PET | GPUGRID | No tasks are available for Long runs (8-12 hours on fastest card)
Mon 14 Mar 2016 09:30:47 AM PET | GPUGRID | Started download of logogpugrid.png
Mon 14 Mar 2016 09:30:47 AM PET | GPUGRID | Started download of project_1.png
Mon 14 Mar 2016 09:30:49 AM PET | GPUGRID | Finished download of logogpugrid.png
Mon 14 Mar 2016 09:30:49 AM PET | GPUGRID | Started download of project_2.png
Mon 14 Mar 2016 09:30:52 AM PET | GPUGRID | Finished download of project_1.png
Mon 14 Mar 2016 09:30:52 AM PET | GPUGRID | Finished download of project_2.png
Mon 14 Mar 2016 09:30:52 AM PET | GPUGRID | Started download of project_3.png
Mon 14 Mar 2016 09:31:20 AM PET | GPUGRID | Sending scheduler request: To fetch work.
Mon 14 Mar 2016 09:31:20 AM PET | GPUGRID | Requesting new tasks for NVIDIA
Mon 14 Mar 2016 09:31:25 AM PET | GPUGRID | Scheduler request completed: got 0 new tasks
Mon 14 Mar 2016 09:31:25 AM PET | GPUGRID | No tasks sent
Mon 14 Mar 2016 09:31:25 AM PET | GPUGRID | No tasks are available for Long runs (8-12 hours on fastest card)
Mon 14 Mar 2016 09:44:11 AM PET | GPUGRID | Sending scheduler request: To fetch work.
Mon 14 Mar 2016 09:44:11 AM PET | GPUGRID | Requesting new tasks for NVIDIA
Mon 14 Mar 2016 09:44:14 AM PET | GPUGRID | Scheduler request completed: got 0 new tasks
Mon 14 Mar 2016 09:44:14 AM PET | GPUGRID | No tasks sent
Mon 14 Mar 2016 09:44:14 AM PET | GPUGRID | No tasks are available for Long runs (8-12 hours on fastest card)

klepel
Send message
Joined: 23 Dec 09
Posts: 126
Credit: 1,699,934,037
RAC: 1,840,431
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43060 - Posted: 21 Mar 2016 | 15:05:06 UTC

Dear all,

As I have not received an answer to my post above. I would like to repeat my request. Does anybody have an idea, why I can't get WUs on this Linux System? Is it because of the type of the card? Shall I try with a newer generation GTX670? Or is it because of the 32 Bit Linux?

Any comment would be highly appreciated! Thanks.

fractal
Send message
Joined: 16 Aug 08
Posts: 86
Credit: 633,657,950
RAC: 1,187,783
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43067 - Posted: 22 Mar 2016 | 22:37:47 UTC

Your 32 bit linux is a problem. There are no 32 bit linux apps listed on https://www.gpugrid.net/apps.php

I also don't know if compute capability 2.0 is enough any more. There was a thread listing the minimum but I can't find it any more.

And, if that's not enough, your driver version is not listed.

Your log says

Mon 14 Mar 2016 09:25:38 AM PET | | CUDA: NVIDIA GPU 0: GeForce GTX 570 (driver version unknown, CUDA version 7.5, compute capability 2.0, 1279MB, 1144MB available, 1530 GFLOPS peak)

but mine says

14-Mar-2016 22:04:40 [---] CUDA: NVIDIA GPU 0: GeForce GTX 750 Ti (driver version 355.11, CUDA version 7.5, compute capability 5.0, 2047MB, 2010MB available, 1388 GFLOPS peak)


and it is useful to know what driver version you are running.

Profile skgiven
Volunteer moderator
Project tester
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3967
Credit: 1,803,940,389
RAC: 502,063
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43068 - Posted: 23 Mar 2016 | 13:28:53 UTC - in response to Message 43067.

fractal spotted it; 3.19.0-32-generic

x86 Linux is a non-starter and the GPU needs to be CC3.0 or above:

https://www.gpugrid.net/join.php


    Supported OS

      Linux 64-bit
      Windows 32/64-bit



    GPU: NVIDIA Kepler GPU (CC3.0) (Geforce 600 series and later)



FAQ - Recommended GPUs for GPUGrid crunching

Your system might still be useful at one of the other Boinc GPU projects.
____________
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help

Post to thread

Message boards : News : Probable access problems on 9th Dec