Advanced search

Message boards : Server and website : Unable to load units

Author Message
Profile Zarck
Send message
Joined: 16 Aug 08
Posts: 144
Credit: 327,844,195
RAC: 13,278
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 51498 - Posted: 15 Feb 2019 | 23:47:19 UTC

Unable to load units.

Loading starts and then stops ... a server problem ?

https://www.dropbox.com/s/p1y5n4rjoi9b5r6/Capture%20d%27%C3%A9cran%202019-02-16%2000.40.01.png?dl=0

____________

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 53
Credit: 1,522,226,036
RAC: 611
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 51503 - Posted: 16 Feb 2019 | 11:38:19 UTC - in response to Message 51498.

Same problem. loading speed below 1kbps

jp de malo
Send message
Joined: 3 Jun 10
Posts: 4
Credit: 1,142,601,314
RAC: 261,327
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 51520 - Posted: 17 Feb 2019 | 19:34:44 UTC

Same problem. loading speed below 1kbps

jp de malo
Send message
Joined: 3 Jun 10
Posts: 4
Credit: 1,142,601,314
RAC: 261,327
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 51619 - Posted: 11 Mar 2019 | 20:37:22 UTC

C'etait revenue puis c'est repartie depuis 3 jours telechargement au ralenti
mes cartes graphiques sont au chomage tous les autres projet fonctionnent mormalement

Profile Zarck
Send message
Joined: 16 Aug 08
Posts: 144
Credit: 327,844,195
RAC: 13,278
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 51625 - Posted: 13 Mar 2019 | 9:11:56 UTC

Again very low...

Profile Zarck
Send message
Joined: 16 Aug 08
Posts: 144
Credit: 327,844,195
RAC: 13,278
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 51626 - Posted: 13 Mar 2019 | 13:15:01 UTC
Last modified: 13 Mar 2019 | 13:16:23 UTC

Téléchargement encore très lent.

https://forum.boinc-af.org

Bienvenue à tous les François et Françoises.

@+
*_*
____________

Jim1348
Send message
Joined: 28 Jul 12
Posts: 733
Credit: 1,478,749,566
RAC: 142,659
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 51627 - Posted: 13 Mar 2019 | 13:16:20 UTC

I haven't seen the problem yet in the U.S.

Erich56
Send message
Joined: 1 Jan 15
Posts: 697
Credit: 3,294,215,935
RAC: 399,269
Level
Arg
Scientific publications
watwatwatwatwatwat
Message 51637 - Posted: 17 Mar 2019 | 6:47:19 UTC

here, too, I had download problems just a few minutes ago.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 9,783
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 51639 - Posted: 18 Mar 2019 | 15:17:01 UTC - in response to Message 51627.

I haven't seen the problem yet in the U.S.

I sure have. It's a consistent problem. Have tried all kinds of DNS servers hoping to alleviate the problem somewhat. To no avail... :-(
It's not on our end. This is another problem that's been going on for years and is still not fixed. There was some chatter a few years ago about having the university IT department look into it but as far as I can see nothing has improved.

Jim1348
Send message
Joined: 28 Jul 12
Posts: 733
Credit: 1,478,749,566
RAC: 142,659
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 51641 - Posted: 18 Mar 2019 | 18:04:53 UTC - in response to Message 51639.

I have problems accessing the forums occasionally. Sometimes I can barely post. And I have problems accessing my statistics or setup page.

But the work units upload and download fine whenever I check them. If there are problems, they are hidden from me, which is just as well.

Erich56
Send message
Joined: 1 Jan 15
Posts: 697
Credit: 3,294,215,935
RAC: 399,269
Level
Arg
Scientific publications
watwatwatwatwatwat
Message 51649 - Posted: 24 Mar 2019 | 6:39:38 UTC

The download problems still persist.
Last night and this morning, the download of tasks took up to almost one hour.

Jim1348
Send message
Joined: 28 Jul 12
Posts: 733
Credit: 1,478,749,566
RAC: 142,659
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 51650 - Posted: 24 Mar 2019 | 16:14:57 UTC - in response to Message 51649.

The download problems still persist.
Last night and this morning, the download of tasks took up to almost one hour.

There is a router problem somewhere. It affects you differently than me. But it seems to affect everyone somehow.

Billy Ewell 1931
Send message
Joined: 22 Oct 10
Posts: 28
Credit: 330,865,974
RAC: 531
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 52622 - Posted: 13 Sep 2019 | 3:08:03 UTC
Last modified: 13 Sep 2019 | 3:16:53 UTC

I reinitiated coverage of GPUGrid about the end of August with four computers and each with its own GPU: Intel Dual Core, Windows XP and GTX 750ti; I3, Windows 10 and GTX1060; I7, Windows 10 and RTX 2080; And Intel Xeon, Windows Vista and GTX1060.

The problem is: The "XEON" powered machine and its GTX 1060 GPU has not downloaded a single task even though dozens of requests to do so occured and the same answers are always given: Either there are (1) No tasks available to meet my specifications OR (2)as I requested:

9/12/2019 9:39:26 PM | GPUGRID | No tasks are available for Short runs (2-3 hours on fastest card)
9/12/2019 9:39:26 PM | GPUGRID | No tasks are available for Long runs (8-12 hours on fastest card)

In the latter answer posted above there are many tasks available and have been available on and off for the last few days, including concurrently with the writing of this observation. I naturally have checked that all boxes are correctly checked, and all preferences are correctly applied.Also I ran tests on several other Nvidia GPU project sites to verify downloading capability from at least 3-4 and found no problem. Would a technician at Grid check this out please as certainly it appears to be a server problem; at least the server is absolutely reporting the status of tasks incorrectly!

Should someone have an idea, I am ready to receive suggestions.By the way, the other three machines are downloading tasks as they become available: The New ACEMDs for the 2080 and the Long and Short tasks for the others.

a Great Project.
PS: Over 100 Long tasks are currently available.

Profile robertmiles
Send message
Joined: 16 Apr 09
Posts: 482
Credit: 554,467,553
RAC: 15,976
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52623 - Posted: 13 Sep 2019 | 14:19:34 UTC - in response to Message 52622.

I reinitiated coverage of GPUGrid about the end of August with four computers and each with its own GPU: Intel Dual Core, Windows XP and GTX 750ti; I3, Windows 10 and GTX1060; I7, Windows 10 and RTX 2080; And Intel Xeon, Windows Vista and GTX1060.

The problem is: The "XEON" powered machine and its GTX 1060 GPU has not downloaded a single task even though dozens of requests to do so occured and the same answers are always given: Either there are (1) No tasks available to meet my specifications OR (2)as I requested:

[snip]

Some of the Xeon processors include graphics capabilities. You may need to check how many GPUs BOINC thinks you have, and if it sees more than one, tell it to turn off use of the one inside the Xeon processor.

Aurum
Avatar
Send message
Joined: 12 Jul 17
Posts: 252
Credit: 9,791,563,847
RAC: 3,936,285
Level
Tyr
Scientific publications
wat
Message 52624 - Posted: 13 Sep 2019 | 14:38:26 UTC - in response to Message 52623.

Some of the Xeon processors include graphics capabilities.
None of my v3 & v4 Xeons have CPU graphics. There's a couple of v2 Xeons with graphics. Which Xeons are you referring to???

____________

Profile robertmiles
Send message
Joined: 16 Apr 09
Posts: 482
Credit: 554,467,553
RAC: 15,976
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52625 - Posted: 13 Sep 2019 | 16:42:14 UTC - in response to Message 52624.

I was referring to the one Billy Ewell has.

That computer shows:

GenuineIntel
Intel(R) Xeon(R) CPU W3520 @ 2.67GHz [Family 6 Model 26 Stepping 5]

I looked up that CPU, and found nothing on whether it has any CPU graphics capabilities or not. I assume that it doesn't.

Two more things to check:

Make sure that the GTX 1060 driver in use was downloaded from Nvidia, rather than from Microsoft. The version from Microsoft usually has CUDA and OpenCL sections removed.

The most recent such driver that is compatible with Vista is known to has at least one bug in the OpenCL section. It may be necessary to find and install a version of the driver that was released, but not the most recent release.

Billy Ewell 1931
Send message
Joined: 22 Oct 10
Posts: 28
Credit: 330,865,974
RAC: 531
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 52626 - Posted: 13 Sep 2019 | 18:58:08 UTC

Thanks to: Auram and Robert Miles.

The only thing I changed on the Xeon/GTX1060 computer is to do a "clean install" on the driver(from EVGA) which honestly is several years old. Of course now, there is no way to check on download progress because no tasks are available. But recall what I reported below before: Namely, my machine sends a request for GPUGrid tasks of the Short Run or Long Run category, just like the ones that have been running on two of my other computers, and the answer from the GG server is(shortened): No Short Run tasks and or No Long Run tasks are available. And as I pointed out previously, in many cases when this exact circumstance occurs there are over a combined 100 or so tasks of those types requested waiting in the server inventory. It makes no sense or logic for the server to report incorrectly to a valid request. And, therefore, I honestly don't see how my equipment would be the problem.Should the limitation be within my system then the reply from the server, typically printed in RED, is paraphrasing: "No Tasks are available that meet your preferences", etc.

Another point concerning graphics: When I bought this refurbished equipment, an EVGA GTX 46X GPU was already installed so I can rationalize that any graphic conflict would not exist.

Again thanks and hopefully GG Admin will reply; if not then I will move on.

Aurum
Avatar
Send message
Joined: 12 Jul 17
Posts: 252
Credit: 9,791,563,847
RAC: 3,936,285
Level
Tyr
Scientific publications
wat
Message 52627 - Posted: 13 Sep 2019 | 19:14:05 UTC

What Nvidia driver are you using on that 1060???
As I recall current wisdom is that:
431.30 Good
436.xx Bad
Also Xeon CPUs must have the latest BIOS dated summer 2018 or later.
____________

Billy Ewell 1931
Send message
Joined: 22 Oct 10
Posts: 28
Credit: 330,865,974
RAC: 531
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 52628 - Posted: 13 Sep 2019 | 20:45:08 UTC - in response to Message 52627.

The Driver is 368.81 which I derived directly from NVIDIA. I will check the BIOS later over the next hours and update accordingly. And thank you bunches!
The driver was a clean install.
But I am not sure that the driver or bios will solve the problem. One of my machines is half as old as Moses and it is processing GPUGrid just perfectly.

Thanks again. I really enjoy the BOINC driven distributed computing activity.
By the way, the numeric value in my ID is my year of birth. Lucky man!

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2185
Credit: 15,823,750,346
RAC: 732,335
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52632 - Posted: 15 Sep 2019 | 17:03:59 UTC - in response to Message 52628.

The Driver is 368.81 which I derived directly from NVIDIA.

I'm sure this old driver version is the reason for the server to not send work for your host. NVidia dropped Windows Vista support in July 2016.
You should install Windows 10 or Linux on that PC.

jiipee
Send message
Joined: 4 Jun 15
Posts: 4
Credit: 2,244,048,638
RAC: 2,235,875
Level
Phe
Scientific publications
watwatwatwatwatwat
Message 53374 - Posted: 19 Dec 2019 | 11:04:33 UTC

Some WU's have downloaded veeeeery slowly. This seems to affect only my single Windows box, not Linux boxes (three Linuxes on same network with Win10).

My Win10 usually works like this:

Task / WU / Sent / Reported / Status / Run time / CPU time / Credit / App

21568109 / 16907517 / 16 Dec 2019 | 17:48:59 UTC / 16 Dec 2019 | 21:19:01 UTC / Completed and validated / 7,958.60 / 7,862.38 / 91,500.00 / New version of ACEMD v2.10 (cuda101)

but when downloading starts to take time, tasks start to look like this:

21568706 / 16908048 / 17 Dec 2019 | 5:42:39 UTC / 19 Dec 2019 | 7:42:10 UTC / Completed and validated / 7,959.27 / 7,860.33 / 61,000.00 / New version of ACEMD v2.10 (cuda101)

If I recall correctly, suspended download retry time is something like several hours. If I manually request it to continue, it usually advances normally. Is there a way to edit download properties somewhere to get the retry interval shorter?

Br, Jukka

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 508
Credit: 524,027,144
RAC: 1,662,925
Level
Lys
Scientific publications
wat
Message 53375 - Posted: 19 Dec 2019 | 15:01:41 UTC - in response to Message 53374.

Is there a way to edit download properties somewhere to get the retry interval shorter?


No. The server sets the backoff interval. Not the client.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1003
Credit: 2,522,838,291
RAC: 3,286,983
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53376 - Posted: 19 Dec 2019 | 16:58:00 UTC - in response to Message 53375.

Is there a way to edit download properties somewhere to get the retry interval shorter?

No. The server sets the backoff interval. Not the client.

That's a different backoff (scheduler for update / request work, not download).

The client sets the download backoff, and it starts small with the first failure, then gets longer after each attempt: the idea is to reduce congestion if it's a general problem and everyone starts hammering on the server at once.

To reach an elapsed time of 50 hours instead of 4 hours, there must have been multiple failures (they'll be in your Event Log).

One observation I have with multiple Windows machines is that the GPUGrid server doesn't like it when different computers try to connect from the same LAN / IP address in quick succession. It only takes a couple of minutes to clear, but it's very annoying: just had it at the beginning of this forum session. I can only suggest that you check the Win 10 download status any time you happen to be passing: click update on any stalled download, and keep clicking until all the stalled files have transferred.

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 508
Credit: 524,027,144
RAC: 1,662,925
Level
Lys
Scientific publications
wat
Message 53377 - Posted: 20 Dec 2019 | 1:36:45 UTC - in response to Message 53376.

One observation I have with multiple Windows machines is that the GPUGrid server doesn't like it when different computers try to connect from the same LAN / IP address in quick succession. It only takes a couple of minutes to clear, but it's very annoying: just had it at the beginning of this forum session.

Thanks for that observation. And the client correction. I wondered if it was just me or does everyone have that problem of multiple hosts trying to connect from the same LAN IP address range and the schedulers just ignoring the host request.

I've learned to wait a couple of minutes after a host has had the double scheduler connect and gone quiescent again before I attempt an update on another host.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2185
Credit: 15,823,750,346
RAC: 732,335
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53378 - Posted: 20 Dec 2019 | 17:47:32 UTC - in response to Message 53377.

One observation I have with multiple Windows machines is that the GPUGrid server doesn't like it when different computers try to connect from the same LAN / IP address in quick succession. It only takes a couple of minutes to clear, but it's very annoying: just had it at the beginning of this forum session.

Thanks for that observation. And the client correction. I wondered if it was just me or does everyone have that problem of multiple hosts trying to connect from the same LAN IP address range and the schedulers just ignoring the host request.

I've learned to wait a couple of minutes after a host has had the double scheduler connect and gone quiescent again before I attempt an update on another host.
+1

Erich56
Send message
Joined: 1 Jan 15
Posts: 697
Credit: 3,294,215,935
RAC: 399,269
Level
Arg
Scientific publications
watwatwatwatwatwat
Message 53379 - Posted: 20 Dec 2019 | 17:54:01 UTC - in response to Message 53378.

One observation I have with multiple Windows machines is that the GPUGrid server doesn't like it when different computers try to connect from the same LAN / IP address in quick succession. It only takes a couple of minutes to clear, but it's very annoying: just had it at the beginning of this forum session.

Thanks for that observation. And the client correction. I wondered if it was just me or does everyone have that problem of multiple hosts trying to connect from the same LAN IP address range and the schedulers just ignoring the host request.

I've learned to wait a couple of minutes after a host has had the double scheduler connect and gone quiescent again before I attempt an update on another host.
+1

+1

Killersocke
Send message
Joined: 18 Oct 13
Posts: 51
Credit: 333,404,147
RAC: 3,355
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 53395 - Posted: 27 Dec 2019 | 7:47:05 UTC - in response to Message 51498.

Same here...

27.12.2019 08:39:38 | GPUGRID | Requesting new tasks for CPU and NVIDIA GPU
27.12.2019 08:39:41 | | Project communication failed: attempting access to reference site
27.12.2019 08:39:42 | | Internet access OK - project servers may be temporarily down.
27.12.2019 08:40:00 | GPUGRID | Scheduler request failed: Couldn't connect to server
27.12.2019 08:40:01 | | Project communication failed: attempting access to reference site
27.12.2019 08:40:02 | | Internet access OK - project servers may be temporarily down.

Erich56
Send message
Joined: 1 Jan 15
Posts: 697
Credit: 3,294,215,935
RAC: 399,269
Level
Arg
Scientific publications
watwatwatwatwatwat
Message 53396 - Posted: 27 Dec 2019 | 12:58:38 UTC - in response to Message 53395.

Same here...

The GPUGRID server as well as the website were down this morning for some short time.
But everything okay since then.

jjch
Send message
Joined: 10 Nov 13
Posts: 38
Credit: 12,950,209,824
RAC: 6,169,250
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 53400 - Posted: 27 Dec 2019 | 21:48:53 UTC - in response to Message 52624.

Only a few of the Xeon E3 - V2, V5 and V6's include the integrated Intel P4000, HD P530 or Iris Pro P580 graphics.

https://www.intel.com/content/dam/www/public/us/en/documents/guides/hd-graphics-p530-p580-performance-guide.pdf

Profile ServicEnginIC
Avatar
Send message
Joined: 24 Sep 10
Posts: 198
Credit: 1,456,178,911
RAC: 932,972
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53629 - Posted: 9 Feb 2020 | 12:10:05 UTC - in response to Message 53379.

One observation I have with multiple Windows machines is that the GPUGrid server doesn't like it when different computers try to connect from the same LAN / IP address in quick succession. It only takes a couple of minutes to clear, but it's very annoying: just had it at the beginning of this forum session.

Thanks for that observation. And the client correction. I wondered if it was just me or does everyone have that problem of multiple hosts trying to connect from the same LAN IP address range and the schedulers just ignoring the host request.

I've learned to wait a couple of minutes after a host has had the double scheduler connect and gone quiescent again before I attempt an update on another host.

+1

+1

+1
It affects not only to scheduler requests, but also to the other GPUGrid's Web services.

Example: One of my hosts has recently asked for new tasks, and I try to access GPUGrid's webpage from another host in the same network, then access will not be possible until a certain delay is past.
Possibly it is due to some DoS (Denial of Service attack) protection running on GPUGrid's server or firewall.

It's been commented in some other posts like this:
It's not just the speed.
There's some DDOS prevention algorithm in operation, because my hosts gets blocked if they try to contact the server one by one in rapid succession (from the same public IP address).

What can we do to mitigate this effect???

There's no easy way to fix this in our end.


Edit:
Second example: I've had to publish this post twice, with a pause in between. First time was not attended by server...

Post to thread

Message boards : Server and website : Unable to load units