Advanced search

Message boards : Server and website : Server status

Author Message
Profile robertmiles
Send message
Joined: 16 Apr 09
Posts: 482
Credit: 554,467,553
RAC: 15,976
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53523 - Posted: 28 Jan 2020 | 16:40:25 UTC

Problems on your server status web page:

The Detailed computing status only shows Long runs and Short runs, which are not happening any more. Could you add New version of ACEMD?

Also, could you mention which type of runs are used for the new MDAD experiment?

Toni
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: 9 Dec 08
Posts: 947
Credit: 4,353,973
RAC: 71
Level
Ala
Scientific publications
watwatwatwat
Message 53524 - Posted: 28 Jan 2020 | 16:44:21 UTC - in response to Message 53523.

That will need updating, when we have time. MDAD are on the short-ish side but with much variability. See "news".

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2185
Credit: 15,823,741,735
RAC: 734,144
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53528 - Posted: 28 Jan 2020 | 18:03:26 UTC - in response to Message 53524.
Last modified: 28 Jan 2020 | 18:04:55 UTC

That will need updating, when we have time.
Is it viable to deprecate the old long-run and short-run applications, and put the acemd3 in their place? Then queue the new tasks as long-runs? This way the server status and performance pages would work as they were intended to work.
EDIT: next time maybe...

jth
Send message
Joined: 30 Nov 19
Posts: 1
Credit: 1,332,049
RAC: 0
Level
Ala
Scientific publications
wat
Message 53542 - Posted: 29 Jan 2020 | 10:21:54 UTC - in response to Message 53524.

That will need updating, when we have time

May I point out, that this is not a one-way road. Out here we are donating computing time = money on our electricity bill.
I would expect an attitude, that would express some thankfulness towards the users and give them something back. You cannot expect people to continue giving you gifts, if you do not give them something back.

Your status pages are really bad with data back from 2014 !


____________

Spatzthecat
Send message
Joined: 26 Nov 09
Posts: 33
Credit: 1,282,327,306
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 53556 - Posted: 29 Jan 2020 | 20:13:11 UTC - in response to Message 53542.

WOW!

Profile robertmiles
Send message
Joined: 16 Apr 09
Posts: 482
Credit: 554,467,553
RAC: 15,976
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53886 - Posted: 10 Mar 2020 | 23:36:41 UTC

Is the server currently down? When I try to check the server state, I get a white screen, with no information.

Toni
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: 9 Dec 08
Posts: 947
Credit: 4,353,973
RAC: 71
Level
Ala
Scientific publications
watwatwatwat
Message 53887 - Posted: 11 Mar 2020 | 8:56:40 UTC - in response to Message 53886.

Here's a tad slow, but not stuck at all.

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 508
Credit: 523,955,637
RAC: 1,662,235
Level
Lys
Scientific publications
wat
Message 53891 - Posted: 12 Mar 2020 | 1:08:27 UTC

Uploads have been sticky all day and going into backoff.

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 508
Credit: 523,955,637
RAC: 1,662,235
Level
Lys
Scientific publications
wat
Message 53892 - Posted: 12 Mar 2020 | 2:34:36 UTC

GPUGRID 1gzhB01_450_0-TONI_MDADpr4sg-7-10-RND8412_0_0 0.000 3.17 K 00:04:00 - 01:17:51 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1gzhB01_450_0-TONI_MDADpr4sg-7-10-RND8412_0_1 0.000 677.98 K 00:04:00 - 01:16:12 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1gzhB01_450_0-TONI_MDADpr4sg-7-10-RND8412_0_2 0.000 677.98 K 00:04:00 - 01:15:19 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1gzhB01_450_0-TONI_MDADpr4sg-7-10-RND8412_0_8 0.000 3390.39 K 00:04:00 - 01:15:39 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1gzhB01_450_0-TONI_MDADpr4sg-7-10-RND8412_0_9 0.000 1199.68 K 00:04:00 - 01:17:31 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1gzhB01_450_0-TONI_MDADpr4sg-7-10-RND8412_0_10 0.000 0.27 K 00:04:00 - 01:18:45 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1kx3D00_450_0-TONI_MDADpr4sk-6-10-RND1178_0_0 0.000 3.17 K 00:04:01 - 00:21:24 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1kx3D00_450_0-TONI_MDADpr4sk-6-10-RND1178_0_1 0.000 583.29 K 00:04:01 - 00:18:37 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1kx3D00_450_0-TONI_MDADpr4sk-6-10-RND1178_0_2 0.000 583.29 K 00:02:01 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1kx3D00_450_0-TONI_MDADpr4sk-6-10-RND1178_0_8 0.000 2916.95 K 00:02:01 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1kx3D00_450_0-TONI_MDADpr4sk-6-10-RND1178_0_9 0.000 1031.27 K 00:02:01 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull
GPUGRID 1kx3D00_450_0-TONI_MDADpr4sk-6-10-RND1178_0_10 0.000 0.27 K 00:02:01 0.00 Kbps Upload pending (Project backoff: 04:47:46) Numbskull

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 508
Credit: 523,955,637
RAC: 1,662,235
Level
Lys
Scientific publications
wat
Message 53894 - Posted: 12 Mar 2020 | 14:17:52 UTC

Out of work on one host and pages of backed off uploads on the others. Cache dropping because unable to report and retrieve new work.

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 508
Credit: 523,955,637
RAC: 1,662,235
Level
Lys
Scientific publications
wat
Message 53909 - Posted: 13 Mar 2020 | 15:09:04 UTC

Looks like the web server crash also pulled down the upload and download servers.

Finally was able to clear the backlog this morning and retrieve new work.

Aurum
Avatar
Send message
Joined: 12 Jul 17
Posts: 252
Credit: 9,791,563,847
RAC: 3,936,285
Level
Tyr
Scientific publications
wat
Message 53926 - Posted: 15 Mar 2020 | 15:05:21 UTC - in response to Message 53542.

That will need updating, when we have time

May I point out, that this is not a one-way road. Out here we are donating computing time = money on our electricity bill.
I would expect an attitude, that would express some thankfulness towards the users and give them something back. You cannot expect people to continue giving you gifts, if you do not give them something back.

Your status pages are really bad with data back from 2014 !

Don't let the door spank you on your way out.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1003
Credit: 2,522,603,430
RAC: 3,278,279
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53928 - Posted: 15 Mar 2020 | 22:08:37 UTC - in response to Message 53909.

Looks like the web server crash also pulled down the upload and download servers.

Finally was able to clear the backlog this morning and retrieve new work.

Er - there's only one server here, it's called grosso. If it's down, it's down.

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 508
Credit: 523,955,637
RAC: 1,662,235
Level
Lys
Scientific publications
wat
Message 53930 - Posted: 16 Mar 2020 | 20:50:18 UTC - in response to Message 53928.

Thanks for straightening me out Richard. Never noticed there was only one server for everything. Or is it. The server page lists the web server as www.gpugrid.net and not as grosso. Is it a separate host?

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1003
Credit: 2,522,603,430
RAC: 3,278,279
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53931 - Posted: 16 Mar 2020 | 22:53:14 UTC - in response to Message 53930.

I'd say they were the same.

C:\>ping www.gpugrid.net

Pinging www.gpugrid.net [84.89.134.145] with 32 bytes of data:


C:\>ping -a 84.89.134.145

Pinging grosso.upf.edu [84.89.134.145] with 32 bytes of data:

Post to thread

Message boards : Server and website : Server status