Advanced search

Message boards : Number crunching : Error PAOLA_RNP (cancel freely) // Addressed & fixed

Author Message
Bedrich Hajek
Send message
Joined: 28 Mar 09
Posts: 485
Credit: 11,087,837,097
RAC: 15,639,702
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24795 - Posted: 8 May 2012 | 10:02:01 UTC

Is there something wrong with these units? I had all 3 crash, with this error message.

5/8/2012 3:26:52 AM | GPUGRID | Computation for task 1H46_11_8-PAOLA_RNP-0-5-RND3163_0 finished
5/8/2012 3:26:52 AM | GPUGRID | Output file 1H46_11_8-PAOLA_RNP-0-5-RND3163_0_4 for task 1H46_11_8-PAOLA_RNP-0-5-RND3163_0 exceeds size limit.
5/8/2012 3:26:52 AM | GPUGRID | File size: 131283476.000000 bytes. Limit: 128000000.000000 bytes

See links:


http://www.gpugrid.net/result.php?resultid=5339678
http://www.gpugrid.net/result.php?resultid=5340152
http://www.gpugrid.net/result.php?resultid=5340071

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,206,655,749
RAC: 261,147
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24796 - Posted: 8 May 2012 | 11:43:00 UTC - in response to Message 24795.
Last modified: 8 May 2012 | 11:47:24 UTC

It seems to be a server side upload size limitation. This is really bad, considering that these units running for 12-14 hours on the fastest cards.

I received the same error message.

Twice.


2012. 05. 08. 13:24:52 GPUGRID Computation for task 1H46_33_1-PAOLA_RNP-0-5-RND9113_0 finished
2012. 05. 08. 13:24:52 GPUGRID Output file 1H46_33_1-PAOLA_RNP-0-5-RND9113_0_4 for task 1H46_33_1-PAOLA_RNP-0-5-RND9113_0 exceeds size limit.
2012. 05. 08. 13:24:52 GPUGRID File size: 131283476.000000 bytes. Limit: 128000000.000000 bytes

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,206,655,749
RAC: 261,147
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24797 - Posted: 8 May 2012 | 11:50:02 UTC - in response to Message 24796.

I got 3 other running right now.
If this problem won't be fixed very soon, I will abort all of these workunits.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,206,655,749
RAC: 261,147
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24799 - Posted: 8 May 2012 | 12:02:28 UTC

Until further notification from the project staff, I suspended all of my running PAOLA_RNP-0-5 workunits, and I suggest you to do the same, if you don't want to loose a lot of GPU time. Also, I will abort all of these kind of workunits I will receive, until this problem is fixed.

ignasi
Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 24800 - Posted: 8 May 2012 | 12:16:39 UTC - in response to Message 24799.

Guys,

The outputs are certainly too large. We are stopping and cancelling all of this batch.

We appologize for the mess.

i

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,206,655,749
RAC: 261,147
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24801 - Posted: 8 May 2012 | 12:22:03 UTC - in response to Message 24800.

The outputs are certainly too large. We are stopping and cancelling all of this batch.

That's too bad....
Can't you simply increase this upload limit?

5pot
Send message
Joined: 8 Mar 12
Posts: 411
Credit: 2,083,882,218
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24803 - Posted: 8 May 2012 | 12:54:49 UTC

Yea I saw I had the same issue this morning. Roughly 47,000 sec.... At least I'm not the only one, thought it was my rig for a sec. I can deal with the loss in points, glad it was caught.

Are the HGAbis ones ok to run? Had several of these run ok so far, they're a little longer than FAX4, but more points are given. Have one running currently.
Want to know if they're ok....

ignasi
Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 24804 - Posted: 8 May 2012 | 13:13:36 UTC - in response to Message 24803.

Upload limit could be increased, yes. But then that would leave too little for torrent seeding and we don't want that. :P

No seriously, [system size]+[sim duration] was far too high. There's no real need to go over 128Mb per WU for now. We then have to operate on all these files too.

Paola is soon sending a comonly-sized one (for LONG), aimed at ~10h on GTX580s and upload file sizes of ~80Mb. As compensation, we are adding an extra 5% credit on the new ones.

cheers,
i

ignasi
Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 24805 - Posted: 8 May 2012 | 13:14:46 UTC - in response to Message 24803.
Last modified: 8 May 2012 | 13:35:53 UTC

Are the HGAbis ones ok to run? (...)
Want to know if they're ok....


They are ok. Problems are for *PAOLA_RNP* batch only.

Rantanplan
Send message
Joined: 22 Jul 11
Posts: 166
Credit: 138,629,987
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24807 - Posted: 8 May 2012 | 14:16:18 UTC - in response to Message 24805.
Last modified: 8 May 2012 | 14:16:31 UTC

What means this ? :

Paola aborted by server

what went wrong ?

tomast
Send message
Joined: 5 Oct 10
Posts: 1
Credit: 193,350
RAC: 0
Level

Scientific publications
wat
Message 24808 - Posted: 8 May 2012 | 14:32:21 UTC

Are you kidding ?
14+ hours running then canceled with no credit ?

7 May 2012 | 21:52:52 UTC 8 May 2012 | 13:51:50 UTC Cancelled by server 50,552.97 3,267.69 --- Long runs (8-12 hours on fastest card) v6.16 (cuda31)

Wow :-(

Greg Beach
Avatar
Send message
Joined: 5 Jul 10
Posts: 21
Credit: 50,844,220
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwat
Message 24812 - Posted: 8 May 2012 | 16:58:15 UTC - in response to Message 24804.

Paola is soon sending a comonly-sized one (for LONG), aimed at ~10h on GTX580s and upload file sizes of ~80Mb. As compensation, we are adding an extra 5% credit on the new ones.

Won't really benefit me. Too many problems lately and inconsistent credit ratios. I'll be looking for a more stable/consistent project for my GPU.

Best of luck.

ignasi
Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 24814 - Posted: 8 May 2012 | 18:04:43 UTC - in response to Message 24807.

PAOLA_RNP error out after (ok) computation due to an incorrect output sizes.

We appologize for it and recommend cancelling them as soon as possible.

thanks

Michael Kingsford Gray
Avatar
Send message
Joined: 28 Nov 11
Posts: 21
Credit: 121,646,463
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwat
Message 24820 - Posted: 9 May 2012 | 6:02:02 UTC - in response to Message 24814.

I am glad to discover that it was not the fault of my GPU!
Points/schmoints. I can deal with that.
It's not like the winner gets a free Maserati!
(Mmmm... now there's an idea...)

Dagorath
Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24821 - Posted: 9 May 2012 | 7:59:08 UTC - in response to Message 24820.
Last modified: 9 May 2012 | 8:01:52 UTC

Points/schmoints. I can deal with that.


+1

If I win the Maserati give it to Paola. Keep up the good work team!

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24891 - Posted: 10 May 2012 | 19:02:06 UTC - in response to Message 24821.

5pot
Send message
Joined: 8 Mar 12
Posts: 411
Credit: 2,083,882,218
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24897 - Posted: 10 May 2012 | 19:44:27 UTC

Wow Amazing

Nice pic

Michael Kingsford Gray
Avatar
Send message
Joined: 28 Nov 11
Posts: 21
Credit: 121,646,463
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwat
Message 24906 - Posted: 11 May 2012 | 9:01:19 UTC - in response to Message 24891.

Nah, not in that colour.

Bedrich Hajek
Send message
Joined: 28 Mar 09
Posts: 485
Credit: 11,087,837,097
RAC: 15,639,702
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24970 - Posted: 11 May 2012 | 23:35:52 UTC - in response to Message 24957.

And this known bug will be fixed when ?????


To prevent this from re-occuring again you'll have to check the wu-deadline before going on.



Refer to this thread:

http://www.gpugrid.net/forum_thread.php?id=2795

Complain long enough and loud enough, and it will get fixed, sooner or later.

Post to thread

Message boards : Number crunching : Error PAOLA_RNP (cancel freely) // Addressed & fixed

//