Advanced search

Message boards : Graphics cards (GPUs) : Progress or state file refresh time?

Author Message
Profile X-Files 27
Avatar
Send message
Joined: 11 Oct 08
Posts: 95
Credit: 68,023,693
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 3901 - Posted: 19 Nov 2008 | 20:36:22 UTC

How often does it write to disk that current status of WU?

if my system crash, either it will restart from the beginning or stall at its current state whc give me no choice but to abort.

I think the science app has some bugs detecting client errors. I can take the restart but stalling??? arrgg.

Restart:
WU 122494
WU 121687

Stall:
WU 117874
WU 120420

Profile X-Files 27
Avatar
Send message
Joined: 11 Oct 08
Posts: 95
Credit: 68,023,693
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4006 - Posted: 24 Nov 2008 | 19:31:37 UTC

any ideas?

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4008 - Posted: 24 Nov 2008 | 21:12:43 UTC

Sorry, no. I guess most people try to avoid frequent system crashes, so they don't observe what you're seeing.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4010 - Posted: 24 Nov 2008 | 21:21:31 UTC - in response to Message 4006.

any ideas?

Put this flag in your cc_config file, it will tell you how often an app checkpoints. I assume it writes to disk every checkpoint. Then reread cc_config file in manager's advanced menu. Messages will appear in managers message tab.

<cc_config>
<flags>
<checkpoint_debug>1</checkpoint_debug>
</flags>
</cc_config>

Remove or change the 1 to a zero to stop.

Profile X-Files 27
Avatar
Send message
Joined: 11 Oct 08
Posts: 95
Credit: 68,023,693
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4020 - Posted: 25 Nov 2008 | 0:52:48 UTC - in response to Message 4010.

Put this flag in your cc_config file, it will tell you how often an app checkpoints. I assume it writes to disk every checkpoint. Then reread cc_config file in manager's advanced menu. Messages will appear in managers message tab.

<cc_config>
<flags>
<checkpoint_debug>1</checkpoint_debug>
</flags>
</cc_config>

Remove or change the 1 to a zero to stop.


it says:
Unrecognized tag in cc_config.xml: <checkpoint_debug>

and btw, what's the meaning of this error:
MDIO ERROR: read error for file "restart.vel", byte number 4: number of atoms (0) != (29042) expected

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4023 - Posted: 25 Nov 2008 | 8:58:38 UTC

Sounds like the checkpoint file (or what has been read in as such) is completely garbage.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 135,911,881
RAC: 56
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 4032 - Posted: 25 Nov 2008 | 14:00:39 UTC

Hmmm, that's just a guess, but maybe problems with the HDD? Maybe check your HDD with checkdisk?
____________

pixelicious.at - my little photoblog

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4033 - Posted: 25 Nov 2008 | 17:59:37 UTC - in response to Message 4020.

Put this flag in your cc_config file, it will tell you how often an app checkpoints. I assume it writes to disk every checkpoint. Then reread cc_config file in manager's advanced menu. Messages will appear in managers message tab.

<cc_config>
<flags>
<checkpoint_debug>1</checkpoint_debug>
</flags>
</cc_config>

Remove or change the 1 to a zero to stop.


it says:
Unrecognized tag in cc_config.xml: <checkpoint_debug>

Oops, Sorry, I was in a hurry in typing that,

should be log_flags not flags

Correct is:
<cc_config>
<log_flags>
<checkpoint_debug>1</checkpoint_debug>
</log_flags>
</cc_config>


and btw, what's the meaning of this error:
MDIO ERROR: read error for file "restart.vel", byte number 4: number of atoms (0) != (29042) expected

Post to thread

Message boards : Graphics cards (GPUs) : Progress or state file refresh time?

//