Advanced search

Message boards : Number crunching : Error message after updating to OpenSUSE 11.4.

Author Message
ashrader330
Send message
Joined: 8 Dec 09
Posts: 3
Credit: 40,163,224
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwat
Message 20664 - Posted: 14 Mar 2011 | 21:43:00 UTC

I have been having issues running GPUGRID after I updated my system to OpenSUSE 11.4. I got an error message on one of my tasks that I have never seen before. Anybody got an idea? My system is 64 bit and I am using SWAN_SYNC=0. The driver version is
260.19.36.

<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 216 (0xd8, -40)
</message>
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce GTX 275"
# Clock rate: 1.40 GHz
# Total amount of global memory: 939196416 bytes
# Number of multiprocessors: 30
# Number of cores: 240
SWAN: Using synchronization method 0
SWAN: FATAL : Unable to create context


</stderr_txt>
]]>

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20665 - Posted: 15 Mar 2011 | 11:44:38 UTC - in response to Message 20664.
Last modified: 15 Mar 2011 | 12:38:42 UTC

Most of the failures are immediate, ended inside 10sec. Although TONI_AB1 tasks tended to fail, some ran to completion. While I have read suggestions that this is memory related (for other projects), a Linux issue, and that in Windows this error is sometimes associated with a MACHINE_TYPE_MISMATCH, it was most likely caused by the upgrade. Perhaps Boinc or the operating system is expecting something that is no longer there. Someone else with Suse might have experienced this and be able to help you tweek it. If not I suggest you upgrade the driver to the latest version and reinstall Boinc.

As well as "process exited with code 216 (0xd8, -40)" you are also seeing "code 193" errors.

Code 193 is a segmentation violation error.

You either have problems with your memory or swap file, or the application attempts to access a memory location that it is not allowed to access, or attempts to access a memory location in a way that is not allowed (for example, attempting to write to a read-only location, or to overwrite part of the operating system).

Good luck,

Profile nenym
Send message
Joined: 31 Mar 09
Posts: 137
Credit: 1,308,230,581
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20666 - Posted: 15 Mar 2011 | 17:13:55 UTC - in response to Message 20665.

Linux 260.xx driver has a bug, try to upgrade to 270 beta. Unable to crunch DistrRTgen, Einstein staff has made fullCPU app dute to that bug (Einstein running on 270 driver takes about 25% core and is quicker). That one affects the GPUGRID maybe.

Post to thread

Message boards : Number crunching : Error message after updating to OpenSUSE 11.4.

//