Author |
Message |
lohphatSend message
Joined: 21 Jan 10 Posts: 44 Credit: 788,587,359 RAC: 65,852 Level
![Glutamic Acid - More than 750M credits Glu](img/badges/aa/badge_glu.png) Scientific publications
![Top 25% (703rd/3113) contribution to Giorgino et al, J. Chem. Theory Comput, 2012 wat](img/badges/papers/badge_pub_ruby.png) ![Top 50% (1391st/4410) contribution to Buch et al, PNAS 2011 wat](img/badges/papers/badge_pub_gold.png) ![Top 50% (651st/2450) contribution to Giorgino et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_gold.png) ![Top 25% (1074th/9662) contribution to Buch et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_ruby.png) ![Top 50% (2875th/5798) contribution to Sadiq et al, PNAS 2012 wat](img/badges/papers/badge_pub_gold.png) ![Top 75% (1366th/1995) contribution to Venken et al, JCTC 2013 wat](img/badges/papers/badge_pub_silver.png) ![Top 90% (2579th/3349) contribution to Buch et al, JCIM 2013 wat](img/badges/papers/badge_pub_bronze.png) ![Top 100% (3581st/3864) contribution to Dainese et al, Biochem. J. 2013 wat](img/badges/papers/badge_pub_white.png) ![Top 25% (822nd/4477) contribution to Pérez-Hernández et al, JCP 2013 wat](img/badges/papers/badge_pub_ruby.png) ![Top 25% (257th/2163) contribution to Bisignano et al. JCIM 2014 wat](img/badges/papers/badge_pub_ruby.png) ![Top 10% (126th/1283) contribution to Doerr et al. JCTC 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (221st/2838) contribution to Stanley et al, Nat Commun 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (80th/3183) contribution to Lauro et al., JCIM 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (337th/3611) contribution to Ferruz et al., JCIM 2015 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (217th/4128) contribution to Ferruz et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (380th/4815) contribution to Stanley et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (254th/4730) contribution to Noe et al., Nat Chem 2017 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (218th/4634) contribution to Martinez-Rosell et al, JCIM 2018 wat](img/badges/papers/badge_pub_emerald.png) ![Top 25% (280th/1656) contribution to Kapoor et al., Sci Rep 2017 wat](img/badges/papers/badge_pub_ruby.png) ![Top 50% (617th/1885) contribution to Ferruz et al., Sci Rep 2018 wat](img/badges/papers/badge_pub_gold.png) ![Top 90% (834th/1022) contribution to Wang et al., ACS Cent. Sci. 2019 wat](img/badges/papers/badge_pub_bronze.png) ![Top 75% (876th/1541) contribution to Rodriguez-Espigares et al., Nat Meth 2020 wat](img/badges/papers/badge_pub_silver.png) ![Top 25% (155th/1450) contribution to Herrera-Nieto et al, Sci Rep 2020 wat](img/badges/papers/badge_pub_ruby.png) ![Top 10% (495th/6232) contribution to Herrera-Nieto et al, JCIM 2020 wat](img/badges/papers/badge_pub_emerald.png) |
https://www.gpugrid.net/workunit.php?wuid=27339139
I just had a WU crash the GPU and it took 105 minutes of reboots to get the monitors to not be full green screens and boot to the desktop.
I then updated the driver from 522.25 to 526.47 to see if that will be better.
However all three hosts trying that WU errored out.
Bad CUDA app?
https://www.gpugrid.net/result.php?resultid=33130348
https://www.gpugrid.net/show_host_detail.php?hostid=581748 |
|
|
|
sounds like something wrong with the GPU honestly.
____________
|
|
|
jjchSend message
Joined: 10 Nov 13 Posts: 98 Credit: 15,429,700,388 RAC: 174,090 Level
![Tryptophan - More than 10B credit - Honorary cruncher Trp](img/badges/aa/badge_trp.png) Scientific publications
![Top 90% (1094th/1283) contribution to Doerr et al. JCTC 2014 wat](img/badges/papers/badge_pub_bronze.png) ![Top 10% (253rd/2838) contribution to Stanley et al, Nat Commun 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 25% (627th/3183) contribution to Lauro et al., JCIM 2014 wat](img/badges/papers/badge_pub_ruby.png) ![Top 25% (376th/3611) contribution to Ferruz et al., JCIM 2015 wat](img/badges/papers/badge_pub_ruby.png) ![Top 1% (27th/4128) contribution to Ferruz et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (377th/4815) contribution to Stanley et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (4th/4730) contribution to Noe et al., Nat Chem 2017 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (4th/1348) contribution to Doerr et al, JCTC 2017 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (5th/4634) contribution to Martinez-Rosell et al, JCIM 2018 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (5th/1656) contribution to Kapoor et al., Sci Rep 2017 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (4th/1885) contribution to Ferruz et al., Sci Rep 2018 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (8th/672) contribution to Martinez-Rosell et al, JCIM 2020 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (7th/1541) contribution to Rodriguez-Espigares et al., Nat Meth 2020 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (48th/1450) contribution to Herrera-Nieto et al, Sci Rep 2020 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (5th/6232) contribution to Herrera-Nieto et al, JCIM 2020 wat](img/badges/papers/badge_pub_sapphire.png) |
A faulty WU should not cause that much of a problem with your GPU. Many of the Python WU's fail but it is somewhat inherent in the type of computing they do.
A couple of the other failures referenced occurred due to insufficient swap space or resources which is usually memory.
Upgrading to the latest driver may help but if you are still having problems I would suggest running DDU to fully remove the driver and try installing it again.
Seems that the 980Tis might run a bit on the hot side. Make sure it is cooling properly. I would suggest setting an aggressive fan curve with EVGA Precision X or similar.
I would also suggest checking for Windows 11 updates sometimes there can be weird problems caused by Windows itself.
Just do some basic health check and cleanup activities. Defrag or trim your disk. Run virus scans etc. Blow out the dust bunnies too.
Other than that it might be time to upgrade your GPU to something a bit newer. It doesn't have to be the latest but I would suggest looking for at least a GTX 1060 6GB or better if you can.
|
|
|
lohphatSend message
Joined: 21 Jan 10 Posts: 44 Credit: 788,587,359 RAC: 65,852 Level
![Glutamic Acid - More than 750M credits Glu](img/badges/aa/badge_glu.png) Scientific publications
![Top 25% (703rd/3113) contribution to Giorgino et al, J. Chem. Theory Comput, 2012 wat](img/badges/papers/badge_pub_ruby.png) ![Top 50% (1391st/4410) contribution to Buch et al, PNAS 2011 wat](img/badges/papers/badge_pub_gold.png) ![Top 50% (651st/2450) contribution to Giorgino et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_gold.png) ![Top 25% (1074th/9662) contribution to Buch et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_ruby.png) ![Top 50% (2875th/5798) contribution to Sadiq et al, PNAS 2012 wat](img/badges/papers/badge_pub_gold.png) ![Top 75% (1366th/1995) contribution to Venken et al, JCTC 2013 wat](img/badges/papers/badge_pub_silver.png) ![Top 90% (2579th/3349) contribution to Buch et al, JCIM 2013 wat](img/badges/papers/badge_pub_bronze.png) ![Top 100% (3581st/3864) contribution to Dainese et al, Biochem. J. 2013 wat](img/badges/papers/badge_pub_white.png) ![Top 25% (822nd/4477) contribution to Pérez-Hernández et al, JCP 2013 wat](img/badges/papers/badge_pub_ruby.png) ![Top 25% (257th/2163) contribution to Bisignano et al. JCIM 2014 wat](img/badges/papers/badge_pub_ruby.png) ![Top 10% (126th/1283) contribution to Doerr et al. JCTC 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (221st/2838) contribution to Stanley et al, Nat Commun 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (80th/3183) contribution to Lauro et al., JCIM 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (337th/3611) contribution to Ferruz et al., JCIM 2015 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (217th/4128) contribution to Ferruz et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (380th/4815) contribution to Stanley et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (254th/4730) contribution to Noe et al., Nat Chem 2017 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (218th/4634) contribution to Martinez-Rosell et al, JCIM 2018 wat](img/badges/papers/badge_pub_emerald.png) ![Top 25% (280th/1656) contribution to Kapoor et al., Sci Rep 2017 wat](img/badges/papers/badge_pub_ruby.png) ![Top 50% (617th/1885) contribution to Ferruz et al., Sci Rep 2018 wat](img/badges/papers/badge_pub_gold.png) ![Top 90% (834th/1022) contribution to Wang et al., ACS Cent. Sci. 2019 wat](img/badges/papers/badge_pub_bronze.png) ![Top 75% (876th/1541) contribution to Rodriguez-Espigares et al., Nat Meth 2020 wat](img/badges/papers/badge_pub_silver.png) ![Top 25% (155th/1450) contribution to Herrera-Nieto et al, Sci Rep 2020 wat](img/badges/papers/badge_pub_ruby.png) ![Top 10% (495th/6232) contribution to Herrera-Nieto et al, JCIM 2020 wat](img/badges/papers/badge_pub_emerald.png) |
I process other CUDA WU from other projects without issue and get decent credits. Just wondering if the task software isn't heeding available resources and forging ahead destined to crash.
Yes, the plan is to update to a newer GPU but the last few years have made that improbable -- I refuse to be gouged.
I also have to make a decision to move away from nVidia or not -- their market behavior isn't my cup of tea. |
|
|
Keith Myers Send message
Joined: 13 Dec 17 Posts: 1313 Credit: 6,009,917,459 RAC: 9,579,893 Level
![Tyrosine - More than 5B credits Tyr](img/badges/aa/badge_tyr.png) Scientific publications
![Top 10% (64th/1022) contribution to Wang et al., ACS Cent. Sci. 2019 wat](img/badges/papers/badge_pub_emerald.png) ![Top 50% (273rd/672) contribution to Martinez-Rosell et al, JCIM 2020 wat](img/badges/papers/badge_pub_gold.png) ![Top 75% (1040th/1541) contribution to Rodriguez-Espigares et al., Nat Meth 2020 wat](img/badges/papers/badge_pub_silver.png) ![Top 10% (413th/6232) contribution to Herrera-Nieto et al, JCIM 2020 wat](img/badges/papers/badge_pub_emerald.png) ![Top 100% (294th/315) contribution to Cossu et al, JCIM 2020 wat](img/badges/papers/badge_pub_white.png) |
If you read these forums regularly you should know that these tasks use mainly just the cpu and not much of any gpu.
And that you need to have a large amount of storage space for each task to properly crunch and finish correctly.
I would look for weaknesses in the rest of your system ignoring any gpu first. |
|
|
|
I also have to make a decision to move away from nVidia or not -- their market behavior isn't my cup of tea.
just be aware that if you decide to move to AMD (or even Intel), you wont be able to contribute to this project anymore. GPUGRID only has CUDA apps, and only Nvidia can process CUDA.
____________
|
|
|