Message boards : Number crunching : Too many errors!!
Author | Message |
---|---|
I suspect this is happening to everyone, something needs to be done right away. | |
ID: 52173 | Rating: 0 | rate:
![]() ![]() ![]() | |
Maybe so but nothing happens at this project 'right away'. | |
ID: 52178 | Rating: 0 | rate:
![]() ![]() ![]() | |
Getting many frozen units, usually around 6% or so the time runs, but after 2-3 hours nothing is happening. | |
ID: 52321 | Rating: 0 | rate:
![]() ![]() ![]() | |
The GPU 0 in your PC is quite hot (85°C=185°F), that may cause workunits to freeze. | |
ID: 52322 | Rating: 0 | rate:
![]() ![]() ![]() | |
More errors and time wasted. I will wait until I have an answer/correction is made. Shame since I would like to crunch for medical science. | |
ID: 52324 | Rating: 0 | rate:
![]() ![]() ![]() | |
Retvari Zoltan gave you the answer. No one else responded as it is probably correct and the 1st thing that should be fixed. | |
ID: 52325 | Rating: 0 | rate:
![]() ![]() ![]() | |
Though temperature may have something to do with it, it may be other things as well. | |
ID: 52326 | Rating: 0 | rate:
![]() ![]() ![]() | |
The GPU 0 in your PC is quite hot (85°C=185°F), that may cause workunits to freeze. My personal experience is that this temperature doesn't cause problems. I have been crunching GPUGrid for over a year now (24/7) with a 1070 in a case with airflow problems that crunches GPUGrid WUs at between 82 and 85 degrees. Feel free to check my recent tasks to confirm. I might be decreasing the life of the GPU (time will tell), but at least in my case everything is working OK. (I am now looking at cooling options for this one. Because of my experience from my first build (Linux only), I have gained some confidence in trying modifications for the Dell machine with the 1070.) | |
ID: 52327 | Rating: 0 | rate:
![]() ![]() ![]() | |
GPUGrid long work units especially may 'freeze' as you describe if interrupted frequently.That's correct. But his tasks (except for the last ones) haven't been suspended (interrupted) at all. So this left the high GPU temperature as the most probable cause. The cause could be also: - overcomitted CPU - inadequate PSU (maybe the PSU got too hot too) - overclocked GPU memory - overclocked GPU - overclocked CPU memory - overclocked CPU - SLI cable - burnt PCIe power connector(s) - burnt M/B power connector (the 12V pins) - interfering application which use GPU acceleration (for example: browsers, video playback software, games) So all the usual stuff. Of course it could be something we can't think of, as we don't know his system. | |
ID: 52328 | Rating: 0 | rate:
![]() ![]() ![]() | |
Recent GPUs tolerate high temperatures much better than the GPUs tolerated it 5-6 years ago, but the physics haven't changed since then. Thermal expansion and contraction stayed the same over the years. The damage it causes is the same. The capacitors are more advanced now than they were 5-6 years ago, but they still prefer lower temperatures.The GPU 0 in your PC is quite hot (85°C=185°F), that may cause workunits to freeze.My personal experience is that this temperature doesn't cause problems. I have been crunching GPUGrid for over a year now (24/7) with a 1070 in a case with airflow problems that crunches GPUGrid WUs at between 82 and 85 degrees. Feel free to check my recent tasks to confirm.I did. Some of your tasks hit 90°C. I might be decreasing the life of the GPU (time will tell),You do. but at least in my case everything is working OK.For now. | |
ID: 52329 | Rating: 0 | rate:
![]() ![]() ![]() | |
85° is just too hot no matter which way you look at it, there's no margin for error. I would feel very uncomfortable running my GPU's that hot, I'm in the mid 70's now and keep a very close watch on them. | |
ID: 52336 | Rating: 0 | rate:
![]() ![]() ![]() | |
Not only prone to errors when very hot, but they also run less efficiently. Heated metal has a higher resistivity than colder metal. The GPU will use more power at 85-90C than 60-70C | |
ID: 52338 | Rating: 0 | rate:
![]() ![]() ![]() | |
Message boards : Number crunching : Too many errors!!