Message boards : Multicore CPUs : Updates to the QMML app
Author | Message |
---|---|
Two changes were made yesterday: | |
ID: 48835 | Rating: 0 | rate: / Reply Quote | |
I have a 2600K processor with 6 logical cores available for the QC app. <app_config> <app> <name>QC</name> <max_concurrent>3</max_concurrent> </app> <app_version> <app_name>QC</app_name> <plan_class>mt</plan_class> <avg_ncpus>2</avg_ncpus> <cmdline>--nthreads 2</cmdline> </app_version> </app_config> | |
ID: 48841 | Rating: 0 | rate: / Reply Quote | |
It seems one task of 3 is progressing much more slowly than the other 2 so I'll reduce max concurrent tasks to 2 in the app_config. | |
ID: 48844 | Rating: 0 | rate: / Reply Quote | |
Toni, there has been an other reason, I abstained from the QMML app on the AMD1700x computer apart from failing WUs and freezing the computer: * CPU threads are limited to 4 (you should still be able to crunch multiple WUs at once, please check) This would not have been necessary as it could easily changed by app-config. And a working app_config is already circulating in the forums. This now limits power user to make there own adjustments, means let run all threads on one single QMML app WU. See problem above. | |
ID: 48846 | Rating: 0 | rate: / Reply Quote | |
I haven't had any issue with the QC tasks running alongside my usual SETI tasks. I reduce the number of Seti cpu tasks when I run the QC task set to use 4 cores. The Seti and Einstein projects download and run tasks normally without any manual intervention. | |
ID: 48848 | Rating: 0 | rate: / Reply Quote | |
Thanks Keith for your comments. | |
ID: 48849 | Rating: 0 | rate: / Reply Quote | |
I've gotten pretty good at tuning Ryzen for 24/7 distributed computing. I have had the 1700X since launch in March of last year. | |
ID: 48852 | Rating: 0 | rate: / Reply Quote | |
Black screens, (computer appears frozen, no display or keyboard or mouse input recognized) “with no error logs generated” are caused by cpu lockup because of insufficient VDDCR cpu voltage for the desired cpu clocks. It is a black screen with the symptoms you describe. I would even say, I am not overclocking at all: I do have a ASUS Prime X370-Pro motherboard and in Bios Settings it asks me, if I am on Water-Cooling, which I am (Corsair Liquid CPU Cooler H60) and then it gives me 3770 MHz, that is all what I did. Similar with RAM: I just adjusted the frequency in BIOS to the Frequency of the RAM specification nothing else. So if you might help with overclocking or with stabilizing the system at these frequencies, would be highly appreciated. Then I will try to switch back to the QMML app. | |
ID: 48854 | Rating: 0 | rate: / Reply Quote | |
We probably should converse via PM so as to not pollute or hijack the thread. | |
ID: 48855 | Rating: 0 | rate: / Reply Quote | |
Wow, how did you manage to get 1200-1300 credits for your QC tasks today. What's your secret? | |
ID: 48856 | Rating: 0 | rate: / Reply Quote | |
So far with testing, I have only been able to run QMML work units on systems with up to 4 cores. | |
ID: 48860 | Rating: 0 | rate: / Reply Quote | |
So far with testing, I have only been able to run QMML work units on systems with up to 4 cores. Probably because there are issues if two tasks start up at the same time. You'll have to limit QC tasks to 1 concurrent task at a time. | |
ID: 48861 | Rating: 0 | rate: / Reply Quote | |
@Keith: Thanks for you PM. | |
ID: 48865 | Rating: 0 | rate: / Reply Quote | |
@klepel: Well Toni didn't change the credits for me it appears. | |
ID: 48868 | Rating: 0 | rate: / Reply Quote | |
I just downloaded a couple more QC tasks hoping that the one I did yesterday was a fluke or carryover from the "old" tasks with the tiny credit. | |
ID: 48875 | Rating: 0 | rate: / Reply Quote | |
Haven't a clue why I get such little credit and others are getting 24 times more for the same cpu elapsed times. Not consistently. They vary all over the place. Your values are a little low for the moment, but you need more data points to draw much of a conclusion. Mine vary a lot too (Ryzen 1700, not overclocked). http://www.gpugrid.net/results.php?hostid=452287&offset=0&show_names=0&state=3&appid= Note that those are with two cores per work unit, but that should not affect the credit per work unit, in a perfect world at least. And note that the longer work units often get less credit than the shorter ones, so the credit system is strange in any case. I think the points are a little more consistent on my Intel machines, and probably a little higher than on the Ryzen machine on average, though I have not tried to calculate it yet. i7-3770: http://www.gpugrid.net/results.php?hostid=433866&offset=0&show_names=0&state=3&appid= i7-4790: http://www.gpugrid.net/results.php?hostid=334241&offset=0&show_names=0&state=3&appid= However, I normally pay no attention to credits, and the Ryzen seems to run comparably fast as the Intels insofar as I can see at the moment, which is the only thing that matters to me. | |
ID: 48878 | Rating: 0 | rate: / Reply Quote | |
Just give me one QC task that gets as much credit as yours or klebel's and I would have hope. Alas the 110 credits I got yesterday for this Task 16998146 is the most I've ever seen. My credits have ranged from 6-47 over 35 tasks so far with the one above the only outlier. | |
ID: 48882 | Rating: 0 | rate: / Reply Quote | |
OK, so I once again crunched some more QC task. This time I reduced the core count to two to see if it made any difference. Nope. Still extremely low credit compared to everyone else that has posted in these threads. | |
ID: 48885 | Rating: 0 | rate: / Reply Quote | |
The problem with multiple WUs starting at the same time, should be fixed (or otherwise a lot of calculation errors will be produced). | |
ID: 48886 | Rating: 0 | rate: / Reply Quote | |
Or just set max_concurrent to 1 and avoid the issue entirely until the applications and software for the problem gets resolved. | |
ID: 48889 | Rating: 0 | rate: / Reply Quote | |
Or just set max_concurrent to 1 and avoid the issue entirely until the applications and software for the problem gets resolved. Until the BM queue fills up with just QC tasks and all other cores go idle. Better to just avoid the entire application in the 1st place. If it's not worth the admins time to fix known issues that cause errors 100% of the time in known situations then its not worth the time for donors to run. | |
ID: 48890 | Rating: 0 | rate: / Reply Quote | |
Every donor is different. I don't have GPUGrid as my sole project so the crunchers never go idle, there is always work being done for someone. | |
ID: 48891 | Rating: 0 | rate: / Reply Quote | |
I'll be watching for updates, but for now I'm also turning off CPU tasks. | |
ID: 48925 | Rating: 0 | rate: / Reply Quote | |
I set up the quantum chem app on a second computer. I started with the following app_config.xml file: <app_config> <app> <name>QC</name> <max_concurrent>1</max_concurrent> </app> <app_version> <app_name>QC</app_name> <plan_class>mt</plan_class> <avg_ncpus>4</avg_ncpus> <cmdline>--nthreads 4</cmdline> </app_version> </app_config> Once the Work Units downloaded and 1 WU started w/4 threads, I edited the app_config.xml to change <max_concurrent>1</max_concurrent> to <max_concurrent>2</max_concurrent>. Then I restarted the boinc client and now I have 2 work units running simultaneously with 4 threads each. So far so good. | |
ID: 48939 | Rating: 0 | rate: / Reply Quote | |
But there is still the issue that whenever your QC tasks finish and more than 2 tasks are downloaded, that your system can try to start both tasks at the same time and then both will fail. There is no guaranteed method to stagger starting of multiple QC tasks in an unattended system on auto. | |
ID: 48948 | Rating: 0 | rate: / Reply Quote | |
I am not seeing this situation at all. | |
ID: 48950 | Rating: 0 | rate: / Reply Quote | |
Quick update: I checked my logs and this thread again to see if I could resume tasks. To my surprise, tasks were resumed while I had still opted out. | |
ID: 48954 | Rating: 0 | rate: / Reply Quote | |
I am not seeing this situation at all. Several posters have reported the problem of starting two QC tasks at the same time or within 5 seconds of each other causes the first task to error out. See Message 48589 | |
ID: 48955 | Rating: 0 | rate: / Reply Quote | |
Dayle Diamond said: Quick update: I checked my logs and this thread again to see if I could resume tasks. To my surprise, tasks were resumed while I had still opted out. Dayle, do you perchance have the preference box checked for If no work for selected applications is available, accept work from other applications? | |
ID: 48956 | Rating: 0 | rate: / Reply Quote | |
If one has more than 4 cores and running max concurrent = 1, there is still the chance that boinc manager will flood your queue with all QC tasks and nothing from the other project. Esp when just starting up that setup before BM gets a better handle of the resource share it is a long term resource share. | |
ID: 48959 | Rating: 0 | rate: / Reply Quote | |
Dayle, do you perchance have the preference box checked for Oops. Thank you ><. Happy Crunching, I'll be back with these tasks once things have stabilized. | |
ID: 48982 | Rating: 0 | rate: / Reply Quote | |
I set up the quantum chem app on a second computer. I started with the following app_config.xml file: This approach, although a bit cumbersome, works. I've completed 189 Work Units with no errors. I think special badges for Quantum Chemistry contribution would be a draw for more users. | |
ID: 48993 | Rating: 0 | rate: / Reply Quote | |
So, forgive me if this has been answered. | |
ID: 49065 | Rating: 0 | rate: / Reply Quote | |
No it hasn't been answered. Or even addressed by the developer as far as I can tell. Seems the resources lately have been in deploying and debugging the WSL QC app. | |
ID: 49066 | Rating: 0 | rate: / Reply Quote | |
Message boards : Multicore CPUs : Updates to the QMML app