Message boards : Graphics cards (GPUs) : Maxwell now
Author | Message |
---|---|
There's a rumor going around that Maxwell is coming out next month. I wonder if this was planned of if AMD's sales are hurting them? | |
ID: 34732 | Rating: 0 | rate:
![]() ![]() ![]() | |
It looks more like a delaying action to hold off AMD until the 20 nm process arrives, probably later than they had originally hoped. A GTX 750 Ti won't set the world on fire in performance, and won't make them a ton of money. But it gives them a chance to see how well the design works in practice, and to give the software developers a head start before the real Maxwell arrives. | |
ID: 34736 | Rating: 0 | rate:
![]() ![]() ![]() | |
It likely is just a false rumor. No prove has been shown that these cards use Maxwell chips, despite relatively complete benchmarks already appeared. It's probably just GK106 with 768 shaders. | |
ID: 34807 | Rating: 0 | rate:
![]() ![]() ![]() | |
Producing a Maxwell on 28nm process would be a complete change of direction for NVidia, so I agree this is likely to be a false rumor. There are two revision models (Rev. 2) of GPU's in the GF600 lineup (GT630 and GT640), so perhaps NVidia want to fill out their GF700 range with a lower end card, so if there is a Rev2 version of the GK650Ti en route, it makes more sense to shift it to the GF700 range. | |
ID: 34823 | Rating: 0 | rate:
![]() ![]() ![]() | |
ARM can already be used to support an OS, so IMO it's inevitable that ARM will bolster their CPU with an NVidia GPU. That's what the market really wants; sufficient CPU processing power to start up and run an OS and a high end GPU for the video-interface, gaming... isn't it? I guess a fairly large part of the market wants that. I would be happy with a motherboard with a BIOS that can boot from PXE, no SuperIO (USB, RS-232, parallel port, PS/2), ISC bus for onboard temperature sensors, no IDE or SATA (no disks), just lots of RAM, an RJ-45 connector and gigabit ethernet, no wifi, enough CPU processing power to startup and run a minimal OS that has a good terminal, SSH and can run BOINC client and project apps. Don't need a desktop or anything to do with a GUI, no TWAIN or printer drivers/daemons, no PnP or printer service, no extra fonts (just a decent terminal and 1 font), network services required, Python or some other scripting language would be nice but not much more. If they could fit all that onto a slightly larger video card I'd be happy, otherwise put it on a 2" x 5" board with a PCIe slot and power connectors and call it a cruncher. Something so no frills IKEA would consider stocking it. What else would be unnecessary... no RTC (get the time off the LAN), no sound, no disk activity LED. ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 34825 | Rating: 0 | rate:
![]() ![]() ![]() | |
Seems like the cat is out of the bag.. and we were all wrong, as usual for a new generation ;) | |
ID: 35023 | Rating: 0 | rate:
![]() ![]() ![]() | |
Sounds like a big performance per watt increase will be coming too. I think I'll put planned purchases on hold, build savings and see what the picture looks like 4 months from now. | |
ID: 35024 | Rating: 0 | rate:
![]() ![]() ![]() | |
That's not what nVidia would like you to do.. but I agree ;) | |
ID: 35029 | Rating: 0 | rate:
![]() ![]() ![]() | |
For GPUGrid, performance touting is premature - we don't even know if it will work with the current app. It could take 6 months of development and debugging. It took ages before the Titan's worked. | |
ID: 35030 | Rating: 0 | rate:
![]() ![]() ![]() | |
I still feel like I'm stuck between a rock and a hard place. Haswell e will have an 8 core variant in q3. So this is definitely going to be bought. However, I would like this to be my last system build for more than a year, as pumping 5k annually is something I can not continue. Every other year, sure. | |
ID: 35038 | Rating: 0 | rate:
![]() ![]() ![]() | |
But with Volta and it's stacked dram.... I'm very cautious about dropping 1.8k+ on gpus that most likely won't be that large of a change. Well see I suppose Volta will still take some time, as GPUs have matured quite a bit (compared to the wild early days of a new chip every 6 months!) and progress is generally slower. That's actually not so bad, because we can keep GPUs longer and the software guys have some time to actually think about using those beasts properly. If you still have Fermis or older running, get rid of them as long as you can still find (casual) gamers willing to pay something for them. If you think about upgrading from Kepler to Maxwell and don't want to spend too much I propose the following: replace 2 Keplers by 1 Maxwell for about the same throughput, which should hopefully be possible with 20 nm and the architectural improvements. This way you don't have to spend as much and reduce power usage significantly (further savings). You throughput won't increase, but so what? If you feel like spending again you could always add another GPU. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 35097 | Rating: 0 | rate:
![]() ![]() ![]() | |
There is no ARM CPU on the block diagram of the GM107: | |
ID: 35103 | Rating: 0 | rate:
![]() ![]() ![]() | |
As far as I remember this "ARM on chip" was still a complete rumor. Could well be that someone confused some material about future nVidia server chips with GPU (project Denver) for the regular GPUs. | |
ID: 35104 | Rating: 0 | rate:
![]() ![]() ![]() | |
I would be a bit concerned about the 1306GFlops rating for the GTX750Ti. That's actually below the GTX650Ti (1420). The 750Ti also has a 128bit bus and bandwidth of 86.4GB/s. While the theoretical GFLOPS/W SP is 21.8, it's still an entry level card; it would talk 4 of these card to have the overall performance of a GTX780Ti. There should be plenty of OC models and potential for these GPU's to boost further. | |
ID: 35114 | Rating: 0 | rate:
![]() ![]() ![]() | |
I see EVGA are selling a GTX750Ti with a 1268MHz Boost. In theory that's 16.8% faster than the reference model, though I would expect the reference card to boost higher than the quoted 1085MHz (if it works)! | |
ID: 35128 | Rating: 0 | rate:
![]() ![]() ![]() | |
I have some GTX750Tis on order; should have them in my hands next week. | |
ID: 35139 | Rating: 0 | rate:
![]() ![]() ![]() | |
I read that the 128bit bus is a bottleneck, but as the card uses 6GHz GDDR5 a 10% OC is a given. The GPU also OC's well (as the temps are low). So these cards could be tweeked to be significantly more competitive than the reference model. | |
ID: 35147 | Rating: 0 | rate:
![]() ![]() ![]() | |
Don't be fooled by the comparably low maximum Flops. We got many of those with Kepler, and complained initially that we couldn't make proper use of them, as the performance per shader per clock was significantly below non-superscalar Fermis. Now we're going non-superscalar again and gain some efficiency through that, as well as through other tweaks. | |
ID: 35157 | Rating: 0 | rate:
![]() ![]() ![]() | |
OK no purchases but I would rather a professional or a Ph.D. test the pretend Maxwells so we can be sure of what we're looking at ;-) | |
ID: 35158 | Rating: 0 | rate:
![]() ![]() ![]() | |
Professional enough? Or shall I search for a review written by someone with a PhD in ancient greek history? ;) | |
ID: 35159 | Rating: 0 | rate:
![]() ![]() ![]() | |
EVGA (Europe) just announced they have Titan Black and GTX 750 and 750Ti for sale. The latter for 150 euro, really cheap with 2GB. However not in stock so can not be ordered yet, but I won't. | |
ID: 35162 | Rating: 0 | rate:
![]() ![]() ![]() | |
The GPU memory latency is supposedly better, so the GTX750Ti's memory bandwidth bottleneck might not be as bad as I first suspected. That said, compute performances are a bit 'all over the place'. It's definitely a wait and see situation for here. | |
ID: 35165 | Rating: 0 | rate:
![]() ![]() ![]() | |
Some more info on Maxwell. | |
ID: 35171 | Rating: 0 | rate:
![]() ![]() ![]() | |
NVidia are comparing the GTX750Ti to a 3 generation old GTX480 for performance and a GT640 for power usage, but not a GTX650Ti! For some games its roughly equal to a GTX480 and in terms of performance/Watt the GTX750Ti is 1.7times better than a GT640 (and similar). While it is a GX107 product, the name suggests its an upgrade to a GTX650Ti. | |
ID: 35173 | Rating: 0 | rate:
![]() ![]() ![]() | |
One of my team mates has a couple 750Ti's and they keep failing here. They are running good at Einstein. I have encouraged our team members to post in here that have the new Maxwell cards. | |
ID: 35236 | Rating: 0 | rate:
![]() ![]() ![]() | |
Thanks. That is important information to share! | |
ID: 35237 | Rating: 0 | rate:
![]() ![]() ![]() | |
I think I might be the one Coleslaw is referring about. I installed a pair of 750Ti cards in my computer yesterday and tried to run GPUGRID. No go, instant fail within a couple seconds. Einstein seems to run just fine. I bought these cards to run GPUGRID and I'm not to happy that they can't. I'm not that interested in running Einstein and other than F@H there isn't much else out there for GPUs. I refuse to participate in F@H anymore. If you need any info feel free to ask. | |
ID: 35238 | Rating: 0 | rate:
![]() ![]() ![]() | |
I think I might be the one Coleslaw is referring about. I installed a pair of 750Ti cards in my computer yesterday and tried to run GPUGRID. No go, instant fail within a couple seconds. Einstein seems to run just fine. I bought these cards to run GPUGRID and I'm not to happy that they can't. I'm not that interested in running Einstein and other than F@H there isn't much else out there for GPUs. I refuse to participate in F@H anymore. If you need any info feel free to ask. Yes you are. Thanks for volunteering. :) Gilthanis ____________ ![]() | |
ID: 35239 | Rating: 0 | rate:
![]() ![]() ![]() | |
That's pretty annoying - it likely means that we'll not be able to use them until CUDA 6 goes public. Matt | |
ID: 35240 | Rating: 0 | rate:
![]() ![]() ![]() | |
Sorry for being annoying. | |
ID: 35241 | Rating: 0 | rate:
![]() ![]() ![]() | |
I bought these cards to run GPUGRID and I'm not to happy that they can't. Well, it's a new architecture (or more precisely: significantly tweaked and rebalanced) so some "unexpected problems" can almost be expected. Be a bit patient, I'm sure this can be fixed. Maxwell is the new architecture for all upcoming nVidia chips in the next 1 - 2 years, after all. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 35270 | Rating: 0 | rate:
![]() ![]() ![]() | |
Don't worry, support is coming just as soon as possible. These new cards are very exciting for us! Unfortunately, because of the way we build our application, we need to wait for the next version of CUDA which contains explicit Maxwell support. The other GPU-using projects don't have this limitation because they build their applications in a different way. The other side of that is that they aren't able to make GPU-specific optimisations the way we do. Matt | |
ID: 35272 | Rating: 0 | rate:
![]() ![]() ![]() | |
Don't worry, support is coming just as soon as possible. These new cards are very exciting for us! Unfortunately, because of the way we build our application, we need to wait for the next version of CUDA which contains explicit Maxwell support. I hope Nvidia comes out with the new version of CUDA soon. I expect GPUs that use much less electricity will become very popular quickly. I'll be switching back to GPUGRID when you get the Maxwell compatible client out. | |
ID: 35275 | Rating: 0 | rate:
![]() ![]() ![]() | |
The new Haswell E 6-core is available in the Netherlands, but pricy. Any idea when the real Maxwell is launched. I read "soon" on the net in some articles, but did not find any date. | |
ID: 37914 | Rating: 0 | rate:
![]() ![]() ![]() | |
The new Haswell E 6-core is available in the Netherlands, but pricy. Any idea when the real Maxwell is launched. I read "soon" on the net in some articles, but did not find any date. Rumor has it- GTX980 (or whatever board will be called) will be showcased (or released) at NVidia's Game24 event on September 18th, along with a 343 branch driver. GTX 970/960 could be released by early/mid October. Leaked benchmarks (if there not fake) show GM204 Maxwell to be at reference GTX780ti (5teraFlops) performance levels with a lower TDP. Maxwell's Integer/256AES/TMU/ROP performance is higher then Kelper's core. GTX 980 will have 256bit memory interface. Float (double/single) will be similar to a disabled DP core GK110 (GTX780/780ti) cards. A Titan with 64DP core SMX enabled for double precision tasks won't be replaced until another Maxwell stack is created for Titan's market position. A dual Maxwell board with 11/12 single teraflops' and 3/4 Teraflops for double would be an ultimate board. | |
ID: 37915 | Rating: 0 | rate:
![]() ![]() ![]() | |
| |
ID: 37919 | Rating: 0 | rate:
![]() ![]() ![]() | |
Thanks for this Jozef J. | |
ID: 37920 | Rating: 0 | rate:
![]() ![]() ![]() | |
So here is deciding which card would be best for GPUgrid The GTX 780Ti is superscalar, so not all of the 2880 CUDA cores can be utilized by the GPUGrid client. The actual number of the utilized CUDA cores of the GTX 780Ti is somewhere between 1920 and 2880 (most likely near the lower end). This could be different for each workunit batch. If they really manufacture the GM204 on 28nm lithography, than this is only a half step towards a new GPU generation. The performance per power ratio will be slightly better of the new GPUs, and (if the data in this chart are correct) I expect the GTX980 could be 15~25% faster than the GTX780Ti (here at GPUGrid). When we'll have the real GPUGrid performance of the GTX980, we'll know how much of the 2880 CUDA cores of the GTX780Ti is actually utilized by the GPUGrid client. But as NVidia choose to move back to scalar architecture, I expect that the superscalar architecture of the Keplers (and the later Fermis) wasn't as successful as expected. | |
ID: 37921 | Rating: 0 | rate:
![]() ![]() ![]() | |
So here is deciding which card would be best for GPUgrid Is NVidia skipping 20nm for 16nm? After couple years of development, TSMC is struggling badly to find proper Die size(s) for 20nm. Nvidia changes lithography every two years or so. Now after Two and half years, boards are still at 28nm, after three series releases(600,700,800m) of 28nm generations, while GTX980 will be the fourth 28nm released. What could be the problem with finding a pattern to fit cores on 20nm? The change from superscalar to scalar? How does a 5 SMM, 640core/40TMU/60W-TDP GTX750ti perform (7%~) better than a 4SMX, 768 core/ 110/130W-TDP Kelper with more TMU(64), while smashing GTX650ti/boost compute time/power consumption ratios? Core/memory speed differences'? GTX 750ti is close (~5%) to GTX660 (5SMX/960Core/140w-TDP) compute times. Is Maxwell's cache sub system architecture, TMU rendering that much better than Kelper's, running GPUGRID code? Maxwell's core architecture may be more efficient than Kepler's, but is Maxwell's really more advanced, when Float processing is similar to Kelper? Maxwell Integer performance is higher, due to having more integer cores in SMM vs. SMX, and the added barrel shifter, which is missing in Kepler. | |
ID: 37923 | Rating: 0 | rate:
![]() ![]() ![]() | |
How does a 5 SMM, 640core/40TMU/60W-TDP GTX750ti perform (7%~) better than a 4SMX, 768 core/ 110/130W-TDP Kelper with more TMU(64), while smashing GTX650ti/boost compute time/power consumption ratios? Core/memory speed differences'? GTX 750ti is close (~5%) to GTX660 (5SMX/960Core/140w-TDP) compute times. That's very easy to answer: The SMXes of the GTX650Ti and the GTX660 are superscalar, so only (approximately) 2/3rd of their cores can be utilized (512 and 640, respectively). | |
ID: 37924 | Rating: 0 | rate:
![]() ![]() ![]() | |
How does a 5 SMM, 640core/40TMU/60W-TDP GTX750ti perform (7%~) better than a 4SMX, 768 core/ 110/130W-TDP Kelper with more TMU(64), while smashing GTX650ti/boost compute time/power consumption ratios? Core/memory speed differences'? GTX 750ti is close (~5%) to GTX660 (5SMX/960Core/140w-TDP) compute times. If this is the case, then why do GPU utilization (MSI afterburner, eVGA precision) programs show +90% for most GPUGRID tasks? Are these programs not accounting for type of (scalar or superscalar) architecture? If only 2/3rd of cores are active, won't GPU utilization be at ~66%, instead of the typical 90%? These programs are capable of monitoring Bus usage, memory control (frame buffer), Video processing, amount of power, and much more. | |
ID: 37925 | Rating: 0 | rate:
![]() ![]() ![]() | |
I estimate that the new GM204 will be about 45% faster than a 780ti. | |
ID: 37926 | Rating: 0 | rate:
![]() ![]() ![]() | |
How does a 5 SMM, 640core/40TMU/60W-TDP GTX750ti perform (7%~) better than a 4SMX, 768 core/ 110/130W-TDP Kelper with more TMU(64), while smashing GTX650ti/boost compute time/power consumption ratios? Core/memory speed differences'? GTX 750ti is close (~5%) to GTX660 (5SMX/960Core/140w-TDP) compute times. The "GPU utilization" is not equivalent of the "CUDA cores utilization". These monitoring utilities are right in showing that high GPU utilization, as they showing the utilization of the untis which feeding the CUDA cores with work. I think the actual CUDA cores utilization can't be monitored. | |
ID: 37928 | Rating: 0 | rate:
![]() ![]() ![]() | |
I estimate that the new GM204 will be about 45% faster than a 780ti. That's a bit optimistic estimation as (1216/928)*(16/15)=1.3977. but... 1. my GTX780Ti is always boosting to 1098MHz, 2. the 1219MHz boost clock seems to be a bit high, as the GTX750Ti's boost clock is only 1085MHz, and it's a lesser chip. We'll see it soon. BTW there's an error in the chart, as the GTX780Ti has 15*192 CUDA cores. | |
ID: 37929 | Rating: 0 | rate:
![]() ![]() ![]() | |
[url]http://images.anandtech.com/doci/7764/SMMrecolored_575px.png | |
ID: 37931 | Rating: 0 | rate:
![]() ![]() ![]() | |
http://images.anandtech.com/doci/7764/SMMrecolored_575px.png | |
ID: 37933 | Rating: 0 | rate:
![]() ![]() ![]() | |
http://images.anandtech.com/doci/7764/SMMrecolored_575px.png Thank you for fixing links. | |
ID: 37934 | Rating: 0 | rate:
![]() ![]() ![]() | |
The GTX980 does quite well in the Folding@home benchmarks. | |
ID: 37941 | Rating: 0 | rate:
![]() ![]() ![]() | |
The GTX980 does quite well in the Folding@home benchmarks. Wow! Than it's possible that the GTX980's performance improvement over the GTX780Ti will be in the 25-45% range. | |
ID: 37942 | Rating: 0 | rate:
![]() ![]() ![]() | |
The GTX980 does quite well in the Folding@home benchmarks. Well if I read the graph correct then its 6.2 and the 780Ti is 11. The GTX 980 is available in the Netherlands and about €80 cheaper then a GTX 780Ti. However no EVGA boards are available yet. I am anxious to see the results of GTX 980 here. ____________ Greetings from TJ | |
ID: 37943 | Rating: 0 | rate:
![]() ![]() ![]() | |
The GTX980 does quite well in the Folding@home benchmarks. Double precision performance is lower but SP which is most common for folding is higher. | |
ID: 37946 | Rating: 0 | rate:
![]() ![]() ![]() | |
OK guys, how's got the first one up and running? I'd like to pull the trigger on a GTX970, but would prefer to know beforehand that it works as good, or better than expected, over here. | |
ID: 37948 | Rating: 0 | rate:
![]() ![]() ![]() | |
Well I am waiting on the 20nm Maxwell. However I will buy a GTX980 as soon as there is one from EVGA with 1 radial fan, to replace my GTX770. As soon as I have it installed I will let you all know. | |
ID: 37949 | Rating: 0 | rate:
![]() ![]() ![]() | |
Hopefully they develop an XP Driver too for these. | |
ID: 37951 | Rating: 0 | rate:
![]() ![]() ![]() | |
Aha you changed the name of the thread ETA? Yes :) MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 37952 | Rating: 0 | rate:
![]() ![]() ![]() | |
Hopefully they develop an XP Driver too for these. The 344.11 driver is released for Windows XP (and for x64 too), and there are GTX 980 and 970 in it (I've checked the nv_dispi.inf file). However if you search for drivers on the NVidia homepage, it won't display any results for WinXP / GTX 980. | |
ID: 37953 | Rating: 0 | rate:
![]() ![]() ![]() | |
GTX970 with driver 344.11 Stderr output I also tried with latest beta driver 344.16, but had same error. Is that a problem with my computer, or is GTX970 not supported yet? | |
ID: 37954 | Rating: 0 | rate:
![]() ![]() ![]() | |
GTX970 with driver 344.11 I think that the problem is the Compute Capability 5.2 is not supported yet. | |
ID: 37956 | Rating: 0 | rate:
![]() ![]() ![]() | |
I've sent Matt a PM. Hopefully this is easy to fix! | |
ID: 37958 | Rating: 0 | rate:
![]() ![]() ![]() | |
ext2097, did you also try other projects like Einstein@Home? | |
ID: 37962 | Rating: 0 | rate:
![]() ![]() ![]() | |
It's a new compute capability. CC 5.2 | |
ID: 37963 | Rating: 0 | rate:
![]() ![]() ![]() | |
SETI@home - http://setiathome.berkeley.edu/results.php?hostid=7376773 | |
ID: 37965 | Rating: 0 | rate:
![]() ![]() ![]() | |
It's a bit off topic, but it's a quite interesting Maxwell advertisement: | |
ID: 37966 | Rating: 0 | rate:
![]() ![]() ![]() | |
It's a bit off topic, but it's a quite interesting Maxwell advertisement: Cool. However, the folks at the flat earth society are not impressed with Nvidia's effort. :) http://forum.tfes.org/index.php?topic=1914.0 While we are waiting for GPUgrid GTX980/970 numbers, F@H performance looks encouraging. http://forums.evga.com/Someone-needs-to-post-980-andor-970-folding-numbers-here-when-they-get-one-m2218148.aspx | |
ID: 37969 | Rating: 0 | rate:
![]() ![]() ![]() | |
Ok gang, | |
ID: 37972 | Rating: 0 | rate:
![]() ![]() ![]() | |
..or not. The current CUDA release seems not to support that architecture yet. | |
ID: 37974 | Rating: 0 | rate:
![]() ![]() ![]() | |
Have you tried this yet?? [url] https://developer.nvidia.com/cuda-downloads-geforce-gtx9xx Driver 343.98 is included offering support for C.C 5.2 cards (GTX980/970) [/url] | |
ID: 37975 | Rating: 0 | rate:
![]() ![]() ![]() | |
Yeah, looked straight through that. | |
ID: 37976 | Rating: 0 | rate:
![]() ![]() ![]() | |
It's a bit off topic, but it's a quite interesting Maxwell advertisement: NVidia showing off VX Global illuminati... err illumination power. The Metro series Redux games utilize this tech with Kepler and above generations. (Even X-box/PS4 ports) GM204 technical advances compared to Kelper is rather striking. Fermi 480/580 compared to 680: GPU-GRID performance jump won't be nearly the GTX980 jump compared to GTX 680. Filtering and sorting performance for images, or to create programs with atoms and DNA/RNA strands- are higher than Kelper. Maxwell also has more internal memory bandwidth, and Third generation Color Compression enhancements offers more ROP performance, with better Cache latency. A single GM204 CUDA core is 25-40% faster compared to a single CUDA core Gk104, from the added atomic memory enhancements, and more registers for a thread. For Gaming: a GTX 980 is equal to [2]GTX 680. In every area GP(GPU) performance jump from GK104 to GM204 is significant. GK110 only now offers higher TMU performance, but without new type of filtering offer by GM204, unless Nvidia offers this for the Kelper Generation. GK110 has higher Double precision. A full fat Maxwell (?20nm/16nm FinFET ?/250W/3072~CUDA cores)that offers 1/3 SP/DP core ratio like GK110 is going to be a card for the ages. (Will first be a Tesla, Quadro, Titan variant?) Maybe AMD will come out with board shortly to challenge GM204 with a similar power consumption. Continuous performance competition should raise standards for each company. These next few years are key. GM204 replaces the GK104 stack, not GK110 [GTX 780](ti)disabled 64DP core SMX stack.(Driver allows 8 be active in SMX) GM204 per nm transistor density is only few percent more than GK104 (3.54B transistors/294mm2) and (7.1B transistors/551mm2) for GK110. Kepler's GK110 (4500-6000GFLOPS Desktop card are near [Single]20GFOLPS/W, while GK104 (2500-3500GFLOPS) is 15-18GFLOPS/W depending on clock speeds and voltage. Maxwell GM107(1400~FLOPS) is 20-23GFLOPS/W. Maxwell GM204(5000~FLOPS) is 30-35GFLOPS/W, depending on voltage and GDDR5/CORE speeds. GK104 highest rated mobile (GTX880m/2900~FLOPS) card is 27-29GFLOPS/W. GM204 compute time/ power usage ratios with a new app will be world class compared to more power hungry cards. Crunchers whose States/Countries higher taxes rates for power bills- a GTX970/980 is top notch choice. | |
ID: 37978 | Rating: 0 | rate:
![]() ![]() ![]() | |
Yeah, looked straight through that. 6.5.19 for 343.98 driver. Are 344 drivers the same? Updated Documents for PTX, programming guide, many others are included with 343.98/6.5 CUDA SDK. Before updating to 6.5.19 driver , I completed GPU-GRID tasks was CUDA 6.5.12 Linux, also has a new CUDA 6.5 driver also for download. | |
ID: 37979 | Rating: 0 | rate:
![]() ![]() ![]() | |
Right. New app version cuda65 for acemdlong. Windows only, needs driver version 344. | |
ID: 37981 | Rating: 0 | rate:
![]() ![]() ![]() | |
I ordered a GTX980 from Newegg. Zotac and gigabyte were my only options so I went with gigabyte. All other manufacturers cards were "out of stock". | |
ID: 37983 | Rating: 0 | rate:
![]() ![]() ![]() | |
Time to replace my trusty GTX 460, which has been GPUGrid-ing for years! At ~£100 the GTX 750Ti fits my budget nicely but I need some guidance. I installed a pair of 750Ti cards in my computer yesterday and tried to run GPUGRID. No go, instant fail within a couple seconds. This quote is from January. I hope the problem has been fixed! | |
ID: 37984 | Rating: 0 | rate:
![]() ![]() ![]() | |
The 750tis are great and work just fine. | |
ID: 37985 | Rating: 0 | rate:
![]() ![]() ![]() | |
I ordered a GTX980 from Newegg. Zotac and gigabyte were my only options so I went with gigabyte. All other manufacturers cards were "out of stock". Good luck with your card biodoc. In the Netherlands only Asus and MSI, but I will wait for EVGA. They are not out of stock, but just in production. A few weeks more is no problem. Moreover I first want to see some results. ____________ Greetings from TJ | |
ID: 37986 | Rating: 0 | rate:
![]() ![]() ![]() | |
http://www.techpowerup.com/ | |
ID: 37989 | Rating: 0 | rate:
![]() ![]() ![]() | |
Does anybody already have a working GTX 980 or 970? | |
ID: 37990 | Rating: 0 | rate:
![]() ![]() ![]() | |
ext2097, would you mind giving GPU-Grid another try? | |
ID: 37991 | Rating: 0 | rate:
![]() ![]() ![]() | |
I had a CUDA6.5 task on one of my GTX680s, but it's failed after 6 sec with the following error: # The simulation has become unstable. Terminating to avoid lock-up (1) 40x35-NOELIA_5bisrun2-2-4-RND5486_0Does anybody had a successful CUDA6.5 task on any older card? | |
ID: 37992 | Rating: 0 | rate:
![]() ![]() ![]() | |
And there's another failed CUDA6.5 task on my GTX780Ti: # Simulation unstable. Flag 11 value 1
# The simulation has become unstable. Terminating to avoid lock-up
# The simulation has become unstable. Terminating to avoid lock-up (2) | |
ID: 37993 | Rating: 0 | rate:
![]() ![]() ![]() | |
There were two more CUDA6.5 workunits on my GTX780Ti, both of them failed the same way: # Simulation unstable. Flag 11 value 1
# The simulation has become unstable. Terminating to avoid lock-up
# The simulation has become unstable. Terminating to avoid lock-up (2) I16R23-SDOERR_BARNA5-32-100-RND7031_0 I12R83-SDOERR_BARNA5-32-100-RND2687_0 Now this host received a CUDA6.0 task, so I don't want to try again, but I think that the CUDA6.5 app has a bug. | |
ID: 37994 | Rating: 0 | rate:
![]() ![]() ![]() | |
I ordered a GTX980 from Newegg. Zotac and gigabyte were my only options so I went with gigabyte. All other manufacturers cards were "out of stock". Yes, my preference would have been EVGA or PNY (lifetime warranty but fixed core voltage). This will be my first Gigabyte card so I hope it works out. The F@H numbers sold me. I think Nvidia GPU performance on F@H generally translates to GPUGrid. Besides, I usually spend the month of December folding, so the GTX980 will be a nice companion to my 780Ti. | |
ID: 37995 | Rating: 0 | rate:
![]() ![]() ![]() | |
I have more failed CUDA6.5 tasks on my GTX680: # The simulation has become unstable. Terminating to avoid lock-up (1) Actually all CUDA6.5 tasks are failing on my GTX680 (OC) and GTX780Ti (Non-OC) | |
ID: 37996 | Rating: 0 | rate:
![]() ![]() ![]() | |
I haven't found any successfully finished CUDA6.5 tasks (obviously these are too fresh). # Simulation unstable. Flag 11 value 1
# The simulation has become unstable. Terminating to avoid lock-up
# The simulation has become unstable. Terminating to avoid lock-up (2) I2R31-SDOERR_BARNA5-30-100-RND8191_0 (GTX770 OC) I11R57-SDOERR_BARNA5-32-100-RND3266_0 (GTX770 non-OC) # The simulation has become unstable. Terminating to avoid lock-up (1) | |
ID: 37997 | Rating: 0 | rate:
![]() ![]() ![]() | |
I just had one of these errors on a 780Ti as well. | |
ID: 37998 | Rating: 0 | rate:
![]() ![]() ![]() | |
I7R110-SDOERR_BARNA5-32-100-RND5097_1 10082727 181608 22 Sep 2014 | 17:59:10 UTC 23 Sep 2014 | 0:04:53 UTC Error while computing 6.07 2.70 --- Long runs (8-12 hours on fastest card) v8.41 (cuda65) | |
ID: 37999 | Rating: 0 | rate:
![]() ![]() ![]() | |
Jozef, there's a CUDA 6.0 task among these. Maybe you need to reboot the host after those CUDA 6.5 failures? Or is it running normally again? | |
ID: 38000 | Rating: 0 | rate:
![]() ![]() ![]() | |
CUDA 65s should only have been going to the new Maxwells, sorry about that. | |
ID: 38001 | Rating: 0 | rate:
![]() ![]() ![]() | |
I had two CUDA65 tasks, but same "FATAL: cannot find image for module [.nonbonded.cu.] for device version 520" error. | |
ID: 38002 | Rating: 0 | rate:
![]() ![]() ![]() | |
Right. The new 65 app is failing for non-obvious reasons, so I've moved it to the acemdbeta queue. If you have a GTX9x0, please get some work from that queue. | |
ID: 38003 | Rating: 0 | rate:
![]() ![]() ![]() | |
I have GTX970 and checked at "Run test applications?" and "ACEMD beta", but BOINC says "No tasks are available for ACEMD beta version". | |
ID: 38004 | Rating: 0 | rate:
![]() ![]() ![]() | |
I have GTX970 and checked at "Run test applications?" and "ACEMD beta", but BOINC says "No tasks are available for ACEMD beta version". On the GPUGRID preference page, about in the middle is a option called: Run test applications? You have to set it to yes as well. But perhaps you did then sorry about this post. ____________ Greetings from TJ | |
ID: 38005 | Rating: 0 | rate:
![]() ![]() ![]() | |
Use NVIDIA GPU : yes | |
ID: 38006 | Rating: 0 | rate:
![]() ![]() ![]() | |
Should I open that ESD bag? :) | |
ID: 38016 | Rating: 0 | rate:
![]() ![]() ![]() | |
These are two GK104/GK110 time from you're hosts to compared with new GM204. | |
ID: 38017 | Rating: 0 | rate:
![]() ![]() ![]() | |
The good news is that I've successfully installed the GTX980 under Windows XP x64. 23/09/2014 19:38:23 | GPUGRID | Requesting new tasks for NVIDIA
23/09/2014 19:38:25 | GPUGRID | Scheduler request completed: got 0 new tasks
23/09/2014 19:38:25 | GPUGRID | No tasks sent
23/09/2014 19:38:25 | GPUGRID | No tasks are available for ACEMD beta version
23/09/2014 19:38:25 | GPUGRID | No tasks are available for the applications you have selected.
23/09/2014 19:41:53 | GPUGRID | update requested by user
23/09/2014 19:41:55 | GPUGRID | Sending scheduler request: Requested by user.
23/09/2014 19:41:55 | GPUGRID | Requesting new tasks for NVIDIA
23/09/2014 19:41:57 | GPUGRID | Scheduler request completed: got 0 new tasks Before you ask: I did all the necessary settings. | |
ID: 38018 | Rating: 0 | rate:
![]() ![]() ![]() | |
Seems like Matt has to fill the beta queue, or already got enough failed results from the batch he submitted :p | |
ID: 38019 | Rating: 0 | rate:
![]() ![]() ![]() | |
Seems like Matt has to fill the beta queue, or already got enough failed results from the batch he submitted :p According to the server status page, there are 100 unsent beta workunits, and the application page shows that there is only the v8.42 CUDA6.5 beta app. Somehow these didn't bound together. I think this could be another scheduler issue. | |
ID: 38022 | Rating: 0 | rate:
![]() ![]() ![]() | |
Nice card you have there Zoltan. Would love to see the results as soon as Matt has got it working. | |
ID: 38024 | Rating: 0 | rate:
![]() ![]() ![]() | |
Nice card you have there Zoltan. Would love to see the results as soon as Matt has got it working. I second that! BTW: what are you currently running on the card? Any results from other projects to share? :) MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 38026 | Rating: 0 | rate:
![]() ![]() ![]() | |
Thank you TJ & ETA! | |
ID: 38027 | Rating: 0 | rate:
![]() ![]() ![]() | |
Thank you TJ & ETA! Any comment on you're phenomenal card's wattage usage for tasks, or temps? | |
ID: 38028 | Rating: 0 | rate:
![]() ![]() ![]() | |
Any comment on you're phenomenal card's wattage usage for tasks, or temps? It's awesome! :) The Einstein@home app is CUDA3.2 - ancient in terms of GPU computing, as this version is released for the GTX 2xx series - so the data you've asked for is almost irrelevant, but here it is: Ambient temperature: 24.8°C Task: p2030.20140610.G63.60-00.95.S.b6s0g0.00000_3648_1 Binary Radio Pulsar Search (Arecibo, GPU) v1.39 (BRP4G-cuda32-nv301) GPU temperature: 53°C GPU usage: 91-92% (muhahaha) GPU wattage: 90W (the difference between the idle GPU and the GPU in use, but the CPU is consuming a little to keep the GPU busy) GPU clock: 1240MHz GPU voltage: 1.218V GPU power 55% | |
ID: 38030 | Rating: 0 | rate:
![]() ![]() ![]() | |
Any comment on you're phenomenal card's wattage usage for tasks, or temps? Integer task? 90 watts (91-92%) for 1024 cores at 3.2 CUDA API shows Maxwell(2) GM204 internal core structure enhancements. Other components will be under less "stress" from energy usage drop. Percentage of taxes risen is off. Any efficiency updates help. Running 24/7 for weeks/months/years at time-- 250TDP card or 175TDP? 50W-105W TDP GM204 wattage change compared to 225W/250W GK110? 145TDP for 1664 core GTX970. GTX980 TDP 30 watts away from a 6/8 core Haswell-E @140watts. (A few 6/8 core E5 Haswell Xeons are 85W) Having multiple cards- energy savings add up. Higher MB/PSU efficiency, included. | |
ID: 38031 | Rating: 0 | rate:
![]() ![]() ![]() | |
I think I know why we don't receive beta tasks. | |
ID: 38032 | Rating: 0 | rate:
![]() ![]() ![]() | |
Zoltan - how many WU are You crunching at once at Einstein? | |
ID: 38036 | Rating: 0 | rate:
![]() ![]() ![]() | |
Zoltan - how many WU are You crunching at once at Einstein? Only one. Now I've changed my settings to run two simultaneously, but the power consumption haven't changed, only the GPU usage risen to 97%. | |
ID: 38039 | Rating: 0 | rate:
![]() ![]() ![]() | |
May I quote all this data at Einstein forum? | |
ID: 38040 | Rating: 0 | rate:
![]() ![]() ![]() | |
May I quote all this data at Einstein forum? Sure. | |
ID: 38041 | Rating: 0 | rate:
![]() ![]() ![]() | |
Any comment on you're phenomenal card's wattage usage for tasks, or temps? The Einstein numbers look great. Congrats on the new card Zoltan! | |
ID: 38042 | Rating: 0 | rate:
![]() ![]() ![]() | |
What does Boinc say, about amount of (peak) FLOPS in event log for GTX980? Near 5TeraFLOPS? Over at Mersenne trial-factoring--- a GTX980 is listed @ 1,126GHz and 4,710 GFLOPS. | |
ID: 38045 | Rating: 0 | rate:
![]() ![]() ![]() | |
What does Boinc say, about amount of (peak) FLOPS in event log for GTX980? Near 5TeraFLOPS? Over at Mersenne trial-factoring--- a GTX980 is listed @ 1,126GHz and 4,710 GFLOPS. Could somebody running a Maxwell-aware version of BOINC check and report this, please, and do a sanity-check of whether BOINC's figure is correct from what you know of the card's SM count, cores per SM, shader clock, flops_per_clock etc. etc? We got the figures for the 'baby Maxwell' 750/Ti into BOINC on 24 February (3edb124ab4b16492d58ce5a6f6e40c2244c97ed6), but I think that was just too late to catch v7.2.42 We're in a similar position this time, with v7.4.22 at release-candidate stage - I'd say that one was safe to test with, if nobody here has upgraded yet. TIA. | |
ID: 38048 | Rating: 0 | rate:
![]() ![]() ![]() | |
No idea why the scheduler wasn't giving out the 842 beta app. Look out for 843 now. | |
ID: 38049 | Rating: 0 | rate:
![]() ![]() ![]() | |
no, that's deliberate. It's a Maxwell-only build | |
ID: 38050 | Rating: 0 | rate:
![]() ![]() ![]() | |
There's now a linux build on acemdbeta. You'll definitely be needing to use a Linux client that reports the right driver version. | |
ID: 38051 | Rating: 0 | rate:
![]() ![]() ![]() | |
No idea why the scheduler wasn't giving out the 842 beta app. Look out for 843 now. I still could no get beta work. 24/09/2014 18:16:35 | GPUGRID | update requested by user
24/09/2014 18:16:38 | GPUGRID | Sending scheduler request: Requested by user.
24/09/2014 18:16:38 | GPUGRID | Requesting new tasks for NVIDIA
24/09/2014 18:16:41 | GPUGRID | Scheduler request completed: got 0 new tasks
24/09/2014 18:16:41 | GPUGRID | No tasks sent
24/09/2014 18:16:41 | GPUGRID | No tasks are available for ACEMD beta version
24/09/2014 18:16:41 | GPUGRID | No tasks are available for the applications you have selected. | |
ID: 38052 | Rating: 0 | rate:
![]() ![]() ![]() | |
I gave Folding@home a try, and the power consumption risen by 130W when I started folding on the GPU (GTX980) only. When I started folding on the CPU also, the power consumption went up by 68W (Core i7-870@3.2GHz, 7 threads). | |
ID: 38054 | Rating: 0 | rate:
![]() ![]() ![]() | |
Thanks Zoltan! Those numbers are really encouraging and show GM204 power consumption to be approximately where we expected them to be. This is in stark contrast to the ~250 W THG has measured under "some GP-GPU load". Maybe it was FurMark? With these results we can rest assured that the cards won't draw more than their power target to run GPU-Grid. | |
ID: 38055 | Rating: 0 | rate:
![]() ![]() ![]() | |
What does Boinc say, about amount of (peak) FLOPS in event log for GTX980? Near 5TeraFLOPS? Over at Mersenne trial-factoring--- a GTX980 is listed @ 1,126GHz and 4,710 GFLOPS. GPU info in sched_request/(projects)file/ or slot init_data file. Also, client_state provides working size? | |
ID: 38056 | Rating: 0 | rate:
![]() ![]() ![]() | |
Time to replace my trusty GTX 460, which has been GPUGrid-ing for years! At ~£100 the GTX 750Ti fits my budget nicely but I need some guidance. Something you didn't mention: If possible, get one that blows the hot air out of the case rather than blowing it around within the case. That should reduce the temperature for both the graphics board and the CPU, and therefore make both of them last longer. | |
ID: 38057 | Rating: 0 | rate:
![]() ![]() ![]() | |
Something I'm having trouble finding: How well do the new cards using PCIE3 work if the motherboard has only PCIE2 sockets? | |
ID: 38058 | Rating: 0 | rate:
![]() ![]() ![]() | |
Something I'm having trouble finding: How well do the new cards using PCIE3 work if the motherboard has only PCIE2 sockets? Physically, the sockets are the same. Electrically, they're compatible. | |
ID: 38059 | Rating: 0 | rate:
![]() ![]() ![]() | |
Something I'm having trouble finding: How well do the new cards using PCIE3 work if the motherboard has only PCIE2 sockets? We'll find out when there will be a working GPUGrid app, as I will move my GTX 980 to another host wich has PCIe3. | |
ID: 38060 | Rating: 0 | rate:
![]() ![]() ![]() | |
And while we're at it: what about memory controller load? Folding@home: 23% Einstein@home 2 tasks: 62-69% (Perseus arm survey/BRP5 & Arecibo, GPU/BRP4G) Einstein@home 1 task : 46-48% (Perseus arm survey/BRP5) Einstein@home 1 task : 58-64% (Arecibo, GPU/BRP4G) | |
ID: 38061 | Rating: 0 | rate:
![]() ![]() ![]() | |
I've got my GTX980 running on linux. | |
ID: 38062 | Rating: 0 | rate:
![]() ![]() ![]() | |
Just got two GTX 980's, will install tomorrow. Should be interesting to see how we go! | |
ID: 38066 | Rating: 0 | rate:
![]() ![]() ![]() | |
What does Boinc say, about amount of (peak) FLOPS in event log for GTX980? Near 5TeraFLOPS? Over at Mersenne trial-factoring--- a GTX980 is listed @ 1,126GHz and 4,710 GFLOPS. Here's what boinc 7.4.22 (64bit-linux version) is reporting: Starting BOINC client version 7.4.22 for x86_64-pc-linux-gnu CUDA: NVIDIA GPU 0: GeForce GTX 980 (driver version 343.22, CUDA version 6.5, compute capability 5.2, 4096MB, 3557MB available, 4979 GFLOPS peak) OpenCL: NVIDIA GPU 0: GeForce GTX 980 (driver version 343.22, device version OpenCL 1.1 CUDA, 4096MB, 3557MB available, 4979 GFLOPS peak) | |
ID: 38067 | Rating: 0 | rate:
![]() ![]() ![]() | |
What does Boinc say, about amount of (peak) FLOPS in event log for GTX980? Near 5TeraFLOPS? Over at Mersenne trial-factoring--- a GTX980 is listed @ 1,126GHz and 4,710 GFLOPS. OpenCl 1.1 ! Spec from 2010 (Fermi) 2.0 OpenCL spec been released for almost a year. This is Nvidia telling Intel and AMD, they don't give a hoot about OpenCL, because of CUDA. | |
ID: 38068 | Rating: 0 | rate:
![]() ![]() ![]() | |
............. Any more thoughts on when we might see a revised app for the 980 - mine looks very nice, but I'd like to put it to work!! | |
ID: 38069 | Rating: 0 | rate:
![]() ![]() ![]() | |
Just got two GTX 980's, will install tomorrow. Should be interesting to see how we go! We're waiting for a working app, so prepare a spare project for awhile. | |
ID: 38070 | Rating: 0 | rate:
![]() ![]() ![]() | |
There's now a linux build on acemdbeta. You'll definitely be needing to use a Linux client that reports the right driver version. I've got the latest boinc client for linux but am still getting no tasks for my GTX 980. Thu 25 Sep 2014 07:29:53 AM EDT | | Starting BOINC client version 7.4.22 for x86_64-pc-linux-gnu Thu 25 Sep 2014 07:29:53 AM EDT | | log flags: file_xfer, sched_ops, task Thu 25 Sep 2014 07:29:53 AM EDT | | Libraries: libcurl/7.35.0 OpenSSL/1.0.1f zlib/1.2.8 libidn/1.28 librtmp/2.3 Thu 25 Sep 2014 07:29:53 AM EDT | | Data directory: /home/mark/BOINC Thu 25 Sep 2014 07:29:53 AM EDT | | CUDA: NVIDIA GPU 0: GeForce GTX 980 (driver version 343.22, CUDA version 6.5, compute capability 5.2, 4096MB, 3566MB available, 4979 GFLOPS peak) Thu 25 Sep 2014 07:29:53 AM EDT | | OpenCL: NVIDIA GPU 0: GeForce GTX 980 (driver version 343.22, device version OpenCL 1.1 CUDA, 4096MB, 3566MB available, 4979 GFLOPS peak) Thu 25 Sep 2014 09:48:27 AM EDT | GPUGRID | Sending scheduler request: Requested by user. Thu 25 Sep 2014 09:48:27 AM EDT | GPUGRID | Requesting new tasks for NVIDIA GPU Thu 25 Sep 2014 09:48:29 AM EDT | GPUGRID | Scheduler request completed: got 0 new tasks Thu 25 Sep 2014 09:48:29 AM EDT | GPUGRID | No tasks sent Thu 25 Sep 2014 09:48:29 AM EDT | GPUGRID | No tasks are available for ACEMD beta version Thu 25 Sep 2014 09:48:29 AM EDT | GPUGRID | No tasks are available for the applications you have selected. | |
ID: 38071 | Rating: 0 | rate:
![]() ![]() ![]() | |
Regarding the power consumption of the new BigMaxwell: | |
ID: 38073 | Rating: 0 | rate:
![]() ![]() ![]() | |
Regarding the power consumption of the new BigMaxwell: Project 7621 uses the GPU "core 15" (Fahcore:0x15) version which is the oldest GPU client and runs exclusively on Windows machines. Those Wus generally run hot and use very little CPU as you've noticed. They are fixed credit WUs so PPD is low. Core 17 WUs are more efficient since they use a more recent version of openMM and are distributed to both windows and linux via an OpenCL app. They generally use 100% of a core on machines with an Nvidia card. These WUs offer a quick return bonus (QRB) and are very popular because the faster the card the higher the bonus. My GTX980 has finished several 9201 project (core17) WUs and is averaging 330,000 ppd. Amazing. Linux users have an advantage in that only core 17 WUs are delivered to linux machines. There are core 18 WUs now available to windows users. I don't know anything about them yet. | |
ID: 38074 | Rating: 0 | rate:
![]() ![]() ![]() | |
The acemd.841-65.exe file is 3.969.024 bytes long, but the acemd.842-65.exe is only 1.112.576 bytes long, so something went wrong with the latter. I've made my BOINC manager to start this acemd.842-65.exe as an acemd.841-60.exe by overwriting the latter and setting <dont_check_file_sizes> in the cc_config.xml, and I've modified the client_state.xml to copy the cudart32_65.dll and the cufft32_65.dll to the slot with the app, but I've got the same result as before with the 841-65 client. #SWAN: FATAL: cannot find image for module [.nonbonded.cu.] for device version 520 http://www.gpugrid.net/result.php?resultid=13130843 http://www.gpugrid.net/result.php?resultid=13132835 http://www.gpugrid.net/result.php?resultid=13135543 | |
ID: 38075 | Rating: 0 | rate:
![]() ![]() ![]() | |
I have data comparing my 780Ti with the 980 at Folding@home. | |
ID: 38079 | Rating: 0 | rate:
![]() ![]() ![]() | |
Well, I've not fixed the scheduler, but would you like to try that trick again with the new version 844? | |
ID: 38080 | Rating: 0 | rate:
![]() ![]() ![]() | |
Well, I've not fixed the scheduler, but would you like to try that trick again with the new version 844? At once, sire. :) | |
ID: 38081 | Rating: 0 | rate:
![]() ![]() ![]() | |
Well, I've not fixed the scheduler, but would you like to try that trick again with the new version 844? ...aaaaand we have a lift-off! It's crunching. # GPU [GeForce GTX 980] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 980
# ECC : Disabled
# Global mem : 4095MB
# Capability : 5.2
# PCI ID : 0000:03:00.0
# Device clock : 1215MHz
# Memory clock : 3505MHz
# Memory width : 256bit
# Driver version : r343_98 : 34411
# GPU 0 : 41C
# GPU 0 : 43C
# GPU 0 : 44C
# GPU 0 : 46C
# GPU 0 : 47C
# GPU 0 : 49C
# GPU 0 : 50C
# GPU 0 : 52C
# GPU 0 : 53C
# GPU 0 : 54C
# GPU 0 : 55C
# GPU 0 : 56C
# GPU 0 : 57C
# GPU 0 : 58C
# GPU 0 : 59C
# GPU 0 : 60C
# GPU 0 : 61C
# GPU 0 : 62C
# GPU 0 : 63C 709-NOELIA_20MGWT-1-5-RND4766_0 GPU usage: 93-97% (CPU 100%, PCIe2.0x16) GPU power 93% (~160W increase measured at the wall outlet) GPU temperature: 64°C (ambient: 24°C) GPU memory controller load: 50% GPU memory used: 825MB GPU voltage: 1.218V GPU core clock: 1240MHz I estimate it will take 19.200 sec to finish this workunit (5h20m), which is more than it takes on a GTX780Ti (16.712), so I really should move this card to another host with PCIe3.0. | |
ID: 38082 | Rating: 0 | rate:
![]() ![]() ![]() | |
Good news that the app is working but disappointing performance. | |
ID: 38083 | Rating: 0 | rate:
![]() ![]() ![]() | |
Good news that the app is working but disappointing performance. Disappointing compared to GK110? Or GK104 boards? GTX980 (64DP cores/4DPperSMM/1DPper32coreblock) is replacement for GTX680 (64DP/8DPperSMX), NOT 96DPcore GTX780 or 120DPcore GTX780ti. Titan(Black)250TDP have 896/960 DP cores (64DPperSMX) Compared to GTX680, I'd say GTX980 is an excellent performer, other than Double Float. | |
ID: 38084 | Rating: 0 | rate:
![]() ![]() ![]() | |
Good news that the app is working but disappointing performance. I believe the GPUGrid app uses SP floating point calculations. F@H also uses SP. | |
ID: 38085 | Rating: 0 | rate:
![]() ![]() ![]() | |
Good news that the app is working but disappointing performance. You're right about GPUGrid. I'll swap my GTX670 and GTX980 and we'll see how's its performance in a PCIe3.0x16 slot. I expect that a GTX980 should be faster than a GTX780Ti at least by 10%. Maybe it won't be faster in the beginning, but in time the GPUGrid app could be refined for Maxwells. Besides different workunit batches will gain different performance (it could be even a loss of performance). | |
ID: 38086 | Rating: 0 | rate:
![]() ![]() ![]() | |
Good news that the app is working but disappointing performance. What do you think difference between PCIe2x16/PCIe3x16 is for GPUGRID, and similar programs? Also, do have idea how many of those "scalar" GM204 cores are cooking? Earlier in this thread-- You estimated 1920-2880 cores are being utilized for "superscalar" GK110. | |
ID: 38087 | Rating: 0 | rate:
![]() ![]() ![]() | |
Could you crop me the Performance information the *0_0 output file, please? | |
ID: 38088 | Rating: 0 | rate:
![]() ![]() ![]() | |
biodoc, I send you a off topic PM in this very moment. | |
ID: 38090 | Rating: 0 | rate:
![]() ![]() ![]() | |
Could you crop me the Performance information the *0_0 output file, please? It's already finished, and uploaded. I'll swap my cards when I get home. 709-NOELIA_20MGWT-1-5-RND4766_0 18,458.00 sec | |
ID: 38091 | Rating: 0 | rate:
![]() ![]() ![]() | |
Could you crop me the Performance information the *0_0 output file, please? I've successfully swapped my GTX670 ans GTX980 and hacked this client, so now I have another workunit in progress. The workunit is 13.103% completed at 40 minutes, the estimated total computing time is 18.316 sec (5h5m) A similar workunit took 16.616 sec (4h37m) to finish on my GTX780Ti (@1098MHz) CPU: Core i7-4770K @4.3GHz, 8GB DDR3 1866MHz GPU usage: 98% (CPU thread 100%, PCIe3.0x16) GPU Temperature: 62°C GPU Memory Controller load: 52% GPU Memory usage: 804MB GPU Voltage: 1.218V GPU Power: 95% (Haven't measured at the wall outlet) GPU Core Clock: 1240MHz # Simulation rate 83.10 (ave) 83.10 (inst) ns/day. Estimated completion Sat Sep 27 11:06:30 2014
# Simulation rate 88.80 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 10:19:41 2014
# Simulation rate 91.00 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 10:03:10 2014
# Simulation rate 92.05 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:55:35 2014
# Simulation rate 92.69 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:51:02 2014
# Simulation rate 93.18 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:47:33 2014
# Simulation rate 93.49 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:45:27 2014
# Simulation rate 93.76 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:43:32 2014
# Simulation rate 93.94 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:42:21 2014
# Simulation rate 94.03 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:41:41 2014
# Simulation rate 94.19 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:40:38 2014
# Simulation rate 94.28 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:39:59 2014
# Simulation rate 94.39 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:39:13 2014
# Simulation rate 94.46 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:38:46 2014
# Simulation rate 94.49 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:38:33 2014
# Simulation rate 94.57 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:38:02 2014
# Simulation rate 94.64 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:37:34 2014
# Simulation rate 94.68 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:18 2014
# Simulation rate 94.73 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:36:55 2014
# Simulation rate 94.76 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:43 2014
# Simulation rate 94.79 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:32 2014
# Simulation rate 94.83 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:36:15 2014
# Simulation rate 94.86 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:06 2014
# Simulation rate 94.88 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:35:58 2014
# Simulation rate 94.88 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:35:57 2014
# Simulation rate 94.85 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:09 2014
# Simulation rate 94.82 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:20 2014
# Simulation rate 94.84 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:12 2014
# Simulation rate 94.84 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:11 2014
# Simulation rate 94.83 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:15 2014
# Simulation rate 94.81 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:25 2014
# Simulation rate 94.67 (ave) 90.65 (inst) ns/day. Estimated completion Sat Sep 27 09:37:20 2014
# Simulation rate 94.57 (ave) 91.40 (inst) ns/day. Estimated completion Sat Sep 27 09:38:01 2014
# Simulation rate 94.51 (ave) 92.55 (inst) ns/day. Estimated completion Sat Sep 27 09:38:26 2014
# Simulation rate 94.52 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:38:21 2014
# Simulation rate 94.54 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:38:12 2014
# Simulation rate 94.56 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:38:03 2014
# Simulation rate 94.59 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:55 2014
# Simulation rate 94.60 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:47 2014
# Simulation rate 94.61 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:44 2014
# Simulation rate 94.63 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:37 2014
# Simulation rate 94.65 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:30 2014
# Simulation rate 94.66 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:24 2014
# Simulation rate 94.68 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:18 2014
# Simulation rate 94.68 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:15 2014
# Simulation rate 94.71 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:37:06 2014
# Simulation rate 94.72 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:01 2014
# Simulation rate 94.72 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:59 2014
# Simulation rate 94.74 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:54 2014
# Simulation rate 94.74 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:52 2014
# Simulation rate 94.75 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:48 2014
# Simulation rate 94.75 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:49 2014
# Simulation rate 94.74 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:54 2014
# Simulation rate 94.72 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:59 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:57 2014
# Simulation rate 94.74 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:53 2014
# Simulation rate 94.75 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:49 2014
# Simulation rate 94.76 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:45 2014
# Simulation rate 94.76 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:43 2014
# Simulation rate 94.74 (ave) 93.33 (inst) ns/day. Estimated completion Sat Sep 27 09:36:53 2014
# Simulation rate 94.73 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:55 2014
# Simulation rate 94.74 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:51 2014
# Simulation rate 94.75 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:47 2014
# Simulation rate 94.76 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:46 2014
# Simulation rate 94.76 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:45 2014
# Simulation rate 94.76 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:44 2014
# Simulation rate 94.77 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:40 2014
# Simulation rate 94.77 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:42 2014
# Simulation rate 94.76 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:46 2014
# Simulation rate 94.74 (ave) 93.72 (inst) ns/day. Estimated completion Sat Sep 27 09:36:52 2014
# Simulation rate 94.72 (ave) 93.33 (inst) ns/day. Estimated completion Sat Sep 27 09:37:00 2014
# Simulation rate 94.73 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:58 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:58 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:55 2014
# Simulation rate 94.72 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:59 2014
# Simulation rate 94.72 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:00 2014
# Simulation rate 94.72 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:01 2014
# Simulation rate 94.73 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:58 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:57 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:57 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:55 2014
# Simulation rate 94.73 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:57 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.72 (ave) 93.72 (inst) ns/day. Estimated completion Sat Sep 27 09:37:00 2014
# Simulation rate 94.71 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:37:03 2014
# Simulation rate 94.72 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:02 2014
# Simulation rate 94.72 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:01 2014
# Simulation rate 94.72 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:02 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:03 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:05 2014
# Simulation rate 94.72 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:02 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:03 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.71 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:37:06 2014
# Simulation rate 94.71 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:03 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.71 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:03 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:05 2014
# Simulation rate 94.71 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:05 2014
# Simulation rate 94.71 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.70 (ave) 93.72 (inst) ns/day. Estimated completion Sat Sep 27 09:37:08 2014
# Simulation rate 94.70 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:07 2014
# Simulation rate 94.71 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.71 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:37:07 2014
# Simulation rate 94.70 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:07 2014
# Simulation rate 94.70 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:08 2014
# Simulation rate 94.71 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:06 2014
# Simulation rate 94.71 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:03 2014
# Simulation rate 94.72 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:37:00 2014
# Simulation rate 94.72 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:01 2014
# Simulation rate 94.71 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:37:03 2014
# Simulation rate 94.71 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:37:05 2014
# Simulation rate 94.70 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:37:07 2014
# Simulation rate 94.70 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:07 2014
# Simulation rate 94.71 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:05 2014
# Simulation rate 94.71 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:06 2014
# Simulation rate 94.71 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:04 2014
# Simulation rate 94.72 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:02 2014
# Simulation rate 94.72 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:37:02 2014
# Simulation rate 94.72 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:37:00 2014
# Simulation rate 94.72 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:37:00 2014
# Simulation rate 94.72 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:59 2014
# Simulation rate 94.73 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:57 2014
# Simulation rate 94.73 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:55 2014
# Simulation rate 94.74 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:54 2014
# Simulation rate 94.74 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:53 2014
# Simulation rate 94.73 (ave) 93.72 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:56 2014
# Simulation rate 94.73 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:55 2014
# Simulation rate 94.74 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:53 2014
# Simulation rate 94.74 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:53 2014
# Simulation rate 94.74 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:51 2014
# Simulation rate 94.74 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:52 2014
# Simulation rate 94.75 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:50 2014
# Simulation rate 94.75 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:48 2014
# Simulation rate 94.75 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:48 2014
# Simulation rate 94.75 (ave) 94.52 (inst) ns/day. Estimated completion Sat Sep 27 09:36:48 2014
# Simulation rate 94.75 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:48 2014
# Simulation rate 94.76 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:46 2014
# Simulation rate 94.76 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:45 2014
# Simulation rate 94.76 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:44 2014
# Simulation rate 94.76 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:44 2014
# Simulation rate 94.77 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:42 2014
# Simulation rate 94.77 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:41 2014
# Simulation rate 94.77 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:39 2014
# Simulation rate 94.76 (ave) 92.55 (inst) ns/day. Estimated completion Sat Sep 27 09:36:45 2014
# Simulation rate 94.76 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:36:43 2014
# Simulation rate 94.77 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:41 2014
# Simulation rate 94.77 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:40 2014
# Simulation rate 94.77 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:38 2014
# Simulation rate 94.78 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:38 2014
# Simulation rate 94.78 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:37 2014
# Simulation rate 94.78 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:35 2014
# Simulation rate 94.79 (ave) 95.34 (inst) ns/day. Estimated completion Sat Sep 27 09:36:34 2014
# Simulation rate 94.79 (ave) 95.75 (inst) ns/day. Estimated completion Sat Sep 27 09:36:31 2014
# Simulation rate 94.79 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:31 2014
# Simulation rate 94.79 (ave) 94.12 (inst) ns/day. Estimated completion Sat Sep 27 09:36:33 2014
# Simulation rate 94.78 (ave) 93.72 (inst) ns/day. Estimated completion Sat Sep 27 09:36:35 2014
# Simulation rate 94.77 (ave) 93.33 (inst) ns/day. Estimated completion Sat Sep 27 09:36:39 2014
# Simulation rate 94.77 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:39 2014
# Simulation rate 94.78 (ave) 94.93 (inst) ns/day. Estimated completion Sat Sep 27 09:36:38 2014 | |
ID: 38094 | Rating: 0 | rate:
![]() ![]() ![]() | |
My GTX980 is crunching fine, a little slower than a GTX780Ti, while consuming much less power. So probably the GPUGrid client can use more than 1920 CUDA cores of the GTX780Ti (or it can't use all CUDA cores in Maxwell). | |
ID: 38097 | Rating: 0 | rate:
![]() ![]() ![]() | |
Great work, Zoltan! biodoc wrote: Good news that the app is working but disappointing performance. I would say it's only disappointing if your expectations were set really high. So far GM204 is not performing miracles here, but it's performing solidly at almost the performance level of GK110 for far less power used. biodoc wrote: I believe the GPUGrid app uses SP floating point calculations. Correct. eXaPower wrote: Also, do have idea how many of those "scalar" GM204 cores are cooking? Earlier in this thread-- You estimated 1920-2880 cores are being utilized for "superscalar" GK110. It was always hard for GPU-Grid to use the superscalar shaders, which amounts to 1/3 of all shaders in "all but the high-end Fermis" and all Keplers. That's where this number comes from. Maxwell has no such restrictions, hence all shaders can be used in principle. This says nothing about other potential bottlenecks, however: PCIe bus, memory bandwidth, CPU support etc. Translating these limitations into statments along the lines of "can only use xxxx shaders" would be misleading. Edit: BTW, what's the memory controller load for GTX780Ti running such tasks? MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 38098 | Rating: 0 | rate:
![]() ![]() ![]() | |
There are more potential variables in Zoltan's tests so far: | |
ID: 38099 | Rating: 0 | rate:
![]() ![]() ![]() | |
[url]http://www.anandtech.com/show/8568/the-geforce-gtx-970-review-feat-evga/13 [/url] | |
ID: 38100 | Rating: 0 | rate:
![]() ![]() ![]() | |
http://www.anandtech.com/show/8568/the-geforce-gtx-970-review-feat-evga/13 | |
ID: 38101 | Rating: 0 | rate:
![]() ![]() ![]() | |
@Biodoc: valid points. Regarding the clockspeed Zoltan said his GTX780Ti was running at 1098MHz, so it's got a "typical" overclock. And the new app claims to be CUDA 6.5. However, I don't think Matt changed the actual crunching code for this release, so any differences would come from changes in built-in functions. During the last few CUDA releases we haven't seen any large changes of GPU-Grid performance, so I don't expect it this time either. Anyway, for the best comparison both cards should run the new version. | |
ID: 38102 | Rating: 0 | rate:
![]() ![]() ![]() | |
@Biodoc: valid points. Regarding the clockspeed Zoltan said his GTX780Ti was running at 1098MHz, so it's got a "typical" overclock. And the new app claims to be CUDA 6.5. However, I don't think Matt changed the actual crunching code for this release, so any differences would come from changes in built-in functions. During the last few CUDA releases we haven't seen any large changes of GPU-Grid performance, so I don't expect it this time either. Anyway, for the best comparison both cards should run the new version. Has dynamic parallelism (C.C 3.5/5.0/5.2) been introduced to ACEMD? Or Unified Memory from CUDA 6.0? Unified memory is a C.C 3.0+ feature. Quoted from newest CUDA programming guide-- "new managed memory space in which all processors see a single coherent memory image with a common address space. A processor refers to any independent execution unit with a dedicated MMU. This includes both CPUs and GPUs of any type and architecture. " | |
ID: 38104 | Rating: 0 | rate:
![]() ![]() ![]() | |
I posted some power consumption data for my GTX980 (+/- overclock) at the F@H forum. | |
ID: 38106 | Rating: 0 | rate:
![]() ![]() ![]() | |
Has dynamic parallelism (C.C 3.5/5.0/5.2) been introduced to ACEMD? Or Unified Memory from CUDA 6.0? Unified memory is a C.C 3.0+ feature. Dynamic parallelism: no. It would break compatibility with older cards or require two separate code paths. Besides, GPU-Grid doesn't have much of a problem occupying all shader multiprocessors (SM, SMX etc.). Unified memory: this is only meant to ease programming for new applications, at the cost of some performance. For any existing code with optimized manual memory management (e.g. GPU-Grid) this would actually be a drawback. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 38108 | Rating: 0 | rate:
![]() ![]() ![]() | |