will do later today Brent!
ATI says that Cat9.2 now allows for Multi-GPU folding.
Anyone w/ multiple ATI video cards care to try this out and get back to us?
I'll be happy to help anyone out w/ the setup.
1: Main Rig - ASUS P6T Deluxe | i7 960 D0 @ 4.5GHz | 6GB OCZ PC3-1600 DDR3 | 2 x Intel X25-M RAID 0 | 2 x eVGA GTX295s in Quad-SLI | Dual Dell 3007WFP-HC Monitors
2: Server - ASUS P6T Deluxe | i7 960 D0 @ 4.5GHz | 6GB G.Skill PC3-1600 DDR3 | 2 x Seagate 73GB 10k RPM SAS RAID 0 | 6 x 1.5TB Seagate 7200.11 RAID 10
3: Folding Rig 1 - Intel Smackover | i7 920 D0 @ 3.95GHz | 6GB Patriot PC3-1600 DDR3
4: Folding Rig 2 - Intel Smackover | i7 920 D0 @ 3.8GHz | G.Skill PC3-1600 DDR3
5: Folding Rig 3 - MSI X58M | i7 920 D0 @ 3.5GHz | 6GB OCZ PC3-1600 DDR
will do later today Brent!
Folding@Home: Fighting diseases 1 WU at a time.
DFI LP Jr. P45 | Xeon X3320 | XFX 4770 | Corsair HX620 | Thermalright SI128SE | Other stuff | Lian-Li PC-V350B
Yeah I can set up dual cards in my pc and try this out. I'll do it in a few hours.
It's Goodbye To The Shortcuts,
Hello To The Grind.
Nobody Ever Said It
Would Be An Easy Ride.
Suffer For Your Art
E8400 @ 4.5GHz / Gigabyte EP45-DS3R / Custom Water / 9800GTX / X-Fi XtremeGamer / Klipsch RB-10s
E8400 / P35-DS3P / Ultima-90i / HD3870 / Win7 RC
I wonder if it improves client stability for even one client... at its current state, it is unusable IMO.
"Never skimp on the Power Supply" -Me
Core i7 920 D0 B-batch (4.1) (for now) | DFI X58 T3eH8 | Patriot 1600 (9-9-9-24) (for now) | XFX HD 4890 (971/1065) (for now) | 80GB X25-m G2 | WD 640GB | PCP&C 750 | Dell 2408 LCD | NEC 1970GX LCD | Win7 Pro | CoolerMaster ATCS 840 {Modded to reverse-ATX, WC'ing internal}
CPU Loop: MCP655 > HK 3.0 LT > ST 320 (3x Scythe G's) > ST Res >Pump
GPU Loop: MCP655 > MCW-60 > PA160 (1x YL D12SH) > ST Res > BIP 220 (2x YL D12SH) >Pump
Read this over at the F@H forum...
If you want to utilize the new drivers, some speacial steps need to be taken (no need to fear!).
Folding Forum • View topic - ATI Catalyst 9.2 is out!
You must do a clean uninstall of your previous ATI drivers. Download 9.2, run it, but instead of selecting "Install", select the second one "Uninstall". You will need to reboot after finishing, then when it finishes, re-open 9.2, and install. Reboot again, and then, the only tricky part to it.
You can find the aticalcl.dll and aticalrt.dll files in your System32 folder, you must copy them into the folder that you have the GPU client installed in, and for the time being, rename the amdcalcl.dll and amdcalrt.dll (just put a 1 in the very front). Why the sudden name change? I have no clue.
In the link above, mhouston (big GPGPU guy with AMD), explains.
I also found in that subforum, mhouston suggests that very soon we will see a new client for ATI cards.
And in this thread: http://foldingforum.org/viewtopic.php?f=51&t=8245
mhouston suggests that either this next one, or the one after, will address the performance deficit b/t ATI and nvidia cards with the GPU2 client. He also mentioned a possibility of a 2.5x speed increase.![]()
Last edited by ColonelCain; 02-21-2009 at 12:11 PM.
"Never skimp on the Power Supply" -Me
Core i7 920 D0 B-batch (4.1) (for now) | DFI X58 T3eH8 | Patriot 1600 (9-9-9-24) (for now) | XFX HD 4890 (971/1065) (for now) | 80GB X25-m G2 | WD 640GB | PCP&C 750 | Dell 2408 LCD | NEC 1970GX LCD | Win7 Pro | CoolerMaster ATCS 840 {Modded to reverse-ATX, WC'ing internal}
CPU Loop: MCP655 > HK 3.0 LT > ST 320 (3x Scythe G's) > ST Res >Pump
GPU Loop: MCP655 > MCW-60 > PA160 (1x YL D12SH) > ST Res > BIP 220 (2x YL D12SH) >Pump
1: Main Rig - ASUS P6T Deluxe | i7 960 D0 @ 4.5GHz | 6GB OCZ PC3-1600 DDR3 | 2 x Intel X25-M RAID 0 | 2 x eVGA GTX295s in Quad-SLI | Dual Dell 3007WFP-HC Monitors
2: Server - ASUS P6T Deluxe | i7 960 D0 @ 4.5GHz | 6GB G.Skill PC3-1600 DDR3 | 2 x Seagate 73GB 10k RPM SAS RAID 0 | 6 x 1.5TB Seagate 7200.11 RAID 10
3: Folding Rig 1 - Intel Smackover | i7 920 D0 @ 3.95GHz | 6GB Patriot PC3-1600 DDR3
4: Folding Rig 2 - Intel Smackover | i7 920 D0 @ 3.8GHz | G.Skill PC3-1600 DDR3
5: Folding Rig 3 - MSI X58M | i7 920 D0 @ 3.5GHz | 6GB OCZ PC3-1600 DDR
So will this mean Stamford can quit gimpin the NV client?
C’est magnifique, mais ce n’est pas la guerre. C’est de la folie...
Main Gamer |Core i7 920|EVGA Classified 760|6x2GB Corsair Dominator|Sapphire HD5870|Corsair HX1000|Asus Xonar D2X|CM ATCS 840|2x 60GB Vertex RAID0|2x WD Raptor RAID0|Samsung F1|
Alternate |Core 2 Duo E8400 E0|DFI P45-T2RS+|2x2GB OCZ ReaperX|Asus HD5870|Silverstone 700W|Asus Xonar DX|Antec 1200 Modded|60GB Vertex|4x Samsung F1|
HTPC |Atom N330|Zotac IONITX-D-E|Geil Black Dragon 2x2GB|hec 200W Mini-ITX PSU|InWin BP655|30GB Apex|Samsung F3 1.5TB|
We can only hope...
Additionally, according to mhouston, their optimisation of the ATI GPU2 client is aimed at all product families, and then they will expand to optimizing for the 48xx series.
According to a blog that I was reading, the main cause of the CPU utilization by the ATI GPU2 client was the lack of local share cache for the SP arrays.
So, in other words, this is a limitation that is left from the R600 and RV670 generations.
The RV770 has local share cache, but ATI has been slow as molasses to expose the functionality to programers.
This is why nvidia kicks *****, is because ever since the G80, they have had local share cache.
AMD’s Folding performance explained, future development revealed Theo’s Bright Side Of IT
mhouston remarked that while this article jumped to many conclusions, they do recognize this as something holding back the 48xx series in folding, and Stanford may include access to local share cache in one of the next releases.
"Never skimp on the Power Supply" -Me
Core i7 920 D0 B-batch (4.1) (for now) | DFI X58 T3eH8 | Patriot 1600 (9-9-9-24) (for now) | XFX HD 4890 (971/1065) (for now) | 80GB X25-m G2 | WD 640GB | PCP&C 750 | Dell 2408 LCD | NEC 1970GX LCD | Win7 Pro | CoolerMaster ATCS 840 {Modded to reverse-ATX, WC'ing internal}
CPU Loop: MCP655 > HK 3.0 LT > ST 320 (3x Scythe G's) > ST Res >Pump
GPU Loop: MCP655 > MCW-60 > PA160 (1x YL D12SH) > ST Res > BIP 220 (2x YL D12SH) >Pump
Hooray! I now have an even better reason to show off my HD 4830! (soon)
e4300 3Ghz| 4GB DDR-887 Corsair XMS2| Sapphire HD 4770 820 Core/800 GDDR5| Gigabyte G31M- E2SL| WD Black 500GB| Ninja Mini (120mm Antec tricool)| Louise
Ok, went through and quoted the text that I found this in (I am not going to highlight much, as when people do you ONLY read that, and ignore everything else... the un-highlighted can change the significance):
The thread is titled: ATI Client limited to utilizing only 320 Shaders?!?!?!
Personally, ignore what I have quoted below, and just go to the link below, it is from this post on that is significant. mhouston is a big GPGPU guy working for AMD right now.
Folding Forum • View topic - ATI Client limited to utilizing only 320 Shaders?!?!?! [No]
Originally Posted by mhouston
Originally Posted by mhouston
That is all that I will quote, or my post will go on forever. Just go and start reading at the post I linked on.Originally Posted by mhouston
Read it. Good stuff.
Make's me happy that ATI users will be rewarded eventually, even if it is much later than when nvidia users were rewarded.
"Never skimp on the Power Supply" -Me
Core i7 920 D0 B-batch (4.1) (for now) | DFI X58 T3eH8 | Patriot 1600 (9-9-9-24) (for now) | XFX HD 4890 (971/1065) (for now) | 80GB X25-m G2 | WD 640GB | PCP&C 750 | Dell 2408 LCD | NEC 1970GX LCD | Win7 Pro | CoolerMaster ATCS 840 {Modded to reverse-ATX, WC'ing internal}
CPU Loop: MCP655 > HK 3.0 LT > ST 320 (3x Scythe G's) > ST Res >Pump
GPU Loop: MCP655 > MCW-60 > PA160 (1x YL D12SH) > ST Res > BIP 220 (2x YL D12SH) >Pump
OHHHHHH!!!!!
I have been reading some more, and according to mhouston, there will be a new core coming out soon, that will "reduce cpu usage on larger WU's dramatically"
Folding Forum • View topic - Frequent VPU Recover events with F@H 6.23 and Cat 9.1
![]()
"Never skimp on the Power Supply" -Me
Core i7 920 D0 B-batch (4.1) (for now) | DFI X58 T3eH8 | Patriot 1600 (9-9-9-24) (for now) | XFX HD 4890 (971/1065) (for now) | 80GB X25-m G2 | WD 640GB | PCP&C 750 | Dell 2408 LCD | NEC 1970GX LCD | Win7 Pro | CoolerMaster ATCS 840 {Modded to reverse-ATX, WC'ing internal}
CPU Loop: MCP655 > HK 3.0 LT > ST 320 (3x Scythe G's) > ST Res >Pump
GPU Loop: MCP655 > MCW-60 > PA160 (1x YL D12SH) > ST Res > BIP 220 (2x YL D12SH) >Pump
hopefully that is true cain because my gpu client is really holding my 2nd smp back
{HTPC / Temp Gaming} PII 250@3.4 | GA-MA74GM-S2 | 4 Gb Gskill DDR2-1000 | palit 4850 Worst RMA service ever | 3x1TB WD Blacks | Antec Mini P180 | Corsair HX-1000w RMA' ING| Samsung 206BW | G15 | G5 |SupremeFxII
Politicians and diapers have one thing in common. They should be changed frequently for the same reason
Stay tuned, I am going to post a new thread regarding my findings using a tweak that reportedly reduces CPU load.
Untill then, I would reccommend closing the second SMP, as the PPD of the 4850 is higher than the PPD of the 4850+SMP. So, run one SMP and the GPU2 client. But again, watch.
"Never skimp on the Power Supply" -Me
Core i7 920 D0 B-batch (4.1) (for now) | DFI X58 T3eH8 | Patriot 1600 (9-9-9-24) (for now) | XFX HD 4890 (971/1065) (for now) | 80GB X25-m G2 | WD 640GB | PCP&C 750 | Dell 2408 LCD | NEC 1970GX LCD | Win7 Pro | CoolerMaster ATCS 840 {Modded to reverse-ATX, WC'ing internal}
CPU Loop: MCP655 > HK 3.0 LT > ST 320 (3x Scythe G's) > ST Res >Pump
GPU Loop: MCP655 > MCW-60 > PA160 (1x YL D12SH) > ST Res > BIP 220 (2x YL D12SH) >Pump
Ok i will, thanks for the tip.
{HTPC / Temp Gaming} PII 250@3.4 | GA-MA74GM-S2 | 4 Gb Gskill DDR2-1000 | palit 4850 Worst RMA service ever | 3x1TB WD Blacks | Antec Mini P180 | Corsair HX-1000w RMA' ING| Samsung 206BW | G15 | G5 |SupremeFxII
Politicians and diapers have one thing in common. They should be changed frequently for the same reason
I had noticed with my 4870 that one GPU2 instance loads the card less than 50% even though the fahcore process doesn't max out the respective CPU core. That's why I've been running dual GPU2 clients on my 4870 all along, which keeps the GPU utilization around 90%. I can add a third instance to max it out, but I've found that my PPD is better if I instead load the rest of my CPU up with dual SMPs.
I look forward to any improvements they make, although I won't hold my breath for anything incredible.
I also want to point out that the 2.5x improvement figure is a misstatement (and a very typical one at that). In the thread you linked with that figure, it was stated correctly in the OP and then misstated elsewhere. They state in the OP that for a 4870, current PPD is 5000, and best case is 10000 to 12500 (12.5/5=2.5, or going by shaders, 800/320=2.5). That means the new performance would be 2 to 2.5 times as much as current performance, and is therefore only 1 to 1.5 times more than current performance. So a maximum of 1.5x increase according to the numbers they gave, not a 2.5x increase. Sorry to nitpick, but incorrect usage of 'as much' versus 'more than' when stating figures is possibly my biggest pet peeve.
Last edited by CapnBFG; 02-21-2009 at 10:14 PM.
Core2Quad 9550 | ASUS P5Q Pro | 8GB Patriot DDR2-800 | XFX Radeon HD 5870 | OCZ GameXStream 700W | Antec P180B
Phenom X2 7750 | MSI K9A2 Platinum | 2GB OCZ DDR2-1066 | GeCube Radeon HD 3870 | OCZ GameXStream 700W | Thermaltake Shark
Athlon X2 5600+ | EPoX MF570SLI | 4GB GeIL DDR2-800 | 2x EVGA GeForce 8800GT | Thermaltake ToughPower 700W | Apple PowerMac G5 case
Athlon XP 2800+ | MSI KT4V-L | 2GB Kingston DDR-333 | HIS Radeon 3850 AGP | Antec NeoPower 480W | custom painted case
Athlon 1.333GHz | MSI K7T266Pro | 1.4GB Kingston DDR-266 | 3dfx Voodoo 3000 PCI | generic 250W | custom Lexan SFF case
Preshottt 3.2GHz | Gigabyte GA-945GCM | 2GB Mushkin DDR2-800 | VisionTek Radeon HD 4870 | FSP Saga+ 450W | Systemax Tiger μATX case
Remember, you're a wreck, an accident. Forget the freak, you're just nature. Keep the gun oiled and the temple cleaned, $#!&, snort, and blaspheme.
Let the heads cool and the engine run. Because in the end, everything we do is just everything we've done.
Oh, that isn't nit picking at all, I am glad that you pointed it out.
That is odd that it doesn't use the full GPU. Have you tried starting up just one instance of GPU2, and let it run for a bit w/out any SMP clients on?
"Never skimp on the Power Supply" -Me
Core i7 920 D0 B-batch (4.1) (for now) | DFI X58 T3eH8 | Patriot 1600 (9-9-9-24) (for now) | XFX HD 4890 (971/1065) (for now) | 80GB X25-m G2 | WD 640GB | PCP&C 750 | Dell 2408 LCD | NEC 1970GX LCD | Win7 Pro | CoolerMaster ATCS 840 {Modded to reverse-ATX, WC'ing internal}
CPU Loop: MCP655 > HK 3.0 LT > ST 320 (3x Scythe G's) > ST Res >Pump
GPU Loop: MCP655 > MCW-60 > PA160 (1x YL D12SH) > ST Res > BIP 220 (2x YL D12SH) >Pump
When I was folding on my 3870 I had the same problem. Had to run 2 intances on one card to fully utilize it. I do not know about the first GPU client but GPU2 and ATI cards have had this problem from the beginning.
Antec 900, 750w Corsair, Biostar T-force TA790gx 128m, Ph II 940BE @ 3.6ghz, 8gb G-skill ddr2 1000, 750gb Samsung F1,1TB Seagate, Sapphire HD 4870 1gb ;
Rocketfish FT, 600w Rosewill, Abit IP35 Pro, E8190 Wolfdale @ 3.2ghz, 4gb Gskill ddr2 800, 200gb Seagate, Diamond HD3870 Ruby Edition
*Originally posted by tool_462: Yeah, Farhang does all his threatening from a Graco stool, wearing a pink princess outfit, in between trips to his Easy Bake Oven.
*To fail at an attempt to SF something, is to succeed at life in general. -Tool 2008
*[Today 01:34 PM] Farhang: Easybake Oven > Playing Crysis @1920x1200 Max Details +60 FPS
*[Today 10:44 AM] SEAL: I was hot black leather speedo gay for 8 hours yesterday... and no one caught that?
*BaldEagle: How fast can Congress spend money? It's like sending a bunch of drunk sailors into a brothel.
*[Today 11:01 PM] TurboFly03: I remember this mountain... I took a hooker roughly on.... wait... nope that was a fantasy
*[Today 03:37 PM] Ches111: Why the heck am I carrying around this hypnotized monkey?
*[Today 10:29 PM] DaSickNinja: I'd have better luck than SF if I had remembered to pull out
Yes, that was what I did when I first got the card. I let it run for about a day, and the the points generated were hardly more than my 3870 (except that with my 3870 a single instance showed higher GPU usage). Then I added on a second GPU2 instance to load it down, which almost doubled the PPD. Then since I had two CPU cores idling I decided to add SMP units on. I used Brent's info on the SMP affinity locker to get two SMPs running. After playing around with various configurations of just SMP, just GPU2, and both, I settled for two of each since that seemed to give the best PPD.
Core2Quad 9550 | ASUS P5Q Pro | 8GB Patriot DDR2-800 | XFX Radeon HD 5870 | OCZ GameXStream 700W | Antec P180B
Phenom X2 7750 | MSI K9A2 Platinum | 2GB OCZ DDR2-1066 | GeCube Radeon HD 3870 | OCZ GameXStream 700W | Thermaltake Shark
Athlon X2 5600+ | EPoX MF570SLI | 4GB GeIL DDR2-800 | 2x EVGA GeForce 8800GT | Thermaltake ToughPower 700W | Apple PowerMac G5 case
Athlon XP 2800+ | MSI KT4V-L | 2GB Kingston DDR-333 | HIS Radeon 3850 AGP | Antec NeoPower 480W | custom painted case
Athlon 1.333GHz | MSI K7T266Pro | 1.4GB Kingston DDR-266 | 3dfx Voodoo 3000 PCI | generic 250W | custom Lexan SFF case
Preshottt 3.2GHz | Gigabyte GA-945GCM | 2GB Mushkin DDR2-800 | VisionTek Radeon HD 4870 | FSP Saga+ 450W | Systemax Tiger μATX case
Remember, you're a wreck, an accident. Forget the freak, you're just nature. Keep the gun oiled and the temple cleaned, $#!&, snort, and blaspheme.
Let the heads cool and the engine run. Because in the end, everything we do is just everything we've done.