OK here’s a head scratcher for the fellow HW geeks.
Experienced system integrator here with a head scratching int fault on my otherwise
rock solid 5y reliable rig. All I changed were the graphics cards.
Rig specs:
ASUS ROG Rampage V Extreme
EVGA Supernova 1600 P2
Corsair Dominator DDR 4, 4*16
Core i7 6850k w/ EK plexi top copper block
2* Samsung Evo 850
3* Seagate 2TB spinners
Dual custom loops.
Outgoing GFX - 3* EVGA 980Ti 6G Hydro Copper
Incoming GFX - 2* ASUS 3090 EKWB 24G, (nVidia brand 3090 SLI bridge)
The Rampage V implements 4 smd PCI device presence LED next to the 4 switch DIP bank for PCI control.
On first boot, no POST with legacy option ROM code on the q code, but machine was unplugged from mains for 4 weeks so cleared CMOS 1 and we get a successful POST.
Top card in slot 1 - no presence LED. Clear CMOS once more, boot to win (display on 2nd card), verify operation of card in “4th” (physically 3rd) slot, shut down, fit missing SLI bridge, get presence LED on slot 1. Happy days. All operational and performing.
Next day, during game title launch, machine halts dead and CPU idle ceases. Hard reset and machine boots. Did not note presence LED state. But nVidia driver refuses to load auto or manually. Reinstall drivers clean - all good, but note SLI options gone from nvidia control panel. Second GPU gone from ASUS GPU tweak tool. Check presence LED on slot 1 due to this - it’s out again.
Obviously completely removing and reseating the card or swapping is difficult without teardown. This will also reault in a lot of coolant wasteage. Today I will partially reseat the card a few times as much as I’m able with coolant pipe flex and see if I can reproduce intermittency - or even operation at this point. The PCI slots for 5 years had the EVGA 980s seated and never removed. The slots and edges on the 3090s visually in pristine clean condition checked during install. Machine fully filtered with marginal dust. Service about every 6m.
Any in situ diag ideas other than the usual PS volt checks and swaptronics? (pS volts look good - in spec) The cards were second hand but in excellent physical condition - I’m starting to get suspicious here as the seller listed several of these cards at the same time - like 6 or more. I wonder if they were abused in a mining rig or somehow the seller got their hands on a stack of working but RMA cards and sold them off gambling that buyers would not find or recognise an int fault or cosmetic fault like bad argb bus traces. There was coloured coolant residue in the blocks but they were *surprisingly* clean. Seller is being evasive of the card history question (where did these come from, twice with no answer to this, but replied to the rest of the query) and maintains they should not be faulty. I note 2/3 of the argb diodes on the “good” card are dead - which does nothing for my suspicion …. If it needs to be swaptronics I’ll get to that in a week or so - anyone else seen symptoms like the above? Dodgy water cooled parts in hard tube loops are a right pain due to integration labour and inability to test in a rig without a water loop.
Experienced system integrator here with a head scratching int fault on my otherwise
rock solid 5y reliable rig. All I changed were the graphics cards.
Rig specs:
ASUS ROG Rampage V Extreme
EVGA Supernova 1600 P2
Corsair Dominator DDR 4, 4*16
Core i7 6850k w/ EK plexi top copper block
2* Samsung Evo 850
3* Seagate 2TB spinners
Dual custom loops.
Outgoing GFX - 3* EVGA 980Ti 6G Hydro Copper
Incoming GFX - 2* ASUS 3090 EKWB 24G, (nVidia brand 3090 SLI bridge)
The Rampage V implements 4 smd PCI device presence LED next to the 4 switch DIP bank for PCI control.
On first boot, no POST with legacy option ROM code on the q code, but machine was unplugged from mains for 4 weeks so cleared CMOS 1 and we get a successful POST.
Top card in slot 1 - no presence LED. Clear CMOS once more, boot to win (display on 2nd card), verify operation of card in “4th” (physically 3rd) slot, shut down, fit missing SLI bridge, get presence LED on slot 1. Happy days. All operational and performing.
Next day, during game title launch, machine halts dead and CPU idle ceases. Hard reset and machine boots. Did not note presence LED state. But nVidia driver refuses to load auto or manually. Reinstall drivers clean - all good, but note SLI options gone from nvidia control panel. Second GPU gone from ASUS GPU tweak tool. Check presence LED on slot 1 due to this - it’s out again.
Obviously completely removing and reseating the card or swapping is difficult without teardown. This will also reault in a lot of coolant wasteage. Today I will partially reseat the card a few times as much as I’m able with coolant pipe flex and see if I can reproduce intermittency - or even operation at this point. The PCI slots for 5 years had the EVGA 980s seated and never removed. The slots and edges on the 3090s visually in pristine clean condition checked during install. Machine fully filtered with marginal dust. Service about every 6m.
Any in situ diag ideas other than the usual PS volt checks and swaptronics? (pS volts look good - in spec) The cards were second hand but in excellent physical condition - I’m starting to get suspicious here as the seller listed several of these cards at the same time - like 6 or more. I wonder if they were abused in a mining rig or somehow the seller got their hands on a stack of working but RMA cards and sold them off gambling that buyers would not find or recognise an int fault or cosmetic fault like bad argb bus traces. There was coloured coolant residue in the blocks but they were *surprisingly* clean. Seller is being evasive of the card history question (where did these come from, twice with no answer to this, but replied to the rest of the query) and maintains they should not be faulty. I note 2/3 of the argb diodes on the “good” card are dead - which does nothing for my suspicion …. If it needs to be swaptronics I’ll get to that in a week or so - anyone else seen symptoms like the above? Dodgy water cooled parts in hard tube loops are a right pain due to integration labour and inability to test in a rig without a water loop.
Last edited: