Hey all, I've got a Sapphire r9 295x2 card that I need some advice on how to fix. The card has undergone fairly moderate use and is a few years old, as a preface. Right now, I have a couple issues with it:
1) Progressively worse overheating that causes immediate PC shutdown, as if it were unplugged. The card has little other simultaneous issues: no artifacting and only a couple BSODs in the past, not recently in tandem with the overheating, with driver failures that did not correlate to new drivers being installed. I can try to grab more info on this if relevant (would need instructions on how). Today was the worst, idle temps were 65C on the first card and 45C on the second - more on that below. When loading a simple 1080p stream, temps would spike up to 80C+, then the PC would shut off. Second card remained inactive, see 2). I am in the process of getting more compressed air to clean out the dust as there is some present. With these temps in mind, I'm wondering if I should reapply thermal paste as it looks like there is a fair amount left but not covering the entire core on both gpus.
2) Only one of two of the gpus appears to be working when idle and when under any amount of load, for a good chunk of time now (I don't use the card heavily anymore, so this hasn't been a major issue - both functioned fully in the past. I have mined some with this card, the moment one of them stopped behaving properly was during this time, I recall temperatures around 85-90C for mining, no forced shutdowns ever occurred during this period). Both gpus are recognized in device manager as functioning properly. Crossfire is on, have not tried turning frame pacing off yet. Wattman (adrenaline 1.9.3.3 - previous versions no difference, maybe a version before 1.7.x would work? - not sure where to get this) shows both gpus running with the same settings, 1250mhz memory and 1018mhz core. The first GPU uses exactly that memory, and functions properly in any scenario. The second gpu sits at 300mhz memory, 150mhz core speed (? Might have reversed the two values) and at 45C-ish idle as mentioned earlier. Have tried adjusting settings within wattman to give preference to the second gpu, no change. Tomorrow after I do a preliminary cleaning of the card without thermal paste, I'll try reseating it with one of the two 8pin PCI-e cables in a different port on my PSU, as I read that may be a possible fix for this issue.
I want to see if just a basic cleaning out of the card without more paste, a switch of PCI-e ports on the PSU, and potentially a previous version of Wattman/drivers will fix the issues I'm having. I'm worried they just seem a bit extreme and that the card is dying, as I definitely don't have the funds for another GPU. Please let me know your thoughts on these issues.
Specs:
Windows 10 64bit
AMD Ryzen 7 1700, 3.5ghz @ 1.27V
G.Skill Ripjaws 2x8 GB DDR4
Seasonic Focus+ Gold 850W PSU
MSi B350 Gaming Pro MB (BIOS is current)
PNY 120GB SSD
1) Progressively worse overheating that causes immediate PC shutdown, as if it were unplugged. The card has little other simultaneous issues: no artifacting and only a couple BSODs in the past, not recently in tandem with the overheating, with driver failures that did not correlate to new drivers being installed. I can try to grab more info on this if relevant (would need instructions on how). Today was the worst, idle temps were 65C on the first card and 45C on the second - more on that below. When loading a simple 1080p stream, temps would spike up to 80C+, then the PC would shut off. Second card remained inactive, see 2). I am in the process of getting more compressed air to clean out the dust as there is some present. With these temps in mind, I'm wondering if I should reapply thermal paste as it looks like there is a fair amount left but not covering the entire core on both gpus.
2) Only one of two of the gpus appears to be working when idle and when under any amount of load, for a good chunk of time now (I don't use the card heavily anymore, so this hasn't been a major issue - both functioned fully in the past. I have mined some with this card, the moment one of them stopped behaving properly was during this time, I recall temperatures around 85-90C for mining, no forced shutdowns ever occurred during this period). Both gpus are recognized in device manager as functioning properly. Crossfire is on, have not tried turning frame pacing off yet. Wattman (adrenaline 1.9.3.3 - previous versions no difference, maybe a version before 1.7.x would work? - not sure where to get this) shows both gpus running with the same settings, 1250mhz memory and 1018mhz core. The first GPU uses exactly that memory, and functions properly in any scenario. The second gpu sits at 300mhz memory, 150mhz core speed (? Might have reversed the two values) and at 45C-ish idle as mentioned earlier. Have tried adjusting settings within wattman to give preference to the second gpu, no change. Tomorrow after I do a preliminary cleaning of the card without thermal paste, I'll try reseating it with one of the two 8pin PCI-e cables in a different port on my PSU, as I read that may be a possible fix for this issue.
I want to see if just a basic cleaning out of the card without more paste, a switch of PCI-e ports on the PSU, and potentially a previous version of Wattman/drivers will fix the issues I'm having. I'm worried they just seem a bit extreme and that the card is dying, as I definitely don't have the funds for another GPU. Please let me know your thoughts on these issues.
Specs:
Windows 10 64bit
AMD Ryzen 7 1700, 3.5ghz @ 1.27V
G.Skill Ripjaws 2x8 GB DDR4
Seasonic Focus+ Gold 850W PSU
MSi B350 Gaming Pro MB (BIOS is current)
PNY 120GB SSD
Last edited: