[SOLVED] Causes of PC black screening under load ?

May 8, 2025
4
0
10
Apologies in advance for the wall of text, but I want to give as much info in case it's helpful.

Current specs:
Motherboard: ASRock X870E Taichi
CPU: Ryzen 9950X3D - with AIO Cooler
GPU: ASUS 4090 TUF OG - Stock Cooler
PSU: EVGA Supernova G7 1000
RAM: G.Skill 64 GB DDR5 6000 (F5-6000J3040G32GX2-TZ5NR) w/ EXPO
I've also tried F5-6400J3239G32GX2-TZ5RK (XMP 6400) RAM, but I didn't realize it wasn't on QVL, so I figured I'd test one that was

I recently upgraded my CPU, mobo, and RAM for a 64 gb 9950x3d build. For the most part, it's been nice, but recently, I picked up Expedition 33 and started encountering black screens and freezes. I had been daily driving Linux, so I was used to NVIDIA issues, but I started noticing 2 new issues:

1. Game would freeze after some time in random spots. Fans spin down, indicating that the DXVK device was likely killed. Video in web browser was unaffected, probably because it was software accelerated.

2. Worst case: screens go black at random intervals, monitors turn off, GPU fans maxed, system audio still playing, and I can connect using SSH as well. Linux logs say Xid 59 and "GPU has fallen off the bus". From my understanding Xid 79 is thermals/power related, but temps seemed fairly normal ~65-70 C.
To check whether it might be Linux driver related, I tried both open and closed-source drivers.

I also have a dual-boot Windows setup, so I tested on various different driver versions, with 576.15 being the latest, and 560.70 being the earliest. Across the 3 versions I tested, the black screen issue persisted, and instead of freezing the game, I would see a UE5 fatal error popup. To check if maybe other games had issues as well, I tried running Oblivion Remastered. Luckily no black screen, instead the game will just crash with the UE5 reporter.

I've tried reseating the GPU multiple times, replugging the 12VHPWR connector, reconnecting my PSU-side. I tried turning off EXPO to no avail. I even tried my old 2070 super, with somewhat similar results, though I didn't test enough to see black screens. I feel like this shouldn't be a GPU issue, as I've had the card for ~2 years now, and never had this issue. I bought the PSU at the same time, so I similarly doubt that's the issue, as I'd expect the system to entirely crash.

I do notice with GPU-Z that my GPU hotspot temps get up to ~120 C, and as PerfCap reasons it's usually Therm, but sometimes Vrel, which I understand should be normal when under load.

At this point, I'm at a loss, I feel like I've tried so many things, and unfortunately my old setup was a 3700X AM4 socket, so it's hard for me to test that many older parts. I appreciate any and all help I could get.
 
Solution
Your hot spot should not reach 120°c while the core is running at 70°c. This is a 50° delta t, way higher than the typical 15-20°. The card might have some thermal coupling issues and likely needs to be repasted. Not saying this is the cause of your problem, but your card is likely struggling right now.
Welcome to the forums, newcomer!

Causes?
1| Driver issue
2| OS corruption
3| GPU overheating
4| Refresh rate mismatch with driver/software/monitor/GPU

I recently upgraded my CPU, mobo, and RAM for a 64 gb 9950x3d build.
Did you reinstall the OS after your platform swap?

Motherboard: ASRock X870E Taichi
BIOS version for your motherboard?

I also have a dual-boot Windows setup, so I tested on various different driver versions, with 576.15 being the latest, and 560.70 being the earliest.
Might want to use DDU to remove all GPU drivers(Intel, AMD and Nvidia) in Safe Mode, then manually reinstall with 566.36 in an elevated command, i.e, Right click installer>Run as Administrator and see if the issue is alleviated.

As for the OS, I'd try and source a blank spare drive, install one copy of Windows OS in offline mode onto it and then manually install all relevant drivers to see if the issue persists. Dual boot setups tend to have anomalies when they're least expected.
 
Did you reinstall the OS after your platform swap?
Not initially, but I have done a full Windows reinstall recently just in case.

BIOS version for your motherboard?
3.20 which is the latest.

Might want to use DDU to remove all GPU drivers(Intel, AMD and Nvidia) in Safe Mode, then manually reinstall with 566.36 in an elevated command, i.e, Right click installer>Run as Administrator and see if the issue is alleviated.
I'll give this a shot and see if that fixes my issue.

As for the OS, I'd try and source a blank spare drive, install one copy of Windows OS in offline mode onto it and then manually install all relevant drivers to see if the issue persists. Dual boot setups tend to have anomalies when they're least expected.
Not sure if this would affect anything, but I do have my Windows entirely on a separate 1TB SSD as my main Linux drive. I think they share the same EFI partitions, but not sure if that could affect it.

I'm kind of curious if it may also be motherboard related, since this also happens with my older card? Current ASRock + 9800X3D issues aside, I wonder if, even after reseating multiple times, my PCIe slot might be messed up?
 
Your hot spot should not reach 120°c while the core is running at 70°c. This is a 50° delta t, way higher than the typical 15-20°. The card might have some thermal coupling issues and likely needs to be repasted. Not saying this is the cause of your problem, but your card is likely struggling right now.
 
  • Like
Reactions: Phaaze88
Solution
I'll try repasting and seeing if it at least improves the thermals. I wouldn't be surprised if it was shutting itself off to protect itself, but it seems odd that a ~2 year old GPU would have these issues?
 
Alright, it's been a couple days, and it seems like repasting (I used PTM instead), has drastically reduced temperatures and no black screens or freezes since. I'll also probably looking into doing some undervolting, since I've heard the 4090 can go kinda low and still have relatively similar performance with less heat. Thanks all!
 
Alright, it's been a couple days, and it seems like repasting (I used PTM instead), has drastically reduced temperatures and no black screens or freezes since. I'll also probably looking into doing some undervolting, since I've heard the 4090 can go kinda low and still have relatively similar performance with less heat. Thanks all!
That’s great news!