I want begin by stating I'm not particularly computer savvy, but ill try my best to list everything as clearly as I can. I guess I need to start from the beginning, roughly around 2022, I got a prebuilt PC from NZXT. I've since changed the ram and memory, but all the other components listed below (besides the GPU for obvious reasons) are what it originally came with and still has currently installed:
AMD Ryzen 9 5900X 12-Core
NVIDIA GeForce RTX 3080 - GIGABYTE
Gigabyte X570S Aorus Master Wi-Fi
Team T-FORCE XTREEM ARGB 4000MHz DDR4 - 16 GB x 4
EVGA SuperNOVA 1000W G5 Gold
boot drive 1TB Samsung SSD
and the cooler is a NZXT Kraken X73
Things were perfectly fine for around a year, no issues that I could recall. I was 3 hours into play War Thunder at max settings, when my PC seemingly did what I will refer to as a "partial shutdown". Why partial? Because the PSU and Motherboard still seemed to be on. GPU and all the case fans would be off. To fix this, all I had to do was turn off the PSU, and turn it back on.
This happened again 3 days later, same game. Then started happening every 2 days, same game. Then it happened 2 days later on a different game. This is when it raised a alarm to me, it became clear the issue wasn't being caused by a game, but instead of something with the PC. The issue began happening without even having games running, once every single day. Having YouTube open, or not even any applications. At this point, sometimes it would do a full shown as well, with everything being completely off. IT seemed roughly 50/50 whether it would go into a full or partial shutdown.
When you google random shutdowns, the main issue is overheating. I never had my CPU go above 65, and GPU past 70. I have done a max 100% stress test on the computer for 8 hours twice, and never hit a issue. I have also conducted individual stress tests of the GPU, CPU, PSU, UPS, and Ram. To me this proved it was not a temperature issue, or a component failing from being at max use
The 2nd biggest cause of shutdowns I heard of was regarding power. My system normally draws around 450 watts, and never really gets past 650. The PSU is 1000W, and is plugged into a 1200 UPS. I have ran the entire build with just the battery of the UPS alone. This showed to be that power was not the concern.
The shutdowns started to occur every few hours, then every hour, then every half hour. You can see where this is going. It got to the point where it was every few minutes. Then, it shutdown before I could even login. This almost seemed to of "reset" the issue. With shutdowns occurring every 3 days or so, and progressively decreasing in time. The pattern became incredible apparent.
What I did to attempt to resolve the issue
Regarding Software:
-Updated everything to the latest drivers
-Downgraded to older drivers
-Reset BIOS
-Updated BIOS
-Clean Reinstall of Windows
Regarding Physical Stuff:
-Unplugged each peripherals one at a time (unplugged a different one every shutdown)
-Unplugged all peripherals (including mouse and keyboard)
-Disconnected all peripherals AND monitor after logging in
-Reset CMOS on motherboard
-Plugged PSU into different outlets, as well as outlets in different buildings
Nothing worked. Due to the Motherboard and PSU staying on during the "partial shutdowns", I concluded the GPU was at fault. So, I used it as a excuse to upgrade
2nd GPU
I got a brand new MSI 4080 from amazon, as well as the appropriate cables for it to function with my now previous gen PSU. Performance was far better, but more importantly the issue seemed to of been completely resolved
Fast forward 4 months...
Random shutdown, oh no. the issue was back, and it was following the same exact pattern. As far as I could tell, nothing related to performance seemed to trigger it. By now, I knew how this would go down. This relatively new GPU was still in warranty, so I would send it to RNA. I could not afford to have my computer for the weeks, or even month it would take for that to be handled. So I got another GPU
3rd GPU
I got another XLR8 4080 from amazon. 400$ cheaper then the MSI one, and the performance was notably worse, but who cares, as the PC was back running.
This lasted a month? maybe 2 at most.
My PC randomly shutdown on YouTube two days ago. The problem is back, it also reminded me that I actually forgot to send the MSI GPU to RMA, so I'm getting ready to do that now. Now there was I have noticed before these shutdowns actually appeared. Things got slower. But not in terms of performance, or at least not ones I was able to detect. I know that does not make a ton of sense, but I'll try to explain it as best I can. Loading into a game went from taking 2 seconds, to 4 seconds, to 10 seconds, then shutdown issue popped up. Thumbnails on YouTube videos would take longer to load as a I scrolled down the page. Neither of these were tied to internet.
But, my performance in the actual games was not changed, still the same loads, temperatures, and FPS. Login on the computer also got slower. When powering on, the screen would first show my login with like a 30% dark filter, before going to the normal view 1-2 seconds later, showing that it was struggling to load in the login screen? Im not sure. But across the board things seemed to have gotten slower, despite performance in applications seemingly not taking a hit whatsoever.
Event History shows these shutdowns as "Unexpected Shutdown Occurred" and never points towards any direction. Even History also shows multiple critical errors, but again, they are all from random unexpected shutdown, being classified as "Stopped working" or "Stopped responding and was closed". Systems never stating more then "a problem stopped this program from interacting with Windows". As far as I can tell, all of these errors are simply triggered by the random shutdown, with nothing being able to determine the actual trigger of the shutdown itself.
I have been broken attempting to trouble shoot this issue, and its become evident that buying new GPUs no longer even works as a band air solution. My new theory is it is something related to the motherboard. Why? because that's what the GPU is directly connected to, and my build has otherwise slowly deteriorated a 3080, and 2 perfectly good 4080s. I do not have another system or friend with a build that can test the GPUs, or really of my components in either. Something is happening that no built in or third party software is picking up. Which to me means that something is faulty on a hardware component that cant be detected by software. I would be very gracious for any type of support that could be provided. I have truly hit the bottom of the barrel in terms of what I'm capable of to resolve this issue,
Here's two old images of the shutdowns being recorded:
AMD Ryzen 9 5900X 12-Core
NVIDIA GeForce RTX 3080 - GIGABYTE
Gigabyte X570S Aorus Master Wi-Fi
Team T-FORCE XTREEM ARGB 4000MHz DDR4 - 16 GB x 4
EVGA SuperNOVA 1000W G5 Gold
boot drive 1TB Samsung SSD
and the cooler is a NZXT Kraken X73
Things were perfectly fine for around a year, no issues that I could recall. I was 3 hours into play War Thunder at max settings, when my PC seemingly did what I will refer to as a "partial shutdown". Why partial? Because the PSU and Motherboard still seemed to be on. GPU and all the case fans would be off. To fix this, all I had to do was turn off the PSU, and turn it back on.
This happened again 3 days later, same game. Then started happening every 2 days, same game. Then it happened 2 days later on a different game. This is when it raised a alarm to me, it became clear the issue wasn't being caused by a game, but instead of something with the PC. The issue began happening without even having games running, once every single day. Having YouTube open, or not even any applications. At this point, sometimes it would do a full shown as well, with everything being completely off. IT seemed roughly 50/50 whether it would go into a full or partial shutdown.
When you google random shutdowns, the main issue is overheating. I never had my CPU go above 65, and GPU past 70. I have done a max 100% stress test on the computer for 8 hours twice, and never hit a issue. I have also conducted individual stress tests of the GPU, CPU, PSU, UPS, and Ram. To me this proved it was not a temperature issue, or a component failing from being at max use
The 2nd biggest cause of shutdowns I heard of was regarding power. My system normally draws around 450 watts, and never really gets past 650. The PSU is 1000W, and is plugged into a 1200 UPS. I have ran the entire build with just the battery of the UPS alone. This showed to be that power was not the concern.
The shutdowns started to occur every few hours, then every hour, then every half hour. You can see where this is going. It got to the point where it was every few minutes. Then, it shutdown before I could even login. This almost seemed to of "reset" the issue. With shutdowns occurring every 3 days or so, and progressively decreasing in time. The pattern became incredible apparent.
What I did to attempt to resolve the issue
Regarding Software:
-Updated everything to the latest drivers
-Downgraded to older drivers
-Reset BIOS
-Updated BIOS
-Clean Reinstall of Windows
Regarding Physical Stuff:
-Unplugged each peripherals one at a time (unplugged a different one every shutdown)
-Unplugged all peripherals (including mouse and keyboard)
-Disconnected all peripherals AND monitor after logging in
-Reset CMOS on motherboard
-Plugged PSU into different outlets, as well as outlets in different buildings
Nothing worked. Due to the Motherboard and PSU staying on during the "partial shutdowns", I concluded the GPU was at fault. So, I used it as a excuse to upgrade
2nd GPU
I got a brand new MSI 4080 from amazon, as well as the appropriate cables for it to function with my now previous gen PSU. Performance was far better, but more importantly the issue seemed to of been completely resolved
Fast forward 4 months...
Random shutdown, oh no. the issue was back, and it was following the same exact pattern. As far as I could tell, nothing related to performance seemed to trigger it. By now, I knew how this would go down. This relatively new GPU was still in warranty, so I would send it to RNA. I could not afford to have my computer for the weeks, or even month it would take for that to be handled. So I got another GPU
3rd GPU
I got another XLR8 4080 from amazon. 400$ cheaper then the MSI one, and the performance was notably worse, but who cares, as the PC was back running.
This lasted a month? maybe 2 at most.
My PC randomly shutdown on YouTube two days ago. The problem is back, it also reminded me that I actually forgot to send the MSI GPU to RMA, so I'm getting ready to do that now. Now there was I have noticed before these shutdowns actually appeared. Things got slower. But not in terms of performance, or at least not ones I was able to detect. I know that does not make a ton of sense, but I'll try to explain it as best I can. Loading into a game went from taking 2 seconds, to 4 seconds, to 10 seconds, then shutdown issue popped up. Thumbnails on YouTube videos would take longer to load as a I scrolled down the page. Neither of these were tied to internet.
But, my performance in the actual games was not changed, still the same loads, temperatures, and FPS. Login on the computer also got slower. When powering on, the screen would first show my login with like a 30% dark filter, before going to the normal view 1-2 seconds later, showing that it was struggling to load in the login screen? Im not sure. But across the board things seemed to have gotten slower, despite performance in applications seemingly not taking a hit whatsoever.
Event History shows these shutdowns as "Unexpected Shutdown Occurred" and never points towards any direction. Even History also shows multiple critical errors, but again, they are all from random unexpected shutdown, being classified as "Stopped working" or "Stopped responding and was closed". Systems never stating more then "a problem stopped this program from interacting with Windows". As far as I can tell, all of these errors are simply triggered by the random shutdown, with nothing being able to determine the actual trigger of the shutdown itself.
I have been broken attempting to trouble shoot this issue, and its become evident that buying new GPUs no longer even works as a band air solution. My new theory is it is something related to the motherboard. Why? because that's what the GPU is directly connected to, and my build has otherwise slowly deteriorated a 3080, and 2 perfectly good 4080s. I do not have another system or friend with a build that can test the GPUs, or really of my components in either. Something is happening that no built in or third party software is picking up. Which to me means that something is faulty on a hardware component that cant be detected by software. I would be very gracious for any type of support that could be provided. I have truly hit the bottom of the barrel in terms of what I'm capable of to resolve this issue,
Here's two old images of the shutdowns being recorded: