Need help with PC crashes

ddrsquirrelz

Distinguished
Jul 17, 2008
16
0
18,510
Hi all,

My computer crashes once or twice a day and I'm trying to figure out how to fix it. At first, in event viewer, the warning showed up as "Windows Kernel event ID 41 error The system has rebooted without cleanly shutting down first". I googled it and read a lot of the solutions. I was using a Xion 700W PSU, which has the worst reviews I've ever seen for a PSU (got it from a friend), so I thought it was my PSU that was busted. I invested in a brand new PSU and bought an EVGA 80+ Gold Certified G3 750W. The issue continued to persist, so it wasnt a power issue.

The second thing I tried was to upgrade the BIOS' firmware. My mobo is a MSI X99A Raider ATX and I upgraded the firmware from P10 to P50. This definitely did something: my computer crashes a lot less frequently now but I'm getting new errors. Here's the summary of errors that I'm getting now:

1. Display driver nvlddmkm stopped responding and has successfully recovered: only happens while gaming. Screen turns black as the driver fails and restarts.
2. Event viewer WHEA error "event id 19": happens at the same time as the above.
3. Windows Kernel event ID 41 error: same as before, PC still randomly crashes (ingame or out of game, sometime crashes when I'm staring at the desktop with nothing open except the usual background programs)

I tried to tackle each of these issues. Here's a list of the things I tried (that I remember, I've been trying to fix this for months now...)

1. I tried playing with the TdrDelay, increasing it to 10 seconds. Didn't work.
2. I tried turning off TDR altogether by setting TdrLevel to 0. Didn't work.
3. I tried to disable Realtek Audio Driver and using only the Nvidia Audio Driver. Didn't work.
4. I turned off overclocking and am running the PC at stock. Didn't work.
5. I have DDR4-3200 RAM. I had XMP enabled to run it at 3200 MHz. I turned it off. Didn't work.
6. I'm using a GTX 1060 Gigabyte Windforce 6gb GPU. My friend has the same model and his PC works fine. We swapped for a week to see if the issue was with the GPU. Didn't work. PC continued to have issues even when using his GPU.
7. I set the BIOS options back to the recommended default. Now the back fan is blowing up a storm since the default seems to set it at 1500 RPM. Didn't work.
8. I cleaned up the whole PC (it was already very clean to begin with, I have a good case that's great at keeping dust out). Didn't work.
9. All drivers were already up to date.

It's also not a temperature issue. The GPU is running at stock and the temperature is around 50 degrees (stays around 35 degrees when idle). I also had installed a ton of fans in my case for no particular reason. The front has a 200mm fan, the back and bottom have a 140mm each, and the top is loaded with triple 120mm fans. The CPU is at roughly the same temperatures as well.

The only solution that I read that I remember not trying is to underclock my GPU. I didn't really feel like using an underclocked GPU, and I don't think I should need to with my current build. I also plan as a last ditch effort to reformat my PC, but I would like to avoid this option if possible. I'm running Windows 10.

Here's a list of all my PC parts

1. Intel Core i7-5930K
2. MSI X99A RAIDER
3. G.SKILL Ripjaws V Series F4-3200C16D-16GVK DDR4 3200MHZ 16GB 2X8GB 16-18-18-38 Memory Kit Black
4. Noctua NF-A15x2 PWM Retail Cooling D-Type Premium CPU Cooler Fan NH-D15
5. Phanteks Enthoo Pro Series PH-ES614P_BK Black Steel / Plastic ATX Full Tower Computer Case
6. Asus 250gb SSD
7. EVGA 80+ Gold Certified PSU G3 750W
6. GTX 1060 Gigabyte Windforce 6gb
7. Asus MG279Q Monitor (not that this would affect anything, putting it on the list just in case)


Thanks
 
Solution
Hello... You can clear or delete those past errors/lists and start a "fresh" new log of them... But typically there is a "Time stamp" for you to correlate the time it happen... typically the most resent ones are more relevant or you might have multiple "triggers" from more than one problem. B /
Hello... 1) Do you have any red or yellow marks in your Control panel-Device manager?
2) go back and look at the Event viewer- "APP" section and post img/digital of the current errors for us.
3) go back and look at the Event viewer- "Security" section and post img/digital of the current errors for us.
4) go back and look at the Event viewer- "System" section and post img/digital of the current errors for us.
 

ddrsquirrelz

Distinguished
Jul 17, 2008
16
0
18,510


Hi,

Thanks for the quick reply. I'm not sure if this is normal, but I literally have around 30k problems on each of those categories; it doesn't really fit in a screenshot...
 
Hello... You can clear or delete those past errors/lists and start a "fresh" new log of them... But typically there is a "Time stamp" for you to correlate the time it happen... typically the most resent ones are more relevant or you might have multiple "triggers" from more than one problem. B /
 
Solution