New PC build has lots of BSOD's

thedude79

Proper
Nov 2, 2018
51
0
130
Hello, I recently built a new gaming rig and i'm having a lot of trouble with system stability. The system seems to run fine for normal web browsing activities, but once i start gaming on it, there's a chance for some random bluescreen error to pop up.

OS Name Microsoft Windows 10 Home
Version 10.0.17134 Build 17134
Other OS Description Not Available
OS Manufacturer Microsoft Corporation
System Name DESKTOP-G3M0DMS
System Type x64-based PC
System SKU SKU
Processor AMD Ryzen 7 2700X Eight-Core Processor, 3700 Mhz, 8 Core(s), 16 Logical
BIOS Version/Date American Megatrends Inc. 0804, 2018-07-09
SMBIOS Version 3.1
Embedded Controller Version 255.255
BIOS Mode UEFI
BaseBoard Manufacturer ASUSTeK COMPUTER INC.
It's running a Crosshair hero VII (wi-fi)
Platform Role Desktop
Secure Boot State Off
PCR7 Configuration Binding Not Possible
Windows Directory C:\WINDOWS
System Directory C:\WINDOWS\system32
Boot Device \Device\HarddiskVolume2
Locale United States
Hardware Abstraction Layer Version = "10.0.17134.285"
User Name DESKTOP-G3M0DMS\Michael
Time Zone Eastern Daylight Time
Installed Physical Memory (RAM) 16.0 GB
Total Physical Memory 15.9 GB
Available Physical Memory 11.2 GB
Total Virtual Memory 18.3 GB
Available Virtual Memory 10.9 GB
Page File Space 2.38 GB
Page File C:\pagefile.sys
Kernel DMA Protection Off
Virtualization-based security Not enabled
Device Encryption Support Reasons for failed automatic device encryption: TPM is not usable, PCR7 binding is not supported, Hardware Security Test Interface failed and device is not InstantGo, Un-allowed DMA capable bus/device(s) detected, TPM is not usable
Hyper-V - VM Monitor Mode Extensions Yes
Hyper-V - Second Level Address Translation Extensions Yes
Hyper-V - Virtualization Enabled in Firmware No
Hyper-V - Data Execution Protection Yes

I'm running an xc 2080 from EVGA, it's not overclocked or anything.
i've got 16gb of g skill ripjaws ddr4 it's supposed to clock to 3200mhz but i found the system is a bit more stable with 2933mhz, though this could just be my own bias.
All of the parts are brand new and not used. The system has a 500gb samsung evo 860 for it's boot drive and a 1tb samsuing 860 evo for it's second hard drive for game storage.

The computer has been throwing blue screen errors at me pretty much from day one, i've tried clean installing the nvidia drivers, running the default bios settings, flashing the bios. Generally speaking the BSOD's come when i'm gaming, ARK really seems to blue screen a lot but the witcher 3 and skyrim SE have also had bluescreen happen during gaming. I'm at my witts end with how to repair this pc.

Another thing the system does is randomly shut off. This usually seems to happen when i'm stress testing it or gaming, the system will just power off completely, and the only way to get the power button to function again is to flip the power supply's switch on and then off. I'm not sure whether or not the motherboard or some component is causing the issue. All of the fans the system has seem to be running smoothly and the 850 watt power supply is more than enough considering the fact i'm not overclocking the cpu or graphics card.


so the first system failure had the following information
Description
A problem with your hardware caused Windows to stop working correctly.

Problem signature
Problem Event Name: LiveKernelEvent
Code: ab
Parameter 1: 2
Parameter 2: 2e0
Parameter 3: 0
Parameter 4: 17
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

followed by this bluescreen
Problem signature
Problem Event Name: BlueScreen
Code: 4e
Parameter 1: 99
Parameter 2: 3ade7b
Parameter 3: 2
Parameter 4: a0003a0003ade7a
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

following this it had another couple of bluescreens
Problem signature
Problem Event Name: BlueScreen
Code: 19
Parameter 1: 20
Parameter 2: ffffc58c9c2f5000
Parameter 3: ffffc58c9c2f5730
Parameter 4: 5730100
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0x19_20_nt!EtwpEnumerateAddressSpace

and then this blue screen
Problem signature
Problem Event Name: BlueScreen
Code: c2
Parameter 1: 7
Parameter 2: 666e6477
Parameter 3: 4050004
Parameter 4: ffffcc0371dfd160
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0xc2_7_wdnf_nvlddmkm!CMemoryAllocator::freeMemoryWithTag

which was then followed by this blue screen
Problem signature
Problem Event Name: BlueScreen
Code: 119
Parameter 1: 10000
Parameter 2: ffff958aef468000
Parameter 3: ffff958af41c3ac0
Parameter 4: 0
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0x119_10000_UNKNOWN_dxgmms2!VidSchiSetFlipDevice

then after another couple of days i got this blue screen
Problem signature
Problem Event Name: BlueScreen
Code: c2
Parameter 1: 4
Parameter 2: 51bb36a3
Parameter 3: 681a4d27
Parameter 4: ffffb206fdac53b8
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0xc2_4_dxgmms2!operator_delete

followed by this one
Problem signature
Problem Event Name: BlueScreen
Code: 19
Parameter 1: 20
Parameter 2: ffffbd043faa6e20
Parameter 3: ffffbd043faa6e90
Parameter 4: 407042b
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0x19_20_nt!MiRemoveSecureEntry

followed by
Problem signature
Problem Event Name: BlueScreen
Code: c2
Parameter 1: 7
Parameter 2: 4d52564e
Parameter 3: 4050004
Parameter 4: ffff8506af875730
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0xc2_7_NVRM_nvlddmkm!CMemoryAllocator::freeMemoryWithTag

this was then followed by
Problem signature
Problem Event Name: BlueScreen
Code: 139
Parameter 1: 1d
Parameter 2: ffff8c07d19870d0
Parameter 3: ffff8c07d1987028
Parameter 4: 0
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0x139_1d_INVALID_BALANCED_TREE_nt!RtlAvlRemoveNode

this issue was then followed by
Problem signature
Problem Event Name: BlueScreen
Code: c2
Parameter 1: 7
Parameter 2: 4d52564e
Parameter 3: 50005
Parameter 4: ffffd38363745e30
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: CORRUPT_MODULELIST_0xc2_7

another bluescreen followed
Problem signature
Problem Event Name: BlueScreen
Code: 3b
Parameter 1: c0000005
Parameter 2: fffff803b08224d8
Parameter 3: ffff9b8f58b7fab0
Parameter 4: 0
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0x3B_nt!MiRemoveSharedCommitNode

with this one following
Problem signature
Problem Event Name: BlueScreen
Code: c2
Parameter 1: 4
Parameter 2: ffffac05
Parameter 3: a8b50280
Parameter 4: ffffac05a78859c0
OS version: 10_0_17134
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.17134.2.0.0.768.101
Locale ID: 4105

Extra information about the problem
Bucket ID: 0xc2_4_nvlddmkm!CMemoryAllocator::freeMemoryWithTag

Today the computer didn't bluescreen but it shut down randomly and required me to flip the power button switch to turn it on. I don't really know what to make of these blue screens as multiple components are pointed at. I was wondering if the GPU's vram could be at fault for the errors because i ran a prime 95 memory and cpu test for about 20 hours consecutively and the system didn't bluescreen or freeze at all during that time period. Let me know if i can provide any other information, i'll be watching this thread fairly frequently, and i'd appreciate any help i could get with the issues. Thank you in advance.
 
check to see if there is a firmware update for you GPU.
google nvidia firmware update tool. (just guessing as to the cause)

all of the bugs are related to memory corruption most likely because of a driver.

otherwise you would update the bios and motherboard driver, then see if you still get bugchecks.
if you do, you should provide a memory dump from c:\windows\minidump directory.
so it can be looked at with a windows debugger for common issues.

after that, you might need to change the memory dump type to kernel, and turn on verifier flags to get the system to do driver checking. the larger memory dump contains more debug info and internal error logs.
verifier will do a lot of error checking and will bugcheck the system faster. it help identify the driver that is causing the problem when it tries to access memory it does not own.
 
This kind of reminds me of my AMD build I kept having similar issues and I never figured it out till years later since I replaced every piece of hardware and still had BSODs it ended up being the RAm conflicting with the rest of the hardware make sure it's 100% compatible with the cpu and motherboard. I actually was reading a long build thread for ryzen building a streaming HTPC which I needed a lot of cores. The person making the build has the same type of bsods stability issues and low performance. I no longer purchase anything from AMD and never have issues like this anymore. I only recently purchased AMD GPUS for a mining rig because there were no other cards available and guess what? My mining rig keeps having stability issues the AMD cards always crash the drivers constantly need to be reinstalled while my 4 nvidia cards run perfect.
 
I looked into updating the nvidia firmware, there didn't seem to be any updates for my card. What's the best way to post minidump files here? From what i could tell, there wasn't a way to attach files here.

I have the d ram voltage set to spec, and the timings are set to spec as well. I lowered the clock speeds because i thought it improved system stability. I think this may have been a red herring though. I did check the qvl list for the ram and it turns out the ram isn't compatible, so I've ordered new ram to see if this is the issue.

I'm going to try and run memtest86 shortly, is there a specific duration i should run it for?
 
Not all DDR4 ram will work.
It is not clear what your ram make/model is.
Verify that it is explicitly supported on your motherboard ram QVL list or by the ram vendor selection app for your motherboard.

Check to see if there is a relevant motherboard bios update available.
 
So i ran memtest86 for 1.5 hours, i had the memory overclocked to 3200mhz, to test if the ram speed was the issue. it ran every test 0-10 sequentially on the cpus testing all of the memory the system had. Memtest86 didn't find any errors with the ram in that time, i can try running it again overnight to see if that's the problem but for the duration i ran it, it didn't detect any errors. the ram model specifically is f4-3200c16d-16gvgb, PC4-25600, 8192MB x 2, CL 16-18-18-38 @ 1.35 volts. I have new ram coming that hasn't arrived that is on the QVL list for the motherboard, but judging by the fact both memtest and the windows built in memory test didn't find any errors I don't think it's the ram.

As for updating the system, i updated all of the drivers I could find on the asus website for this motherboard. I updated the firmware on the bios about a week ago and it's still throwing bluescreens and errors. I also clean installed the latest nvidia drivers in safe mode and this didn't seem to resolve the problems either.

Here's a shareable link to the system reliability checker report
https://drive.google.com/file/d/1RDObanuPGtmWuDwoeqcDELFDq4XGKF7O/view?usp=sharing
 
the files should be in c:\window\minidump directory
or kernel memory dumps should be at c:\windows\memory.dmp file
certain programs will cleanup the system and delete the files.
also, the locations are user specified, so your system might store them in another place or have them disabled.





 


I did a default install of windows 10 and I don't think i have any programs that do that. But it seems like it'd likely be the case because I have a bluescreen viewer app that could find the bluescreen error files one at a time, but once the computer restarted or was turned off I could no longer access the minidump files. Is there another place on the computer where they would be stored or backed up?
 


The power supply is brand new. It's an EVGA supernova 850 watt powersuply G3, it's rated at 80 plus gold efficiency. I don't know of any friends that would have a similar power supply, is there any other way to test if the power supply is good?
 


So i ran memtest86 for 1.5 hours, i had the memory overclocked to 3200mhz, to test if the ram speed was the issue. it ran every test 0-10 sequentially on the cpus testing all of the memory the system had. Memtest86 didn't find any errors with the ram in that time, i can try running it again overnight to see if that's the problem but for the duration i ran it, it didn't detect any errors. the ram model specifically is f4-3200c16d-16gvgb, PC4-25600, 8192MB x 2, CL 16-18-18-38 @ 1.35 volts. I have new ram coming that hasn't arrived that is on the QVL list for the motherboard, but judging by the fact both memtest and the windows built in memory test didn't find any errors I don't think it's the ram.
 


No need for a similar PSU, but one that is 650W or so and 80+ Gold as well. Test out the system there.
 


Check this link to validate your setup. There are certain circumstances in which you won't be getting minidump file generated. You'll want those to help drive into the nature of those BSODs (kernel panics).

https://answers.microsoft.com/en-us/windows/forum/windows_10-performance/windows-10-bsod-with-ssd-no-mini-dump-files/1134fcb0-d85d-446f-aff6-a757ff9a4d53

Also, you mentioned running Prime95 for 20 hours with no problems and Memtest86 passed. I'm going to guess at this point the source of your BSODs are either device driver based or the AV if you have one installed.

Short of knowing exactly what's tripping them, you might want to install Windows 10 on another drive (trial mode) and test again. That way you won't have to mess with your existing drive if at all possible.
 


I've got an empty disc hard drive in the case now, I can try to install windows 10 on that for sure. Currently I don't have any AV installed, and i haven't really been downloading or doing any web browsing that would get me any viruses so i'm fairly certain the computer is fine in that regard. It seems like a lot of the crashes happen when the system is under a heavy load.'

Also the computer does generate the minidump files, but it seems like something is removing them when the computer is restarted.
 
As others have pointed out, it could be PSU or MB related. If it's not getting fed enough current at the right voltages, that could cause the symptoms; specifically if it's HW related to where it can't write out a minidump file in time.

PSUs are vexing because they don't pass or fail. They can kinda sorta fail, or partial failure. Ditto with bad VRMs and capacitors on a MB.
 


So if i were to put the system under a really heavy load with a different but functional power supply, would that be a good way to determine what is going on?
 
FYI, if you've got a UPS, disconnect the PSU from it and instead plug directly to the wall. Some high efficient PSUs really hate synthetic sine wave generation that most consumer UPS put out. True sine wave is the way to go