Help with BSOD while playing The Witcher 3

lemelecode

Prominent
Jul 28, 2017
21
0
510
Hi everyone,

My PC has been acting up lately. I bought the Witcher 3 couple of weeks back, started playing, and with it started having BSODs. My PC is pretty beefy, I haven't had any trouble so far. Except when I play TW3. Sometimes it BSODs after a couple of hours, sometimes after 8-10 hours.

At first I was getting the error "CLOCK_WATCHDOG_TIMEOUT". I tried googling it, turns out it's something most likely related to OC CPUs and possibly not enough voltage?

I had my CPU Overclocked a bit, so I decided to go back to default values (I had to enter them myself because there is no easy Clear CMOS feature - it seems I have to short the jumpers manually and I really don't know how to do that). I still crashed.

So I tried running a Win10 memtest, I updated the BIOS, I uninstalled XTU, looked for conflicting devices/drivers (of which it appears I have none).

After this, the BSOD changed. It still happens with the same sort of random-ish frequency, but now it's a "WHEA_UNCORRECTABLE_ERROR"

What am I supposed to do?

My Specs:

Motherboard: MSI Z170a Gaming M3
CPU: Intel i7 6700k
RAM: Corsair Vengeange LPX DDR4 4x8GB @2133MHz (stock speed, my CPU don't allow XMP profiles if I understood correctly)
VGA: NVidia GeForce GTX 1060 from ASUS Strix
OS SSD: Samsung 840 PRO 512 GB
PSU: Modular Corsair HX850 850W 80Plus Gold
CPU Cooling: Noctua NH-D9L
Windows 10 64bit
 
Solution
i have heard its both, you might need soft brush on CPU/GPU fans but case fans should be okay with compressed air. I generally just use a brush on everything, its a little slower though.

I really need to clean my case. but its still winter here so 30C or below is average on components now. Getting to fans through their grills is difficult on my case, might need compressed air after all.
I should specify that the crash happens ONLY when playing The Witcher 3.

I have a dump file, although not sure how to post it.

EDIT: I'm almost positive my CPU Core Voltage should be 1.2V, although from the BIOS, when in AUTO mode, it stays between 1.296 and 1.301. Not really sure why.
 
upload dump file to a file sharing web site and share a link here and someone will decode it for us :)
WHEA and Clock Watchdog Timeout are both hardware errors called by CPU

run this on CPU: https://downloadcenter.intel.com/download/19792/Intel-Processor-Diagnostic-Tool

vcore should be 1.2

clear CMOS:

1. power off and unplug PC
2. use a jumper cap to short jbat1 for 5-10 seconds
3. remove jumper cap from jbat1
4. plug in power cord and restart PC

jbat1 appears to be next to PCI E3 - I found the instructions in your manual on page 39
 
Here is the link for the minidump.

Minidump

As for the clear CMOS, where can I get a jumper cap? I fail to understand how, but I've lost my motherboard box, so even if it was there I have lost it.
I had to manually enter the vcore value to keep it from spiking up. I've now set it to 1.25v

I'll run the CPU test and will let you know what it comes out of it

EDIT: Intel Diagnostics Results:

--- IPDT64 - Revision: 4.0.0.29
--- IPDT64 - Start Time: 29-Jul-17 3:47:20 PM

CPU1
Intel(R) spbc Module Test
Module Version: 1.0.8.64b.W
Start Time: Sat Jul 29 15:47:20 2017
Test Result - PASS
..!!..Processor under test is verified as 'PRODUCTION'..!!..

Processor Manufacturer: GenuineIntel
Processor Brand String: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz

End Time: Sat Jul 29 15:47:20 2017
Total Time: seconds: 0
--------------------------------------------------------------------
CPU1
Genuine Intel CPU Test
Module Version: 1.0.12.64b.W
Start Time: Sat Jul 29 15:47:20 2017
Test Result - PASS
Expected: GenuineIntel
Detected: GenuineIntel
End Time: Sat Jul 29 15:47:20 2017
Total Time: seconds: 0
--------------------------------------------------------------------
CPU1
CPU Brand String Test
Module Version: 1.0.14.64b.W
Start Time: Sat Jul 29 15:47:20 2017
Test Result - PASS
Expected: Intel(R) Core(TM) CPU @
Detected: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
Intel(R) Core(TM) processor detected..!!..
End Time: Sat Jul 29 15:47:20 2017
Total Time: seconds: 0
--------------------------------------------------------------------
CPU1
Floating Point Test
Module Version: 1.0.11.64b.W
Start Time: Sat Jul 29 15:47:21 2017
AVX is supported in your OS
Max AVX supported = AVX2
FMA3 supported
Test Result - PASS
Million Floating Points per Second, MFLOPS: 585.13
Error: 0
--- Floating Point Test Passed!!!---
End Time: Sat Jul 29 15:47:23 2017
Total Time: seconds: 2
--------------------------------------------------------------------
CPU1
Prime Number Generation Test
Module Version: 1.0.13.64b.W
Start Time: Sat Jul 29 15:47:23 2017
AVX is supported in your OS
Max AVX supported = AVX2
Test Result - PASS
Operation Per Second: 110726
Error: 0
--- Prime Number Generation Test Passed!!!---
End Time: Sat Jul 29 15:47:25 2017
Total Time: seconds: 2
--------------------------------------------------------------------
CPU1
Cache Test
Module Version: 1.0.11.64b.W
Start Time: Sat Jul 29 15:47:25 2017
Test Result - PASS

--- Reading Cache Size ---

- Detected L1 Data Cache Size --> 32
- Detected L1 Inst Cache Size --> 32

- Detected L2 Cache Size --> 256
- Detected L3 Cache Size --> 8192

Cache Size Test Passed!!!

End Time: Sat Jul 29 15:47:25 2017
Total Time: seconds: 0
--------------------------------------------------------------------
CPU1
MMXSSE Test
Module Version: 1.0.11.64b.W
Start Time: Sat Jul 29 15:47:25 2017
Test Result - PASS

--- Determining MMX - SSE capabilities ---
--- CPU FEATURES DETECTION FOR ---
--- MMX SSE ---
MMX - MMX Supported --> Yes
SSE - SSE Supported --> Yes
SSE2 - SSE2 Supported --> Yes
SSE3 - SSE3 Supported --> Yes
SSSE3 - SSSE3 Supported --> Yes
SSE4.1 - SSE4.1 Supported --> Yes
SSE4.2 - SSE4.2 Supported --> Yes
--- MMX SSE - capabilities check complete ---
MMX Test Result --- PASS
SSE Test Result --- PASS
SSE2 Test Result --- PASS
SSE3 Test Result --- PASS
SSSE3 Test Result --- PASS
SSE4.1 Test Result --- PASS
SSE4.2 Test Result --- PASS

End Time: Sat Jul 29 15:47:25 2017
Total Time: seconds: 0
--------------------------------------------------------------------
CPU1
FMA3 Test
Module Version: 1.0.10.64b.W
Start Time: Sat Jul 29 15:47:25 2017
Test Result - PASS

--- CPU Features Detection ---
FMA3 - Supported --> Yes
FMA3 Test Result --- PASS

End Time: Sat Jul 29 15:47:31 2017
Total Time: seconds: 6
--------------------------------------------------------------------
CPU1
AVX Test
Module Version: 2.0.8.64b.W
Start Time: Sat Jul 29 15:47:31 2017
Test Result - PASS

--- CPU Features Detection ---
AVX - Supported --> Yes
AVX2 - Supported --> Yes
AVX512 - Supported --> No
AVX OS Support - Supported --> Yes
AES - Supported --> Yes
PCLMULQDQ - Supported --> Yes
AVX2 Test Result --- PASS

End Time: Sat Jul 29 15:47:37 2017
Total Time: seconds: 6
--------------------------------------------------------------------
CPU1
IMC Test
Module Version: 1.0.8.64b.W
Start Time: Sat Jul 29 15:47:37 2017
Test Result - PASS
--- Reading Memory Size ---
Detected Memory Size is --> 32.00GB
--- Subtest - Memory Size Test Passed!!! ---
--- Integrated Memory Controller Stress Test ---
Memory to be allocated = 1048576 byte
Memory Allocated.
Test 1 Ones and Zeros Moving Inversions write operations - Passed
Test 1 Ones and Zeros Moving Inversions verification operations - Passed
Test 2 32Bits Sliding Ones write operations - Passed
Test 2 32Bits Sliding Ones verification operations - Passed
Test 3 32Bits Sliding Zero write operations - Passed
Test 3 32Bits Sliding Zero verification operations - Passed
Memory Deallocated.
--- Subtest - Memory Stress Test Passed!!! ---
Integrated Memory Controller Stress Test Passed!!!
End Time: Sat Jul 29 15:47:37 2017
Total Time: seconds: 0
--------------------------------------------------------------------
CPU1
PCH Test
Module Version: 1.0.10.64b.W
Start Time: Sat Jul 29 15:47:37 2017
Test Result - PASS
LPC deviceID = 0xa145

Chipset : Skylake Platform Controller Hub

No device connected to PCI Express Port 1

1 device connected to PCI Express Port 3
1 : PCI-to-PCI Bridge

1 device connected to PCI Express Port 4
1 : Killer E2400 Gigabit Ethernet Controller

No device connected to PCI Express Port 9

1 device connected to PCI Express Port 11
1 : ASMedia USB3.1 eXtensible Host Controller

3 device connected to SATA Controller 1
1 : Samsung SSD 840 PRO Series
2 : HL-DT-ST BDDVDRW CH10LS28
3 : WDC WD10EARX-00N0YB0

3USB devices detected
1 : USB Composite Device
2 : USB Composite Device
3 : USB Composite Device

1 device connected to Intel High Definition Audio Controller
1 : Realtek High Definition Audio

Platform Controller Hub Test Completed!!!
End Time: Sat Jul 29 15:47:38 2017
Total Time: seconds: 1
--------------------------------------------------------------------
CPU1
IGD Module Test
Module Version: 1.0.12.64b.W
Start Time: Sat Jul 29 15:47:38 2017
Test Result: Intel(R) Integrated Graphics Device Not Detected, Test Skipped
End Time: Sat Jul 29 15:47:38 2017
Total Time: seconds: 0
--------------------------------------------------------------------
CPU1
GFX Module Test
Module Version: 1.0.12.64b.W
Start Time: Sat Jul 29 15:47:38 2017
GFX Module Test: PASS
End Time: Sat Jul 29 15:48:06 2017
Total Time: seconds: 28
--------------------------------------------------------------------
CPU1
CPU Load
Module Version: 1.0.12.64b.W
Start Time: Sat Jul 29 15:48:06 2017
CPU Load: PASS
End Time: Sat Jul 29 15:52:26 2017
Total Time: seconds: 260
--------------------------------------------------------------------
CPU1
CPU Frequency Test
Module Version: 1.0.16.64b.W
Start Time: Sat Jul 29 15:52:26 2017
Test Result - PASS
Processor Name: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
Expected CPU Frequency: 4.00
Detected CPU Frequency: 4.00044
Expected Base Clock: 100
Detected Base Clock: 100.011
QPI link rate Test - Not supported.
Skipping QPI link rate Test.
End Time: Sat Jul 29 15:52:27 2017
Total Time: seconds: 1
--------------------------------------------------------------------
--- IPDT64 - Temperature Monitor Results
Temperature Monitor will begin when CPULoad Test starts
--- Temperature Test ---
...
Version 1.0.13.64b.W
...

70 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 70 degrees below max

--- Temperature Test ---
...
Version 1.0.13.64b.W
...


Entering run forever
65 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 65 degrees below max
Test PASSED
52 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 52 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
52 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 52 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
48 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 48 degrees below max
Test PASSED
61 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 61 degrees below max
Test PASSED
60 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 60 degrees below max
Test PASSED
50 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 50 degrees below max
Test PASSED
59 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 59 degrees below max
Test PASSED
60 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 60 degrees below max
Test PASSED
51 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 51 degrees below max
Test PASSED
60 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 60 degrees below max
Test PASSED
61 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 61 degrees below max
Test PASSED
61 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 61 degrees below max
Test PASSED
60 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 60 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
60 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 60 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
61 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 61 degrees below max
Test PASSED
49 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 49 degrees below max
Test PASSED
50 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 50 degrees below max
Test PASSED
59 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 59 degrees below max
Test PASSED
60 degrees C below maximum
Test Result - PASS
Expected Greater than 1 degrees below max
Received 60 degrees below max
Test PASSED

Temperature Monitor Stopped

--------------------------------------------------------------------

System Information
------------------
Processor Name: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
Processor Information: Intel64 Family 6 Model 94 Stepping 3
Number of Physical Cores: 4
Number of Logical Cores: 8
Installed System Memory: 32 GB
Operating System: Microsoft Windows 10 Enterprise 64-bit
Graphics Information: NVIDIA GeForce GTX 1060 6GB
System Product: Z170A GAMING M3 (MS-7978)
System BIOS: A.B0
System Serial Number: Default string
--------------------------------------------------------------------

--- IPDT64 - End Time: 29-Jul-17 3:52:27 PM
--- IPDT64 - Result: Pass
--------------------------------------------------------------------
 
Hi, I ran the dump file(s) through the debugger and got the following information: https://pastebin.com/wNbF2iNT

File: 072917-5687-01.dmp (Fri Jul 28 20:16:34.964 2017)
BugCheck: [WHEA_UNCORRECTABLE_ERROR (124)] {0, ffffd902c0346028, be000000, 800400}
Probably caused by: GenuineIntel (Process: System)

!errrec on 2nd argument returned: Error : Internal timer (Proc 1 Bank 4)

I can't help you with this. Wait for someone else to reply. Good luck.
 


Thank you, I shall wait for someone to shed more light on the matter :)
 
Intel test seems okay...

WHEA errors can be caused by driver software, I am not an expert on reading the dumps created by the debugger but I can see a few old drivers there that might be cause

See if you have latest Samsung Magician installed as I see you running RAPID mode? Drivers dated Nov 16
Update Logitech Gaming Software, your version from 2015 - http://support.logitech.com/en_us/software/lgs

asmtxhci.sys (dated Thu Feb 4 2016) - ASMEDIA XHCI Host Controller (Find on others tab of driver page below)
Update Killer internet drivers from LAN page here: https://www.msi.com/Motherboard/support/Z170A-GAMING-M3.html#down-driver

or Install and run MSI Live update 6 from the utility page and see what it finds, as it should help with the ASMEDIA driver as well
 


Allright i'm downloading all of the drivers from the MSI page, since I can't remember which are updated and which are not. MSI live update kinda sucks bad, it didnt even prompt me about 4 new bios versions, I found out last week when I updated the bios myself.

It has been two days that I've been playing with no BSODs though, ever since I manually set the Vcore voltage to 1.25.
I can't be certain the problem is fixed, will test further (as in: I will have to play more, since the BSODs only happen when playing The Witcher 3)
 
I do have to ask: I have stress tested the CPU with multiple programs (XTU, Real Bench, and some other I cant quite recall now). Max temp I've seen has been 83° if I am not mistaken. I believe it to be a tad bit too high?

It's really hot now here, around 35° and even more during the day, so that might be an explanation.
And Idle temp are really low, around 30°-40° max. The case fans are working properly, maybe a bit too slow because they are 6 years old but air comes out fresh from the back. The cooler is a Noctua, which should be good. I had to put the thermal paste myself but I followed a guide for it.
Plus I don't really see temperature being the cause of the BSODs, since as I stated in OP, the crashes happen quite randomly, so my though process excludes temperature build-up.

I don't know. It seems weird.
 
not sure about ram not being able to run xmp but I am not an expert on that.

it depends what recorded that 83. There are different sensors that can show temps, the max temp for the CPU heat spreader is 64C but the max temp for the individual cores themselves is 100

You might want to check thermal paste but if temps are warm, its not going to get any cooler than ambient. My PC sits on 30C idle in summer. Check fans aren't dusty as well.
 
I believe it was the Core Package temp recorded with HWMonitor during a RealBench stress test that was around 83°.
With XTU I had a stress test with the max temp being 70°-ish. Don't remember as it was a couple of months ago.

I might have to buy a new case, and clean all the dust off of the components when switching over. Do you suggest doing it with compressed air or with a soft brush?
 
i have heard its both, you might need soft brush on CPU/GPU fans but case fans should be okay with compressed air. I generally just use a brush on everything, its a little slower though.

I really need to clean my case. but its still winter here so 30C or below is average on components now. Getting to fans through their grills is difficult on my case, might need compressed air after all.
 
Solution