Hi, all. I hope someone can help me out here because I cannot find any info on this myself to fix whatever is going on.
Here's how this happened: I've got a relatively new PC built around two months ago (ASUS z690 d4 + wifi, 12700k, 2x16 gb FURY RAM, ASUS TUF 3080 v2, and an AIO cooler with temps never exceeding 55-60c on any component). It was fine for all this time until Win11 got installed. I tried to update my iGPU drivers before and after, but they wouldn't update properly. The setup would start, go halfway, and disappear. I'd reboot my PC and the intel driver & support assistant would say I have everything updated. Weird. At this point, I decided to use DDU to remove those drivers while in safe mode.
Anyway, recently I've been playing a game mostly at night called 7 days to die. Now, this game has stuttering where the entire PC freezes for a second or two (happens only online) because the server needs to save everything. Sometimes this causes the game to crash locally, but the PC is definitely unusable when this happens. Apparently this is normal according to players because it's just the way it's coded (and FWIW, I don't have stuttering issues outside of this game on any other relatively intensive game). However, I've seen something more from that: the PC freezes, BSODs without creating a memory dump properly (it ends up being corrupt) and then reboots with no display (shows no signal on screen). Weirdly, it reboots & runs like normal (apparently it loads Windows because I see my aura software load up). I have to force turn it off, though, by holding it down or shutting it off at the PSU because the display issue doesn't fix itself.
So, I used to have everything run through my displayport which was directly hooked into the mobo instead of the GPU, but then I switched to the dedicated one. And now I'm able to kind of get more info on why this is happening, I think. Before, I'd just get corrupt dump files. Now, using whocrashed, the latest dump produced something of value:
I also analyzed it with windbg, but depending on which dump I'm looking at, it's a bit more inconclusive. Note that the strange thing about this is that it happens every other day and not every day. And, from my experience so far, it only happens with this game (7 days to die) and never outside of it if it's not running. I have no idea why. Can anyone please tell me what's going on? Here's the info from windbg:
Some stuff I see in event viewer:
Errors
There are dcom warnings, but I will take care of those right now (I'm guessing they're not responsible for the crashes).
Here's how this happened: I've got a relatively new PC built around two months ago (ASUS z690 d4 + wifi, 12700k, 2x16 gb FURY RAM, ASUS TUF 3080 v2, and an AIO cooler with temps never exceeding 55-60c on any component). It was fine for all this time until Win11 got installed. I tried to update my iGPU drivers before and after, but they wouldn't update properly. The setup would start, go halfway, and disappear. I'd reboot my PC and the intel driver & support assistant would say I have everything updated. Weird. At this point, I decided to use DDU to remove those drivers while in safe mode.
Anyway, recently I've been playing a game mostly at night called 7 days to die. Now, this game has stuttering where the entire PC freezes for a second or two (happens only online) because the server needs to save everything. Sometimes this causes the game to crash locally, but the PC is definitely unusable when this happens. Apparently this is normal according to players because it's just the way it's coded (and FWIW, I don't have stuttering issues outside of this game on any other relatively intensive game). However, I've seen something more from that: the PC freezes, BSODs without creating a memory dump properly (it ends up being corrupt) and then reboots with no display (shows no signal on screen). Weirdly, it reboots & runs like normal (apparently it loads Windows because I see my aura software load up). I have to force turn it off, though, by holding it down or shutting it off at the PSU because the display issue doesn't fix itself.
So, I used to have everything run through my displayport which was directly hooked into the mobo instead of the GPU, but then I switched to the dedicated one. And now I'm able to kind of get more info on why this is happening, I think. Before, I'd just get corrupt dump files. Now, using whocrashed, the latest dump produced something of value:
On Sun 2/27/2022 1:01:50 AM your computer crashed or a problem was reported
crash dump file: C:\WINDOWS\Minidump\022722-11625-01.dmp
This was probably caused by the following module: 0n87.sys (0n87)
Bugcheck code: 0x101 (0xA, 0x0, 0xFFFF8000E7B10180, 0x8)
Error: CLOCK_WATCHDOG_TIMEOUT
Bug check description: This indicates that an expected clock interrupt on a secondary processor, in a multi-processor system, was not received within the allocated interval. This can be caused by non-responding hardware or by a overheated CPU (thermal issue).
A third party driver was identified as the probable root cause of this system error. It is suggested you look for an update for the following driver: 0n87.sys .
Google query: 0n87.sys CLOCK_WATCHDOG_TIMEOUT
On Sun 2/27/2022 1:01:50 AM your computer crashed or a problem was reported
crash dump file: C:\WINDOWS\MEMORY.DMP
This was probably caused by the following module: information.sys (information)
Bugcheck code: 0x101 (0xA, 0x0, 0xFFFF8000E7B10180, 0x8)
Error: CLOCK_WATCHDOG_TIMEOUT
Bug check description: This indicates that an expected clock interrupt on a secondary processor, in a multi-processor system, was not received within the allocated interval. This can be caused by non-responding hardware or by a overheated CPU (thermal issue).
A third party driver was identified as the probable root cause of this system error. It is suggested you look for an update for the following driver: information.sys .
Google query: information.sys CLOCK_WATCHDOG_TIMEOUT
On Sun 2/20/2022 10:42:09 PM your computer crashed or a problem was reported
crash dump file: C:\WINDOWS\Minidump\022022-13562-01.dmp
This was probably caused by the following module: 0n87.sys (0n87)
Bugcheck code: 0x101 (0xA, 0x0, 0xFFFFCD8053B89180, 0x8)
Error: CLOCK_WATCHDOG_TIMEOUT
Bug check description: This indicates that an expected clock interrupt on a secondary processor, in a multi-processor system, was not received within the allocated interval. This can be caused by non-responding hardware or by a overheated CPU (thermal issue).
A third party driver was identified as the probable root cause of this system error. It is suggested you look for an update for the following driver: 0n87.sys .
Google query: 0n87.sys CLOCK_WATCHDOG_TIMEOUT
I also analyzed it with windbg, but depending on which dump I'm looking at, it's a bit more inconclusive. Note that the strange thing about this is that it happens every other day and not every day. And, from my experience so far, it only happens with this game (7 days to die) and never outside of it if it's not running. I have no idea why. Can anyone please tell me what's going on? Here's the info from windbg:
Path validation summary *
Response Time (ms) Location
Deferred srv*
Symbol search path is: srv*
Executable search path is:
Unable to load image Unknown_Module_0000000000000000, Win32 error 0n2 *** WARNING: Unable to verify timestamp for Unknown_Module_00000000
00000000
*** ERROR: Module load completed but symbols could not be loaded for Unknown_Module_0000000000000000 Unable to add module at 00000000
00000000
WARNING: .reload failed, module list may be incomplete
Debugger can not determine kernel base address
Windows 10 Kernel Version 22000 MP (20 procs) Free x64
Product: WinNt, suite: TerminalServer SingleUserTS Personal
Machine Name:
Kernel base = 0xfffff8046e200000 PsLoadedModuleList = 0xfffff804
6ee29b90
Debug session time: Sun Feb 27 01:01:50.902 2022 (UTC + 8:00)
System Uptime: 0 days 23:20:45.500
Unable to load image Unknown_Module_0000000000000000, Win32 error 0n2 *** WARNING: Unable to verify timestamp for Unknown_Module_00000000
00000000
*** ERROR: Module load completed but symbols could not be loaded for Unknown_Module_0000000000000000 Unable to add module at 00000000
00000000
WARNING: .reload failed, module list may be incomplete
Debugger can not determine kernel base address
Loading Kernel Symbols
.Unable to load image Unknown_Module_0000000000000000, Win32 error 0n2 *** WARNING: Unable to verify timestamp for Unknown_Module_00000000
00000000
*** ERROR: Module load completed but symbols could not be loaded for Unknown_Module_0000000000000000 Unable to add module at 00000000
00000000
Loading User Symbols
For analysis of this file, run !analyze -v
fffff8046e6164c0 ?? ??? 17: kd> !analyze -v ******************************************************************************* [LIST] [*]* [*]Bugcheck Analysis * [*]* [/LIST] ******************************************************************************* CLOCK_WATCHDOG_TIMEOUT (101) An expected clock interrupt was not received on a secondary processor in an MP system within the allocated interval. This indicates that the specified processor is hung and not processing interrupts. Arguments: Arg1: 000000000000000a, Clock interrupt time out interval in nominal clock ticks. Arg2: 0000000000000000, 0. Arg3: ffff8000e7b10180, The PRCB address of the hung processor. Arg4: 0000000000000008, The index of the hung processor. Debugging Details: ------------------ ***** Debugger could not find nt in module list, module list might be corrupt, error 0x80070057. KEY_VALUES_STRING: 1 Key : Analysis.CPU.mSec Value: 15 Key : Analysis.DebugAnalysisManager Value: Create Key : Analysis.Elapsed.mSec Value: 14 Key : Analysis.Init.CPU.mSec Value: 46 Key : Analysis.Init.Elapsed.mSec Value: 8874 Key : Analysis.Memory.CommitPeak.Mb Value: 45 Key : WER.CorruptModuleList Value: 1 FILE_IN_CAB: 022722-11625-01.dmp BUGCHECK_CODE: 101 BUGCHECK_P1: a BUGCHECK_P2: 0 BUGCHECK_P3: ffff8000e7b10180 BUGCHECK_P4: 8 CUSTOMER_CRASH_COUNT: 1 STACK_TEXT: ffff8000
e82ac9e8 fffff8046e6c9266 : 00000000
00000101 000000000000000a 00000000
00000000 ffff8000e7b10180 : 0xfffff804
6e6164c0
ffff8000e82ac9f0 00000000
00000101 : 000000000000000a 00000000
00000000 ffff8000e7b10180 00000000
00000008 : 0xfffff8046e6c9266 ffff8000
e82ac9f8 000000000000000a : 00000000
00000000 ffff8000e7b10180 00000000
00000008 0000000000000000 : 0x101 ffff8000
e82aca00 0000000000000000 : ffff8000
e7b10180 0000000000000008 00000000
00000000 fffff780`00000320 : 0xa
SYMBOL_NAME: ANALYSIS_INCONCLUSIVE
MODULE_NAME: Unknown_Module
IMAGE_NAME: Unknown_Image
STACK_COMMAND: .cxr; .ecxr ; kb
FAILURE_BUCKET_ID: CORRUPT_MODULELIST_CLOCK_WATCHDOG_TIMEOUT
OSPLATFORM_TYPE: x64
OSNAME: Windows 10
FAILURE_ID_HASH: {505a7e83-2b57-2f1c-5666-16c4a774a5e4}
Followup: MachineOwner
---------
Some stuff I see in event viewer:
Errors
A timeout was reached (30000 milliseconds) while waiting for the Intel(R) SUR QC Software Asset Manager service to connect.
Dump file creation failed due to error during dump creation.
A fatal hardware error has occurred. A record describing the condition is contained in the data section of this event.
There are dcom warnings, but I will take care of those right now (I'm guessing they're not responsible for the crashes).
Last edited: