Question Multiple BSOD errors ?

Sep 14, 2023
5
0
10
Solution: The cpu was at fault, after borrowing one to test it all errors stopped. I am currently going through the Intel warranty process.

So the other evening I updated a number of drivers through the Gigabyte Control Centre app. I believe they were all chipset drivers. Since then I have been getting constant BSOD crashes and I have been unable to fix the issue.

Sorry in advance for the mobile formatting.

System:
  • Gigabyte aorus z790 elite ax
  • I7 13900k
  • 32gb corsair vengeance ram
  • Samsung 980 Pro 2tb
  • Gigabyte aorus 4090
  • Rog strix 1000w gold

The errors I have been getting are:
  • Kmode exception not handled
  • Page fault in non-paged area
  • Whea uncorrectable error - Processor core - APIC ID: 32 - Machine check error - Internal parity error
  • Clock_watchdog-Timeout

What I have tried:
  • Restore to a previous point
  • Rollback drivers in device manager
  • Reinstall windows 10
  • Installed windows 11
  • Updated motherboard bios to F8
  • Reinstall chipset drivers from Gigabyte website
  • Sfc /scannow
  • DISM /online /cleanup-image /restorehealth
  • DDU reinstall of Nvidia drivers
  • Windows memory diagnostic
  • CHKDSK
  • I briefly tried to run driver verifier but it crashed during the check.

I'm definitely losing my mind after a few days of trying to fix this and wishing I hadn't updated anything.

I am currently at work but will try to answer any questions.

Any help would be greatly appreciated, thanks in advance!
 
Last edited:
Can you follow option one on the following link - here - and then do this step below: Small memory dumps - Have Windows Create a Small Memory Dump (Minidump) on BSOD - that creates a file in c windows/minidump after the next BSOD

  1. Open Windows File Explore
  2. Navigate to C:\Windows\Minidump
  3. Copy the mini-dump files out onto your Desktop
  4. Do not use Winzip, use the built in facility in Windows
  5. Select those files on your Desktop, right click them and choose 'Send to' - Compressed (zipped) folder
  6. Upload the zip file to the Cloud (OneDrive, DropBox . . . etc.)
  7. Then post a link here to the zip file, so we can take a look for you . . .

I guess chipset drivers could cause WHEA & Clock watchdog... they both CPU related. The also can be hardware errors.

How did you reinstall windows? clean install or reset?

driver errors shouldn't survive a reinstall, but hardware ones will.

Try running this on CPU - https://www.intel.com/content/www/us/en/download/15951/19792/intel-processor-diagnostic-tool.html?

1kw PSU and a 4090, that seems a bit small. I would have expected it need more.
 
Thank you for your help.

I'll look to do both of your suggestions once I get home.

I've done clean windows installs both times.

I went with the manufacturer recommended 1000w so I'm hoping that's enough.
 
Can you follow option one on the following link - here - and then do this step below: Small memory dumps - Have Windows Create a Small Memory Dump (Minidump) on BSOD - that creates a file in c windows/minidump after the next BSOD

  1. Open Windows File Explore
  2. Navigate to C:\Windows\Minidump
  3. Copy the mini-dump files out onto your Desktop
  4. Do not use Winzip, use the built in facility in Windows
  5. Select those files on your Desktop, right click them and choose 'Send to' - Compressed (zipped) folder
  6. Upload the zip file to the Cloud (OneDrive, DropBox . . . etc.)
  7. Then post a link here to the zip file, so we can take a look for you . . .

I guess chipset drivers could cause WHEA & Clock watchdog... they both CPU related. The also can be hardware errors.

How did you reinstall windows? clean install or reset?

driver errors shouldn't survive a reinstall, but hardware ones will.

Try running this on CPU - https://www.intel.com/content/www/us/en/download/15951/19792/intel-processor-diagnostic-tool.html?

1kw PSU and a 4090, that seems a bit small. I would have expected it need more.
https://www.dropbox.com/scl/fi/u2fg...dump.zip?rlkey=nkvn0nwzenchtbf8v279dat0d&dl=0

Here is the link to the 4 dump files from today (I reinstalled Win11 this morning). I've had slightly different errors so far.

Additionally, from the CPU diagnostic:

--- IPDT64 - Revision: 4.1.8.40
--- IPDT64 - Start Time: 14/09/2023 17:29:42

----------------------------------------------
-- Testing
----------------------------------------------
CPU 1 - Genuine Intel - Pass.
CPU 1 - BrandString - Pass.
CPU 1 - Cache - Pass.
CPU 1 - MMXSSE - Pass.
CPU 1 - IMC - Pass.
CPU 1 - Prime Number - Pass.
CPU 1 - Floating Point - Pass.
CPU 1 - Math - Pass.
CPU 1 - GPUStressW - Fail.

IPDT64 Failed
--- IPDT64 - Revision: 4.1.8.40
--- IPDT64 - End Time: 14/09/2023 17:32:02

----------------------------------------------
FAIL

--- Prime Number Generation Test ---
...
Version 1.0.28.64b.W
...

..DetectUtils64 DLL Version - 1.1.8
AVX is supported in your OS
Max AVX supported AVX2

Ops Per Sec CycleRun Error Timesec

249429 2 0 1
623627 7 0 2
498889 11 0 3
748315 17 0 4
Module Math_PrimeNum.exe Completed - Fail
No valid errorcode returned
Error Code -1

Result - Fail

An earlier test succeeded but on second run through it failed.
 
https://www.dropbox.com/scl/fi/u2fg...dump.zip?rlkey=nkvn0nwzenchtbf8v279dat0d&dl=0

Here is the link to the 4 dump files from today (I reinstalled Win11 this morning). I've had slightly different errors so far.

Additionally, from the CPU diagnostic:

--- IPDT64 - Revision: 4.1.8.40
--- IPDT64 - Start Time: 14/09/2023 17:29:42

----------------------------------------------
-- Testing
----------------------------------------------
CPU 1 - Genuine Intel - Pass.
CPU 1 - BrandString - Pass.
CPU 1 - Cache - Pass.
CPU 1 - MMXSSE - Pass.
CPU 1 - IMC - Pass.
CPU 1 - Prime Number - Pass.
CPU 1 - Floating Point - Pass.
CPU 1 - Math - Pass.
CPU 1 - GPUStressW - Fail.

IPDT64 Failed
--- IPDT64 - Revision: 4.1.8.40
--- IPDT64 - End Time: 14/09/2023 17:32:02

----------------------------------------------
FAIL

--- Prime Number Generation Test ---
...
Version 1.0.28.64b.W
...

..DetectUtils64 DLL Version - 1.1.8
AVX is supported in your OS
Max AVX supported AVX2

Ops Per Sec CycleRun Error Timesec

249429 2 0 1
623627 7 0 2
498889 11 0 3
748315 17 0 4
Module Math_PrimeNum.exe Completed - Fail
No valid errorcode returned
Error Code -1

Result - Fail

An earlier test succeeded but on second run through it failed.
I just cleared CMOS and removed the battery for 5 mins and since then I'm crashing every few minutes. It's getting quite hard to even upload logs.

Got a few more recent ones below:
 
Multiple tests with the Intel Processor Diagnostic Tool seem to lead to inconsistent passes and fails on the GPUStressW portion of the test.

Update: Having said that, it now also failed on the Math test module.