Question WHEA Errors 1, 17, 18 & 19 and Uncorrectable Error ?

Jun 30, 2023
2
0
10
Hello,

I’ve got the following problem (Sorry for the long post, but I want to give as many details as possible):

Since Sunday my pc started to randomly reboot. Following this it was usually caught in a boot-loop for 5 or 6 times, until either starting or crashing completely.

  • It does so both when playing games (it did so the first time it happened), when being idle, or, most common by now, right or some seconds after starting. However, on Tuesday I could use the PC the whole afternoon without crashing (while using the browser, and some other low intensity stuff) only to crash when I opened another tab in the evening.
  • The event viewer contains all kinds of error (which happen more or less consistently in the boot-loop), however most notably are the WHEA errors (I attached a .txt that includes the details of the most prominent errors, although I can check for more if you need them. I’m sorry that they are in German, but I hope the most important parts are equal in English)
  • Sometimes the system crashes with an WHEA_UNCORRECTABLE_ERROR
  • Sometimes there are warnings in the log about a resolved WHEA_ERROR cache_hierarchy_error and PCI-Express root port error.
  • Sometimes these are also errors and not resolved.
  • The CPU idles around 40°C, but spikes at ~70°C. During stress tests it even went to 102°C once.
  • I can do tasks that involve heavy load such as multiple CPU tests/benchmarks without the pc crashing.
The PC was largely upgraded in fall 2022 (CPU, Mainboard, M2, RAM). I had never experienced any troubles since then, so these errors started pretty much out of nowhere (although it was pretty hot in the room that evening). I also can’t find any updates that were installed at the evening.

Im sorry for the weird pictures, but currently my pc keeps crashing so I cannot create a real screenshot)

The PC specs (Also HWiNFO image attached, and HWiNFO export attached):

  • MB: Gigabyte B660M DS3H DDR4, Bios Version F23b
  • CPU: I5 12600 (Boxed)
  • RAM: 2x Kingston KF3200C16D4/8GX (16GB in total, slots 2 and 4, runs with 2400MHz by bios defaults)
  • GPU: Geforce 1060 6GB (GP 106-400)
  • OS: Windows 11 home (x64) Build 22621.1848 (22H2)
  • M2 (with OS): NVMe 4x 16.0 GT/s Kingston SKC3000S 1024G
  • Additionally, 2 SSD (250GB each), 1 HDD, 1 DVD drive
  • I changed almost nothing in the Bios defaults and never did any changes to CPU/RAM configs or any overclocking. XMP is disabled.
  • PSU: BeQuiet System Power 7 BQ SU7-450W
What I tried so far:

  • HWiNFO does not show any issues (besides the system log)
  • Did CPU test by both Intel tool and PassMark, both were passed with 0 errors.
  • Renewed the thermal paste on the CPU.
  • Ran Memtest86 multiple times. Passses in different configurations (also only one stick in other slots) when done with sequential or selected CPUs, but CRASHES when running with the CPU in parallel mode. Didn’t show any errors besides that..
  • Checked SSDs both with tool from Kingston as well as with bios, passed both tests.
  • Disconnected the other SATA devices, still crashed.
  • Removed the GPU, still crashed.
  • Disabled the onboard GPU, still crashed.
  • Booted from a linux stick, still crashed (though there is a small chance this is unrelated)
  • Updated the Bios (Was F3 previously I think, but crashed with both the old and most up to date version).
  • Updated all drivers as far as possible.
  • Reset Bios to default.
  • Disabled all energy saving methods in Bios I could find.
  • Disabled PTT, still crashed.
  • Running Windows in safe mode it didn’t crash so far.
At this point I don’t know any further. I’ve searched in multiple forums for hours and can’t find any solution that works.

The only things left on my Todo List are clean reinstall of Windows and to somehow check the PSU, although I have no idea how to do the later.

I would gladly appreciate any Ideas you have, especially regarding the device that might be defect if this is a hardware issue. I’m not even sure if it is the RAM, the Mainboard, the CPU or something else entirely.

Thank you!

The PC Info and errors in the .txt are in the following Onedrive: https://1drv.ms/f/s!AipS2y7jwUo7iEYu8qQBaeEIVzjF?e=MhvkPe