Question WHEA_UNCORECTABLE_ERROR Windows 10

Dec 11, 2022
12
0
10
Hello all! I recently got a new laptop after RMAing my previous one, and it's already having issues 🙁 .

The laptop has been bluescreening anywhere from once every other day to once a week. The bluescreen always has graphical artifacts on the bottom half of the screen, and the stop code is WHEA_UNCORRECTABLE_ERROR. There are no dump files, all I have to go off of are the Event Viewer logs, which state the following (in this order):

Error
Source: EventLog
EventID: 6008, Category: none
The previous system shutdown at time on date was unexpected

Error
Source: volmgr
EventID: 161
Category: none
Dump file creation failed due to error during dump creation

Critical
Source: Kernal-Power
EventID: 41
Category: 63
The system has rebooted without cleanly shutting down first. This error could be caused if the system stopped responding, crashed, or lost power unexpectedly
From the event details...:
  • BugcheckCode: 292
  • BugcheckParameter1: 0x10
  • BugcheckParameter2: 0x0
  • BugcheckParameter3: 0x0
  • BugcheckParameter4: 0x0
  • SleepInProgress: 0
  • PowerButtonTimestamp: 0
  • BootAppStatus: 0
  • Checkpoint: 0
  • ConnectedStandbyInProgress: false
  • SystemSleepTransitionsToOn: 2
  • CsEntryScenarioInstanceId: 0
  • BugcheckInfoFromEFI: true
  • CheckpointStatus: 0
  • CsEntryScenarioInstanceIdV2: 0
  • LongPowerButtonPressDetected: false

Error
Source: WHEA-Logger
EventID: 1
Category: none
A fatal hardware error has occurred. A record describing the condition is contained in the data section of this event.
From the event details...:
  • Length: 292
  • RawData:
  • 435045521002FFFFFFFF010001000000070000002A0100002A0A1300090C16143C60C1835215A74887D114D9467D7765000000000000000000000000000000008D7C2157665EFB4480339B74CACEDF5B03F83300702E884E992C6F26DAF3DB7AB5DCC2E25F0BD901080000000000000000000000000000000000000000000000C8000000620000000003020001000000000000000000000000000000000000000000000000000000000000000000000001000000000000000000000000000000000000000000000053544F52504F5254010062000000030001000500110000001C95B73C7249EC1192DF806E6F6E6963730074006F0072006E0076006D006500000000000000000000000000000000004E564D652020202000494E54454C2053534450454B4E55303100

I am specifically posting this here because when converted to text, the raw data contains my SSD name, so I think that might be the issue. However, I have absolutely no idea what to do with that information. I've checked for updates for my SSD through device manager and ASUS's download center for my laptop, but neither have anything.

Here's my specs:
  • Type: Laptop
  • Model: 2021 ASUS ROG Zephyrus 2021 (GA401QM)
  • Processor: AMD Ryzen 9 5900HS with Radeon Graphics
  • GPU: GeForce RTX 3060 Laptop GPU
  • Drive: Intel SSDPEKNU010TZ SSD
  • RAM: Micron 4atf1g64hz-3g2e2

If there's anything else I can do/provide please lmk! Thank you for helping out :)
 
It does look like an SSD problem:

Code:
Offset(h) 00 01 02 03 04 05 06 07 08 09 0A 0B 0C 0D 0E 0F

00000000  43 50 45 52 10 02 FF FF FF FF 01 00 01 00 00 00  CPER..ÿÿÿÿ......
00000010  07 00 00 00 2A 01 00 00 2A 0A 13 00 09 0C 16 14  ....*...*.......
00000020  3C 60 C1 83 52 15 A7 48 87 D1 14 D9 46 7D 77 65  <`ÁƒR.§H‡Ñ.ÙF}we
00000030  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
00000040  8D 7C 21 57 66 5E FB 44 80 33 9B 74 CA CE DF 5B  .|!Wf^ûD€3›tÊÎß[
00000050  03 F8 33 00 70 2E 88 4E 99 2C 6F 26 DA F3 DB 7A  .ø3.p.ˆN™,o&ÚóÛz
00000060  B5 DC C2 E2 5F 0B D9 01 08 00 00 00 00 00 00 00  µÜÂâ_.Ù.........
00000070  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
00000080  C8 00 00 00 62 00 00 00 00 03 02 00 01 00 00 00  È...b...........
00000090  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
000000A0  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
000000B0  01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
000000C0  00 00 00 00 00 00 00 00 53 54 4F 52 50 4F 52 54  ........STORPORT
000000D0  01 00 62 00 00 00 03 00 01 00 05 00 11 00 00 00  ..b.............
000000E0  1C 95 B7 3C 72 49 EC 11 92 DF 80 6E 6F 6E 69 63  .•·<rIì.’߀nonic
000000F0  73 00 74 00 6F 00 72 00 6E 00 76 00 6D 00 65 00  s.t.o.r.n.v.m.e.
00000100  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
00000110  4E 56 4D 65 20 20 20 20 00 49 4E 54 45 4C 20 53  NVMe    .INTEL S
00000120  53 44 50 45 4B 4E 55 30 31 00                    SDPEKNU01.

Is there any clue in the SMART report? You could use a tool such as CrystalDiskInfo or GSmartControl.
 
Here's the results from CrystalDiskInfo:

eablil.PNG


I also ran the command: wmic diskdrive get status, which said my SSD status was ok
 
Hm, I downloaded GSmartControl and for some reason, it says my disk is unknown & unsupported

ymn4sw.PNG


And the output text from clicking on the drive:

smartctl 7.2 2020-12-30 r5155 [x86_64-w64-mingw32-w10-b19044] (sf-7.2-1)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number: INTEL SSDPEKNU010TZ
Serial Number: BTKA13210PJQ1P0B
Firmware Version: 002C
PCI Vendor/Subsystem ID: 0x8086
IEEE OUI Identifier: 0x5cd2e4
Controller ID: 1
NVMe Version: 1.4
Number of Namespaces: 1
Namespace 1 Size/Capacity: 1,024,209,543,168 [1.02 TB]
Namespace 1 Formatted LBA Size: 512
Local Time is: Sun Dec 11 15:26:51 2022 EST
Firmware Updates (0x14): 2 Slots, no Reset required
Optional Admin Commands (0x0017): Security Format Frmw_DL Self_Test
Optional NVM Commands (0x005f): Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp
Log Page Attributes (0x0f): S/H_per_NS Cmd_Eff_Lg Ext_Get_Lg Telmtry_Lg
Maximum Data Transfer Size: 64 Pages
Warning Comp. Temp. Threshold: 77 Celsius
Critical Comp. Temp. Threshold: 80 Celsius

Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 4.00W - - 0 0 0 0 0 0
1 + 3.00W - - 1 1 1 1 0 0
2 + 2.20W - - 2 2 2 2 0 0
3 - 0.0250W - - 3 3 3 3 5000 5000
4 - 0.0040W - - 4 4 4 4 3000 11999

Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
 
This spec appears to define the error report:

https://uefi.org/specs/UEFI/2.10/Apx_N_Common_Platform_Error_Record.html

The byte offsets are in decimal.

Common Platform Error Record (CPER)

Code:
Offset(d) 00 01 02 03 04 05 06 07 08 09

00000000  43 50 45 52 10 02 FF FF FF FF  CPER..ÿÿÿÿ
00000010  01 00 01 00 00 00 07 00 00 00  ..........
00000020  2A 01 00 00 2A 0A 13 00 09 0C  *...*.....
00000030  16 14 3C 60 C1 83 52 15 A7 48  ..<`ÁƒR.§H
00000040  87 D1 14 D9 46 7D 77 65 00 00  ‡Ñ.ÙF}we..
00000050  00 00 00 00 00 00 00 00 00 00  ..........
00000060  00 00 00 00 8D 7C 21 57 66 5E  .....|!Wf^
00000070  FB 44 80 33 9B 74 CA CE DF 5B  ûD€3›tÊÎß[
00000080  03 F8 33 00 70 2E 88 4E 99 2C  .ø3.p.ˆN™,
00000090  6F 26 DA F3 DB 7A B5 DC C2 E2  o&ÚóÛzµÜÂâ
00000100  5F 0B D9 01 08 00 00 00 00 00  _.Ù.......
00000110  00 00 00 00 00 00 00 00 00 00  ..........
00000120  00 00 00 00 00 00 00 00 C8 00  ........È.
00000130  00 00 62 00 00 00 00 03 02 00  ..b.......
00000140  01 00 00 00 00 00 00 00 00 00  ..........
00000150  00 00 00 00 00 00 00 00 00 00  ..........
00000160  00 00 00 00 00 00 00 00 00 00  ..........
00000170  00 00 00 00 00 00 01 00 00 00  ..........
00000180  00 00 00 00 00 00 00 00 00 00  ..........
00000190  00 00 00 00 00 00 00 00 00 00  ..........
00000200  53 54 4F 52 50 4F 52 54 01 00  STORPORT..
00000210  62 00 00 00 03 00 01 00 05 00  b.........
00000220  11 00 00 00 1C 95 B7 3C 72 49  .....•·<rI
00000230  EC 11 92 DF 80 6E 6F 6E 69 63  ì.’߀nonic
00000240  73 00 74 00 6F 00 72 00 6E 00  s.t.o.r.n.
00000250  76 00 6D 00 65 00 00 00 00 00  v.m.e.....
00000260  00 00 00 00 00 00 00 00 00 00  ..........
00000270  00 00 4E 56 4D 65 20 20 20 20  ..NVMe 
00000280  00 49 4E 54 45 4C 20 53 53 44  .INTEL SSD
00000290  50 45 4B 4E 55 30 31 00        PEKNU01.

Section Descriptor

Code:
Offset(d) 00 01 02 03 04 05 06 07 08 09

00000000  C8 00 00 00 62 00 00 00 00 03  È...b.....
00000010  02 00 01 00 00 00 00 00 00 00  ..........
00000020  00 00 00 00 00 00 00 00 00 00  ..........
00000030  00 00 00 00 00 00 00 00 00 00  ..........
00000040  00 00 00 00 00 00 00 00 01 00  ..........
00000050  00 00 00 00 00 00 00 00 00 00  ..........
00000060  00 00 00 00 00 00 00 00 00 00  ..........
00000070  00 00                          ..

After decoding the bits and bytes, it appears that there was a fatal error which was not contained within the processor or memory, and which may have propagated to persistent storage or network.

It doesn't tell me much, but then I'm not a programmer.
 
Last edited: