Question PC has developed stability issues with no clear consistent cause or behaviour ?

Jan 14, 2025
3
0
10
This is an ongoing problem that pops up quite inconsistently, taking days to resurface. It could therefore take a while before i can verify whether anything that's been attempted works. I'm keeping a regular log of events. What i've got so far is this:

Me and my girlfriend have a pair of almost identical PCs which we've been using for about 1.5 years. Hers has a slightly better GPU, and an M2 SSD instead of my SATA. I will post the exact specs later once i can have a look at them. She might also have a better power supply, but i'm not sure. They are both running on windows 10. Hers has recently developed a variety of problems that i can't seem to find an explanation for. The current sequences of events is this (dates are not exact, but near enough insofar i can remember):

- Dec 19th: I updated the BIOS to prevent CPU damage as per the recent trouble with intel CPUs. I performed the update for both PCs in the exact same way, applying default settings on both. There were no immediate problems, and my system runs fine.

- Jan 10th: Her PC bluescreened during regular browsing. The percentage indicated on there remained at 0%, and the system rebooted afer a while on its own, into BIOS. The BIOS could not find the boot drive. I messed around with it for a bit (reboots, exploring menus, more reboots, but changed no settings), and finally settled on trying to arrange for a system recovery/repair, when the problem fixed itself before i did anything. We could just boot into windows as normal, and i did not do a repair.

There is no dump file for the BSOD. Logs were already enabled, and there is a file for the 4th of january, but nothing for the 10th. I'm positive that it did not happen on the 4th. I'm assuming this is because it never got past 0%, and had no drive to write it to at the time. Some older logs were also present. I have these logs in a zip file, and i can post these if you like.

- Jan 11th: Just in case, i switched the SSD to a different M2 port. According to the motherboard manual, this should be fine. The PC booted with no issues. I took the opportunity to run a health check for the drive using Samsung Magician, and it all came back healthy. Maybe the SSD is fine, but the system struggles to reliably access it? i don't know.

- Jan 14th: At 10:00, The PC booted, but was unable to properly load windows. It got past the login screen, but then entered an endless cycle of crashing and loading explorer. This was accompanied by the screen flashing black, but with visible cursor. She tried again at 11:00, but that time only got as far as the login screen. I have videos of both events if you want them. I suspect another attempt to boot was made at around 14:00. I'm currently waiting for conformation on that.

At 18:00 or so, i tried booting the system myself, and was able to boot normally, as if nothing was wrong. After that i did the following things:
- I ran sfc/ scannow. This seemed to fix some things, though i can't make much sense of the log. It seemed to just repair some duplicates. I'll try to post the log after this.
- Ran dism /online /cleanup-image /restorehealth. Found nothing.
- I changed the memory dump settings to a small memory dump, and told to PC to not automatically restart on a system failure. This is in case it BSODs again.
- I did a startup repair. This failed because it found nothing to repair, judging by the log file. I have this log, and can post it if you want.
- I explored the event viewer, which revealed some things: There are occasional series of memory access errors, mostly (if not all) from the time of the explorer crash loop. There were also a few WHEA-logger errors, ID 3. One of those dates to May last year, the other 3 up to a few weeks after the BIOS update. I have a screenshot of them, with data. My system does not have these events.
- I ran the Windows Memory Diagnostic tool. 2 passes, no errors. I have a screenshot of this result from Event Viewer.
- I updated the graphics drivers, and removed Armoury Crate.
- Given that our systems are largely identical, i swapped our RAM sticks to see if it would transfer the problem to my PC. The systems booted as normal.

I am as of the 15th waiting to see if the above actions did anything. Is there anything else i can do to improve the system health or find the problem? I was personally thinking of doing some more hardware swaps, but i'd rather not. They could be a real pain. Maybe try some driver updates?
 
Last edited:
The first step is to post the full specs and OS information for both systems.

Very important to start with as much information as possible.

Include PSU: make, model, wattage, age, condition (original to build, new, refurbished, used)?

Disk drive(s): make, model, capacity, how fulll?

= = = =

The common starting point of the problems being the BIOS updates - correct? BIOS update source? Link?

You mentioned Event Viewer. That is good.

However, also look in Reliabiity History/Monitor. Much more end-user friendly and the timeline format may reveal some patterns.

Look in Update History for failed or problem updates.

Keep in mind that "almost identical" means that what worked on one system may not necessarily work on the other.

Treat each system as a separate problem. The problems are mostly with her system - correct?

If swapping components for troubleshooting change only one thing at a time and keep good notes on what was changed.

More information needed.
 
The first step is to post the full specs and OS information for both systems.

Very important to start with as much information as possible.

Include PSU: make, model, wattage, age, condition (original to build, new, refurbished, used)?

Disk drive(s): make, model, capacity, how fulll?

All parts were new when the system was built, unless specified. The systems are now 1.5 years old.

Her system (the subject of my post):

GPU: Asus GeForce RTX 4060 Ti DUAL-RTX4060TI-O8G
PSU: Cooler Master MWE Gold 650 Full Modular V2 PSU
Storage: Samsung 980 PRO 2TB M.2 SSD - about 40% full. This is the only storage device.
RAM: Two x G.Skill DDR4 Aegis 2x8GB 3000Mhz, so 4 sticks of 8 GB.
Motherboard: Asus PRIME B760-PLUS D4
CPU: Intel Core i5 13400F
Case: Unknown. Came from a previous build.

OS: Windows 10 Home, Version 10.0.19045

My system (doesn't need fixing! I'm just using it for reference and hardware swaps!):

GPU: Asus GeForce RTX 4060 - Don't know the exact model.
PSU: Cooler Master MWE Gold 550 Full Modular V2 PSU
Storage: Samsung V-NAND 850 EVO 1TB SSD. - about 30% full. Used, but fine. This is the only storage device.
RAM: Two x G.Skill DDR4 Aegis 2x8GB 3000Mhz, so 4 sticks of 8 GB.
Motherboard: Asus PRIME B760-PLUS D4
CPU: Intel Core i5 13400F
Case: Unknown, but new.

OS: Windows 10 Home, Version 10.0.19045

The common starting point of the problems being the BIOS updates - correct? BIOS update source? Link?

That's the question. I thought i'd mention it, since it's a pretty significant update, but it could have a different cause entirely. This is where i got it:

https://www.asus.com/motherboards-c.../helpdesk_bios/?model2Name=PRIME-B760-PLUS-D4

I've checked the BIOS version, and it's 1805 for both systems.

However, also look in Reliabiity History/Monitor. Much more end-user friendly and the timeline format may reveal some patterns.
Look in Update History for failed or problem updates.

From the reliability monitor, a few things stand out:
18/12: failed windows update
19/12: failed windows update
22/12: hardware error - see below:
Source
Windows

Summary
Hardware error

Date
‎22/‎12/‎2024 13:36

Status
Not reported

Description
A problem with your hardware caused Windows to stop working correctly.

Problem signature
Problem Event Name:     LiveKernelEvent
Code: 144
Parameter 1:      3003
Parameter 2:      ffffb08ade1b75f0
Parameter 3:      40010000
Parameter 4:      0
OS version: 10_0_19045
Service Pack:     0_0
Product:    768_1
OS Version: 10.0.19045.2.0.0.768.101
Locale ID:  8192

24/12: Armoury crate stopped working
25/12: ASUS Update Helper - Unsuccesful application reconfiguration
4/1: Windows shut down unexpectedly. That's the source of the log, i guess, though i don't remember seeing such a thing happen. I'm still confident that the bluescreen happed more recently.
14/1: A lot. Mostly explorer.exe & dwm.exe stopped working, with a few other things mixed in. No surprise there.
Various dates: Local Security Authority Process Stopped Working, from 20/12 onward, after the failed windows update. It might have also done it before, but i can't look back that far.

There's a few other dates that say that windows was improperly shut down. This is weird, since we just shut it down through the start menu and then wait, but we've had random weirdness with that before. My PC never goes into low power mode; it just turns off the screen. Hers does, but it also sometimes boots back up on its own. I don't think it's related to oru current problems, but it could be related to some of the errors.
 
Last edited:
Disable Armoury Crate.
Disable ASUS Update Helper.
Check Task Manager > Startup and Task Scheduler for any unexpected or unknown apps being launched at startup or triggered to run later via Task Scheduler.

The unexpected shutdowns could simply be a bad/loose component or connection somewhere.

Power down, unplug, open the case.
Clean ou dust and debris
Verify by sight and feel that all connecters, cards, RAM, jumpers, and case connections are fully and firmly in place
Use a bright flashligh to inspect everywhere for signs of damage.

Run "dism" and "sfc /scannow" agaIn.

Objective simply to simplify what all is started at boot and limit what you open thereafter.

And eliminate possible culprits due to loose connections etc..
 
  • Like
Reactions: Terrence Flew
I see you noticed my updates. My previous post now has all the information that was still missing, as well as all the stuff i found in the reliablity history. There seems to be some pretty important stuff in there. Sorry for making you go back and forth like this, it just worked out that way.

I uninstalled Armoury Crate on the 14th, so i guess that takes care of that. It might be worth noting that i never installed it on my own system. Did that also take care of the update helper? I don't see it anywhere.

I disabled a few things on startup in task manager, but there was nothing unexpected in there. It should now be down to the bare essentials.

As for Task Scheduler, I disabled an MMO-related piece of software. I'm not sure if it'll stay down, but i'll see. It's currently set to 'disabled', but will still trigger at the log on of any user. Is that enough?
There are two ASUS tasks in there: NoiseCancelingEngine and P508PowerAgent_sdk. I don't know what they are, so i've left them alone. I don't know what else to check for. I've never used Task Scheduler before.

I'll check for loose connectors tomorrow if that's okay. At least the RAM is seated.

Ran "dism" and "sfc /scannow" again. No errors.

I've also got you some logs. I hope you can access these.
WHEA-logger files, saved from event viewer: https://files.catbox.moe/z1k1cs.zip
Minidump logs, including the Jan 4th log: https://files.catbox.moe/s8j8h0.zip
SrtTrail: https://files.catbox.moe/02wsjq.zip
CBS.log: https://files.catbox.moe/q5267k.zip

I do need to stress that while windows might say that it was improperly shut down, we didn't see such a thing happen, apart from the bluescreen and explorer crash loop.
 
Last edited: