Question Conflicts, crashes and deteriorating FPS ?

Pandawaffle

Distinguished
Apr 1, 2017
11
0
18,510
Specs
MOBO: MSI PRO B650-P WIFI
CPU: AMD Ryzen 7 7700X 8-Core
GPU: GeForce RTX 2070 8GB
RAM: Corsair XFlares 32 GB DDR5
OS: Win11 Home x64
Disk: Samsung SSD 850 EVO 500GB (OS)
Disk: Fikwot GN960 1TB (Gaming)
Disk: Seagate ST2000VX008-2E3164 (misc storage)
Disk: Western Digital WDS100T2G0A-00JG30 (misc storage)

History
I have been struggling with frame drop issues for months--honestly can't remember if there was a triggering event. I would be playing an intensive game (Helldivers 2/Space Marine 2) and my frames would occasionally plummet from my normal 90-110fps to 50-60fps, along with a chugging pattern where 1-2 seconds of gameplay would run fluidly followed by 1 second of total chug. The longer I played an intensive game, the more likely it would occur. Rebooting the application would not fix the problem, I would have to restart to resolve it. I have been a mix of busy and lazy so I just lived with the issue.
About a month ago I got fed up with it and looked into it. My research suggested a memory issue, so I used Memtest86.
No errors found, and the problem persisted, but I started to pin the frame rate drops to this warning in Event Viewer:

Nvidia GL Driver (Warning)
Ran out of Memory

My research told me the cause might be AMD and Nvidia drivers being in conflict or incomplete. I ran DDU in safe mode and reinstall my graphics drivers. Which I did ~1 week ago, clearing and reinstalling my Nvidia drivers.
Problem persisted. So with more research, I followed a suggestion to disable my iGPU. I did that 2 days ago and HD2 ran fine & stable. I thought I had fixed it! Only note was that I was previously running my 2nd monitor from the iGPU, so I had to plug it into the 2070 after disabling it.
I left my gaming PC off for 2 days while traveling and came back today, booted HD2 and had a crash on startup, with these Events:

“Nvlddmkm" (Error) x2
The description for Event ID 153 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

\Device\000000d3
Error occurred on GPUID: 100

The message resource is present but the message was not found in the message table

“Application Error” x15
Faulting application name: helldivers2.exe, version: 1.8.27959.0, time stamp: 0x67b4c753
Faulting module name: ntdll.dll, version: 10.0.26100.3037, time stamp: 0x95e6c489
Exception code: 0xc000000d
Fault offset: 0x0000000000034703
Faulting process id: 0x2594
Faulting application start time: 0x1DB8AE116959C71
Faulting application path: L:\Program Files (x86)\steamapps\common\Helldivers 2\bin\helldivers2.exe
Faulting module path: C:\WINDOWS\SYSTEM32\ntdll.dll
Report Id: dc181075-35c0-4e81-b12e-7992be318bee
Faulting package full name:
Faulting package-relative application ID:

I tried rebooting the game and had a system crash, both monitors going black before an automatic restart. These were the Events:

Kernel-EventTracing (Warning)
The maximum file size for session "PerfDiag Logger" has been reached. As a result, events might be lost (not logged) to file "C:\WINDOWS\system32\WDI\LogFiles\ShutdownPerfDiagLogger.etl". The maximum files size is currently set to 20971520 bytes.

Kernel-EventTracing (Error)
Session "PerfDiag Logger" stopped due to the following error: 0xC0000188

“Application Error” x~100
Very similar to the helldivers crash logs from before

Volmgr (Error)
Dump file generation succeded.

Kernel-Power (Error)
The system has rebooted without cleanly shutting down first. This error could be caused if the system stopped responding, crashed, or lost power unexpectedly.

Tried booting HD2 & other intensive games and they all start at 100+ FPS and then progressively fall to 30-40fps, along with the same fluid-chugging pattern every 2 seconds.
Went Back to DDU, cleaned out and reinstalled AMD drivers and then re-enabled the iGPU, set to “Game Mode.” Both monitors still running from the 2070.

HD2 and other games still lose FPS over time, settling ~60fps without any fluid-chugging patterns. Restarting the games and/or system did not change this pattern. It’s playable, but definitely less than what the system is capable of.
I’ve stopped getting any memory or Nvidia-related events, so I have nothing left to search on and I seem to have made the problem consistently worse trying to fix it on my own, so I would really appreciate some guidance.
 
You forgot to mention the make, model and age of the PSU in your build.

Disk: Fikwot GN960 1TB (Gaming)
That's probably your weakest link considering it's a cheap no name SSD.

Your specs look like you recently built the system, if so, did you migrate the OS drive without reinstalling the OS? If so, this is the main reason for all your issues. Recreate your bootable installer for the OS, disconnect all drives except for the one you wish to install the OS onto, install the OS in offline mode then manually install all drivers while in offline mode(in an elevated command).

I ran DDU in safe mode and reinstall my graphics drivers.
Use DDU in Safe Mode to remove all GPU drivers(intel, AMD and Nvidia) then manually install the latest GPU driver sourced from Nvidia's support site, in an elevated command, i.e, Right click installer>Run as Administrator.

BIOS version for your motherboard?