[SOLVED] Several driver crashes

-Arke-

Distinguished
Dec 26, 2015
197
0
18,710
Hello there!

I've been suffering from Windows crashes for some time now, and even though I feel like I've had some periods in which it didn't happen, eventually, it keeps crashing sooner or later.

It happens mostly while I am playing, and the kind of errors I've experienced have been variating a little bit. Most of the time, the screen just freezes and then turns black. Sound usually keeps working for a little longer but then freezes as well.

A few weeks ago I had two crashes with visual artifacts but It haven't happened ever since. Today I had a crash in which I managed to alt+sup and close my Windows sesion, but right after that it still went into a blue screen and had to reset.

These crashes started a long ago with a Windows reinstall, but I've tried a lot of things (uninstalling the drivers with DDU and getting them again, installing a different version of windows from zero, replacing most hardware pieces...) but still I keep suffering from these errors.

At first they would freeze my computer and I'd had to turn it off by brute force, and no error would appear on the registry. Now I've checked again just in case and I have some crash reports.

All of them seem to be related my GPU drivers, but I've changed my GPU from AMD to Nvidia, and some time later back to AMD again (Rx 580 to gtx 1070 to RX 5700), and every time I've done that I've used DDU to remove the old drivers and install the new ones, to no avail.

Here are some of my errors. Thanks in advance:

On Wed 21/10/2020 15:27:10 your computer crashed or a problem was reported
crash dump file: C:\WINDOWS\Minidump\102120-6875-01.dmp
This was probably caused by the following module: amdkmdag.sys (0xFFFFF8062E831940)
Bugcheck code: 0x116 (0xFFFFDB0FF1735050, 0xFFFFF8062E831940, 0xFFFFFFFFC0000001, 0x3)
Error: VIDEO_TDR_ERROR
file path: C:\WINDOWS\System32\DriverStore\FileRepository\c0359518.inf_amd64_20c585088e1e4b33\B359297\amdkmdag.sys
product: ATI Radeon Family
company: Advanced Micro Devices, Inc.
description: ATI Radeon Kernel Mode Driver
Bug check description: This indicates that an attempt to reset the display driver and recover from a timeout failed.
A third party driver was identified as the probable root cause of this system error. It is suggested you look for an update for the following driver: amdkmdag.sys (ATI Radeon Kernel Mode Driver, Advanced Micro Devices, Inc.).
Google query: amdkmdag.sys Advanced Micro Devices, Inc. VIDEO_TDR_ERROR

On Tue 15/09/2020 11:59:39 your computer crashed or a problem was reported
crash dump file: C:\WINDOWS\Minidump\091520-7812-01.dmp
This was probably caused by the following module: dxgkrnl.sys (dxgkrnl+0x26128E)
Bugcheck code: 0x116 (0xFFFFDC861C329460, 0xFFFFF806372009F0, 0x0, 0xD)
Error: VIDEO_TDR_ERROR
file path: C:\WINDOWS\system32\drivers\dxgkrnl.sys
product: Microsoft® Windows® Operating System
company: Microsoft Corporation
description: DirectX Graphics Kernel
Bug check description: This indicates that an attempt to reset the display driver and recover from a timeout failed.
The crash took place in a Microsoft module. Your system configuration may be incorrect. Possibly this problem is caused by another driver on your system that cannot be identified at this time.

On Sun 23/08/2020 1:40:28 your computer crashed or a problem was reported
crash dump file: C:\WINDOWS\Minidump\082320-6921-01.dmp
This was probably caused by the following module: dxgkrnl.sys (dxgkrnl+0x26129E)
Bugcheck code: 0x116 (0xFFFF9B8B33B49010, 0xFFFFF80219D30818, 0xFFFFFFFFC0000001, 0x3)
Error: VIDEO_TDR_ERROR
file path: C:\WINDOWS\system32\drivers\dxgkrnl.sys
product: Microsoft® Windows® Operating System
company: Microsoft Corporation
description: DirectX Graphics Kernel
Bug check description: This indicates that an attempt to reset the display driver and recover from a timeout failed.
The crash took place in a Microsoft module. Your system configuration may be incorrect. Possibly this problem is caused by another driver on your system that cannot be identified at this

Also, this is my system:
Computer name: DESKTOP-KBJ10TS
Windows version: Windows 10 , 10.0, build: 19041
Windows dir: C:\WINDOWS
Hardware: ASUSTeK COMPUTER INC., ROG STRIX Z370-F GAMING
CPU: GenuineIntel Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz Intel8664, level: 6
12 logical processors, active mask: 4095
RAM: 17110679552 bytes (15,9GB)
PSU: EVGA Supernova G2 750W
GPU: RX 5700

These crashes started to happen around Jan or Feb with some good periods in between. I've replaced not only most of my Software since it started, but also my Hardware, with the RAM and PSU being the unique exceptions, but I've had them tested with Prime96, Aida64 and Memtest and everything felt on spot. I'm baffled at this point and I don't really know what else could I do.

Thanks in advance for your time, guys.

These were my threads earlier this year:
Mars: https://forums.tomshardware.com/threads/gpu-crashing-on-heavy-load.3579355/
July: https://forums.tomshardware.com/thr...time-and-ive-tried-almost-everything.3628488/
 
Last edited:
Solution
were the errors using AMD also TDR errors? maybe fault isn't the cards but something else in pc. swapping from amd to nvidia and getting same errors would make me think it wasn't the gpu.

how old is PSU? if its one of the two parts you haven't replaced, its always possible. PSU are difficult to test but they are crucial part of PC so if they being flaky the entire PC will too.

Some ways to test PSU
the paper clip method - https://forums.tomshardware.com/threads/what-is-the-paperclip-method-of-testing-a-psu.1336402/

or multimeter,

or in the BIOS to check the +3.3V, +5V, and +12V. - https://www.lifewire.com/power-supply-voltage-tolerances-2624583

none of these are perfect as they aren't under load. I would run HWINFO...

Colif

Win 11 Master
Moderator
were the errors using AMD also TDR errors? maybe fault isn't the cards but something else in pc. swapping from amd to nvidia and getting same errors would make me think it wasn't the gpu.

how old is PSU? if its one of the two parts you haven't replaced, its always possible. PSU are difficult to test but they are crucial part of PC so if they being flaky the entire PC will too.

Some ways to test PSU
the paper clip method - https://forums.tomshardware.com/threads/what-is-the-paperclip-method-of-testing-a-psu.1336402/

or multimeter,

or in the BIOS to check the +3.3V, +5V, and +12V. - https://www.lifewire.com/power-supply-voltage-tolerances-2624583

none of these are perfect as they aren't under load. I would run HWINFO - https://www.hwinfo.com/download/
when you run it, click on sensors only and then run
in the next window, on bottom riught are a series of icons.
Click on the icon to right of the clock, that shows "logging start" if you hover mouse over it
this opens file explorer, make a file name you will remember and do the things you know will likely cause error.
You can read the output in excel or google docs, its a CSV file. If you want you can upload it to a file sharing site and show link, I might see something odd.
might need to run logging a few times if you can't recreate problem easily.
 

-Arke-

Distinguished
Dec 26, 2015
197
0
18,710
were the errors using AMD also TDR errors? maybe fault isn't the cards but something else in pc. swapping from amd to nvidia and getting same errors would make me think it wasn't the gpu.

how old is PSU? if its one of the two parts you haven't replaced, its always possible. PSU are difficult to test but they are crucial part of PC so if they being flaky the entire PC will too.

Some ways to test PSU
the paper clip method - https://forums.tomshardware.com/threads/what-is-the-paperclip-method-of-testing-a-psu.1336402/

or multimeter,

or in the BIOS to check the +3.3V, +5V, and +12V. - https://www.lifewire.com/power-supply-voltage-tolerances-2624583

none of these are perfect as they aren't under load. I would run HWINFO - https://www.hwinfo.com/download/
(...)

Hey, thanks for helping me!
I don't know if the errors were the same during my Nvidia time,to be honest. They certainly felt the same, but since I've seen some differences it might as well be something else causing different kind of errors, I don't know. I managed to recrod a video with my mobile phone when the artifact crash happened, I can upload it if you think it could be of use.

I'll try to do those things this weekend. My GPU isn't too old, it's still covered by the EVGA warranty; got it during mid 2015.
I'll try to answer during the weekend with some more info...take care ^^

Edit:
A bit of necroposting but still, since I got my problem solved it's worth something in case face a similar problem. I just got my GPU refunded and started using my girlfriend's RX 570. Not a single problem and been going this good for some months now, so I think we can safely blame my old RX 5700.
 
Last edited:
Solution