[SOLVED] Is my GPu (EVGA 1080ti FTW) going bad?

axwack

Honorable
Feb 1, 2016
10
0
10,510
I have a weird problem happening. I thought it was Citrix but it doesn't seem to be. When I play any Uplay game or any game, my machine doesn't BSOD but the machine completely crashes with the monitors going completely white. I can still hear something going on with the system such as Discord because I can still hear players but then after about 2 minutes the system is unresponsive.

I just changed the thermal paste on the GPU. I used Heaven benchmark and did a memtest. All passed. I'm in the process of moving the system boot to a new harddrive. I'm out of ideas. Is the GPU/ going bad or is it the motherboard. The temps for the GPU are at 30c idle and 50-60 when playing games.

Any thoughts would be greatr.
 
I was about to post an issue that seem quite similar. Difference is that it happens to me at random, about once every day. I have two monitors and they turn different shades of gray, rather than clean white. Guessing my GPU is dying, but I have also no clue at this point.

Edit: Also running the 1080 ti (Gigabyte)
 
axwack I don't know the answer off the top of my head. But I believe in the "shotgun approach" to problems like these, meaning try everything. Am I correct in assuming that you've already run DDU, removed all drivers under Windows safe mode, redownloaded fresh copy of drivers and reinstalled? If so then I would do the following:

1. Post all specs here, including power supply manufacturer and model. Take two photos of of the inside of your build: One of the entire inside of your build and the other of your power supply label. Upload those two images to imgur and share the link here.

2. Update BIOS to latest stable version, according to instructions on motherboard site.

3. Obtain another SSD. Download latest chipset drivers, update SSD firmware using manufacturer's SSD toolbox. Install clean install of Windows 10, Uplay launcher, and game that keeps crashing. Install discord app and whichever applications could be the culprit, and try to reproduce the problem. If you don't have spare SSD laying around then I would consider purchasing a Samsung 860 Evo 500 GB SSD from Amazon for ~$60.

4. Register your 1080 Ti on the EVGA website. If your tests on the new storage and Windows 10 install still crashes, then reach out to EVGA. Even if you're just past the warranty period, they "might" still honor it. I've literally purchased approximately 18 PSUs and GPUs from EVGA over the past five years, and they have fist class support. If it turns out that your GPU is the problem, and EVGA approves the RMA request, then ask for a "advanced RMA" so that you're not without a graphics card during this process.

I know my suggestions above sound extreme but it is what I would actually do, in as short a period of time as possible. If you suspect your GPU is the culprit, then you have no time to waste.

Addendum: The reason that I recommend an entirely new (different) storage device, as opposed to wiping and reinstalling Windows on your current SSD, is in case my suggestion doesn't work. If the problem persists then you can simply pop in your original storage and continue troubleshooting the problem without starting from scratch.
 
Last edited:
  • Like
Reactions: geiroaa
I found I could trigger the crash by entering a game and then quitting. I did a clean wipe (DDU) of Nvidia driver 460.89 and reverted back to 457.51. This seems to have solved the issue for me.

There are a few on the Nvidia forums who had similar issues with 1080 ti on 460.89.
Windows dump gave this for google to pick up LKD_0x141_Tdr:6_IMAGE_nvlddmkm.sys_Pascal_PagingCE

My bad if your issue is entirely different. If you applied thermal paste again I would assume you have already tried reverting the drivers.
 
Last edited:
  • Like
Reactions: spentshells
I found I could trigger the crash by entering a game and then quitting. I did a clean wipe (DDU) of Nvidia driver 460.89 and reverted back to 457.51. This seems to have solved the issue for me.

There are a few on the Nvidia forums who had similar issues with 1080 ti on 460.89.
Windows dump gave this for google to pick up LKD_0x141_Tdr:6_IMAGE_nvlddmkm.sys_Pascal_PagingCE

My bad if your issue is entirely different. If you applied thermal paste again I would assume you have already tried reverting the drivers.
That sounds more like what I am experiencing. How can I go back a few versions of the drivers?
 
  • Like
Reactions: rcald2000
Solution
Uninstall your current driver (460.89) using this application, should be quite straight forward,
https://www.guru3d.com/files-details/display-driver-uninstaller-download.html

Find the 457.51 driver through here (460.79 may also run ok, I have not tested),
https://www.nvidia.com/Download/Find.aspx?lang=en-uk

I downgraded to this version 457.51 and have had no problems (knock on wood). This seems to be the culprit.

I don't know why this doesn't register itself in the event viewer so peope can debug it. Thanks @geiroa