Question Help with underclocking GPU that is probably failing

pmacarthur

Distinguished
Apr 27, 2011
18
0
18,510
Hi there,

I have been troubleshooting an issue with my system for several months where it instantly restarts randomly when playing more demanding games - nothing out of the ordinary though. I can play LOL and minecraft no problem, but if i load something like stelaris or PUBG up, I will eventually see the restart issue. Long story short: started eliminating a bunch of components, reinstalled windows, and did a whole bunch of troubleshooting with temps that kept leading me to dead ends. I even RMA'ed my PSU (corsair AX760 platnum) and the issue persisted.

I eventually got to the graphics card and happened to have an old gtx 580 that i could swap in to test. I noted that the system didn't shut down with the 580 in (albeit ran a lot slower). I spoke with EVGA they basically told me to pound sand because my warranty expired 6 months before I "officially" reported the issue to them.

I decided to try and figure out what specifically is wrong with the card because frankly; i don't want to spend the better part of a grand on a new one right now. I installed EVGA precision. and started looking at temps. Now, I'm no expert with overclocking. I could probably follow some guides and find some success, but i don't really understand a lot of the basic concepts around overclocking GPU's. Because of this I have always ran my cards with stock settings.

I have no baseline for how my card behaved originally so I have been relying on the spec sheet for the card to tell me what the clock speed should be. When i started monitoring the card with precision xoc i noticed that the GPU clock has wild fluctuations in its clock speed. i will see the card sit at like 800ish MHZ at system idle or when playing LOL. However, when i play more demanding games i see it shoot up to around 1300mhz. this seems to be outside what the spec sheet says for the card, so im wondering if there is something out of whack. I am thinking i could lower the clock speed on the card a bit and stop the restarts from happening.

I don't think the shutdowns are being caused by a thermal issue. I tried cranking the fans up to 100 and just had the issue happen while the card was close to 60c. I understand i can turn on Kboost to force the card to the max clock. i have done that and will now test to see if i can get a reboot to happen while playing LOL.

If anyone has any input on this I would appreciate it. I dont really know how to decrease my clock speed.

EVGA GTX 980ti SC
1328MHZ GPU clock
3505MHZ mem clock
1.162 voltage
36c
 
Last edited:
So, if this is your card, then I am surprised you are getting a GPU clock of 1328. That seems to be beyond the specs. That could easily cause a crash without overheating the GPU.

The GPU clock is normally controlled through firmware on the card (sometimes called VGA BIOS). If you have ever messed with this, there is the potential you have loaded an incompatible one. Secondarily, the clock can be controlled by software (like the Precision software you installed).

I think it is more likely a software issue, so I would first look for and remove any overclocking software. You are getting drivers from EVGA or nVidia directly? If not, I would download and install the newest drivers. Lastly, you can check your card specs and VGA BIOS version with something like GPU-Z. Here is a list of BIOS ... your version should match the one for your card. If something seems wrong, proceed with caution. Loading the wrong BIOS can brick your card.
 
  • Like
Reactions: knickle

knickle

Distinguished
Jan 25, 2008
227
12
18,695
^^^THIS^^^

1328Mhz is a 20% overclock from factory, in which case you don't need an underclock. You need to return to the factory settings.

Are you sure you quoted the correct numbers?
 

pmacarthur

Distinguished
Apr 27, 2011
18
0
18,510
I will initially get the drivers from EVGA, but then i install geforce experience and will get new releases from nvidia direct. I have the most current driver installed, and have just loaded GPUZ. I also uninstalled precision, but the clock is still over the stock OC. How would i go about lowering the clock speed back to factory if precision has the offset at 0 already?

I haven't done anything to the firmware on the card. i do seem to have a version listed on the site you linked. is this something i should try flashing on the card to correct the issue? are these bios updates typically something that gets published by evga/nvidia?

Screens:
GPU1 & 2 are the settings before i uninstalled percision. GPU 3 & 4 is after i uninstalled precision
wkrb0h.png

2i12l91.png


11u7ptw.png
nofvrn.png
 

pmacarthur

Distinguished
Apr 27, 2011
18
0
18,510
That's the thing... its at the default settings in precision. the clock still goes past the listed specs. I know pascal cards can do this, but i didn't think the 980 could. What would be the best way to underclock it? I can change the clock offset slider to negative, but that only goes to -90 mhz which still isn't within spec.
 

knickle

Distinguished
Jan 25, 2008
227
12
18,695
Is it possible that you have a second overclocking utility that you forgot about that might be overriding the xvga precision settings? I would start looking at installed programs and see if there are tools that you forgot about. MSI afterburner for example.
 

pmacarthur

Distinguished
Apr 27, 2011
18
0
18,510
Unfortunately no, I just clean installed windows to eliminate anything like that. the only thing that was installed was precision. Its also important to note that i have never overclocked this card. the only thing i used precision for was fan speed.
 

knickle

Distinguished
Jan 25, 2008
227
12
18,695
In the short term I would try lowering your temperature target to something around 70ºC, and hit apply. That might help limit the clock speed and make it more stable. I would then hop on EVGAs support forum and post your issue. There's probably more than one expert there that can help you with this. https://forums.evga.com/
 

knickle

Distinguished
Jan 25, 2008
227
12
18,695
I assumed you turned on K-Boost in order to show what the frequencies are when you were taking screen shots from windows. As far as I know, K-Boost will run the card at MAX 24/7, wasting more energy.