Random BSOD after OCing a CPU

Venen

Distinguished
Nov 13, 2010
18
0
18,510
I've recently changed cooling of my CPU (4770k) from radiator + fan to Thermaltake water loop 3.0 extreme, primarily to reduce the noise.

It works well, and even during stress tests the temperature stayed under 50c. So I've switched the profile in BIOS from balanced to performance, which clocked the cpu from 3.9 to 4.3
I've let it ride for 2 hours on a stress test in Intel Extreme Tuning with no issues - usage was stable at 100% and temperature at 63c.

But then while playing a game, the PC froze and from minidump I've gathered:


On Fri 05/05/2017 14:50:02 your computer crashed
crash dump file: C:\Windows\Minidump\050517-15734-01.dmp
This was probably caused by the following module: hal.dll (hal+0x3627F)
Bugcheck code: 0x124 (0x0, 0xFFFFB70007D39028, 0xBF800000, 0x124)
Error: WHEA_UNCORRECTABLE_ERROR
file path: C:\Windows\system32\hal.dll
product: Microsoft® Windows® Operating System
company: Microsoft Corporation
description: Hardware Abstraction Layer DLL

So I unrolled it, let the PC ride on default for just over a day - no crashes.

Figured I will retry to reproduce it, so switched the setting in bios to performance again and let 4 hours cpu stress test run through. No problems detected, all good, so I started working on some stuff and... then it crashed when cpu was mostly idle.

Here is what came from the minidump:

On Fri 05/05/2017 18:42:10 your computer crashed
crash dump file: C:\Windows\memory.dmp
This was probably caused by the following module: hal.dll (hal!HalBugCheckSystem+0xCF)
Bugcheck code: 0x124 (0x0, 0xFFFFE08846F67028, 0xBF800000, 0x124)
Error: WHEA_UNCORRECTABLE_ERROR
file path: C:\Windows\system32\hal.dll
product: Microsoft® Windows® Operating System
company: Microsoft Corporation
description: Hardware Abstraction Layer DLL

And the minidump itself: https://jumpshare.com/v/KGjVhVN1d8fiq0bnRiX7

Any help appreciated, and hints at what may be the issue and how to resolve it.
 
Solution
So I would try one of two things. The easiest thing is to dial back the core multiplier to 42x perhaps. Optionally, if you want to keep the 43x multiplier, then your chip might need more than the 1.225v the utility suggests. You could try bumping this up to 1.25v. If you can stress test for hours and only get to 63c, then you have some room to play with the voltage to stabilize the system. 1.3v would be the extreme end for 4.3ghz and I really doubt that would be necessary.

So try 1.25v if you want to keep your multiplier and see if it helps stabilize.
This is almost certainly because of the overclocking. Something with the multiplier or voltage, but the 0x124 error is almost exclusive to an overclocked system. Did you increase the core voltage?
 

I've only switched the preset in BIOS and that did increase the voltage. The values it set were:

Max Turbo cpu speed: 4.307,,
core multiplier 41x
cache multiplier 39x
core voltage 1.225V

(picture version with more details: https://i.imgur.com/hgzhdgW.png)

 
So I would try one of two things. The easiest thing is to dial back the core multiplier to 42x perhaps. Optionally, if you want to keep the 43x multiplier, then your chip might need more than the 1.225v the utility suggests. You could try bumping this up to 1.25v. If you can stress test for hours and only get to 63c, then you have some room to play with the voltage to stabilize the system. 1.3v would be the extreme end for 4.3ghz and I really doubt that would be necessary.

So try 1.25v if you want to keep your multiplier and see if it helps stabilize.
 
Solution

TRENDING THREADS