Crash once or twice a week

ShockSA

Honorable
Jan 8, 2014
5
0
10,510
Hello, I have a problem and it's driving me insane, I keep getting a random BSOD once or twice a week and I didn't find any solution for this, I tried memtest+ and prime95 my CPU, stress test my GPU and all is fine, the BSOD is really random sometimes when I'm watching a stream or playing a game or simply browsing, any help is appreciated.

p.s. : nothing is overclocked, only GPU is factory clocked.

My System specs:
CPU : i5-4670k
GPU : GeForce 680 GTX
RAM : 16GB 1600hz Kingston
MOBO : Gigabyte Z87x D3H
PSU : Antec 750W

I uploaded the memory dumps to Mediafire :
http://www.mediafire.com/download/p0xwh0eh86ocphc/mymemd.zip

==================================================
Dump File : 010814-18484-01.dmp
Crash Time : 08/01/2014 09:08:38 AM
Bug Check String :
Bug Check Code : 0x00000124
Parameter 1 : 00000000`00000000
Parameter 2 : ffffe000`040da028
Parameter 3 : 00000000`be000000
Parameter 4 : 00000000`0100110a
Caused By Driver : hal.dll
Caused By Address : hal.dll+35cdf
File Description : Hardware Abstraction Layer DLL
Product Name : Microsoft® Windows® Operating System
Company : Microsoft Corporation
File Version : 6.3.9600.16408 (winblue_gdr.130920-1803)
Processor : x64
Crash Address : ntoskrnl.exe+14dca0
Stack Address 1 :
Stack Address 2 :
Stack Address 3 :
Computer Name :
Full Path : C:\Windows\Minidump\010814-18484-01.dmp
Processors Count : 4
Major Version : 15
Minor Version : 9600
Dump File Size : 302,128
Dump File Time : 08/01/2014 09:31:44 AM
==================================================

 
Solution
edit: 1.2v vcore should be fine for i5-4670K, give it a shot for several days and see if you get a crash. is so post the crash again.
edit #2 the power supply basically will supply, 3.3v, 5v and 12v the lower voltage would be regulated by the motherboard
and I would expect that a power feed would have to fail or realy fluctuate to cause a problem.

The voltage reported in the crash dump may not be correct, I would not get a new PSU, but would increase the power to the CPU slightly just like you were doing a slight overclock, but don't actually overclock the CPU, just increase the voltage and run the system for a few days and see if you get the same crash.
What I saw in the debugger was voltage 1.1v, I can not see what you see...
edit #2 BIOS reports that your CPU is at 1.1V this might be too low (BIOS update or set the value directly)
[Processor Information (Type 4) - Length 42 - Handle 0041h]
Socket Designation SOCKET 0
Processor Type Central Processor
Processor Family c6h - Specification Reserved
Processor Manufacturer Intel
Processor ID c3060300fffbebbf
Processor Version Intel(R) Core(TM) i5-4670K CPU @ 3.40GHz
Processor Voltage 8bh - 1.1V
External Clock 100MHz
Max Speed 7000MHz
Current Speed 3800MHz
Status Enabled Populated




edit: forgot to mention: you could also get this if the voltage to the processor is too low. (power issues)
each bugcheck showed cache errors on different cache memory banks:
GCACHEL2_ERR_ERR (Proc 1 Bank 5)
GCACHEL2_ERR_ERR (Proc 1 Bank 7)
GCACHEL2_ERR_ERR (Proc 3 Bank 5)

this is why i suggest that your cpu may have a low voltage. if they were all the same cache bank it would be a cpu cache memory error.

I would do a BIOS upgrade, the BIOS will control the voltage to the CPU



CPU reported a error in it cache windows produced a bugcheck in response:
CPU generated cache error is hard to help. I would look for heat related issues
maybe:
- underclock cpu
- check for overheating, dust on fan/cpu cooler
- update BIOS to get CPU microcode fixes
- you might be able to disable the cache but CPU would run slowly
- you might be able to disable a core, or tell the OS to run on one cpu by setting the affinity

all three bugchecks had the same root cause

debug info:
WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: ffffe0000469b028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000be000000, High order 32-bits of the MCi_STATUS value.
Arg4: 000000000100110a, Low order 32-bits of the MCi_STATUS value.

---------
Machine ID Information [From Smbios 2.7, DMIVersion 39, Size=3056]
BiosMajorRelease = 4
BiosMinorRelease = 6
BiosVendor = American Megatrends Inc.
BiosVersion = F7
BiosReleaseDate = 08/02/2013
SystemManufacturer = Gigabyte Technology Co., Ltd.
SystemProductName = Z87X-D3H
SystemFamily = To be filled by O.E.M.
SystemVersion = To be filled by O.E.M.
SystemSKU = To be filled by O.E.M.
BaseBoardManufacturer = Gigabyte Technology Co., Ltd.
BaseBoardProduct = Z87X-D3H-CF
BaseBoardVersion = x.x
1: kd> !sysinfo cpuspeed
CPUID: "Intel(R) Core(TM) i5-4670K CPU @ 3.40GHz"
MaxSpeed: 3400
CurrentSpeed: 3400
------------------

===============================================================================
Common Platform Error Record @ ffffe0000469b028
-------------------------------------------------------------------------------
Record Id : 01cefe0a7b8dab1a
Severity : Fatal (1)
Length : 928
Creator : Microsoft
Notify Type : Machine Check Exception
Timestamp : 12/21/2013 6:12:22 (UTC)
Flags : 0x00000000

===============================================================================
Section 0 : Processor Generic
-------------------------------------------------------------------------------
Descriptor @ ffffe0000469b0a8
Section @ ffffe0000469b180
Offset : 344
Length : 192
Flags : 0x00000001 Primary
Severity : Fatal

Proc. Type : x86/x64
Instr. Set : x64
Error Type : Cache error
Operation : Generic
Flags : 0x00
Level : 2
CPU Version : 0x00000000000306c3
Processor ID : 0x0000000000000002

===============================================================================
Section 1 : x86/x64 Processor Specific
-------------------------------------------------------------------------------
Descriptor @ ffffe0000469b0f0
Section @ ffffe0000469b240
Offset : 536
Length : 128
Flags : 0x00000000
Severity : Fatal

Local APIC Id : 0x0000000000000002
CPU Id : c3 06 03 00 00 08 10 02 - bf fb da 7f ff fb eb bf
00 00 00 00 00 00 00 00 - 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 - 00 00 00 00 00 00 00 00

Proc. Info 0 @ ffffe0000469b240

===============================================================================
Section 2 : x86/x64 MCA
-------------------------------------------------------------------------------
Descriptor @ ffffe0000469b138
Section @ ffffe0000469b2c0
Offset : 664
Length : 264
Flags : 0x00000000
Severity : Fatal

Error : GCACHEL2_ERR_ERR (Proc 1 Bank 7)
Status : 0xbe0000000100110a
Address : 0x00000003d8f1bac0
Misc. : 0x0000015086000086

 
edit: 1.2v vcore should be fine for i5-4670K, give it a shot for several days and see if you get a crash. is so post the crash again.
edit #2 the power supply basically will supply, 3.3v, 5v and 12v the lower voltage would be regulated by the motherboard
and I would expect that a power feed would have to fail or realy fluctuate to cause a problem.

The voltage reported in the crash dump may not be correct, I would not get a new PSU, but would increase the power to the CPU slightly just like you were doing a slight overclock, but don't actually overclock the CPU, just increase the voltage and run the system for a few days and see if you get the same crash.
What I saw in the debugger was voltage 1.1v, I can not see what you see in your BIOS, they could be adjusting the numbers or telling the OS a hardcoded number.
So find your BIOS setting and increase it by a small amount, something like .05 volts and retest.
You might also look up what people that overclock actually set their VCORE to. (just to find a safe range of values)
(It has been years since I have done any real overclocking, so I don't feel I can give good advice on the voltage)
I do however feel that your problem is well explained by overheating or a low core voltage. (cpu cache failures all in different memory banks in the cpu cache) if they were all the same cache bank then I would think it was a fried memory location in the cache (not the case this time)

 
Solution


new crash after bumping the cpu voltage to 1.25
https://www.mediafire.com/?jm3dp95f0kta3k4
also my cpu temp never went above 62c on prime95
also i found F8d beta bios update in the notes it says :
Fixes:
- Improve system compatibility
should i use beta bios?
 
memory dump file:011014-16390-01.dmp
System Uptime: 1 days 2:45:15.326
Built by: 9600.16452.amd64fre.winblue_gdr.131030-1505
WHEA_UNCORRECTABLE_ERROR (124)

Error : GCACHEL2_ERR_ERR (Proc 3 Bank 5)
Status : 0xbe0000000100110a
Address : 0x00000002dcde9200
Misc. : 0x000001d084000086

CPUID: "Intel(R) Core(TM) i5-4670K CPU @ 3.40GHz"
CurrentSpeed: 3400
Processor Voltage 8ch - 1.2V
Manufacturer Gigabyte Technology Co., Ltd.
Product Name Z87X-D3H
Product Z87X-D3H-CF

BIOS Version F7
BIOS Major Revision 4
BIOS Minor Revision 6

BIOS Release Date 08/02/2013


error looks like the last but cpu 3 cache bank 5 had the error.

your OS binaries in memory did not show any corruption.

I would try the beta BIOS and see if that helps, be sure to grab any intel chipset updates
if neither of those fix the issue, it is looking like a cache error inside the cpu.



 

TRENDING THREADS