X99 Taichi PCIe WHEA 17 + BSOD124

midnight410

Commendable
Dec 9, 2016
4
0
1,510
Hi guys
My system
Asrock x99 Taichi
i7 6800k
Asus gtx1070 strix

I have is the same timing problem like the x79 chipset had.
On pcie gen3 windows gets flooted with whea 17 errors until bsod 124.
If i change the pcie to gen1 all working fine.

Does someone know the problem yet?
Or is there an existing fix?
 

midnight410

Commendable
Dec 9, 2016
4
0
1,510


i flashed the newest p1.4 its the same problem.
I know the problem from x79 extreme 6 board.
its a timing problem between windows and the pcie config, answering the windows alive question. the pcie answers too late and windows reduce the communication speed with the driver, this brings the whea errors. after enough errors windows shuts down with bsod124.
for the extreme 6 there was a beta bios 1.0b with a fix for this problem.
 

midnight410

Commendable
Dec 9, 2016
4
0
1,510


the dump files only told that the problem is between hardware id from the pcie root port of the graphics card and the process of the hal.dll.
as you can read on the first post if i put the pcie manually to gen1 the errors and the bsod are gone.
But thats not a solution only a workaround
 

midnight410

Commendable
Dec 9, 2016
4
0
1,510

hi Paul

today i did further testing and going to the dump files.
every time ist the same Errors.

1. its on the pcie root port of the graficcard
"WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000004, PCI Express Error
Arg2: ffffb0875f1a6038, Address of the WHEA_ERROR_RECORD structure.
Arg3: 0000000000000000
Arg4: 0000000000000000"

2. its a Timing Problem, the error codes Show it definetly, as you can see the "UC" and "CTO" is set
"0: kd> !errrec ffffb0875f1a6038
================================================== =============================
Common Platform Error Record @ ffffb0875f1a6038
-------------------------------------------------------------------------------
Record Id : 01d25dacbb613311
Severity : Fatal (1)
Length : 672
Creator : Microsoft
Notify Type : PCI Express Error
Timestamp : 12/24/2016 6:46:35 (UTC)
Flags : 0x00000000
================================================== =============================
Section 0 : PCI Express
-------------------------------------------------------------------------------
Descriptor @ ffffb0875f1a60b8
Section @ ffffb0875f1a6148
Offset : 272
Length : 208
Flags : 0x00000001 Primary
Severity : Recoverable
Port Type : Root Port
Version : 1.1
Command/Status: 0x0010/0x0407
Device Id :
VenId:DevId : 8086:6f08
Class code : 030400
Function No : 0x00
Device No : 0x03
Segment : 0x0000
Primary Bus : 0x00
Second. Bus : 0x00
Slot : 0x0000
Dev. Serial # : 0000000000000000
Express Capability Information @ ffffb0875f1a617c
Device Caps : 00008001 Role-Based Error Reporting: 1
Device Ctl : 0027 ur FE NF CE
Dev Status : 0003 ur fe NF CE
Root Ctl : 0008 fs nfs cs
AER Information @ ffffb0875f1a61b8
Uncorrectable Error Status : 00014000 ur ecrc mtlp rof UC ca CTO fcp ptlp sd dlp und
Uncorrectable Error Mask : 00000000 ur ecrc mtlp rof uc ca cto fcp ptlp sd dlp und
Uncorrectable Error Severity : 00062010 ur ecrc MTLP ROF uc ca cto FCP ptlp sd DLP und
Correctable Error Status : 00002000 ADV rtto rnro dllp tlp re
Correctable Error Mask : 00000000 adv rtto rnro dllp tlp re
Caps & Control : 000000ae ecrcchken ECRCCHKCAP ecrcgenen ECRCGENCAP FEP
Header Log : 4a000001 03000004 00180040 00000000
Root Error Command : 00000000 fen nfen cen
Root Error Status : 00000000 MSG# 00 fer nfer fuf mur ur mcr cer
Correctable Error Source ID : 00,00,00
Correctable Error Source ID : 00,00,00"


yesterday i xchanged the graficcard but still the same Problem
ist only working on gen1