Question WHEA Logger 18 crashes

ZsigiPajka · Nov 11, 2022

Computer Type: Desktop
GPU: AMD RX 5700
CPU: RYZEN 5 3600 6 CORE 12 THREADS
Motherboard: MSI B450 TOMAHAWK MAX (MS-7C02)
BIOS Version: 3.F0 07/23/2022
RAM: CORSAIR VENGEANCE® RGB PRO 16GB (2 x 8GB) DDR4 DRAM 3200MHz C16
PSU: SilverStone Essential Gold ET650-HG 650W
Operating System & Version: Windows 11 Pro 22H2 Build 22621.819
GPU Drivers: 21.50.21.11-220428a-382767C-AMD-Software-Adrenalin-Edition
Chipset Drivers: AMD B450 CHIPSET DRIVERS VERSION 4.09.23.507
Background Applications: LGHUB, iCUE AMD Adrenaline, HWINFO64, Malwarebytes
Description of Original Problem: Every time I try to play any game, it randomly crashes the whole system. It can happen after one hour of playing, or after a few days. Every time it's WHEA Logger 18 with random APIC ID (it basically crashed on every core ID ). The PC was build in 2019, these errors started at the beginning of this year.
Troubleshooting: I have tried every software fix like sfc. I also turned off cpu overclock and pbo. Temps are always stable, around 65 °C. Gpu is overclocked to 1850mhz with 1100mv max, vram is on 1860mhz. Gpu temps are also stable, around 75°C. I did all OCCT tests without errors. I also did MEMTEST86 for multiple hours without errors. I also tried changing RAM timing and voltage. RAM is currently running default at 2666mhz, no XMP. At this point I am trying to determine which HW component is responsible for the errors. I have a suspicion it's the RAM or CPU, but i am not 100% sure. So any suggestions what should I try next ?

white.a.drew · Nov 11, 2022

ZsigiPajka said:
Computer Type: Desktop
GPU: AMD RX 5700
CPU: RYZEN 5 3600 6 CORE 12 THREADS
Motherboard: MSI B450 TOMAHAWK MAX (MS-7C02)
BIOS Version: 3.F0 07/23/2022
RAM: CORSAIR VENGEANCE® RGB PRO 16GB (2 x 8GB) DDR4 DRAM 3200MHz C16
PSU: SilverStone Essential Gold ET650-HG 650W
Operating System & Version: Windows 11 Pro 22H2 Build 22621.819
GPU Drivers: 21.50.21.11-220428a-382767C-AMD-Software-Adrenalin-Edition
Chipset Drivers: AMD B450 CHIPSET DRIVERS VERSION 4.09.23.507
Background Applications: LGHUB, iCUE AMD Adrenaline, HWINFO64, Malwarebytes
Description of Original Problem: Every time I try to play any game, it randomly crashes the whole system. It can happen after one hour of playing, or after a few days. Every time it's WHEA Logger 18 with random APIC ID (it basically crashed on every core ID ). The PC was build in 2019, these errors started at the beginning of this year.
Troubleshooting: I have tried every software fix like sfc. I also turned off cpu overclock and pbo. Temps are always stable, around 65 °C. Gpu is overclocked to 1850mhz with 1100mv max, vram is on 1860mhz. Gpu temps are also stable, around 75°C. I did all OCCT tests without errors. I also did MEMTEST86 for multiple hours without errors. I also tried changing RAM timing and voltage. RAM is currently running default at 2666mhz, no XMP. At this point I am trying to determine which HW component is responsible for the errors. I have a suspicion it's the RAM or CPU, but i am not 100% sure. So any suggestions what should I try next ?

How long has the gpu been oc'd and what volts

ZsigiPajka · Nov 11, 2022

white.a.drew said:
How long has the gpu been oc'd and what volts

Pretty much since the start. The OC is:
1850mhz clock speed at 1100 mV (It was also set at 1200 mV in the past - same behavior). Vram at 1860mhz. Power limit +20%. Temperature was never above 80°C, most of the time it's around 70°C. Do you think the Whea crashes are from GPU OC ?

scout_03 · Nov 11, 2022

what happens if you put back gpu to default settings .

white.a.drew · Nov 11, 2022

ZsigiPajka said:
Pretty much since the start. The OC is:
1850mhz clock speed at 1100 mV (It was also set at 1200 mV in the past - same behavior). Vram at 1860mhz. Power limit +20%. Temperature was never above 80°C, most of the time it's around 70°C. Do you think the Whea crashes are from GPU OC ?

The gpu itself was maxing at 80c your may have fried the vram

ZsigiPajka · Nov 11, 2022

scout_03 said:
what happens if you put back gpu to default settings .

I will try it tomorrow. But it would still be weird, that normal GPU OC is causing system crash. It also crashes randomly, most of the time the GPU is not at full load. I have also done multiple benchmarks after the OC without issue.]

ZsigiPajka · Nov 11, 2022

white.a.drew said:
The gpu itself was maxing at 80c your may have fried the vram

Fried at 80°C? Google says max junction temp is 110°C, and it was never at 80 for a long time.

white.a.drew · Nov 11, 2022

ZsigiPajka said:
Fried at 80°C? Google says max junction temp is 110°C, and it was never at 80 for a long time.

The temp you are reading isnt the ram thats the CPU temp of the card. The vram usually is 15 hotter then that no oc but yours is oced which adds more volts which adds more heat you may have fried the vram not the cpu

ZsigiPajka · Nov 11, 2022

white.a.drew said:
The temp you are reading isnt the ram thats the CPU temp of the card. The vram usually is 15 hotter then that no oc but yours is oced which adds more volts which adds more heat you may have fried the vram not the cpu

Okay, so fired Vram can cause WHEA crashes? And how do I verified, that it's fired?

ZsigiPajka · Nov 11, 2022

ZsigiPajka said:
Okay, so fired Vram can cause WHEA crashes? And how do I verified, that it's fired?

Also, is vram hotter than hot spot?

white.a.drew · Nov 11, 2022

ZsigiPajka said:
Also, is vram hotter than hot spot?

Not always but it is possible for the vram to be hotter then the hot spot... It depends on where the hotspot sensors is on this specific model... The best way I know to test if the vram is fried is a super strong bench test that will do a focused tested on eack part of the gpu not just stress the gpu but will stress the CPU vram fans it will stress the whole gpu and give feed back I think occt might have one that can it but I'm not sure anymore it's been a while since I have looked into this type of testing

ZsigiPajka · Nov 11, 2022

white.a.drew said:
Not always but it is possible for the vram to be hotter then the hot spot... It depends on where the hotspot sensors is on this specific model... The best way I know to test if the vram is fried is a super strong bench test that will do a focused tested on eack part of the gpu not just stress the gpu but will stress the CPU vram fans it will stress the whole gpu and give feed back I think occt might have one that can it but I'm not sure anymore it's been a while since I have looked into this type of testing

I have done OCCT for one hour (limit for free version) without problem. I can try it again, maybe for more than once.

white.a.drew · Nov 11, 2022

ZsigiPajka said:
I have done OCCT for one hour (limit for free version) without problem. I can try it again, maybe for more than once.

[SOLVED] - Whats a good stress test for VRAM?

I've been using furmark for stress tests during my ocing, but according to HWMonitor, my system's VRAM utilization is at like 20%. Any idea of a benchmark that will butcher that as well?

forums.tomshardware.com

This is the only thing I'm finding on stressing your vram right now

ZsigiPajka · Nov 12, 2022

white.a.drew said:
[SOLVED] - Whats a good stress test for VRAM?

I've been using furmark for stress tests during my ocing, but according to HWMonitor, my system's VRAM utilization is at like 20%. Any idea of a benchmark that will butcher that as well?

forums.tomshardware.com

This is the only thing I'm finding on stressing your vram right now

This software is only utilizing 1GB of vram for me (It's from 2009). I have done another OCCT Vram test without issue.

scout_03 · Nov 12, 2022

see this could help https://www.makeuseof.com/gpu-overheating-causes-symptoms/

ZsigiPajka · Nov 12, 2022

I don't think the GPU is overheating. It's mostly around 65°C and 75°C for a brief moments(I have radeon chill turned on) in GPU intensive games. 80°C was reached only during OCCT benchmark. I also did some more research and I suspect it could be also faulty PSU that causes random crashes.

white.a.drew · Nov 12, 2022

ZsigiPajka said:
I don't think the GPU is overheating. It's mostly around 65°C and 75°C for a brief moments(I have radeon chill turned on) in GPU intensive games. 80°C was reached only during OCCT benchmark. I also did some more research and I suspect it could be also faulty PSU that causes random crashes.

I was thinking it could be psu related in the begining however I highly don't it. Your psu is a highly qualified psu... That doesn't mean it can't be faulty but generally less likely to be faulty

ZsigiPajka · Nov 12, 2022

I will try different PSU for now and see, if it really is bad PSU. Actually, bad PSU is the only thing making sense to me, no other component is really showing any type of problem.

Search

Question WHEA Logger 18 crashes

ZsigiPajka

white.a.drew

Dignified

ZsigiPajka

scout_03

Titan

white.a.drew

Dignified

ZsigiPajka

ZsigiPajka

white.a.drew

Dignified

ZsigiPajka

ZsigiPajka

white.a.drew

Dignified

ZsigiPajka

white.a.drew

Dignified

[SOLVED] - Whats a good stress test for VRAM?

ZsigiPajka

[SOLVED] - Whats a good stress test for VRAM?

scout_03

Titan

ZsigiPajka

white.a.drew

Dignified

ZsigiPajka

TRENDING THREADS

Latest posts

Moderators online

Share this page