Looking for any sort of suggestion or guidance on this thus far.
Initial Setup
CPU - Ryzen 5 3600x
Cooler - Noctua U9S push+pull
Motherboard - MSI B450i Gaming Plus AC (latest beta bios, issue occur on both latest non-beta and beta bios. Contacting AMD they told me to update to beta...)
Ram - Corsair 32GB 3200 C16 (CMK32GX4M2B3200C16)
PSU - Corsair SF600 Platinum
GFX - Gigabyte 1080 Turbo OC
*never OC'd, only XMP turned on
Issue
Started observing random BSOD and hard reboot (almost always overnight when I leave it on to do low load work such as download and/or automation task)
I then turned off XMP, and observed same result
I then did a clean Windows 10 Pro install and observed same result
Diagnostic
Memtest86 pass repeatedly (both XMP on and off)
OCCT pass for free 1 hour test (both XMP on and off)
Linpack Xtreme found no issue 1+ hour (XMP off, haven't tried XMP on)
Prime95. This is where I can get a consistent failure.
Ran PSU and GFX on another old Intel/DDR3 system on a Ubuntu USB boot, no issue with Prime95 over 8+hours on blend
Ran the original system on same Ubuntu USB boot, same failure with Prime95 as observed in Windows
Prime95 Failure
Blend will always produce failure on Worker #3 or #4. Sometime after long enough, it will fail both and Ryzen Master will show Core 2 idle.
Large FFT test also produces the same result consistently.
Other FFT settings seems to be less consistent so far, sometime it'll be fine for a long time, sometimes not and always fail on the same Core.
A strange thing is, I noticed sometime it'll fail Worker #3 immediately on stress start and almost always it'll fail consistently around 1hour 40min into the test when it's doing "Self-test 896k"
Fix Attempt
Bump SoC voltage in 0.0125v increment offset. All same result. I stopped at 0.05v
Bump VRAM voltage in 0.01v increment from 1.35v. All same result. I stopped at 1.38v
Contact with AMD/Retailer
I contacted AMD describing my issue. They told me to update BIOs to beta and give that a try, this resulted in same outcome. They then told me it'll be easier to contact retailer for warranty otherwise I have to pay shipping to Singapore... (I'm located in Australia)
I contacted my retailer (Computer Alliance from QLD, Australia) and they told me to ship CPU/RAM/Mobo to them so they can test it out. I did so and after a week they told me they couldn't reproduce the error and said it might be my PSU or GFX... (I'm a bit surprised they said it might be GFX when I described my consistent failure in Prime95... They used a generic 550w PSU and a GTX 1660), so they sent it back. I immediately was able to reproduce the issue again...
I've now purchased a new SF750 and will be trying that out when it arrives (although I'm skeptical since PSU and GFX is rock solid with an old Intel/DDR3).
Not sure what else I can try... I don't have a spare CPU/Ram/Mobo... getting really depressed about this whole thing, especially when retailer came back with no issue...
Initial Setup
CPU - Ryzen 5 3600x
Cooler - Noctua U9S push+pull
Motherboard - MSI B450i Gaming Plus AC (latest beta bios, issue occur on both latest non-beta and beta bios. Contacting AMD they told me to update to beta...)
Ram - Corsair 32GB 3200 C16 (CMK32GX4M2B3200C16)
PSU - Corsair SF600 Platinum
GFX - Gigabyte 1080 Turbo OC
*never OC'd, only XMP turned on
Issue
Started observing random BSOD and hard reboot (almost always overnight when I leave it on to do low load work such as download and/or automation task)
I then turned off XMP, and observed same result
I then did a clean Windows 10 Pro install and observed same result
Diagnostic
Memtest86 pass repeatedly (both XMP on and off)
OCCT pass for free 1 hour test (both XMP on and off)
Linpack Xtreme found no issue 1+ hour (XMP off, haven't tried XMP on)
Prime95. This is where I can get a consistent failure.
Ran PSU and GFX on another old Intel/DDR3 system on a Ubuntu USB boot, no issue with Prime95 over 8+hours on blend
Ran the original system on same Ubuntu USB boot, same failure with Prime95 as observed in Windows
Prime95 Failure
Blend will always produce failure on Worker #3 or #4. Sometime after long enough, it will fail both and Ryzen Master will show Core 2 idle.
Large FFT test also produces the same result consistently.
Other FFT settings seems to be less consistent so far, sometime it'll be fine for a long time, sometimes not and always fail on the same Core.
A strange thing is, I noticed sometime it'll fail Worker #3 immediately on stress start and almost always it'll fail consistently around 1hour 40min into the test when it's doing "Self-test 896k"
Fix Attempt
Bump SoC voltage in 0.0125v increment offset. All same result. I stopped at 0.05v
Bump VRAM voltage in 0.01v increment from 1.35v. All same result. I stopped at 1.38v
Contact with AMD/Retailer
I contacted AMD describing my issue. They told me to update BIOs to beta and give that a try, this resulted in same outcome. They then told me it'll be easier to contact retailer for warranty otherwise I have to pay shipping to Singapore... (I'm located in Australia)
I contacted my retailer (Computer Alliance from QLD, Australia) and they told me to ship CPU/RAM/Mobo to them so they can test it out. I did so and after a week they told me they couldn't reproduce the error and said it might be my PSU or GFX... (I'm a bit surprised they said it might be GFX when I described my consistent failure in Prime95... They used a generic 550w PSU and a GTX 1660), so they sent it back. I immediately was able to reproduce the issue again...
I've now purchased a new SF750 and will be trying that out when it arrives (although I'm skeptical since PSU and GFX is rock solid with an old Intel/DDR3).
Not sure what else I can try... I don't have a spare CPU/Ram/Mobo... getting really depressed about this whole thing, especially when retailer came back with no issue...