O Gods and Gurus of Tom's Hardware. I humbly supplicate your help with my current hardware issues :
This computer is now freezing up. Sometimes(very rarely) a BSOD happens. I have managed to get the minidump for 3 of these events(win10) and have included the links.
Specs
MBD: MSI Godlike X570
CPU: AMD Ryzen 5950X
GPU: ASUS TUF 4090
RAM: 64MB (G.Skill F4-4000C 18-32GTZR x 2)
PSU: Superflower 1600w 80+ Titanium Leadex
OS: Win 10/Win 11 Dual Boot
The CPU is cooled with an Arctic Freezer 2 480mm radiator and 2xfans. I have not changed the thermal paste in 4 years but there does not seem to be any thermal issues with the CPU as far as I can see.
I am currently running with an open case. There are plenty fans and airflow. Case is a Fractal Design Meshify 2(?). One of the Meshifys.
Windows 10(old 20H2 I think?) + Windows 11(latest updates/patches as of 02/09/2024 and drivers)
I do not know the BIOS verson but I have not had any need to update the BIOS in many years now. So expect a very old version, definitely from 2021. Given what is happening I am loathe to attempt a BIOS upgrade incase a fault happens mid update and bricks the board.
I have ran memtest86 twice and it did not show any faults after 4 tests(which takes about 8 hours).
I have the latest Windows 11 fully updated with all updates patches and latest version of drivers for all hardware. I also have an old version of Windows 10 on dual boot. They are on separate disks and the fault occurs on both.
For a long time the motherboard has been complaining "Over Current have been detected on USB device, Reboot on 15 seconds to protect your mainboard.". Normally I keep the computer powered for days(weeks, months?) on end as I host a web site and various other reasons. After a few tries the error leaves and the computer boots, works normally up until now. This has been the case for maybe 18 months or so now. I do not know if this fault is related to this issue or a parallel issue. It is a hassle but not a major problem and not worth spending hundreds on a new board to fix if I possibly can avoid it(unless it is the cause of this problem).
This problem seemed to start after I installed windows 11 on dual boot, although I suspect this is a coincidence and not the cause, or the several reboots have exacerbated an existing problem into a crisis.
Most of the time there is no BSOD log but I have managed to capture the minidump on 3 occasions :
https://www.dropbox.com/scl/fi/svnl...6-01.dmp?rlkey=5latzaqpx7mes43hvn26gyhh9&dl=1
https://www.dropbox.com/scl/fi/y0a1...3-01.dmp?rlkey=u9atnwvqfh79zy1lvgdpwb8cv&dl=1
https://www.dropbox.com/scl/fi/a94n...5-01.dmp?rlkey=9y27zxyjal47u7nhe1ipfenlw&dl=1
I have ran HwInfo64, here is the sensor readings from when I started logging to the crash/freeze are here(csv file) : https://www.dropbox.com/scl/fi/ocqz..._log.CSV?rlkey=f3zuv610vngflfgk1zsf5scmo&dl=1
I also ran the other HwInfo program which creates the graphs and did actually manage to catch a freeze event mid, you can see some of the bitmaps are all zero. I include 2 captures, one with the 2 minutes before the freeze and the next for the 2 minutes of the freeze :
https://www.dropbox.com/scl/fi/7387...3h46.zip?rlkey=5gtevy6uiv1zyorub9sv4npan&dl=1
In all cases the logs look ok with no usage or temperature spikes, however I am not a hardware expert, just a boring software guy, and cannot tell if there are abnormal voltages or such as I do not know what the correct baseline is.
I do NOT want to have to buy a new board/cpu and upgrade this now, I absolutely do not fancy the 9950x I really want this to last another year until AMD can come up with something a bit more sexy, so the cheapest possible hardware replacement to get this fixed and going would be ideal. I spend all my time on this and it has not been off since late 2020. I was lucky enough to be one of the first to get my hands on a 5950x so that is the timeframe. So having this not working is a major disruption to me + others who rely on my website(I fell out with my ISP which is why I am self hosting and am NOT going back!).
Any help or good ideas would be greatly appreciated. It lasts only a few minutes now if I can even boot but if there is something I can do to get more diagnostic info or other thing that might help I will make it happen if I can.
Thank you so much for any help or insight you can give for this problem.
This computer is now freezing up. Sometimes(very rarely) a BSOD happens. I have managed to get the minidump for 3 of these events(win10) and have included the links.
Specs
MBD: MSI Godlike X570
CPU: AMD Ryzen 5950X
GPU: ASUS TUF 4090
RAM: 64MB (G.Skill F4-4000C 18-32GTZR x 2)
PSU: Superflower 1600w 80+ Titanium Leadex
OS: Win 10/Win 11 Dual Boot
The CPU is cooled with an Arctic Freezer 2 480mm radiator and 2xfans. I have not changed the thermal paste in 4 years but there does not seem to be any thermal issues with the CPU as far as I can see.
I am currently running with an open case. There are plenty fans and airflow. Case is a Fractal Design Meshify 2(?). One of the Meshifys.
Windows 10(old 20H2 I think?) + Windows 11(latest updates/patches as of 02/09/2024 and drivers)
I do not know the BIOS verson but I have not had any need to update the BIOS in many years now. So expect a very old version, definitely from 2021. Given what is happening I am loathe to attempt a BIOS upgrade incase a fault happens mid update and bricks the board.
I have ran memtest86 twice and it did not show any faults after 4 tests(which takes about 8 hours).
I have the latest Windows 11 fully updated with all updates patches and latest version of drivers for all hardware. I also have an old version of Windows 10 on dual boot. They are on separate disks and the fault occurs on both.
For a long time the motherboard has been complaining "Over Current have been detected on USB device, Reboot on 15 seconds to protect your mainboard.". Normally I keep the computer powered for days(weeks, months?) on end as I host a web site and various other reasons. After a few tries the error leaves and the computer boots, works normally up until now. This has been the case for maybe 18 months or so now. I do not know if this fault is related to this issue or a parallel issue. It is a hassle but not a major problem and not worth spending hundreds on a new board to fix if I possibly can avoid it(unless it is the cause of this problem).
This problem seemed to start after I installed windows 11 on dual boot, although I suspect this is a coincidence and not the cause, or the several reboots have exacerbated an existing problem into a crisis.
Most of the time there is no BSOD log but I have managed to capture the minidump on 3 occasions :
https://www.dropbox.com/scl/fi/svnl...6-01.dmp?rlkey=5latzaqpx7mes43hvn26gyhh9&dl=1
https://www.dropbox.com/scl/fi/y0a1...3-01.dmp?rlkey=u9atnwvqfh79zy1lvgdpwb8cv&dl=1
https://www.dropbox.com/scl/fi/a94n...5-01.dmp?rlkey=9y27zxyjal47u7nhe1ipfenlw&dl=1
I have ran HwInfo64, here is the sensor readings from when I started logging to the crash/freeze are here(csv file) : https://www.dropbox.com/scl/fi/ocqz..._log.CSV?rlkey=f3zuv610vngflfgk1zsf5scmo&dl=1
I also ran the other HwInfo program which creates the graphs and did actually manage to catch a freeze event mid, you can see some of the bitmaps are all zero. I include 2 captures, one with the 2 minutes before the freeze and the next for the 2 minutes of the freeze :
https://www.dropbox.com/scl/fi/7387...3h46.zip?rlkey=5gtevy6uiv1zyorub9sv4npan&dl=1
In all cases the logs look ok with no usage or temperature spikes, however I am not a hardware expert, just a boring software guy, and cannot tell if there are abnormal voltages or such as I do not know what the correct baseline is.
I do NOT want to have to buy a new board/cpu and upgrade this now, I absolutely do not fancy the 9950x I really want this to last another year until AMD can come up with something a bit more sexy, so the cheapest possible hardware replacement to get this fixed and going would be ideal. I spend all my time on this and it has not been off since late 2020. I was lucky enough to be one of the first to get my hands on a 5950x so that is the timeframe. So having this not working is a major disruption to me + others who rely on my website(I fell out with my ISP which is why I am self hosting and am NOT going back!).
Any help or good ideas would be greatly appreciated. It lasts only a few minutes now if I can even boot but if there is something I can do to get more diagnostic info or other thing that might help I will make it happen if I can.
Thank you so much for any help or insight you can give for this problem.