Question System freezing up and I'm not sure how to troubleshoot ?

Apr 6, 2024
3
0
10
Hello,

I recently built a Threadripper rig for work related tasks, but I have had a weird problem with it over the last several months that basically causes the system to freeze up. I will describe the problem more below and also list my system components. Any help troubleshooting this issue would be fantastic. I also don't know if this is the right section of the forum to post this in, so if it is wrong, I do apologize.

Normal Computer Usage:
I should first describe the use of the system to maybe help diagnose this issue.
The system is primarily used for CAM software and some CAD. Autodesk Powermill and Autodesk Fusion as well as Geomagic DesignX.
The software I use has no apparent bottlenecks. In fact, this system is much faster than my previous system for using any of these softwares. That would make sense because it is a Threadripper rig.

Problem at hand:
The system essentially 'freezes' up. I think 'freezing' up may not be the right word, but that is what I can think of to describe it.

Essentially, the system becomes unresponsive.
Let's say the system 'froze' up while I was using it. I would still be able to use the mouse and move the cursor around the screen no problem. I could minimize windows, sometimes bring windows back up. I am unable to open any new program, unable to save any program, unable to really do anything with any program except for move the mouse around the screen. The graphics still displays what it would normally display prior to it having 'frozen' up. The sure way to know my system has 'frozen' is when I turn my keyboard off and then on and the light that would normally be illuminated with a solid white light when connected is blinking instead.

Let's say I left the system overnight. I usually leave the system on and running overnight because I let programs like PowerMill calculate or do some action that will take a long time overnight. I turn the monitor off when I am leaving it overnight as well as turning my keyboard and mouse off. When I return to it hours later, the monitor would not turn back on (as in, it would not display what was previously open the day before. The monitor does TURN on, but there is nothing displayed except a black screen). The keyboard does as described in the previous paragraph, where the connection light blinks instead of stays illuminated (indicating the keyboard is not connected).

I do not believe it is my system overheating because I have since upgraded to a water-cooled radiator from the previous Noctua NH-U9 TR4-SP3 that I had running with the processor. I also have monitored the CPU temp with core temp while I am running the system and the temperature range is actually less with the water-cooled radiator rather than the Noctua I previously had, yet the problem persists. So I assume it is not an overheating problem.

The problem is also not consistent, at least I have yet to discover anything that triggers it. Sometimes the system will run perfectly for a week or more before this happens. Today, as I write this, the system ran for about 3 hours before it occurred. I know it is not based on my activity on the system, as in maybe it just happens when it sits for long periods of time. There have been times where I was using it and then it would 'freeze' up and I would determine I had to restart the system by checking the keyboard connection light (described above).

I do not know where else to begin to troubleshoot this system to determine why this happens.

Maybe one of my RAM sticks is corrupted and causes a freeze to the system? I legitimately have no idea. I have enough computer knowledge to believe I put the right components together for this system, but not enough computer knowledge to determine the best way to solve this issue.

I have looked online at various resources to try and see if anyone else has had this problem, but nothing sounds like what I am experiencing.

Computer Hardware:
AMD Ryzen Threadripper PRO 5955WX - sWRX8 socket
ASUS Pro WS WRX80E-SAGE SE WIFI II (Workstation motherboard)
G.SKILL TridentZ RGB Series 128GB DDR4 3200 (PC4 25600)
WD Blue 3D NAND 500GB Internal SSD (this was actually a salvage from my old system. I did do a fresh Windows OS install when I moved this over to the new system).
WD Black 2TB Performance Desktop Hard Drive (2nd HDD)
Phanteks Glacier One 420D30 Premium (CPU radiator)
RM1000x PSU (1000 watt)
NVIDIA RTX A4000


If anyone is so kind enough to help figure this out and you require more information, please do not hestitate to ask for me to try to find more information through the computer for this problem. This is an incredibly annoying issue.

Thank you.
 
Welcome to the forums, newcomer!

How old is the PSU in your build? BIOS version for your motherboard at this moment of time? Which Windows are you working with? What version(not edition) of the OS are you on?

G.SKILL TridentZ RGB Series 128GB DDR4 3200 (PC4 25600)
Got a link to this kit?
 
Welcome to the forums, newcomer!

How old is the PSU in your build? BIOS version for your motherboard at this moment of time? Which Windows are you working with? What version(not edition) of the OS are you on?

G.SKILL TridentZ RGB Series 128GB DDR4 3200 (PC4 25600)
Got a link to this kit?
PSU is only a few months old. Bought it shortly after setting up this new build. Previous PSU was around 850W.

BIOS Version 1106 x64 (02/10/2023)

Windows 11 Home
Version: 10.0.22621 Build 22621

I forgot to mention. My primary keyboard is bluetooth. When this problem has 'froze' my computer, I did try to see if a wired keyboard would turn the computer 'back on.' It did not.

Link to RAM kit: https://www.newegg.com/g-skill-128gb-288-pin-ddr4-sdram/p/N82E16820232989?Item=N82E16820232989
 
Hello,

I recently built a Threadripper rig for work related tasks, but I have had a weird problem with it over the last several months that basically causes the system to freeze up. I will describe the problem more below and also list my system components. Any help troubleshooting this issue would be fantastic. I also don't know if this is the right section of the forum to post this in, so if it is wrong, I do apologize.

Normal Computer Usage:
I should first describe the use of the system to maybe help diagnose this issue.
The system is primarily used for CAM software and some CAD. Autodesk Powermill and Autodesk Fusion as well as Geomagic DesignX.
The software I use has no apparent bottlenecks. In fact, this system is much faster than my previous system for using any of these softwares. That would make sense because it is a Threadripper rig.

Problem at hand:
The system essentially 'freezes' up. I think 'freezing' up may not be the right word, but that is what I can think of to describe it.

Essentially, the system becomes unresponsive.
Let's say the system 'froze' up while I was using it. I would still be able to use the mouse and move the cursor around the screen no problem. I could minimize windows, sometimes bring windows back up. I am unable to open any new program, unable to save any program, unable to really do anything with any program except for move the mouse around the screen. The graphics still displays what it would normally display prior to it having 'frozen' up. The sure way to know my system has 'frozen' is when I turn my keyboard off and then on and the light that would normally be illuminated with a solid white light when connected is blinking instead.

Let's say I left the system overnight. I usually leave the system on and running overnight because I let programs like PowerMill calculate or do some action that will take a long time overnight. I turn the monitor off when I am leaving it overnight as well as turning my keyboard and mouse off. When I return to it hours later, the monitor would not turn back on (as in, it would not display what was previously open the day before. The monitor does TURN on, but there is nothing displayed except a black screen). The keyboard does as described in the previous paragraph, where the connection light blinks instead of stays illuminated (indicating the keyboard is not connected).

I do not believe it is my system overheating because I have since upgraded to a water-cooled radiator from the previous Noctua NH-U9 TR4-SP3 that I had running with the processor. I also have monitored the CPU temp with core temp while I am running the system and the temperature range is actually less with the water-cooled radiator rather than the Noctua I previously had, yet the problem persists. So I assume it is not an overheating problem.

The problem is also not consistent, at least I have yet to discover anything that triggers it. Sometimes the system will run perfectly for a week or more before this happens. Today, as I write this, the system ran for about 3 hours before it occurred. I know it is not based on my activity on the system, as in maybe it just happens when it sits for long periods of time. There have been times where I was using it and then it would 'freeze' up and I would determine I had to restart the system by checking the keyboard connection light (described above).

I do not know where else to begin to troubleshoot this system to determine why this happens.

Maybe one of my RAM sticks is corrupted and causes a freeze to the system? I legitimately have no idea. I have enough computer knowledge to believe I put the right components together for this system, but not enough computer knowledge to determine the best way to solve this issue.

I have looked online at various resources to try and see if anyone else has had this problem, but nothing sounds like what I am experiencing.

Computer Hardware:
AMD Ryzen Threadripper PRO 5955WX - sWRX8 socket
ASUS Pro WS WRX80E-SAGE SE WIFI II (Workstation motherboard)
G.SKILL TridentZ RGB Series 128GB DDR4 3200 (PC4 25600)
WD Blue 3D NAND 500GB Internal SSD (this was actually a salvage from my old system. I did do a fresh Windows OS install when I moved this over to the new system).
WD Black 2TB Performance Desktop Hard Drive (2nd HDD)
Phanteks Glacier One 420D30 Premium (CPU radiator)
RM1000x PSU (1000 watt)
NVIDIA RTX A4000


If anyone is so kind enough to help figure this out and you require more information, please do not hestitate to ask for me to try to find more information through the computer for this problem. This is an incredibly annoying issue.

Thank you.
Try turning off power management system from within Windows, so that the system does not go to sleep.
There are good free memory diagnostic programs a available, Ask the manufacturer which they suggest.
If that does not fix your issue, verify your BIOS version is up to date and that your firmware is up to date on the hardware components.
 
I will try updating drivers and turning off the power management system.

Thank you.

I mentioned the RAM point because I was not sure if a bad stick could be a cause of a system essentially bricking itself. Is that possible?