I assembled this system about a year back, with few parts taken from my last system:
CPU: AMD Ryzen 5 3600
MoBo: MSI B450 Gaming Pro Carbon AC
GPU: NVIDIA Gigabyte RTX 2070 Super
Hard disk: Intel 660p
RAM: 2 * 8 GB Corsair LPX 3000
PSU: Corsair VS650 (bought in December 2017)
Since about a month ago, I started having random app crashes (mostly chrome or plex) and BSODs. The app crashes were more frequent, with every 10 minutes, while I would get 2 or 3 BSODs a day.
Mostly the errors were IRQL_NOT_LESS_OR_EQUAL, but often other errors like MEMORY_MANAGEMENT, KERNEL_SECURITY_CHECK_FAILURE, SYSTEM_THREAD_EXCEPTION_NOT_HANDLED etc. All of them would contain ntoskrnl.exe when viewed on bluescreenviewer.
I ran memtest86 for 4 hours with no errors, and also tried running one ram stick at a time, and would continue to get BSODs in each case, so I'm confident RAM is not the problem (I don't think both RAM can get faulty suddenly).
I also tried reinstalling windows, and reinstalling in a different hard disk too, but still got the same errors. The chkdsk command gave no errors either, so I'm confident that there's no issues with the hard disk. I also ran verifier for 24 hours, and in fact there were no bsod during that time (but some app crashes). I also tried running prime95 but without any errors.
I then realized that when I keep my prime95 running, none of my apps crash too, and no blue screens. I would start getting issues as soon as I stop the test. Ofcourse that is not how I want to run my system, so I tried to debug other issues related to CPU. I realized that there were no crashes during high load, but when system got idle, the load would be high. I also changed the power plan from "Ryzen Balanced" to "Windows balanced", and the frequency of crashes decreased. However, there were still crashes which correlated with high voltage of CPU. Finally, I modified my power plan to use a maximum of 99% of my CPU instead of 100%, and no app crashes or bsod at all since 2 days (I verified with Windows Reliability Monitor). Having said that, even though the current solution looks somewhat safe, I would not want to decrease the performance for stability, and also fear that it may start crashing in future again.
Also, it may be caused due to overheating, but I've always only had BSODs, and not system shutting down suddenly. My CPU temperatures go as high as 87 when running stress tests, but often stay around 60-70. Also crashes don't seem to be directly related with stress test, so I feel my CPU is not handling high voltages correctly rather than high temperatures.
I don't have any issues with GPU, so I'm not sure if it is a PSU fault, but I'm not sure if it is a faulty CPU, MoBo or PSU. I don't think any other part could be faulty. Should I get my CPU replaced from warranty? And will it be done, since I don't seem to have any "concrete proof" of a faulty CPU?
CPU: AMD Ryzen 5 3600
MoBo: MSI B450 Gaming Pro Carbon AC
GPU: NVIDIA Gigabyte RTX 2070 Super
Hard disk: Intel 660p
RAM: 2 * 8 GB Corsair LPX 3000
PSU: Corsair VS650 (bought in December 2017)
Since about a month ago, I started having random app crashes (mostly chrome or plex) and BSODs. The app crashes were more frequent, with every 10 minutes, while I would get 2 or 3 BSODs a day.
Mostly the errors were IRQL_NOT_LESS_OR_EQUAL, but often other errors like MEMORY_MANAGEMENT, KERNEL_SECURITY_CHECK_FAILURE, SYSTEM_THREAD_EXCEPTION_NOT_HANDLED etc. All of them would contain ntoskrnl.exe when viewed on bluescreenviewer.
I ran memtest86 for 4 hours with no errors, and also tried running one ram stick at a time, and would continue to get BSODs in each case, so I'm confident RAM is not the problem (I don't think both RAM can get faulty suddenly).
I also tried reinstalling windows, and reinstalling in a different hard disk too, but still got the same errors. The chkdsk command gave no errors either, so I'm confident that there's no issues with the hard disk. I also ran verifier for 24 hours, and in fact there were no bsod during that time (but some app crashes). I also tried running prime95 but without any errors.
I then realized that when I keep my prime95 running, none of my apps crash too, and no blue screens. I would start getting issues as soon as I stop the test. Ofcourse that is not how I want to run my system, so I tried to debug other issues related to CPU. I realized that there were no crashes during high load, but when system got idle, the load would be high. I also changed the power plan from "Ryzen Balanced" to "Windows balanced", and the frequency of crashes decreased. However, there were still crashes which correlated with high voltage of CPU. Finally, I modified my power plan to use a maximum of 99% of my CPU instead of 100%, and no app crashes or bsod at all since 2 days (I verified with Windows Reliability Monitor). Having said that, even though the current solution looks somewhat safe, I would not want to decrease the performance for stability, and also fear that it may start crashing in future again.
Also, it may be caused due to overheating, but I've always only had BSODs, and not system shutting down suddenly. My CPU temperatures go as high as 87 when running stress tests, but often stay around 60-70. Also crashes don't seem to be directly related with stress test, so I feel my CPU is not handling high voltages correctly rather than high temperatures.
I don't have any issues with GPU, so I'm not sure if it is a PSU fault, but I'm not sure if it is a faulty CPU, MoBo or PSU. I don't think any other part could be faulty. Should I get my CPU replaced from warranty? And will it be done, since I don't seem to have any "concrete proof" of a faulty CPU?