[SOLVED] Is my ryzen 5 3600 defective?

Herr B

Commendable
May 29, 2020
179
36
1,690
Dear Community,

I just bought Components for a server and encounter many crashes.
The Components are:
  • Corsair AX 1600i
  • Asus Rog Strix B450F Gaming
  • 500 GB Samsung SSD (Used)
  • 8 GB Ram (Different kinds, used)
  • Ryzen 5 3600 (bought refurbished)
  • nvidia gt 1030 for display out
What are the Issues?
During installation of Ubuntu server, I encountered crash after crash, freeze after freeze. Sometimes it would spit out a message, Often related to the CPU or its LVL2 cash, sometimes to the ram as well and often all kinds of errors.
Also during changing bios settings I encountered one freeze.

I booted from a Live/Test usb stick and performed Memtest. No error.

I switched through 3 Bars of ram in different setups and memory slots. With one ram in a specific slot I was able to get the Ubuntu installation through. In the Terminal, I encountered a memory error (which was corrected)
So far the Machine runs up to a few Hours. After which I cannot reach the computer via SSH Terminal anymore. My network infrastructure cannot see the device. All fans are still spinning, the gpu is hot.

At this point my best guess is a dead CPU
I switched through multiple bars of ram, tried installing on two separate SSD Drives. The mainboard is brand new, the CPU however was bought used from a large electronics dealer. It had a "tested" seal on it. I could not find any issue with the pins upon visual inspection after taking it out of its packaging.

What are your Thoughts on this? (before I open rma on the cpu)
 
Last edited:
Dear Community,

I just bought Components for a server and encounter many crashes.
The Components are:
  • Corsair AX 1600i
  • Asus Rog Strix B450F Gaming
  • 500 GB Samsung SSD (Used)
  • 8 GB Ram (Different kinds, used)
  • Ryzen 5 3600 (bought refurbished)
  • nvidia gt 1030 for display out
What are the Issues?
During installation of Ubuntu server, I encountered crash after crash, freeze after freeze. Sometimes it would spit out a message, Often related to the CPU or its LVL2 cash, sometimes to the ram as well and often all kinds of errors.
Also during changing bios settings I encountered one freeze.

I booted from a Live/Test usb stick and performed Memtest. No error.

I switched through 3 Bars of ram in different setups and memory slots. With one ram in a specific slot I was able to get the Ubuntu installation through. In the Terminal, I encountered a memory error (which was corrected)
So far the Machine runs up to a few Hours. After which I cannot reach the computer via SSH Terminal anymore. My network infrastructure cannot see the device. All fans are still spinning, the gpu is hot.

At this point my best guess is a dead CPU
I switched through multiple bars of ram, tried installing on two separate SSD Drives. The mainboard is brand new, the CPU however was bought used from a large electronics dealer. It had a "tested" seal on it. I could not find any issue with the pins upon visual inspection after taking it out of its packaging.

What are your Thoughts on this? (before I open rma on the cpu)
Have you updated BIOS on the motherboard and reset CMOS? If updating BIOS I see no reason not to go with the latest for a 3rd gen CPU which would be v4602, with AMD AM4 AGESA V2 PI 1.2.0.3 Patch C.

Be absolutely certain to reset CMOS whether or not you update BIOS. Do it with a battery pull and leave it out a minute or so while shorting the pins before reassembling.
 

Herr B

Commendable
May 29, 2020
179
36
1,690
  • 8 GB Ram (Different kinds, used)
Are you mixing memory? If so try it with one stick.
tried with all reasonable configurations (1x 4gb, 2 * 4gb set, 1x 8gb) in different slots.


Have you updated BIOS on the motherboard and reset CMOS? If updating BIOS I see no reason not to go with the latest for a 3rd gen CPU which would be v4602, with AMD AM4 AGESA V2 PI 1.2.0.3 Patch C.

Be absolutely certain to reset CMOS whether or not you update BIOS. Do it with a battery pull and leave it out a minute or so while shorting the pins before reassembling.
I can check bios version after work today. It should be relatively recent (at least for gen 3 processor).
I have performed a cmos reset already.
I will not update the Bios with a potentially broken cpu. I have ordered a replacement which I can test the components with. Also I ordered 2 x 8gb ecc memory to rule that issue out (though I dont believe 3 bars of memory in a row would be defective)
 
....
(though I dont believe 3 bars of memory in a row would be defective)
Have you checked one DIMM at a time in each socket? It could be one channel only is defective. Then it would be a matter of determining if it's CPU or motherboard as either CPU socket or a DIMM socket could have a problem.

And actually, I would suggest going to the latest BIOS update for a 3rd gen CPU too. Reason being, later BIOS's have incorporated changes for better Win11 compatibility. But so long as you're on a known-stable BIOS right now that can be delayed since you're worried about hardware reliability during the update.
 
Last edited:

Herr B

Commendable
May 29, 2020
179
36
1,690
At first, I checked with 2 x 4 gb dimms in different sockets. I thought the ram might be defective so I switched to 1 x 4 gb dimm (it was what I had available at the time.
checked each socked without much success.
Then I took the other 4 gb dimm and eventually the installation ran through on socket B2. The System crashed after 4 hours of uptime. I brought 1 x 8gb dim to the office and installed that one in socket a1. Once ubuntu was installed, it would boot with each dimm in each socket but only hold up a few hours.

I prepared rma documents and ordered a replacement cpu in the meantime. Dropped it in. just 30 minutes ago. So far its still running. Well see tomorrow if its still up.
Unfortunately I cant install the ecc dimms I ordered. They are registered ecc and ryzen only suports unregistered ecc. well another lesson learned.

Edit:
Found these in my syslogs:

Code:
kernel: [  316.915286] mce: [Hardware Error]: Machine check events logged
kernel: [  316.915288] [Hardware Error]: Corrected error, no action required.
kernel: [  316.915301] [Hardware Error]: CPU:8 (17:71:0) MC0_STATUS[-|CE|MiscV|AddrV|-|-|SyndV|UECC|-|-|-]: 0x9c202000000c0135
kernel: [  316.915312] [Hardware Error]: Error Addr: 0x000000010b081bf8
kernel: [  316.915317] [Hardware Error]: IPID: 0x000000b000000000, Syndrome: 0x0000003f1a1b2f00
kernel: [  316.915323] [Hardware Error]: Load Store Unit Ext. Error Code: 12, DC Data error type 1 and poison consumption.
kernel: [  316.915330] [Hardware Error]: cache level: L1, tx: DATA, mem-tx: DRD


Code:
systemd[1]: Reloading.
kernel: [ 1562.128252] mce: [Hardware Error]: Machine check events logged
kernel: [ 1562.128254] [Hardware Error]: Corrected error, no action required.
kernel: [ 1562.128654] [Hardware Error]: CPU:2 (17:71:0) MC0_STATUS[-|CE|MiscV|AddrV|-|-|SyndV|UECC|-|-|-]: 0x9c202000000c0135
kernel: [ 1562.129043] [Hardware Error]: Error Addr: 0x000000013a896bc0
kernel: [ 1562.129420] [Hardware Error]: IPID: 0x000000b000000000, Syndrome: 0x0000003f1a1b2f00
kernel: [ 1562.129801] [Hardware Error]: Load Store Unit Ext. Error Code: 12, DC Data error type 1 and poison consumption.
kernel: [ 1562.130200] [Hardware Error]: cache level: L1, tx: DATA, mem-tx: DRD
kernel: [ 1562.130624] mce: [Hardware Error]: Machine check events logged
kernel: [ 1562.130625] [Hardware Error]: Corrected error, no action required.
kernel: [ 1562.131077] [Hardware Error]: CPU:8 (17:71:0) MC0_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|UECC|-|-|-]: 0xdc202000000c0135
kernel: [ 1562.131543] [Hardware Error]: Error Addr: 0x0000000112aeebdc
kernel: [ 1562.131985] [Hardware Error]: IPID: 0x000000b000000000, Syndrome: 0x0000003f1a1b2f03
kernel: [ 1562.132463] [Hardware Error]: Load Store Unit Ext. Error Code: 12, DC Data error type 1 and poison consumption.
kernel: [ 1562.132914] [Hardware Error]: cache level: L1, tx: DATA, mem-tx: DRD
systemd[1]: message repeated 3 times: [ Reloading.]

Code:
kernel: [ 3118.640366] mce: [Hardware Error]: Machine check events logged
kernel: [ 3118.640369] [Hardware Error]: Corrected error, no action required.
kernel: [ 3118.640748] [Hardware Error]: CPU:2 (17:71:0) MC0_STATUS[-|CE|MiscV|AddrV|-|-|SyndV|UECC|-|-|-]: 0x9c202000000c0135
kernel: [ 3118.641113] [Hardware Error]: Error Addr: 0x0000000138f8ebe0
kernel: [ 3118.641481] [Hardware Error]: IPID: 0x000000b000000000, Syndrome: 0x0000003f1a1b2f00
kernel: [ 3118.641958] [Hardware Error]: Load Store Unit Ext. Error Code: 12, DC Data error type 1 and poison consumption.
kernel: [ 3118.642462] [Hardware Error]: cache level: L1, tx: DATA, mem-tx: DRD

Code:
moneyheist kernel: [ 3429.942698] mce: [Hardware Error]: Machine check events logged
moneyheist kernel: [ 3429.942701] [Hardware Error]: Corrected error, no action required.
moneyheist kernel: [ 3429.943446] [Hardware Error]: CPU:2 (17:71:0) MC0_STATUS[-|CE|MiscV|AddrV|-|-|SyndV|UECC|-|-|-]: 0x9c202000000c0135
moneyheist kernel: [ 3429.944612] [Hardware Error]: Error Addr: 0x000000013e206bc0
moneyheist kernel: [ 3429.945782] [Hardware Error]: IPID: 0x000000b000000000, Syndrome: 0x0000003f1a1b2f03
moneyheist kernel: [ 3429.946971] [Hardware Error]: Load Store Unit Ext. Error Code: 12, DC Data error type 1 and poison consumption.
moneyheist kernel: [ 3429.948142] [Hardware Error]: cache level: L1, tx: DATA, mem-tx: DRD

Edit 2:
Bios will be updated the day after tomorrow, when I have time again to access the system physically.
 
Last edited: