Question PC often hard-freezing shortly after reboot ?

Jan 18, 2022
4
0
10
I have a PC, self-built from components, that hard-freezes, seemingly randomly. Sometimes it reboots spontaneously, sometimes it doesn't. Sometimes the mouse pointer works for a few seconds while the rest is unresponsive, then everything freezes. Some other times everything freezes right away.
I tried Linux with console on serial: no messages are printed on the serial when the freeze occurs (I was hoping in some kernel panic message).

The strange thing is that it seems to occur mostly within a few hours after a reboot. If the PC survives the first few hours, it can go on for days and days, and even be suspended and resumed, until the next kernel upgrade that forces a full reboot.
I tried other OSes, namely the NomadBSD FreeBSD live distribution, and the problem exists there as well.

Any suggestions?
 
psu: Cooler Master MasterWatt Lite 500
cpu: AMD Ryzen 7 1700
MB: Gigabyte GA-AB350-GAMING
memory: Crucial - DDR4 16 GB 2666 MHz
ssd: Transcend MTS800 256 GB (M.2 connection)
gpu: ASUS EX-RX570-O4G (Radeon RX 570)
 
I have to bump this thread, because I upgraded the PSU to a Seasonic G12 GC 650W, which has higher power than the previous one, and is good quality according to https://cultists.network/140/psu-tier-list/ , but the symptoms persist.

The only suspect thing is an oops that occurs every time Linux is booted:
Code:
[    1.729568] irq 7: nobody cared (try booting with the "irqpoll" option)
[    1.729671] CPU: 3 PID: 0 Comm: swapper/3 Not tainted 5.19.0-29-generic #30-Ubuntu
[    1.729674] Hardware name: Gigabyte Technology Co., Ltd. AB350-Gaming/AB350-Gaming-CF, BIOS F6 08/21/2017
[    1.729677] Call Trace:
[    1.729678]  <IRQ>
[    1.729680]  show_stack+0x4e/0x61
[    1.729685]  dump_stack_lvl+0x4a/0x6f
[    1.729690]  dump_stack+0x10/0x18
[    1.729693]  __report_bad_irq+0x3a/0xbb
[    1.729696]  note_interrupt.cold+0xb/0x5c
[    1.729698]  handle_irq_event+0x79/0x80
[    1.729703]  handle_fasteoi_irq+0x7d/0x1d0
[    1.729705]  __common_interrupt+0x56/0xf0
[    1.729709]  common_interrupt+0x9f/0xb0
[    1.729712]  </IRQ>
[    1.729713]  <TASK>
[    1.729714]  asm_common_interrupt+0x27/0x40
[    1.729717] RIP: 0010:native_safe_halt+0xb/0x10
[    1.729721] Code: 51 2d 00 4c 89 ee 48 c7 c7 40 50 a5 95 e8 ed 70 84 ff eb c4 cc cc cc cc cc cc cc cc cc cc cc eb 07 0f 00 2d 69 06 4e 00 fb f4 <e9> 60 51 2d 00 eb 07 0f 00 2d 59 06 4e 00 f4 e9 51 51 2d 00 cc 0f
[    1.729724] RSP: 0018:ffff97470015fdc0 EFLAGS: 00000246
[    1.729726] RAX: 0000000000004000 RBX: ffff894f0191bc64 RCX: 0000000000000000
[    1.729728] RDX: 0000000000000001 RSI: ffff894f0191bc00 RDI: 0000000000000001
[    1.729729] RBP: ffff97470015fdd0 R08: 0000000000000000 R09: 0000000000000000
[    1.729730] R10: 0000000000000000 R11: 0000000000000000 R12: ffff894f0191bc64
[    1.729732] R13: 0000000000000003 R14: ffffffff95cc54c0 R15: ffff89520ecc0000
[    1.729734]  ? acpi_idle_do_entry+0x82/0xc0
[    1.729737]  acpi_idle_enter+0xbb/0x180
[    1.729740]  cpuidle_enter_state+0x9a/0x650
[    1.729745]  cpuidle_enter+0x2e/0x50
[    1.729748]  call_cpuidle+0x23/0x60
[    1.729751]  cpuidle_idle_call+0x11d/0x190
[    1.729753]  do_idle+0x82/0x100
[    1.729755]  cpu_startup_entry+0x1d/0x20
[    1.729757]  start_secondary+0x122/0x160
[    1.729760]  secondary_startup_64_no_verify+0xe5/0xeb
[    1.729765]  </TASK>
[    1.729766] handlers:
[    1.729800] [<000000001baa5ca2>] amd_gpio_irq_handler
[    1.729877] Disabling IRQ #7