Question Fix RAID m2 Genie Error after mobo replacement z490 MSI CArbon WiFi

Mar 10, 2024
7
2
15
I had to replace my z490 Carbon Wifi mobo with another one (used/2nd hand, same model) after a power fluctuation event, but now I am continuously getting an error when trying to fix my RAID setup.

I have 2 RAID (mirror) volumes set up:
- Samsung 870 EVO (SATA), works fine
- Samsung 970 Evo Plus (m2), works fine with only one M2 plugged in.

If I plug in the 2nd drive in the m2 socket, the problems appear. I am getting an error at boot saying M2 Genie error, Press F1

What I tried so far w/o any success:
- Updated BIOS to the latest
- Unplugged all the SATA drives
- Tried to switch the SATA Mode to AHCI but after reboot they are switched back to [RAID/Optane Mode]
- Messing arround with the M2 RST PCIE Remapping but, again, the settings get reverted after reboot
- Intel RST shows my RAID volume as degraded (which is obvious) and 1 NON-RAID physical disk which is the new M2

I want to be able to add the new M2 in the RAID volume and then sync it but I'm not seeing any option, or I'm missing something in the manual

View: https://imgur.com/a/fet6uhc



Thanks for all your help.
 
Last edited:
A RAID 1 with a 870 EVO (SATA) and 970 EVO Plus (NVMe) is sooo much of a bad idea.

But anyway....
This wasn't the OS drive, hopefully?
The system 'works' with only one of the drives connected?
How was this RAID crated? Software, motherboard, other?
Hi,

I have two different RAID1 (mirror) volumes created via UEFI:
- 1st volume is 2x970 EVO Plus disks, booting Linux from here
- 2nd volume is 2x870 EVO disks

I managed to fix the booting process after the mobo replacement

Are the image links better now?
 
Given a proper backup routine, RAID 1 is rarely needed. It only protects against physical drive fail. Not actual data protection.

But given this is motherboard RAID, you'll likely have to rebuild the array with the new motherboard.
I understand the RAID theory. My problem right now is strictly related to RAID implementations on the z490 chipset or this particular mobo or family, and I was hoping that someone who knows more about this chipset or RAID implementations under this chipset knows something that I am missing.

I highly appreciate your help but I am not interested in how to implement backup/redundancy solutions, I just want to know why I am getting the m2 Genie error and how can I add the disk back into the volume w/o risking loss of the data. Rebuilding the array will result in loss of data, and I don't see any option to add it in the array. I had a disk failure in he past, replaced the failed disk and m2 Genie picked it up automagically. I had only to sync the array. Now I can't see any option that will allow me to add the 2nd M.2 disk in the array and I don't understand why as there aren't any logs or any information displayed describing the reason of the error
 
Last edited:
Is the replacement drive exactly the same capacity or larger than the old drive? If it's even a few tens of kilobytes smaller, you probably won't be able to rebuild the array.

https://www.gigabyte.com/WebPage/496/raid-2b-1.html

If the old and new drives are different sizes and you don't have a backup (why not?), copy all the data off the old drive, then wipe it and rebuild a new, slightly smaller array.

When the new array is configured, restore the data and keep the backup for the next failure.

The difference between the old and new arrays might only be a couple of MB, so you won't lose any appreciable space. The new array will take on the same capacity as the smallest drive, if you stick with default settings.

Never mess with a broken array unless you've got a backup of all data. You're just asking for trouble. RAID is not a backup.
 
I understand the RAID theory. My problem right now is strictly related to RAID implementations on the z490 chipset or this particular mobo or family, and I was hoping that someone who knows more about this chipset or RAID implementations under this chipset knows something that I am missing.

I highly appreciate your help but I am not interested in how to implement backup/redundancy solutions, I just want to know why I am getting the m2 Genie error and how can I add the disk back into the volume w/o risking loss of the data. Rebuilding the array will result in loss of data, and I don't see any option to add it in the array. I had a disk failure in he past, replaced the failed disk and m2 Genie picked it up automagically. I had only to sync the array. Now I can't see any option that will allow me to add the 2nd M.2 disk in the array and I don't understand why as there aren't any logs or any information displayed describing the reason of the error
Sorry, I should have been more verbose.

Since the single drive appears to work:
Copy ALL data off to some other device.
Slot in the second drive.
Rebuild the array completely.
Copy the data back into the RAID 1.

I am entirely unsure of a 100% safe way to cause it to recognize the second drive in situ, with 0% possibility of data loss.
 
Is the replacement drive exactly the same capacity or larger than the old drive? If it's even a few tens of kilobytes smaller, you probably won't be able to rebuild the array.

https://www.gigabyte.com/WebPage/496/raid-2b-1.html

If the old and new drives are different sizes and you don't have a backup (why not?), copy all the data off the old drive, then wipe it and rebuild a new, slightly smaller array.

When the new array is configured, restore the data and keep the backup for the next failure.

The difference between the old and new arrays might only be a couple of MB, so you won't lose any appreciable space. The new array will take on the same capacity as the smallest drive, if you stick with default settings.

Never mess with a broken array unless you've got a backup of all data. You're just asking for trouble. RAID is not a backup.
Hi,

I wish you would've read the thread & initial description. I might of pulled some extra hours this week and my description is perhaps poorly written but I fail to identify where I said that the 970 EVOs are different sized disks or that I have an Gigabyte mobo. All the hardware is the same before the power problems and it used to work fine with the previous mobo.

"If the old and new drives are different sizes and you don't have a backup (why not?), copy all the data off the old drive, then wipe it and rebuild a new, slightly smaller array."

As I previously said this is among other things a Kubernetes master (something that you can't "backup" really easy). The entire cluster is running some AI/ML workloads (mostly for learning purposes) taking advantage of the GPU, something that I can't afford running in the Cloud even if it made the operational/redundancy part easier.
 
Last edited:
my description is perhaps poorly written but I fail to identify where I said that the 970 EVOs are different sized disks or that I have an Gigabyte mobo
Have you actually checked both 970 EVOs and confirmed they are exactly the same size, down to the last byte? It's one reason why resilvering fails after replacing a drive in a failed array. SSD manufacturers sometimes keep the same model number, but fit different RAM chips and controllers which can affect the total number of bytes.

The only reason I linked to the Gigabyte web site was because they mention you need identically sized disks when replacing a failed drive. This applies regardless of who manufactured your motherboard. Sorry for the confusion it caused.

As I previously said this is among other things a Kubernetes master (something that you can't "backup" really easy).
In that case it might be wise to perform a "bit perfect" clone of the working drive to another drive, before messing around any further. That way if things go badly wrong and you wipe the working drive, you won't have lost everything. I'd be inclined to put the working drive to one side and try to rebuild the array with the clone.

Alternatively, contact a professional recovery agency, throw large quantities of money at them and let the experts recover your system. How much is recovery worth to you? Can you start from scratch instead if professional help is too expensive?

If you cannot backup a Kubernetes master easily, what will you do to stop this happening again? It seems bad practice to trust anything to a RAID system that cannot be duplicated or backed up somehow. A single point of failure, or too many eggs in one basket?

Best of luck!