Intel RAID

Issues related to hardware problems
Post Reply
furiousmurphy
Posts: 5
Joined: 2018/03/02 04:10:06

Intel RAID

Post by furiousmurphy » 2018/03/02 04:26:11

Hello all,

I have a CentOS 6 server with onboard Intel RAID 5 with 5 Seagate server grade 500GB hard drives. The first drive (drive0 of the array) has failed (SMART message on power up)and as-is the server will no longer boot up. I am assuming that the boot sector used in this configuration is located on the first drive of the array, thus causing this behavior. All other 4 drives report that they are in good condition.

I received an exact model this week for a replacement, I was wondering if swapping out the drives and booting to the CentOS Live disk would work or be the best option to rebuild the array with the new disk. I have found a bit of info using Google, but none that seems to apply to my specific situation, and I really don't want to lose the current array by making a bad decision. Anyone out there had this experience? Any advice is appreciated.

furiousmurphy
Posts: 5
Joined: 2018/03/02 04:10:06

Re: Intel RAID

Post by furiousmurphy » 2018/03/02 06:11:48

The motherboard is a SuperMicro MBD-X9SCL-F LGA 1151 Xeon-E3 with a Intel C202 / Nuvoton NCT6776F chipset if it helps anyone. Not sure if it really matters in this instance, but running CentOS 6 64 bit installed over 4 yrs ago - just before the release of CentOS 7. I do know that there is a correct driver on the Live disk, as the RAID was properly recognized when the OS was installed.

I forgot to mention that in my original post.

furiousmurphy
Posts: 5
Joined: 2018/03/02 04:10:06

Re: Intel RAID

Post by furiousmurphy » 2018/03/02 17:52:25

Information on Intel site concerning this kind of scenario, but on a Windows OS

Turn on the system.
Click Start.
Click All Programs.
Click Intel.
Click Intel® Rapid Storage Technology.
Click Rebuild to another disk.
Select the replacement hard drive and click Rebuild.

Is there something similar that is part of the Live CentOS 6 disk?

Help??

User avatar
TrevorH
Forum Moderator
Posts: 22762
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: Intel RAID

Post by TrevorH » 2018/03/02 18:00:12

Almost every linux user that I know of will run screaming from a FakeRAID setup.
CentOS 5 died in March 2017 - migrate NOW!
Full time Geek, part time moderator. Use the FAQ Luke

furiousmurphy
Posts: 5
Joined: 2018/03/02 04:10:06

Re: Intel RAID

Post by furiousmurphy » 2018/03/02 18:12:46

Thank you for your input, but that isn't really helpful. I was looking for information and personal experience, not opinions.

I am well aware of horror stories and nightmares associated with software, hardware, and 'firmware' implementations. It does not seem that there is a perfect foolproof implementation of RAID.

furiousmurphy
Posts: 5
Joined: 2018/03/02 04:10:06

Re: Intel RAID

Post by furiousmurphy » 2018/03/27 14:09:12

After research and experience, the following would have worked had the drives still been listed as 'online member' disks. As Intel RAID driver modules are included with the Live DVD, replacing the failed drive, booting the system with the Live disk, the drive would be rebuilt as the disk monitor aspect of the driver module would detect the new drive and start to repair the array automatically.

In my case, I had a mishap where I had no response from anyone here on the forum and I wasn't having any luck finding the information that applied to my situation. The drives ended up in a 'offline member' condition after I had booted the Live DVD on the system, and was poking around looking for a manual option to trigger the rebuild. The rebuild operation seemed to have started automatically without me realizing it, and I ended up shutting down the system before it had time to complete. This is what seemed to cause the 'offline member' condition on the RAID configuration boot screen.

So the procedure would be:
Remove the failed drive
Replace with the new drive (do nothing at the RAID boot config screen, just hit whatever key is promted to continue)
Boot the machine (to the installed OS if bootable, or use a Live DVD if it isn't)
Log in and walk away from the damn thing (make sure that power settings don't allow it to go to sleep)
Go home or watch TV if you are already home, drink a beer, do whatever, but leave the machine alone, then go to sleep, go back and make sure the operation was successful the next morning.

I hope this saves someone a headache down the road, and BTW this seems to apply to both CentOS 6 and 7.
Thanks for all the help everyone.

Post Reply