January 27, 2004
System drive lost
Tags: EYEWI

EYEWI suffered a system drive failure on Saturday morning, and Monday afternoon and evening I replaced the drive and rebuilt the system.

First off, make sure your volume manager really is mirroring all the partitions on the system.

Recovery proceeded fairly straightforwardly. Since only the root partition had actually been mirrored, the system would not boot with the second drive. I used one of the spare drives from ROJ's AMANDA installation to copy the root partition of the surviving drive and to test restoring the /home partition from the dump that happens with every successful NetBackup catalog backup.

I Jumpstarted the system with the surviving drive in its original location (slot 1) and the other of ROJ's spare AMANDA drives in slot 0. After Jumpstart, I restored the /home partition from the catalog tape, moved /home/opt to a temporary location, and installed NetBackup (in order to get the appropriate files in /etc). I moved the new /home/opt out of the way and moved the old /home/opt into position, and NetBackup seemed almost happy.

Details that needed to be corrected: the Dell PowerVault 110T and the Overland tape library had swapped device files - /dev/rmt/0* now points to the tape library wile /dev/rmt/1* points to the 110T. I needed to use the Java GUI on NetBackup to change the device files for the two storage units. I had also originally forgotten to increase the shared memory configuration in /etc/system, and thus backups failed with error code 11 (failed system call).

There are still a few other details to be worked out, but the Jumpstart and /home restore seem to have gotten the bulk of it.

Posted by Rowan Littell at January 27, 2004 11:17 AM, updated 08:38 AM November 03, 2005
Comments
Post a comment
Name:


Email Address:


URL:


Comments:


Remember info?