raid
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionLast revisionBoth sides next revision | ||
raid [2009/11/17 05:59] – 172.26.0.166 | raid [2010/09/15 18:04] – aorth | ||
---|---|---|---|
Line 2: | Line 2: | ||
We have two RAIDs on the HPC | We have two RAIDs on the HPC | ||
* Linux kernel software RAID | * Linux kernel software RAID | ||
- | * 3mware | + | * 3ware hardware RAID |
==== Drive numbering ==== | ==== Drive numbering ==== | ||
Line 50: | Line 50: | ||
| | ||
unused devices: < | unused devices: < | ||
- | |||
==== Repair RAID ==== | ==== Repair RAID ==== | ||
- | When a disk is failing | + | When a disk is failing you need to replace the drive. |
- | < | + | |
- | In that case you need to replace the drive. | + | |
< | < | ||
Personalities : [raid1] [raid0] | Personalities : [raid1] [raid0] | ||
Line 74: | Line 71: | ||
unused devices: < | unused devices: < | ||
- | Because it is ''/ | + | If ''/ |
< | < | ||
# mdadm /dev/md1 --fail /dev/hda3 --remove /dev/hda3 | # mdadm /dev/md1 --fail /dev/hda3 --remove /dev/hda3 | ||
# mdadm /dev/md3 --fail /dev/hda1 --remove /dev/hda1 | # mdadm /dev/md3 --fail /dev/hda1 --remove /dev/hda1 | ||
# mdadm /dev/md4 --fail /dev/hda6 --remove / | # mdadm /dev/md4 --fail /dev/hda6 --remove / | ||
+ | ''/ | ||
+ | < | ||
+ | # mdadm --stop / | ||
<note warning> You must Shutdown the server before you physically remove the drive! </ | <note warning> You must Shutdown the server before you physically remove the drive! </ | ||
Shut the server down and replace the faulty drive with a new one. After booting your drive letters may have shifted around, so just be sure to verify which is which before proceeding. | Shut the server down and replace the faulty drive with a new one. After booting your drive letters may have shifted around, so just be sure to verify which is which before proceeding. | ||
Line 91: | Line 91: | ||
/dev/sdc: msdos partitions 1 | /dev/sdc: msdos partitions 1 | ||
</ | </ | ||
- | You can now add the new partitions back to the arrays: | + | Re-create the scratch partition (RAID0): |
+ | < | ||
+ | # mkfs.ext3 /dev/md2 | ||
+ | # mount /dev/md2 / | ||
+ | You can now add the new partitions back to the RAID1 arrays: | ||
< | < | ||
# mdadm /dev/md1 --add /dev/hdd3 | # mdadm /dev/md1 --add /dev/hdd3 | ||
Line 116: | Line 120: | ||
| | ||
unused devices: < | unused devices: < | ||
- | Clearing any previous raid info on a disk (eg. reusing a disk from another decommissioned raid array) | ||
- | |||
- | # mdadm --zero-superblock /dev/hdc1 | ||
- | Adding a disk to an array | ||
- | |||
- | # mdadm --add /dev/md0 /dev/hdc1 | ||
- | |||
- | |||
- | === To Do list: === | ||
- | |||
- | |||
- | Prepare written instructions on how to repair disk arrays. | ||
- | |||
- | What disks to we have? | ||
- | |||
- | Add extra spare disks? | ||
- | |||
- | How do you know which physical disk is broken to replace it? | ||
- | |||
- | f | ||
- | |||
===== Hardware RAID ===== | ===== Hardware RAID ===== | ||
- | A 3ware 9500S SATA RAID card using the 3w-9xxx kernel module. | + | A 3ware 9500S-12 SATA RAID card using the 3w-9xxx kernel module. |
==== Physical Disk Layout ==== | ==== Physical Disk Layout ==== |
raid.txt · Last modified: 2010/09/19 23:58 by aorth