raid
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
raid [2009/08/27 09:14] – 172.26.0.166 | raid [2009/09/29 15:02] – 172.26.0.166 | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | === HPC RAID array === | + | ===== RAID ===== |
- | The storage | + | We have two RAIDs on the HPC |
+ | * Linux kernel software | ||
+ | * 3mware hardware RAID | ||
+ | ==== Drive numbering ==== | ||
- | It is currently reporting a degraded array: | + | If you're looking at the front of the HPC you'll see four rows of drives. |
+ | * Rows 0 - 2 are SATA, connected to the hardware 3ware RAID card | ||
+ | * Row 3 are IDE | ||
- | < | + | ===== Software RAID ===== |
- | Personalities : [raid0] [raid1] | + | The Linux kernel has the '' |
- | md1 : active raid1 hda1[0] | + | |
- | | + | Here is information on their configuration: |
+ | |||
+ | < | ||
+ | /dev/md0 on / type ext3 (rw) | ||
+ | /dev/md3 on /boot type ext3 (rw) | ||
+ | /dev/md2 on /scratch type ext3 (rw) | ||
+ | /dev/md1 on /export type ext3 (rw) | ||
+ | # df -h | grep md | ||
+ | / | ||
+ | / | ||
+ | / | ||
+ | / | ||
+ | |||
+ | It should be noted that ''/ | ||
+ | < | ||
+ | Filename Type Size Used Priority | ||
+ | / | ||
+ | |||
+ | A snapshot of the software RAID's health: | ||
+ | |||
+ | < | ||
+ | Personalities : [raid1] [raid0] | ||
+ | md3 : active raid1 hdd1[1] | ||
+ | | ||
| | ||
- | md3 : active raid1 hdc3[1] hda3[0] | + | md1 : active raid1 hdd3[1] hda3[0] |
- | | + | |
| | ||
- | md2 : active | + | md2 : active |
- | | + | |
| | ||
- | md0 : active | + | md4 : active |
- | | + | |
| | ||
- | unused devices: < | + | md0 : active raid1 hdd2[1] hda2[0] |
+ | 30716160 blocks [2/2] [UU] | ||
+ | |||
+ | unused devices: < | ||
+ | |||
+ | === To Do list: === | ||
+ | |||
+ | |||
+ | Prepare written instructions on how to repair disk arrays. | ||
+ | |||
+ | What disks to we have? | ||
+ | |||
+ | Add extra spare disks? | ||
+ | |||
+ | How do you know which physical disk is broken to replace it? | ||
+ | |||
+ | |||
+ | ===== Hardware RAID ===== | ||
+ | |||
+ | There is a utility, tw_cli, which can be used to control the hardware raid. The hardware RAID has three arrays, all RAID 5. Each " | ||
+ | |||
+ | | 8 | 9 | 10 | 11 | | ||
+ | | 4 | 5 | 6 | 7 | | ||
+ | | 0 | 1 | 2 | 3 | | ||
+ | |||
+ | Study the output of '' | ||
+ | * Which controller is active? (c0, c1, etc) | ||
+ | * Which unit is degraded? (u0, u1, u2, etc) | ||
+ | * Which | ||
+ | Remove the faulty port: | ||
+ | < | ||
+ | Insert a new drive and rescan: | ||
+ | < | ||
+ | Rebuild the degraded array: | ||
+ | < | ||
+ | Check the status of the rebuild by monitoring ''/ |
raid.txt · Last modified: 2010/09/19 23:58 by aorth