LSI (Mylex) RAID Engines

Top  Previous  Next

If you are using an external fibre channel RAID subsystem that incorporates a Mylex family engine, the software can provide details on the health of the devices as well as return event log entries that are maintained by the RAID controller. The supported engines are members of the DAC960 family and include models FF, FF2, FFx, and FFx2.  These engines are also known as the SANArray Pro family. You must be running 7.0 firmware or higher.

 

If you are not sure what RAID engine you are using, you should ask your RAID vendor or try sending one of the -Z options to a logical disk in the RAID subsystem and see if you get any results.

 

When you supply any of the -Z command line options, you instruct SMARTMon-UX to send Mylex vendor-specific commands to query the RAID engine and report the desired information. If you send them to a non-Mylex controller, the commands will be rejected by the device and no RAID information will be returned. All of the commands can be sent at any time to the RAID engine and are non-destructive (see notes on the -ZL, -ZA and -ZM options). If you are running extremely heavy I/O, it may take several minutes for these options to complete.

 

Command Options

The -Z option (note the case.  -Z is for LSI/Mylex family, while -z is for LSI/Engenio family) displays a summary of all of the physical disks installed in the subsystem that are known to the RAID controller. In addition it will display information on all of the logical devices which are defined.

 

smartmon-ux -Z \\.\PHYSICALDRIVE5

SMARTMon-ux [Release 1.16, Build 27-DEC-2002] - Copyright 2002 SANtools, Inc. http://www.SANtools.com

Discovered MYLEX DACARMRB247240T5 S/N " " on \\.\PHYSICALDRIVE5 (SMART unsupported) [Adapter/ID.LUN=4/3.31](247239 MB)

This is a RAID Controller model "DAC960FFx" with 128 MB of RAM running firmware revision 7.70.

 Physical Device Dump:

  SEAGATE ST336605FC [0004] S/N=3FP00B1P 20:00:00:20:37:e6:0f:48 71132960 Blocks at 0:05h [ONLINE]

  SEAGATE ST336605FC [0004] S/N=3FP017BV 20:00:00:20:37:e6:95:b7 71687371 Blocks at 0:07h [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP00BB7 20:00:00:20:37:e6:0a:38 71132960 Blocks at 0:09h [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP00ARC 20:00:00:20:37:e6:0b:ef 71132960 Blocks at 0:0Bh [HOTSPARE]

  SEAGATE ST336605FC [0003] S/N=3FP017K6 20:00:00:20:37:e6:95:a5 71687371 Blocks at 0:0Dh [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP00BJZ 20:00:00:20:37:e6:09:3a 71132960 Blocks at 0:0Fh [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP0148W 20:00:00:20:37:e6:95:1a 71687371 Blocks at 0:11h [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP011LD 20:00:00:20:37:e6:93:b2 71687371 Blocks at 0:13h [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP009Z6 20:00:00:20:37:e6:06:31 71132960 Blocks at 1:04h [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP008NA 20:00:00:20:37:e6:03:c3 71132960 Blocks at 1:06h [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP009Y0 20:00:00:20:37:e6:0c:84 71132960 Blocks at 1:08h [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP008FD 20:00:00:20:37:e6:03:80 71132960 Blocks at 1:0Ah [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP00B4W 20:00:00:20:37:e6:09:be 71132960 Blocks at 1:0Ch [ONLINE]

  SEAGATE ST336605FC [0003] S/N=3FP00ANW 20:00:00:20:37:e6:07:3d 71132960 Blocks at 1:0Eh [ONLINE]

  SEAGATE ST336605FC [0004] S/N=3FP00B01 20:00:00:20:37:e6:08:7d 71132960 Blocks at 1:10h [ONLINE]

  SEAGATE ST336605FC [0004] S/N=3FP00Y3T 20:00:00:20:37:e6:9f:53 71687371 Blocks at 1:12h [ONLINE]

 RAID Controller Logical Device Dump:

  LUN[0] State=Optimal RAID_5 DeviceSize=20500480 Blocks

  LUN[1] State=Optimal RAID_5 DeviceSize=40972288 Blocks

  LUN[2] State=Optimal RAID_5 DeviceSize=102416384 Blocks

  LUN[3] State=Optimal RAID_5 DeviceSize=40972288 Blocks

  LUN[4] State=Optimal RAID_5 DeviceSize=40972288 Blocks

  LUN[5] State=Optimal RAID_0 DeviceSize=40980480 Blocks

  LUN[6] State=Optimal RAID_5 DeviceSize=122888192 Blocks

  LUN[7] State=Optimal RAID_5 DeviceSize=81944576 Blocks

  LUN[8] State=Optimal RAID_5 DeviceSize=506347520 Blocks

Terminating program.

 

In the example above, you can see the FFx RAID engine is running 7.70 firmware and has 128MB of RAM. It is attached to 16 Seagate disk drives. The first disk is a ST336605FC running firmware release 0004, and the serial numbers and world-wide names are also displayed. There are 71132960 usable blocks, and it is configured for channel 0 at hex ID #5.  State is ONLINE. The subsystem also defines a single disk as a hot spare.

 

There are 9 logical devices, all of which are defined as RAID5 except for a single striped RAID0 LUN.  All logical devices are "Optimal" which means they are online and operating properly.  If you had a drive failure, you might see status of Critical, Rebuilding, or Off line.

 

In the example below, we instructed the engine to return all known events in the controller's internal event log. The Mylex event log maintains the last 512 events and is volatile. That is, the log starts at event #0 at system power up time. Power cycles reset the log. Our example shows the power-on sequence for a controller through an exercise where we turned off each of the redundant power supplies to generate some events.

smartmon-ux -ZL \\.\PHYSICALDRIVE5

SMARTMon-ux [Release 1.16, Build 27-DEC-2002] - Copyright 2002 SANtools, Inc. http://www.SANtools.com

Discovered MYLEX DACARMRB247240T5 S/N " " on \\.\PHYSICALDRIVE5 (SMART unsupported) [Adapter/ID.LUN=4/3.31](247239 MB)

 Event log (Max of 512 events saved in controller):

  (0) [Severe] Ch:ID=0:0 "WARM BOOT failed. Memory error detected during WARM boot scan. Possible data loss."

  (1) [Warning] Ctl=0 "Dual controllers enabled."

  (2) [Info]  "Array management server software started successfully. The server system (or array management utility server) started."

  (3) [Info] Ctl=0 "Parameter type value is the reboot count. Automatic reboot count has changed. Controller has rebooted. Automatic reboot has rearmed itself or was reconfigured."

  (4) [Warning] Ctl=0 "Updated partner's status."

  (5) [Warning] Ctl=0 "Dual controllers entered nexus."

  (6) [Warning] Ctl=0 "Updated partner's status."

  (7) [Warning] Ctl=0 "Dual controllers enabled."

  (8) [Info]  "Array management server software started successfully. The server system (or array management utility server) started."

  (9) [Info] Ctl=0 "Parameter type value is the reboot count. Automatic reboot count has changed. Controller has rebooted. Automatic reboot has rearmed itself or was reconfigured."

  (10) [Warning] Ctl=0 "Updated partner's status."

  (11) [Warning] Ctl=0 "Updated partner's status."

  (12) [Info] Ch:ID=1:4 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (13) [Info] Ch:ID=0:5 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (14) [Info] Ch:ID=1:6 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (15) [Info] Ch:ID=0:7 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (16) [Info] Ch:ID=1:8 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (17) [Info] Ch:ID=0:9 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (18) [Info] Ch:ID=1:10 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (19) [Info] Ch:ID=0:11 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (20) [Info] Ch:ID=1:12 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (21) [Info] Ch:ID=0:13 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (22) [Info] Ch:ID=1:14 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (23) [Info] Ch:ID=0:15 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (24) [Info] Ch:ID=1:16 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (25) [Info] Ch:ID=0:17 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (26) [Info] Ch:ID=1:18 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (27) [Info] Ch:ID=0:19 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (28) [Info] Ctl=0 "Controller device start complete."

  (29) [Info] Ch:ID=1:4 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (30) [Info] Ch:ID=0:5 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (31) [Info] Ch:ID=1:6 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (32) [Info] Ch:ID=0:7 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (33) [Info] Ch:ID=1:8 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (34) [Info] Ch:ID=0:9 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (35) [Info] Ch:ID=1:10 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (36) [Info] Ch:ID=0:11 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (37) [Info] Ch:ID=1:12 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (38) [Info] Ch:ID=0:13 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (39) [Info] Ch:ID=1:14 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (40) [Info] Ch:ID=0:15 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (41) [Info] Ch:ID=1:16 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (42) [Info] Ch:ID=0:17 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (43) [Info] Ch:ID=1:18 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (44) [Info] Ch:ID=0:19 "A new hard disk has been found. A physical device has been powered on. A new physical device has been added. Controller was powered on. Controller was added. System has rebooted."

  (45) [Info] Ctl=0 "Controller device start complete."

  (46) 14:14:15 12/19/2002 [Info] Ctl:LD=0:0 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (47) 14:14:15 12/19/2002 [Info] Ctl:LD=0:1 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (48) 14:14:15 12/19/2002 [Info] Ctl:LD=0:2 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (49) 14:14:15 12/19/2002 [Info] Ctl:LD=0:3 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (50) 14:14:15 12/19/2002 [Info] Ctl:LD=0:4 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (51) 14:14:15 12/19/2002 [Info] Ctl:LD=0:5 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (52) 14:14:15 12/19/2002 [Info] Ctl:LD=0:6 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (53) 14:14:15 12/19/2002 [Info] Ctl:LD=0:7 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (54) 14:14:15 12/19/2002 [Info] Ctl:LD=0:8 "Logical drive has been placed online. Rebuild completed. User set the physical device online. New configuration was added."

  (55) 14:14:23 12/19/2002 [Info] Ctl=0 "BBU Present. Controller is dead. Controller has been removed. Controller has been powered off."

  (56) 14:14:26 12/19/2002 [Info] Ctl=0 "BBU Present. Controller is dead. Controller has been removed. Controller has been powered off."

  (57) 14:14:31 12/19/2002 [Severe] Ctl=0 "BBU recondition needed."

  (58) 14:14:31 12/19/2002 [Info] Ctl:Enc=0:0 "Enclosure services ready."

  (59) 15:27:11 12/19/2002 [Info] Ctl=0 "BBU Power OK. BBU has enough power to enable the write data cache."

  (60) 15:27:13 12/19/2002 [Warning] Ctl=0 "Controller entered normal cache mode."

  (61) 17:00:18 12/19/2002 [Info] Ctl=0 "BBU Power OK. BBU has enough power to enable the write data cache."

  (62) 17:00:21 12/19/2002 [Warning] Ctl=0 "Controller entered normal cache mode."

  (63) 14:13:57 12/20/2002 [Info] Ctl=0 "Parameter type value is the reboot count. Automatic reboot count has changed. Controller has rebooted. Automatic reboot has rearmed itself or was reconfigured."

  (64) 14:13:57 12/20/2002 [Info] Ctl=0 "Parameter type value is the reboot count. Automatic reboot count has changed. Controller has rebooted. Automatic reboot has rearmed itself or was reconfigured."

  (65) 09:34:37 12/22/2002 [Warning] Ctl=0 "UPS Battery Low - Controller entered Conservative Cache Mode."

  (66) 09:34:40 12/22/2002 [Severe] Enc:Unit=1:0 "Power supply failure. Cable connection is broken.  Bad power supply."

  (67) 09:34:40 12/22/2002 [Warning] Ctl=0 "UPS Battery Low - Controller entered Conservative Cache Mode."

  (68) 09:37:17 12/22/2002 [Warning] Ctl=0 "Controller entered normal cache mode."

  (69) 09:37:19 12/22/2002 [Info] Enc:Unit=1:0 "Power supply has been restored. Faulty power supply has been replaced."

  (70) 09:37:21 12/22/2002 [Warning] Ctl=0 "Controller entered normal cache mode."

  (71) 09:37:27 12/22/2002 [Warning] Ctl=0 "UPS Battery Low - Controller entered Conservative Cache Mode."

  (72) 09:37:30 12/22/2002 [Severe] Enc:Unit=1:1 "Power supply failure. Cable connection is broken.  Bad power supply."

  (73) 09:37:30 12/22/2002 [Warning] Ctl=0 "UPS Battery Low - Controller entered Conservative Cache Mode."

  (74) 09:42:25 12/22/2002 [Warning] Ctl=0 "Controller entered normal cache mode."

  (75) 09:42:27 12/22/2002 [Info] Enc:Unit=1:1 "Power supply has been restored. Faulty power supply has been replaced."

  (76) 09:42:29 12/22/2002 [Warning] Ctl=0 "Controller entered normal cache mode."

Terminating program.

 

Note that the software has entries up through and including firmware release 9.0, which totals to more than 250 events. If you do not understand what any of these events mean, or what you should do about them, please contact your disk subsystem provider for assistance.

 

Note also, that SMARTMon-UX does not launch alert emails or take any action on these events. The current release of the software only dumps them for you.  If you would like to have the system generate automated alerts based on the event log, you will need to incorporate the alerts into a shell script or external program of your design.

 

The -ZA option produces the same report as the -ZL option, only you specify the starting event number to begin reporting.

 

The -ZM option instructs the software to print a WWN-Mapping table that shows what WWNs are allocated to each logical unit.