At approx 9:37pm tonight one of our SAN devices in BOL23 (EQL-SAS-4) reported that its primary controller was not responding.
The controller triggered a kernel panic which caused the controller to reboot. The failover controller took over once it had confirmed the primary controller was not responding.
This may have caused some virtual machines to receive disk timeouts. These should be gracefully handled (depending on the drivers within the OS, and whether VMtools is installed) but may generate errors to event logs.
The timeouts are due to equallogic SANs working in active/passive failover mode which requires a few seconds to fully failover.
After the original primary controller recovered, it is now reporting a nvram cache battery failure so will need to have this replaced.
We will schedule this replacement. As it is now a backup module due to the failover event, this should not be disruptive – however it will remove the failover capability during the replacement period.