Tuesday, January 3, 2012

Equallogic PS6500 Kernel panic

I guess there is a first time for everything, but this is not something you want to experience. Today for the first time since I have started using Equallogic SAN's I had one reboot in the middle of the day - the logs indicated a CPU Kernel Panic and the modules did failover as expected. We are running firmware 5.1.1 and I have sent the diagnostics off to support to see what they can determine the root cause to be. These SAN's have been rock-solid for us up to this point and I am hoping that this is just a one-off incident. As I get more information back from support I will update this post. Hopefully I will get more information than just upgrade your firmware to 5.1.2 - which I plan to do this weekend.

Update: Dell got back in touch with me and wants to swap out the faulty controller and also a drive that showed errors in the diagnostics. I should have the parts within 4 hours and will see if this problem occurs again.

Got the new controller and drive delivered on time. Installed the new controller and approximately two hours later the SAN did a failover event to the new controller after complaining that communication was lost between the two controllers again. No problems have occurred since that event but I am still concerned about what may be going on.

1 comments:

Mark said...

Hi,

I have just tried updating an EqualLogic PS6000 to FW version 5.1.2, got a NetBSD Kernel Panic at which point the hosted iSCSI volumes went off line for about 3 minutes then came back on. I am now stuck with Active controller on FW 5.1.1 and Secondary on FW 5.1.2. Have generated logs and sent to Tech support... never had this issue before with EqualLogic and have performed a number of Firmware updates now. Very strange, watch this space !

http://www.interweb.org.uk