CFBMC-3048:「BMC heartbeat stopped」を指定すると、ONTAPがリブートしてBMCをリカバリします。
問題
- When the BMC on AFF A250, AFF C250, ASA A250, ASA C250 or FAS500f systems becomes unresponsive, ONTAP attempts to reboot the BMC to recover it.
- This operation can hang.
- ONTAP subsequently reboots itself to recover from this condition.
- The following are example events indicating the issue:
- 13:21:27 [node-01: spmgrd: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.
- 13:21:27 [node-01: spmgrd: callhome.sp.hbt.missed:notice]: Call home for SP HBT MISSED
- 13:31:46 [node-01: spmgrd: callhome.sp.hbt.stopped:alert]: Call home for SP HBT STOPPED
- 13:34:07 [node-01: env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes.
- 13:44:07 [node-01: env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (System reboot to recover the BMC)