Good day,
Hope someone can assist or point me in the right direction.
Our 7 node failover cluster when unstable on Thursday last week. eventually we turned off all vm's and rebooted the hosts.
all working again :)
we then looked and the logs and see one of out hosts had a bugcheck and rebooted.
logs we picked up is as follows
started with event id 153 The IO operation at logical block address 0x1ac91a540 for Disk 19 (PDO name: \Device\MPIODisk19) was retried.
alot of them and then
event id 1146 The cluster Resource Hosting Subsystem (RHS) process was terminated and will be restarted. This is typically associated with cluster health detection and recovery of a resource. Refer to the System event log to determine which resource and resource DLL is causing the issue.
followed by event id 41 The system has rebooted without cleanly shutting down first. This error could be caused if the system stopped responding, crashed, or lost power unexpectedly.
i get the following on the memory dump but can not make out what caused the server to crash.
bug check code 0x0000009e parameter 1:ffff8d8f`15b58540 parameter 2:00000000`000004b0 parameter 3: 00000000`000000c9 parameter 4: 00000000`00000000
can anyone decipher these bug check codes?