powerpc/eeh: Force reset on fenced PHB
authorGavin Shan <gwshan@linux.vnet.ibm.com>
Thu, 8 Oct 2015 03:58:54 +0000 (14:58 +1100)
committerMichael Ellerman <mpe@ellerman.id.au>
Wed, 21 Oct 2015 09:41:43 +0000 (20:41 +1100)
On fenced PHB, the error handlers in the drivers of its subordinate
devices could return PCI_ERS_RESULT_CAN_RECOVER, indicating no reset
will be issued during the recovery. It's conflicting with the fact
that fenced PHB won't be recovered without reset.

This limits the return value from the error handlers in the drivers
of the fenced PHB's subordinate devices to PCI_ERS_RESULT_NEED_NONE
or PCI_ERS_RESULT_NEED_RESET, to ensure reset will be issued during
recovery.

Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com>
Reviewed-by: Daniel Axtens <dja@axtens.net>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
arch/powerpc/kernel/eeh_driver.c

index 32178a43138ff3242fe2227c355c9eec069d140c..80dfe8965df9f7d49fc57a1f1d6773f0c5ffd736 100644 (file)
@@ -664,9 +664,17 @@ static void eeh_handle_normal_event(struct eeh_pe *pe)
         * to accomplish the reset.  Each child gets a report of the
         * status ... if any child can't handle the reset, then the entire
         * slot is dlpar removed and added.
+        *
+        * When the PHB is fenced, we have to issue a reset to recover from
+        * the error. Override the result if necessary to have partially
+        * hotplug for this case.
         */
        pr_info("EEH: Notify device drivers to shutdown\n");
        eeh_pe_dev_traverse(pe, eeh_report_error, &result);
+       if ((pe->type & EEH_PE_PHB) &&
+           result != PCI_ERS_RESULT_NONE &&
+           result != PCI_ERS_RESULT_NEED_RESET)
+               result = PCI_ERS_RESULT_NEED_RESET;
 
        /* Get the current PCI slot state. This can take a long time,
         * sometimes over 300 seconds for certain systems.