Skip to main content

8.5 RMA Processing & Field Failure Analysis

Field returns (RMA) are the only source of truth in manufacturing. While internalInternal yield metrics only measure your ability to build the product,product; RMA data measures the product's ability to survive reality. A robust Field Failure Analysis (FFA) process is not a "returns department" function; it is a forensic engineering discipline. The objective is not to issue a credit note, but to extract the failure mode, replicate it, and feed the corrective action back into the Design (NPI) or Manufacturing process to prevent recurrence.

Intake & Safety Protocols (The Air Gap)

Returned units are unknown variables. They may have been installed in hazardous environments (hospitals, oil rigs) or subjected to bio-contamination.

Decision Logic for Intake:

  • IFIf Unitunit arrives from Medical/Industrial field:field →
    • THENThen Quarantine immediately. Do not open without a Decontamination Certificate.
      • Risk: Exposure to pathogens or chemicals.
    • IFIf Li-Ion Battery is presentswollen andor swollen/damaged:damaged →
      • THENThen Classifyclassify as HAZMAT. Store in fireproof cabinet. Do not charge.
    • Action: Record serial number and photograph external condition before any testing. This distinguishesdifferentiates "shipping damage" from "customer abuse."

    The Verification Gate (Confirming the Failure)

    The most expensive outcome in RMA is NTF (No Trouble Found). NTF is not a customer error; it is often a gap in your test coverage.

    Testing Hierarchy:

    1. Visual Inspection: Look for burn marks (EOS), corrosion (liquid ingress), or impact damage (cracked housing).
      • Decision: If physical damage is evident, stop electrical testing. The root cause is likely "Misuse."
    2. Functional Verification: Attempt to replicate the failure using the customer's reported scenario.
    3. NTF Protocol: If the unit passes all tests:
      • Action: Perform thermal cycling (0˚C to 50˚C) or vibration. Intermittent failures (cold solder joints) often hide at room temperature.

    Pro-Tip: Customers often report "Dead Unit" when the issue is actually a specific sleep-mode bug. Do not just run the standard "Pass/Fail" script; try to break the unit.

  • NTF Protocol: If the unit passes all tests:
    • Action: Perform thermal cycling (0˚C to 50˚C) or vibration. Intermittent failures (cold solder joints) often hide at room temperature.
  • Root Cause Analysis (The "Smoking Gun")

    When a defect is confirmed, categorize it into one of three buckets to assign ownership.

    Bucket A: Electrical Overstress (EOS)

    • Signs: Burnt components, carbonized PCB, melted traces.
    • Physics: Massive energy surge (Lightning, wrong power supply, short circuit).
    • Owner: Typically Customer (Application) or Design (Inadequate Protection).

    Bucket B: Electrostatic Discharge (ESD)

    • Signs: SEM analysis shows gate oxide puncture. No external burns.
    • Physics: Latent defect from assembly or improper handling.
    • Owner: Manufacturing Process (EPA Violation).

    Bucket C: Workmanship / Component

    • Signs: Cold solder, missing component, wrong value, dead silicon.
    • Owner: Manufacturing or Supplier.

    The Feedback Loop (CAPA)

    RMA data is useless if it sits in a spreadsheet. It must trigger the Corrective Action (CAPA) system.

    Trigger Thresholds:

    • IFIf Safety Incident (Fire/Shock): → THENThen Immediate Stop Ship + Recall Analysis.
    • IFIf New Failure Mode detected:detected → THENThen Issue CAR (Corrective Action Request) to Engineering.
    • IFIf Repeat Failure Rate > 1%: → THENThen Initiate Process Audit.

    Final Checklist

    Control Step

    ParameterPoint

    Critical LimitRequirement

    Risk / RuleAvoided

    Safety

    Decontamination

    Zero Handling without PPE/CertDecontamination Cert.

    Biohazard / Chemical Exposure

    Verification

    Replicate Failureusing

    Use Customer Scenario, not Factory TestTest.

    False NTF Results

    NTF Rate

    "No Trouble Found"

    Target < 10%. (If higher, review Test Specs).

    Unexplained Field Risk

    Analysis

    Differentiate EOS vs.(External) vs ESD (Internal).

    DifferentiateMisassigning Energy Source (External vs. Internal)Liability

    Disposition

    DemolishScrapping scrapped units (Crush/Shred).

    DemolishGray (DoMarket not allow gray market resale)Resale

    DataFeedback

    Feedback

    RMA Metrics must drive FMEA updates.

    Repeating Design Errors