Google

System Hardware Reliability Engineer, TPU Systems

Job Description

Posted on: 
2026-01-22

Responsibilities

  • Lead system hardware reliability efforts with partner organizations to identify failure-inducing factors for TPU systems.
  • Define and manage system hardware reliability needs for TPU system deployments, including monitoring installation issues and repair times.
  • Extract and analyze system field reliability data to drive failure analysis and improve customer experience.
  • Manage collaborations with external partners, testing labs, and cross-functional internal groups.
  • Develop in-house test and qualification capabilities as necessary.
  • Monitor random failure events and early detection of wearout or unexpected system failure trends.
  • Provide actionable insights based on reliability principles to enhance product quality.

Job Requirements

  • Bachelor's degree in Electrical, Mechanical, Industrial, Materials, or a related engineering field.
  • 8 years of experience in manufacturing.
  • Preferred: Master's degree or PhD in a related engineering field.
  • Experience in setting up manufacturing processes and managing product launches.
  • Strong technical leadership and project management skills.
  • Expertise in statistical analysis and reliability statistics.
  • Experience with failure analysis of hardware systems, especially related to solder joint reliability and PCB assembly.
Apply now

More job openings