NVIDIA

Senior System Reliability Engineer

Job Description

Posted on: 
2025-06-19

The Senior System Reliability Engineer at NVIDIA will focus on hardware reliability engineering for electronics and server systems, specifically in graphics and high-performance computing. The role involves establishing reliability standards, participating in design reviews, and conducting reliability testing.

Responsibilities

  • Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems.
  • Establish and maintain product reliability standards and metrics.
  • Participate in product and engineering design reviews to assess reliability.
  • Interface with engineering groups, suppliers, and partners to achieve desired reliability.
  • Define and implement Reliability Plans & Specifications.
  • Perform testing and lead failure analysis with recommendations for improvements.
  • Develop methods to correlate reliability test results with actual field performance.

Job Requirements

  • BS in Engineering, Material Science, Physics, or related field; MS or PhD preferred.
  • 6+ years in a hardware validation/reliability environment.
  • Understanding of power supply, memory, high-speed I/O, PCI express, Ethernet, and I2C.
  • Hands-on experience in reliability concepts for electronic products.
  • Strong command of statistical concepts/models/analysis related to product reliability.
  • Good verbal and writing skills for high-level communication.
  • Project management skills with the ability to handle multiple projects.
Apply now

More job openings