OpenAI

Reliability/DFX Engineer

Job Description

Posted on: 
2025-11-08

Responsibilities

  • Oversee DFX architecture, implementation, and execution in silicon from concept to deployment.
  • Build system-level reliability models to guide DFX and reliability strategy.
  • Collaborate with design teams to implement DFX features.
  • Partner with hardware health and platform design teams to improve reliability.
  • Serve as the DFX/reliability champion within the industry ecosystem.
  • Propose high-ROI features to enhance reliability and fault tolerance.
  • Analyze data to drive continuous improvements across the stack.

Job Requirements

  • BS with 15+ years, MS with 10+ years, or PhD with 3+ years of relevant experience.
  • Hands-on experience with RTL design and DFT.
  • Detailed understanding of ML chip and platform architecture.
  • Strong fundamentals in reliability modeling and empirical data analysis.
  • Experience with physical implementation and/or silicon ATE is preferred.
  • Ability to work collaboratively across teams.
  • Strong communication and alignment skills within the broader ecosystem.
Apply now

More job openings