Hardware Operations Technical Program Manager

This a Full Remote job, the offer is available from: United States

About the Team

The Stargate team is responsible for building the physical infrastructure that powers OpenAI’s largest-scale AI systems. We design, deploy, and operate next-generation data center infrastructure across a rapidly expanding footprint, bringing together hardware, networking, facilities, supply chain, and deployment execution.

This work sits at the intersection of advanced AI hardware and real-world infrastructure delivery. Our team turns compute requirements into deployable, reliable, and scalable systems that support frontier AI workloads.

About the Role

We are seeking a Hardware Operations Technical Program Manager to drive execution across the lifecycle of AI infrastructure hardware programs.

In this role, you will own cross-functional program execution across hardware readiness, supplier coordination, deployment planning, rack-level integration, manufacturing operations, logistics, field deployment, and operational handoff. You will partner closely with hardware engineering, data center engineering, networking, supply chain, manufacturing, deployment, and operations teams to ensure critical infrastructure programs move from design intent to production readiness.

This role is ideal for someone who can operate at both the technical and programmatic level: understanding hardware systems, identifying operational blockers, driving accountability across teams, and creating scalable processes for high-volume infrastructure deployment.

Key Responsibilities

  • Drive end-to-end Hardware Operations readiness programs across AI infrastructure systems, including servers, racks, networking hardware, power and cooling interfaces, and related data center infrastructure.

  • Develop and operationalize scalable hardware operations processes, workflows, and support models spanning deployment, repair operations, diagnostics, break/fix, escalation management, and sustaining operations.

  • Lead cross-functional execution of Hardware Operations readiness initiatives, ensuring operational capabilities, tooling, documentation, staffing models, and workflows are established prior to production deployment and operational handoff.

  • Partner across Hardware Engineering, Manufacturing, Supply Chain, Data Center Operations, Network Operations, Deployment, Reliability Engineering, and external suppliers to ensure alignment on operational requirements, supportability, and readiness milestones.

  • Develop operational scorecards, reporting frameworks, and metric algorithms to measure hardware operational health, repair performance, deployment quality, readiness status, and execution efficiency.

  • Identify operational, technical, supplier, tooling, and process risks early; drive mitigation plans, cross-functional alignment, and executive-level communication.

  • Lead cross-functional issue resolution efforts during hardware deployment, validation, operational ramp, and sustaining operations, ensuring rapid containment, corrective action development, and long-term process improvement.

  • Create and mature operational governance models, including standardized readiness reviews, action tracking, escalation management, performance reviews, and operational business rhythms.

  • Ensure operational knowledge sharing and alignment across internal teams, external suppliers, and infrastructure partners to improve execution consistency, issue resolution efficiency, and operational maturity.

You Might Thrive in This Role If You

  • Have experience driving complex hardware or infrastructure programs from development through production and deployment.

  • Are comfortable operating across engineering, manufacturing, supply chain, deployment, and operations teams.

  • Can understand technical system dependencies without needing to be the deepest engineer in every domain.

  • Know how to create structure in ambiguous, fast-moving environments.

  • Are effective at driving accountability across teams and vendors without direct authority.

  • Can move between tactical execution details and executive-level communication.

  • Have strong judgment around when to escalate, when to unblock directly, and when to create a repeatable process.

Qualifications

  • 7+ years of experience in technical program management, hardware operations, manufacturing operations, infrastructure deployment, or related technical execution roles.

  • Experience supporting hardware systems at scale, ideally including servers, racks, networking hardware, data center infrastructure, or high-performance compute environments.

  • Strong understanding of hardware development and deployment lifecycle, including NPI, qualification, manufacturing ramp, logistics, installation, validation, and operational support.

  • Demonstrated ability to manage complex cross-functional schedules, dependencies, risks, and executive communications.

  • Strong technical fluency across hardware systems, rack integration, manufacturing readiness, and infrastructure deployment.

  • Proven ability to operate in ambiguous environments and create scalable execution mechanisms.

  • Excellent written and verbal communication skills, with the ability to influence technical and non-technical stakeholders.

  • Experience with AI infrastructure, hyperscale data centers, cloud infrastructure, or high-density compute systems is a plus.

  • Bachelor’s degree in engineering, computer science, operations, supply chain, or equivalent practical experience.

Preferred Skills

  • Experience with GPU, accelerator, server, rack, or cluster-scale infrastructure programs.

  • Background in hardware operations, manufacturing program management, supply chain operations, data center deployment, or technical infrastructure TPM.

  • Familiarity with rack integration, power/cooling constraints, cabling, networking, serviceability, and deployment readiness.

  • Experience building operating rhythms, readiness reviews, risk registers, launch dashboards, or executive program reviews.

  • Experience scaling programs from prototype/NPI into repeatable production deployment.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

This offer from "Rockset" has been enriched by Jobgether.com and got a 84% flex score.
Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...