Top Benefits
Competitive PTO policies
Remote or hybrid work
Generous parental leave
About the role
Who you are
- Does the idea of turning thousands of blank, powered‑off servers into fully configured, production‑ready machines with a single, deterministic pipeline sound like your kind of challenge
- 10+ years hands‑on systems or infrastructure engineering (Go, Python, Rust, or similar) building automation for bare‑metal provisioning, OS imaging, and configuration management at fleet scale
- 5+ years in engineering leadership roles including managing other managers or tech leads
- Demonstrated ability to scale complex provisioning / automation platforms across thousands of servers or devices
- Deep expertise in boot & hardware pipelines—PXE/UEFI HTTP, BMC / Redfish, secure boot, firmware flashing, and tools like Chef or Ansible and experience making high-leverage architectural decisions at org scale
- Proven technical leadership of teams and mission‑critical projects (mentoring, driving design reviews, owning roadmaps), not just delivery execution
- Strong cross‑functional collaboration and executive communication skills, able to align long-term technical strategy with business goals
- Experience leading teams that engineered UEFI / USB boot-shims that stream a Live OS over an out-of-band channel (Redfish virtual media, HTTPBoot, iPXE) to provide zero-touch diagnostics, patching, or full re-image; turning fleet-wide emergency recovery into a single pipeline run
- Leveraged that expertise to set architectural direction at scale
What the job involves
- We’re looking for a Senior Engineering Manager to lead our Infrastructure Automation team, owning everything from “cold silicon” power‑on to Chef convergence at fleet scale
- You’ll shape the vision, mentor a team of sharp developers and scale the organization
- You’ll set a deeply technical direction with a focus on building the tooling, APIs, and observability that make site bring‑up as effortless and repeatable as a git push, unlocking rapid deployments at scale
- Define and drive the multi-year architecture roadmap for fully automated bring-up of new hardware platforms and PoPs for deterministic, end‑to‑end provisioning automation across all existing and new hardware platforms
- Lead and mentor senior engineers and managers, creating an org that excels in systems design, operational reliability, and rapid iteration
- Deliver and operate scalable tooling, APIs, and dashboards that make bulk site bring‑ups self‑service, idempotent, and fully traceable
- Collaborate cross‑functionally with Operations (DC & Net), Hardware, and Security to align firmware, OS, and config‑management workflows with fleet‑wide SLAs; also while iterating POP design, and using that expertise to influence organizational strategy and architecture at scale
- Own success metrics (e.g. first‑pass yield & error‑recovery time) and drive continuous improvement through data‑driven retrospectives and translate into roadmap updates
- This position will require you to be available during core business hours. You will also be on-call during evenings and weekends as an escalation point in team on-call rotation
The application process
- Anticipated Posting Close Date Sept. 23, 2025
- Job posting may close early due to the volume of applicants
Benefits
- Competitive PTO Policies
- Remote or Hybrid Work
- Generous time off for parental leave
- Full medical, dental, and vision coverage
- Short- and long-term disability insurance
- Mental health resources
- 401(k)/retirement plans
- Employee stock purchasing plans (ESPP)
- Reimbursements for learning and development programs
Top Benefits
Competitive PTO policies
Remote or hybrid work
Generous parental leave
About the role
Who you are
- Does the idea of turning thousands of blank, powered‑off servers into fully configured, production‑ready machines with a single, deterministic pipeline sound like your kind of challenge
- 10+ years hands‑on systems or infrastructure engineering (Go, Python, Rust, or similar) building automation for bare‑metal provisioning, OS imaging, and configuration management at fleet scale
- 5+ years in engineering leadership roles including managing other managers or tech leads
- Demonstrated ability to scale complex provisioning / automation platforms across thousands of servers or devices
- Deep expertise in boot & hardware pipelines—PXE/UEFI HTTP, BMC / Redfish, secure boot, firmware flashing, and tools like Chef or Ansible and experience making high-leverage architectural decisions at org scale
- Proven technical leadership of teams and mission‑critical projects (mentoring, driving design reviews, owning roadmaps), not just delivery execution
- Strong cross‑functional collaboration and executive communication skills, able to align long-term technical strategy with business goals
- Experience leading teams that engineered UEFI / USB boot-shims that stream a Live OS over an out-of-band channel (Redfish virtual media, HTTPBoot, iPXE) to provide zero-touch diagnostics, patching, or full re-image; turning fleet-wide emergency recovery into a single pipeline run
- Leveraged that expertise to set architectural direction at scale
What the job involves
- We’re looking for a Senior Engineering Manager to lead our Infrastructure Automation team, owning everything from “cold silicon” power‑on to Chef convergence at fleet scale
- You’ll shape the vision, mentor a team of sharp developers and scale the organization
- You’ll set a deeply technical direction with a focus on building the tooling, APIs, and observability that make site bring‑up as effortless and repeatable as a git push, unlocking rapid deployments at scale
- Define and drive the multi-year architecture roadmap for fully automated bring-up of new hardware platforms and PoPs for deterministic, end‑to‑end provisioning automation across all existing and new hardware platforms
- Lead and mentor senior engineers and managers, creating an org that excels in systems design, operational reliability, and rapid iteration
- Deliver and operate scalable tooling, APIs, and dashboards that make bulk site bring‑ups self‑service, idempotent, and fully traceable
- Collaborate cross‑functionally with Operations (DC & Net), Hardware, and Security to align firmware, OS, and config‑management workflows with fleet‑wide SLAs; also while iterating POP design, and using that expertise to influence organizational strategy and architecture at scale
- Own success metrics (e.g. first‑pass yield & error‑recovery time) and drive continuous improvement through data‑driven retrospectives and translate into roadmap updates
- This position will require you to be available during core business hours. You will also be on-call during evenings and weekends as an escalation point in team on-call rotation
The application process
- Anticipated Posting Close Date Sept. 23, 2025
- Job posting may close early due to the volume of applicants
Benefits
- Competitive PTO Policies
- Remote or Hybrid Work
- Generous time off for parental leave
- Full medical, dental, and vision coverage
- Short- and long-term disability insurance
- Mental health resources
- 401(k)/retirement plans
- Employee stock purchasing plans (ESPP)
- Reimbursements for learning and development programs