Principal, Operational Excellence & Resilience (Remote) at CrowdStrike
This role is designed for a seasoned Disaster Recovery or Site Reliability Engineering (SRE) professional with at least 10 years of experience in enterprise-sca
Work type: remote
Location: USA - Remote
Type: Full-time
This role is designed for a seasoned Disaster Recovery or Site Reliability Engineering (SRE) professional with at least 10 years of experience in enterprise-scale, cloud-native environments. You are the ideal candidate if you possess a deep understanding of infrastructure redundancy, chaos engineering, and application resilience. As a senior individual contributor, you will bridge the gap between business units and engineering teams to ensure global service reliability.
As a fully remote "Principal" position, this role offers significant autonomy and the opportunity to build a technology resilience function from the ground up within a mission-driven cybersecurity leader. You’ll act as a central hub for strategy, enjoying a "Great Place to Work" culture that emphasizes equity awards, professional development, and comprehensive wellness benefits.
**You might be a good fit if you:**
* Have a background in scaling resilience programs for high-growth, hybrid cloud architectures (AWS/Azure/GCP).
* Are an expert in disaster recovery metrics like RTO/RPO and modern patterns like circuit breakers and progressive delivery.
* Enjoy "chaos engineering" and proactively testing systems to find and fix vulnerabilities before they cause outages.
* Can provide technical leadership and strategic briefings to executive stakeholders during high-pressure crisis events.
View this job on nocollar jobs