Principal, Operational Excellence & Resilience (Remote) at CrowdStrike

This role is designed for a seasoned Disaster Recovery or Site Reliability Engineering (SRE) professional with at least 10 years of experience in enterprise-sca

Work type: remote

Location: USA - Remote

Type: Full-time

This role is designed for a seasoned Disaster Recovery or Site Reliability Engineering (SRE) professional with at least 10 years of experience in enterprise-scale, cloud-native environments. You are the ideal candidate if you possess a deep understanding of infrastructure redundancy, chaos engineering, and application resilience. As a senior individual contributor, you will bridge the gap between business units and engineering teams to ensure global service reliability. As a fully remote "Principal" position, this role offers significant autonomy and the opportunity to build a technology resilience function from the ground up within a mission-driven cybersecurity leader. You’ll act as a central hub for strategy, enjoying a "Great Place to Work" culture that emphasizes equity awards, professional development, and comprehensive wellness benefits. **You might be a good fit if you:** * Have a background in scaling resilience programs for high-growth, hybrid cloud architectures (AWS/Azure/GCP). * Are an expert in disaster recovery metrics like RTO/RPO and modern patterns like circuit breakers and progressive delivery. * Enjoy "chaos engineering" and proactively testing systems to find and fix vulnerabilities before they cause outages. * Can provide technical leadership and strategic briefings to executive stakeholders during high-pressure crisis events.

View this job on nocollar jobs