
Sr Manager, Site Reliability Engineering
- Lehi, UT
- Permanent
- Full-time
- Leadership & Strategy
- Lead and grow a team of SREs, fostering a culture of ownership, innovation, and accountability.
- Define and drive the SRE roadmap in alignment with business goals and engineering priorities.
- Partner with engineering, product, and infrastructure teams to ensure reliability is built into every layer of the stack.
- Reliability Engineering
- Own the availability, latency, performance, and capacity of services across production environments.
- Implement and evolve SRE best practices including SLIs/SLOs, error budgets, incident response, and postmortems.
- Drive automation of operational tasks and improve system observability.
- Incident Management & Response
- Lead major incident response efforts, ensuring timely resolution and clear communication.
- Establish and refine incident management processes, including root cause analysis and follow-up actions.
- Operational Excellence
- Monitor and report on system health, reliability metrics, and operational KPIs.
- Champion continuous improvement through blameless postmortems and reliability reviews.
- Ensure compliance with security, privacy, and regulatory standards.
- 8+ years of experience in software engineering, infrastructure, or SRE roles, with 3+ years in a leadership capacity.
- Proven experience managing distributed systems at scale in cloud-native environments (AWS, GCP, Azure).
- Strong understanding of observability tools (e.g., Prometheus, Grafana, Datadog), CI/CD pipelines, and infrastructure-as-code.
- Excellent communication and stakeholder management skills.
- Experience with agile methodologies and DevOps practices.
- Experience with Python, Powershell, and other similar languages
- Experience with Kubernetes, service meshes, and microservices architecture.
- Familiarity with chaos engineering and resilience testing.
- Background in performance engineering or capacity planning.
- Competitive total rewards (base salary + bonus, if applicable)
- Customizable benefits package (3 medical plans with Health Saving Account company match)
- We offer generous paid time off for our non-exempt team members, starting with 3 weeks + 13 paid holidays, including 2 personal floating holidays. We also offer flexible time off for our exempt team members + 13 paid holidays
- Paid parental leave (including maternity + paternity leave)
- Education assistance opportunities and free LinkedIn Learning access
- Free mental health and family planning programs, including adoption assistance and fertility support
- 401(K) program with company match
- Pet insurance
- Employee resource groups