Engineering Manager - DevOps
Loop
- Columbus, OH Austin, TX
- $152,000-228,000 per year
- Permanent
- Full-time
- Drive self-service and automation at Loop by designing golden-path workflows so product teams can provision infrastructure, integrate monitoring, and release safely on their own.
- Lead the strategy and execution for scaling to multi-region by implementing active-active/active-standby architectures, cross-region data replication, and global traffic management.
- Champion the evolution of deployment patterns, including blue/green, canary, feature-flag, and immutable-infra releases that minimize risk and Mean Time To Recovery (MTTR).
- Implement Site Reliability Objectives (SLOs), error budgets, chaos testing, and auto-remediation playbooks to raise the reliability bar, and own the infrastructure on-call rotation culture.
- Mentor and develop a diverse team of DevOps engineers, DBAs, and MLOps Engineers with a wide variety of technical skillsets, cultivating the next wave of engineering leaders.
- Partner hand-in-hand with Product Engineering Teams, our Data Team, and other stakeholders to align roadmaps and unlock velocity.
- Contribute hands-on by writing Terraform modules, optimizing Helm charts, reviewing merge requests, helping support the team with reactive work, and joining high-severity incident calls when needed.
- 4+ years of proven DevOps leadership managing DevOps or Site Reliability Engineering (SRE) teams, along with 7+ years in hands-on platform or infrastructure roles.
- You have a strong self-service track record, having delivered internal platforms or portals that empowered hundreds of engineers to ship autonomously.
- You bring large-scale AWS expertise, demonstrated by designing and operating multi-region, high-throughput systems that support over $100 million in annual Gross Merchandise Value (GMV).
- An expert in advanced CI/CD pipelines and Infrastructure as Code (IaC), proficient with tools like GitLab CI (or similar), Terraform, and Kubernetes, and are comfortable introducing progressive delivery, policy-as-code, and secrets management at scale.
- Deep understanding of metrics, tracing, logging, and alerting, reflecting an observability mindset, with experience using Datadog or comparable stacks.
- Familiar with cloud security best practices, least privilege access, and regulated-data environments such as PCI, SOC 2, and GDPR, demonstrating security and compliance awareness.
- You exhibit empathetic leadership, with demonstrated success fostering psychological safety, inclusion, and continuous feedback while driving accountability and high performance.