
Senior Site Reliability Engineer
- Eden Prairie, MN
- $89,900-160,600 per year
- Permanent
- Full-time
- Design and maintain Azure-based infrastructure using Terraform, including AKS, Azure Container Apps, VNETs, NSGs, Key Vault, and Storage Accounts
- Support the ongoing application migration effort across multiple AWS accounts, ensuring consistency, security, and automation best practices
- Build and optimize CI/CD pipelines using GitHub Actions, enabling frequent and reliable deployments
- Deploy and manage containerized applications with Docker and orchestrate using Azure Kubernetes Service (AKS)
- Collaborate with Security and Engineering to remediate vulnerabilities in infrastructure and pipelines
- Monitor, troubleshoot, and enhance platform reliability, performance, and observability using tools like Azure Monitor, Log Analytics, and Splunk
- Develop and maintain Ansible playbooks
- Automate repetitive tasks and improve operational efficiency through scripts and infrastructure tooling
- Maintain and continuously improve documentation of deployment practices, cloud configurations, and operational runbooks
- Participate in 24x7 on-call rotation to support production environments, troubleshoot incidents, and perform root cause analysis
- Paid Time Off which you start to accrue with your first pay period plus 8 Paid Holidays
- Medical Plan options along with participation in a Health Spending Account or a Health Saving account
- Dental, Vision, Life& AD&D Insurance along with Short-term disability and Long-Term Disability coverage
- 401(k) Savings Plan, Employee Stock Purchase Plan
- Education Reimbursement
- Employee Discounts
- Employee Assistance Program
- Employee Referral Bonus Program
- Voluntary Benefits (pet insurance, legal insurance, LTC Insurance, etc.)
- More information can be downloaded at:
- High School Diploma/GED (or higher)
- 5+ years of experience in production DevOps, SRE, or Cloud Engineer role
- 3+ years of hands-on experience with Azure Cloud, including AKS, Container Apps, Azure Pipelines, and IAM
- 3+ years of experience with AWS services such as EC2, S3, IAM, VPC, CloudWatch, and EKS in a production
- 3+ years of experience in deep knowledge of Infrastructure as Code (IaC) using Terraform (HashiCorp)
- Proven experience managing containerized workloads using Docker and Kubernetes (AKS preferred)
- Experience with source control tools such as Git, GitHub, or GitLab
- Strong background in building and maintaining CI/CD pipelines using GitHub Actions (or similar like Azure DevOps/Jenkins)
- Understanding of cloud networking (TCP/IP, DNS, Load Balancers, VNETs, NSGs)
- Background with Linux Administration experience
- Familiarity with scripting languages (Bash, Python, or PowerShell)
- Knowledge of observability/monitoring tools (Azure Monitor, Application Insights, Splunk, Dynatrace, etc.)
- Excellent communication and collaboration skills in agile team environments