
Site Reliability Engineer I
- Boston, MA
- Permanent
- Full-time
- Design and build Kubernetes platforms and tools that empower engineering teams to provision services quickly, securely, and consistently.
- Develop clear, maintainable, and performant code to support Kubernetes infrastructure and platform automation..
- Tackle complex challenges in cloud-native distributed systems with strong problem-solving and debugging skills.
- Drive continuous improvement of platform services
- Accelerate AI adoption across the organization.
- Create clear and comprehensive documentation to promote platform self-service and onboarding.
- This position involves handling of classified federal data; under federal regulations, it is open to US Citizens only
- Bachelor's degree in Computer Science, Engineering, or a related technical field.
- 1+ years of hands-on experience (including internships or equivalent exposure).
- Experience with Kubernetes environments (e.g., AKS, EKS, or similar) in production
- Proficiency in a managed programming language such as Python, Go, C#, or Java.
- Exposure to operating and maintaining high-availability, high-volume Kubernetes deployments with zero-downtime requirements.
- Proven track record of reducing toil and developing automation
- Familiarity with configuration management and GitOps tools (e.g., ArgoCD, GitHub actions).
- Demonstrated ability to debug and resolve issues in complex, distributed systems.
- Interest or experience in leveraging AI tools and services for development and platform optimization.
- Competitive salary and 401k with employer match
- Discretionary time off
- Paid parental leave for all
- Medical, Dental, Vision plans
- Fitness Programs
- Emotional & Development Programs
- And yes, we have snacks in our offices