
Senior Site Reliability Engineer
- Atlanta, GA
- $143,000-179,000 per year
- Permanent
- Full-time
- Collaborate with other teams to define and implement system requirements.
- Design, build, and maintain cloud-based microservices infrastructure.
- Automate routine operational tasks and remediation processes to improve efficiency and reliability.
- Proactively fix and resolve issues, collaborating with support teams, other engineering teams, and using monitoring tools to ensure system health.
- Ensure that datastores operate efficiently and meet performance and availability goals.
- Contribute to the team's growth by mentoring junior engineers and sharing standard methodologies.
- Plan and execute strategies for scaling systems and infrastructure as needs grow.
- Strong background in infrastructure, operations, or software engineering with a focus on reliability.
- Extensive experience working with cloud platforms such as Google Cloud Platform (GCP) or Amazon Web Services (AWS).
- Proficiency in using configuration management tools like Terraform and Ansible to manage infrastructure.
- Hands-on experience with modern monitoring and observability tools such as Prometheus, Grafana, and similar technologies.
- Proven experience with distributed databases (e.g. Cassandra, Elasticsearch) and maintaining their health at scale.
- Familiarity with distributed event stores and stream-processing platforms.
- Strong coding skills in at least one modern programming language (Python, Go, etc.).
- Expertise in running and maintaining production systems in a Linux environment and public cloud infrastructure.
- Demonstrated expertise in architecting solutions for complex technical challenges, and the ability to lead initiatives from conception through to execution.
- Strong interpersonal and communication skills, with a history of building effective relationships with cross-functional teams.
- Ability to mentor and guide junior engineers, fostering a collaborative and inclusive team culture.
- Experience with container orchestration platforms.
- Expertise in CI/CD pipeline automation and infrastructure as code practices.
- Knowledge of network architecture and security best practices in cloud environments.
- Experience with containerization and microservices architectures.
- Advanced problem solving skills, particularly in highly sophisticated and distributed systems.
- STAY HEALTHY: We offer comprehensive market competitive medical, dental, and vision plans. A variety of supplemental plans are also provided to meet your individual needs including access to telehealth for all participants.
- CARE FOR YOURSELF: Take advantage of our free virtual counseling resources through our global Employee Assistance Program. Your mental health is as important as your physical health.
- SECURE YOUR FUTURE: Plan for your future with our Roth and Pre-tax 401(k) options including an employer match for all participants.
- TAKE A BREAK: Enjoy a generous paid time off program. We value balance and understand that performance at work requires time to rest at home and/or rejuvenate on vacation.
- PUT FAMILY FIRST: We know that families can be built in a variety of ways; therefore, we offer paid parental leave and family planning support.
- WORK WHEREVER: Our flexible remote work offerings allow you to work wherever you are the most productive and successful. It is what you do, not where you work, that matters.
- MAKE AN IMPACT: Support betterment in your community and beyond by taking paid time off to support a volunteer program of your choice.