
Software Engineer II
- Boston, MA
- Permanent
- Full-time
- Design and implement simple, scalable, maintainable, and repeatable solutions in AWS
- Develop self-service solutions to empower internal teams to efficiently build on Chewy's platform
- Define Infrastructure as Code (IaC) standards and collaborate with software development leadership to enhance reliability and scalability
- Work with software engineering teams to build infrastructure automation and services that simplify large-scale distributed system operations
- Participate in cloud readiness assessments, workshops, roadmaps, and application modernization to drive cloud-native architecture adoption
- Automate monitoring and support solutions for public cloud platforms and streamline software development and release processes
- Drive innovation through rapid prototyping and iterative development
- Debug production issues across services and technology stack
- Act as a technical thought leader and Subject Matter Expert for AWS adoption
- Continuously measure and optimize the performance, efficiency, and uptime of systems
- Apply algorithms, data structures, and design patterns to build scalable and performant code
- Collaborate with team members and developers to drive solutions to standardize methodologies, workflows, reusable modules, and tooling
- Participate in design and code reviews
- Participate in on-call rotations and 24x7 production support rotations
- Bachelor's degree in Computer Science, Information Systems, Telecommunication Systems Management, or a related field.
- AWS Solution Architect Associate Certification or equivalent experience
- 3+ years of hands-on experience with public cloud infrastructure (AWS)
- Experience in managing large single AWS accounts, with multiple VPCs as well as multiple smaller accounts
- Experience working across fully automated stacks in a DevOps, CI/CD ecosystem
- Experience with Docker, Terraform/IaC, IAM, Networking, Secrets Management, Jenkins
- Experience in programming in Python (preferred)
- Expertise in designing scalable, fault-tolerant, highly available, and globally distributed systems
- Knowledge of on-premise and cloud networking (VPC, TGW, network routing, firewalls)
- Strong verbal and written communication skills with a proven ability to lead technical initiatives
- Experience with Scrum
- Experience in designing systems in Java or other object-oriented programming languages
- Strong background with modern tools and environments (e.g., Chaos Engineering, web services, version control systems, bug/issue tracking tools)
- Experience with data analysis/mining in large scale database environments a plus
- Linux System Administration experience
- Aviatrix administration skills
- Experience with Wiz