
Site Reliability Engineer, Network and Traffic - USDS
- New York City, NY
- Permanent
- Full-time
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure.
- Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
- Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement
- Master's degree (or Bachelor's degree with 3+) years of experience in Computer Engineering, Electrical Engineering, Computer Science or related major
- 3+ years experience working with Unix Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
- 3+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
- Self-driven and capable of coping with ambiguity and move projects from concept to delivery.
- Strong in analytical skills and the ability to solve real world problems in a fast moving environment.
- Experience in designing, analyzing and building automation and tools for large scale systems
- Experience in building solutions with AWS, GCP, Azures and other cloud services.
- Experience in networking technologies such TCP/IP, BGP, DNS, etc. in a carrier-grade environment.
- Experience in developing and operating one or more of following systems: OpenStack, Kubernetes, Nginx, ipvs, ELK stack, Hadoop, etc.