
Software Engineering Manager, Workload Benchmarking, Google Cloud Platform Compute
- Kirkland, WA
- Permanent
- Full-time
- Bachelor’s degree, or equivalent practical experience.
- 8 years of experience in software development.
- 3 years of experience with developing large-scale infrastructure, distributed systems or networks, or with compute technologies, storage or hardware architecture.
- 3 years of experience in a technical leadership role overseeing projects.
- 2 years of experience in a people management, supervision/team leadership role.
- Master's degree or PhD in Computer Science or related technical field.
- 3 years of experience working in a structured organization.
- Experience developing accessible technologies.
- Experience with internal quality and repro testing to cover critical user journeys (CUJs).
- Ability to collaborate with internal infrastructure teams to identify bottlenecks and expand capacity as needed.
- Ability to drive continuous product improvement through bug fixes and short-term feature enhancements.
- Manage a team of 6-8 engineers, supporting the new model benchmarking needs for customers. Identify and resolve technical bottlenecks to drive customer success.
- Understand the state of art models and contribute to the tooling for TPU/GPU Inference/training. Partner with customers to optimize AI/ML model performance on Google Cloud infrastructure.
- Collaborate with internal infrastructure teams to enhance support for demanding AI workloads. Develop and deliver high-quality training materials and demos for customers and internal teams.
- Conduct design and code reviews to ensure adherence to best practices across technologies. Maintain and update documentation and educational content based on product changes and user feedback.
- Triage, debug, and resolve system issues by analyzing root causes and operational impact. Design and implement specialized machine learning (ML) solutions, effectively leveraging advanced ML infrastructure.