
Technical Program Manager III, Generative AI Serving Efficiency, Google Cloud
- Sunnyvale, CA
- Permanent
- Full-time
- Bachelor's degree in a technical field, or equivalent practical experience.
- 5 years of experience in program management.
- Experience in Machine Learning and Infrastructure Efficiencies.
- Experience in accelerator (e.g., GPU, caching, quantization, batching, etc.) or generative AI.
- 5 years of experience managing cross-functional or cross-team projects.
- Experience building products and services that empower AI developers.
- Experience optimizing AI infrastructure for performance, scalability and efficiency.
- Experience working collaboratively while being customer-focused.
- Passion for AI and its potential to transform the world.
- Develop and manage the overall program plan for Generative AI Serving Efficiencies including communication/planning on migrations, applications of efficiency initiatives, and resource allocation.
- Work with Technical Program Managers, Serving Engineers, and product area PoCs to define and prioritize feature gaps to enable product areas to adopt the latest efficiency recommendations.
- Track and manage the progress of efficiency rollouts.
- Communicate with stakeholders to keep them informed of the program's progress and to obtain their buy-in on key decisions.
- Facilitate collaboration and coordination between the different teams involved in the LLM deployment process.