
Principal Capacity Engineer, Compute
- Seattle, WA San Francisco, CA
- Permanent
- Full-time
- Design, develop, and deliver capacity management systems for AI workloads on heterogenous infrastructure
- Build and maintain robust attribution of usage and enable in-depth data-driven insights
- Oversee design and implementation of planning tools and systems-level guardrails for capacity planning and quota management
- Build a deep understanding of research and training workloads to accurately model cost-to-serve and cost-to-train
- Proactively identify efficiency opportunities and collaborate with teams across the org to increase total effective compute for Anthropic
- Partner closely with Finance and leadership, providing detailed and clear capacity inputs for financial planning and strategic decision making
- Have experience working on capacity at a major cloud provider or hyperscaler company
- Have experience driving cross-functional projects and interfacing with technical and non-technical stakeholders.
- Have experience working with LLMs and/or a deep interest in learning about model training and serving efficiency
- Are comfortable leveraging data and have experience building observability for complex systems
- Have strong interpersonal skills that enable you to influence without authority and build cross-organizational support for capacity initiatives.
- Past experience as a lead capacity engineer
- Past experience partnering with senior leadership
- Past experience working on model training or model inference
- Building a system for capacity planning and optimizing resource allocation for model training, inference, and research