
Sr. Manager, Software Engineering, ML Platform - Game Tech Group
- Los Angeles, CA
- Permanent
- Full-time
- Build and lead a new ML Platform team from the ground up-recruiting, mentoring, and growing both ICs and future leads.
- Set the cultural and operational foundations for a sustainable, inclusive, and high-performing engineering team.
- Collaborate technical leadership (including Riot, AI Foundations, and the ML Platform P5 Principal Engineer) to ensure architectural decisions are grounded in Riot's long-term needs.
- Drive the team's roadmap and execution-balancing foundational investments with fast, iterative delivery of user-visible platform capabilities.
- Align closely with data scientists, ML engineers, and game product teams to deeply understand workflows, pain points, and infrastructure needs.
- Champion developer experience, designing for usability and self-service adoption from day one.
- Own the delivery of the ML Platform's early feature set: scalable inference serving, model artifact CI/CD, versioning, testing environments (A/B, shadow), and observability.
- Coordinate cross-team dependencies and build strong partnerships with platform engineering, SRE, security, and product stakeholders.
- Ensure the platform meets critical non-functional goals such as cost-efficiency, operational reliability, and regional availability.
- Represent the ML Platform team in broader AI Foundations and Riot-wide planning forums-connecting strategy to execution and ensuring alignment with Riot's broader technical ecosystem.
- 10+ years of experience in software engineering
- 8+ years of management experience, including managing engineers and team leads
- Experience founding or scaling global infrastructure/platform teams from the ground up
- Demonstrated ability to lead delivery of technical products used by other developers, engineers, or researchers
- Working knowledge of production ML systems, including model inference, CI/CD for ML artifacts, observability, and cost optimization
- Experience with cloud-native orchestration architectures (e.g., Kubernetes, GPU scheduling, container orchestration)
- Strong execution skills-ability to translate long-term vision into well-scoped milestones, track progress, and unblock teams
- Comfortable making trade-offs between velocity, cost, and maintainability while managing stakeholder expectations
- Experience launching internal platforms or ML infrastructure products at scale
- Familiarity with MLOps and model serving tools (e.g., MLflow, KServe, BentoML, TorchServe, Seldon Core, DVC, LakeFS, etc)
- Exposure to A/B testing infrastructure, especially in online or latency-sensitive environments
- Prior experience with budgeting, CPU & GPU utilization tracking, and platform efficiency work
- Experience working in cross-functional settings with research, infra, and game teams
- Familiarity with global infrastructure requirements, including delivery to China or other regional deployments
- Safeguarding confidential and sensitive Company data
- Communication with others, including Rioters and third parties such as vendors, and/or players, including minors
- Accessing Company assets, secure digital systems, and networks
- Ensuring a safe interactive environment for players and other Rioters