Robotics Reliability Engineering Lead
Path Robotics
- Columbus, OH
- Permanent
- Full-time
- Act as the L2 support layer, handling escalated issues from L1 (Robot Fleet Operations, also known as “Mission Control”), directly resolving issues where able, and collaborating with L3 (feature developers) as needed.
- Perform root cause analysis to ensure thorough understanding and resolution of recurring issues.
- Develop proficiency and eventually expertise with all sub-systems that comprise our product.
- Identify opportunities for improvement and influence the technical roadmap.
- Develop tooling, processes, documentation, and SOPs to minimize support escalations.
- Build playbooks and solutions that enable Mission Control to resolve issues independently.
- Create automated solutions, bug fixes, and workarounds to proactively prevent support issues and reduce the number of support escalations.
- Create and maintain a comprehensive database of documentation and playbooks for common technical issues.
- Work to make our code more supportable: drive software best practices including instrumentation, traceability, repeatability, testing, and software QA/QC
- Work closely with Mission Control & Operations to establish and refine SOPs for ticket triage, handling, and communication.
- Track and analyze support metrics and SLOs for response and resolution times.
- Report frequent and resource-intensive support cases to developers, providing actionable insights.
- Collaborate with developers and test engineers to ensure adequate test coverage for recurring issues.
- Communicate constantly with a wide array of teams spanning Engineering, Operations, and Customer Success.
- Help establish and grow the Support Engineering team, mentoring more junior incoming members.
- Define team SOPs and best practices.
- Depending on background and skillset, develop over time into a full-time Manager role as the team grows.
- Bachelor's or Master's degree in Computer Science, Software Engineering, Robotics Engineering, or a related field, or equivalent experience.
- Documented ability to execute immediate fixes in production systems AND to perform in-depth root cause analysis and produce high quality, long-term system improvements
- Experience with robotics, path planning, and point cloud processing.
- Experience with computer vision methods, vision sensors, and data analysis (2D and 3D).
- Exposure to machine learning concepts and applications.
- Experience with Docker
- Strong proficiency in Python and C++.
- Proven ability to develop top-notch documentation, including SOPs and playbooks for operational efficiency.
- Strong cross-functional collaboration skills with technical and non-technical teams.
- Expertise in triaging and prioritizing issues based on impact and urgency.
- Exceptional time management and a self-directed approach to finding high-value work.
- Excellent communication skills: able to communicate equally well with technical and nontechnical audiences.
- Strong commitment to maintaining and enhancing documentation.
- Experience leading teams is a big plus.
- Free lunch every day
- Flexible PTO
- Medical, Dental, and Vision insurance
- 6 weeks 100% paid parental leave plus an additional 6-8 weeks maternity leave for the birthing parent (12-14 weeks total)
- 401K through Empower
- Paid Referral Bonus