
Lead Software Engineer - SRE
- Columbus, OH
- Permanent
- Full-time
- Partners and collaborates with product, architecture, security and operations teams prioritize reliability and embed best practices across the software development lifecycle to improve operations across the Loan Origination environment.
- Lead and develop resilient software solutions that improve reliability, scalability and system performance of the Loan Origination System (LOS).
- Leads evaluation sessions with external vendors, startups, and internal teams to outcomes-oriented probing of architectural designs, technical credentials, and applicability for use within existing systems and information architecture.
- Design, Develop and Automate operational workflows to help improve Mean time to resolve (MTTR) and Mean time to detect (MTTD).
- Enhance observability by identifying gaps in the user journey and by building monitoring, logging and alerting to ensure service health are within thresholds and SLAs across internal systems and third party integrations.
- Lead and participate in on-call rotations, incident response and post mortems.
- Proactively identifies hidden problems and patterns in data and uses these insights to drive operational improvements to coding hygiene and system architecture.
- Executes software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems.
- Creates secure and high-quality production code and maintains algorithms that run synchronously with appropriate systems.
- Produces architecture and design artifacts for complex applications while being accountable for ensuring design constraints are met by software code development.
- Contributes to software engineering communities of practice and events that explore new and emerging technologies.
- Formal training or certification on software engineering and SRE concepts and 5+ years applied experience.
- 5+ years of experience working as a software developer/engineer, SRE/DevOps, platform engineer, or similar.
- Hands-on practical experience in designing resilient system design, application development, testing, that improve operational stability and reliability.
- Experienced in working and integrating internal and third-party systems developing APIs.
- Advanced in one or more programming language(s) (e.g. Python) and practical cloud native experience.
- Experience in developing, debugging, and maintaining code in a large corporate environment with one or more modern programming languages and database querying languages.
- Proficient in all aspects of the Software Development Life Cycle.
- Advanced understanding of agile methodologies such as CI/CD, Application Resiliency, and Security.
- Demonstrated proficiency in software applications and technical processes within a technical discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.).
- In-depth knowledge of the financial services industry and their IT systems.
- Strong communicator and effective collaborator across technical and non-technical audiences.
- Experienced in SRE/DevOps practices (SLA/SLOs, error budgets, MTTR, MTTD).
- Problem solver that uses code to solve problems and enhance the operational experience.
- Exposure to AI/Automation technologies that improve operations.
- Knowledge of ITIL, and Agile methodologies.
- Experience with containers and orchestration (Dockers, Kubernetes).
- Exposure to monitoring tools like Thousand Eyes, Grafana, Dynatrace, Prometheus, Data Dog, Splunk etc.
- AWS Certifications (e.g. Solutions Architect Associate).