
Senior Software Developer in Test, Foundation - Seattle
- Seattle, WA
- Permanent
- Full-time
- Drive the quality assurance strategy for agentic features, ensuring standardization of testing approaches from a customer-centric perspective
- Develop comprehensive evaluation tools and frameworks that enable automated testing of AI agent workflows
- Define model launch readiness criteria for agentic features that align with both customer needs and regulatory requirements
- Establish model risk management protocols that document AI system capabilities and limitations
- Create knowledge transfer processes to disseminate best practices in AI testing across Qualtrics teams
- Work in a collaborative environment with ML Engineers, Research Scientists, Software Engineers, and Product Managers
- Become an expert in the emerging field of AI agent testing and evaluation
- Lead the development of innovative testing methodologies for complex AI systems
- Mentor other quality engineers on AI-specific testing approaches and tools
- Build upon our existing LLM evaluation infrastructure to create next-generation testing capabilities
- Expand your technical leadership through cross-functional collaboration and knowledge sharing
- Stay at the forefront of AI testing practices by researching and implementing new techniques
- Design and implement automated evaluation frameworks for testing AI agent behaviors and outputs
- Build upon our existing AI test strategy and LLM-as-a-Judge infrastructure to create comprehensive testing solutions
- Develop metrics and benchmarks to quantify AI system quality, reliability, and safety
- Create reproducible test scenarios that simulate real-world user interactions with AI agents
- Collaborate with product teams to integrate quality processes into their AI development workflows
- Identify and mitigate potential risks in AI agent behaviors, including hallucination detection and security guardrails
- Present testing results and quality metrics to stakeholders across engineering and product teams
- Bachelor's degree in Computer Science or related field
- 5+ years of professional coding experience with strong Python skills
- 2+ years experience building test automation frameworks and tools from scratch
- 5+ years of relevant experience as part of an automated test team, with 1+ years as a senior contributor
- Understanding of Machine Learning concepts (deep learning, categorization, classification, data mining)
- Experience with Machine Learning evaluation frameworks
- Experience with CI/CD deployment automation (Jenkins, etc.)
- Proficiency with testing distributed systems and SOA (Service Oriented Architecture)
- Experience applying various software testing techniques (equivalence class partitioning, boundary value testing, pairwise testing)
- Strong critical thinking and root cause analysis skills
- The team works on cutting-edge AI technologies that power many of Qualtrics' product experiences.
- Our team is pioneering quality engineering practices for AI systems and building the infrastructure that enables reliable, safe, and effective AI features.
- We're actively working on several innovative AI projects, including security guardrails for AI models, LLM-based evaluation systems, and agentic workflows across various Qualtrics products.
- The team has a strong culture of collaboration, innovation, and quality, and we're looking for someone who can help us elevate our testing practices for these complex systems.
- Wellness Reimbursement: $300 per quarter for wellness activities including gym memberships, spa massages, workout equipment, meditation apps, and much more.
- Experience Bonus: $1800 to be used for an “Experience” of your choosing.
- Amazing QGroup Communities: MOSAIQ, Green Team, Qualtrics Pride, Q&Able, Qualtrics Salute, and Women’s Leadership Development, which exist as places for support, allyship, and advocacy.