
Principal AI Knowledge AI Architect
- Bellevue, WA
- Permanent
- Full-time
- Architect and implement multi-layer RAG pipelines leveraging ontologies, semantic graphs, embeddings, and hybrid retrieval strategies.
- Design agentic RAG workflows where autonomous agents reason about query decomposition, multi-hop retrieval, and context stitching for better factual accuracy.
- Build hierarchical and ontology-based knowledge graphs to improve entity resolution, semantic search, and contextual reasoning.
- Optimize retrieval for domain-specific knowledge using structured + unstructured data fusion.
- Lead development of content ingestion pipelines for enterprise sources (Confluence, SharePoint, Google Drive, Salesforce KB, ServiceNow KB, etc.)
- Design real-time data sync connectors and ETL frameworks to keep knowledge sources fresh and in sync with external systems.
- Implement document parsing, enrichment, chunking, metadata tagging, and semantic indexing pipelines at scale.
- Architect agentic knowledge workflows where agents autonomously evaluate, retrieve, and cross-reference multi-source knowledge.
- Enable agents to invoke external APIs/tools dynamically to complement RAG with transactional or dynamic information retrieval.
- Integrate multi-modal RAG (text, images, tables, PDFs) into reasoning loops for richer AI responses.
- Develop knowledge health check pipelines to automatically validate knowledge freshness, detect stale or redundant articles, and recommend updates.
- Implement automated knowledge evaluation using LLMs (hallucination detection, coverage analysis, answer accuracy).
- Define governance policies for knowledge versioning, lifecycle management, and auditing.
- Architect multi-tenant, enterprise-ready knowledge systems with strict access controls, encryption, and compliance (SOC2, HIPPA, GDPR).
- Ensure cost-efficient vector database and embedding management strategies (e.g., partitioning, caching, tiered storage).
- Mentor engineers on best practices for RAG pipelines, knowledge representation, and semantic search.
- Work with product leadership to define long-term knowledge strategy for powering enterprise-grade agentic AI assistants.
- Collaborate closely with LLM engineers on optimizing retrieval-planning-generation loops for factual accuracy and latency.
- 10+ years in software architecture, with at least 3+ years in AI-driven knowledge systems, RAG pipelines, or semantic search
- Deep expertise in retrieval techniques (vector search, hybrid search, ontology-based retrieval) and knowledge graph design
- Experience with ontology design and reasoning (OWL, SPARQL, etc.) for enterprise knowledge modeling
- Proven experience building RAG pipelines with LLMs (OpenAI, Anthropic, LLaMA, etc.) integrated into production systems
- Strong proficiency in Java & Python and AI/ML frameworks (LangChain, LangGraph, etc.)
- Knowledge of vector DBs (Pinecone, ElasticSearch, etc.) and graph DBs (Neo4j, etc.)
- Experience building enterprise knowledge ingestion frameworks from CMS/CRM/ITSM platforms (e.g, Salesforce, ServiceNow)
- Background in document parsing (OCR, PDFs, HTML), metadata enrichment, and semantic embeddings
- Expertise in scalable cloud-native architecture (Kubernetes, event-driven microservices, streaming pipelines)
- Understanding of agentic AI frameworks (LangChain, LangGraph) and their integration with retrieval for reasoning
- Familiarity with self-healing knowledge pipelines (auto-detection and repair of broken links, stale knowledge)
- Strong grounding in AI safety and governance for enterprise knowledge systems
- Contributes to open-source RAG or knowledge graph frameworks are a plus
- Familiarity with multi-modal knowledge retrieval (image/document embeddings and cross-modal search)