AI/ML Architect
Be The Match
- Minneapolis, MN
- Permanent
- Full-time
This role works across a diverse ecosystem including AWS, Salesforce, Snowflake, and Oracle to support prioritized enterprise initiatives.ACCOUNTABILITIES: (The primary functions, scope and responsibilities of the role)Engineering and Architecture:
- Lead the hands-on architecture, development, and deployment of production-grade AI/ML systems, ensuring scalability, reliability, performance, and cost-efficiency.
- Architect traditional ML solutions (e.g., classification, regression, recommendation systems) and advanced GenAI systems including Retrieval-Augmented Generation (RAG) and Agentic AI.
- Design and implement cloud-native AI/ML pipelines using cloud platforms.
- Evaluate, prototype, and build PoCs regularly to test architectural decisions, validate feasibility, and accelerate solution delivery.
- Integrate and deploy multiple LLMs (e.g., from OpenAI, Claude, Gemini, LLaMA, Hugging Face) and vector databases (e.g., Pinecone, Qdrant, pgvector, Milvus, Weaviate).
- Create reusable frameworks and solution templates that drive consistency, speed, and quality across AI initiatives.
- Ensure all solutions are aligned with responsible AI standards, security best practices, and enterprise governance.
- Explore and implement emerging AI paradigms, including agentic AI, Model Context Protocol (MCP), and Google's Agent-to-Agent (A2A) protocol.
- Drive innovation by recommending and evaluating GenAI tools, third-party orchestration frameworks, and SaaS integrations.
- Develop a forward-looking technical roadmap that balances short-term deliverables with long-term innovation.
- Stay current with cutting-edge trends in LLMOps, AI observability, and intelligent automation platforms.
- Leverage AI coding assistants (e.g., Claude Code, AWS Bedrock, GitHub Copilot) to boost productivity.
- Collaborate with data scientists, ML engineers, software engineers, and enterprise architects to translate business needs into scalable AI solutions.
- Provide technical guidance, mentorship, and architectural direction to teams working across the AI/ML lifecycle.
- Work in agile teams, contributing hands-on while also shaping backlog priorities and solution design.
- Partner cross-functionally to integrate AI into enterprise systems (e.g., Salesforce, Snowflake, Oracle).
- Scalable architecture patterns for traditional ML and GenAI.
- Multi-cloud AI/ML services including AWS (SageMaker, Bedrock etc) and at least one of Azure (ML, OpenAI) or GCP (Vertex AI).
- Strong familiarity with multiple LLMs and embedding models (e.g., OpenAI, Anthropic, Meta, Google, Hugging Face).
- Proficiency in contextual memory and multiple vector databases for semantic search.
- MLOps and LLMOps practices, including CI/CD, model monitoring, versioning, drift detection, and governance.
- Prompt engineering and management practices, including prompt versioning, A/B testing of prompts, and experience with prompt management tools
- AI/ML observability stacks such as Weights & Biases, Langsmith or similar tools.
- Hands-on experience designing and building AI/ML solutions from prototype to production.
- Strong Python development skills, including frameworks and libraries for ML, GenAI, and software engineering best practices.
- Experience with TensorFlow and/or PyTorch for training and deploying models.
- Deep understanding of software engineering, including modular design, testing, version control (Git), and CI/CD pipelines.
- Proven track record of building and running PoCs to validate architecture and feasibility.
- Experience working in agile environments, participating in sprints and cross-functional delivery.
- Ability to communicate technical concepts clearly to a wide range of stakeholders.
- Eagerness and ability to quickly learn and apply new AI/ML and automation technologies.
- Bachelor's degree in computer science, Engineering, or a related field (Master's preferred).
- 8+ years of experience in AI/ML engineering or architecture roles.
- Strong portfolio of real-world deployments in both traditional ML and GenAI.
- Master's or PhD in a technical field.
- Experience architecting agentic AI systems and multi-agent orchestration workflows.
- Experience in regulated industries, especially healthcare or finance.
- AI/ML certifications from AWS, Azure, or GCP.
- Contributions to open-source AI/ML projects or published research.
This role works across a diverse ecosystem including AWS, Salesforce, Snowflake, and Oracle to support prioritized enterprise initiatives.ACCOUNTABILITIES: (The primary functions, scope and responsibilities of the role)Engineering and Architecture:
- Lead the hands-on architecture, development, and deployment of production-grade AI/ML systems, ensuring scalability, reliability, performance, and cost-efficiency.
- Architect traditional ML solutions (e.g., classification, regression, recommendation systems) and advanced GenAI systems including Retrieval-Augmented Generation (RAG) and Agentic AI.
- Design and implement cloud-native AI/ML pipelines using cloud platforms.
- Evaluate, prototype, and build PoCs regularly to test architectural decisions, validate feasibility, and accelerate solution delivery.
- Integrate and deploy multiple LLMs (e.g., from OpenAI, Claude, Gemini, LLaMA, Hugging Face) and vector databases (e.g., Pinecone, Qdrant, pgvector, Milvus, Weaviate).
- Create reusable frameworks and solution templates that drive consistency, speed, and quality across AI initiatives.
- Ensure all solutions are aligned with responsible AI standards, security best practices, and enterprise governance.
- Explore and implement emerging AI paradigms, including agentic AI, Model Context Protocol (MCP), and Google's Agent-to-Agent (A2A) protocol.
- Drive innovation by recommending and evaluating GenAI tools, third-party orchestration frameworks, and SaaS integrations.
- Develop a forward-looking technical roadmap that balances short-term deliverables with long-term innovation.
- Stay current with cutting-edge trends in LLMOps, AI observability, and intelligent automation platforms.
- Leverage AI coding assistants (e.g., Claude Code, AWS Bedrock, GitHub Copilot) to boost productivity.
- Collaborate with data scientists, ML engineers, software engineers, and enterprise architects to translate business needs into scalable AI solutions.
- Provide technical guidance, mentorship, and architectural direction to teams working across the AI/ML lifecycle.
- Work in agile teams, contributing hands-on while also shaping backlog priorities and solution design.
- Partner cross-functionally to integrate AI into enterprise systems (e.g., Salesforce, Snowflake, Oracle).
- Scalable architecture patterns for traditional ML and GenAI.
- Multi-cloud AI/ML services including AWS (SageMaker, Bedrock etc) and at least one of Azure (ML, OpenAI) or GCP (Vertex AI).
- Strong familiarity with multiple LLMs and embedding models (e.g., OpenAI, Anthropic, Meta, Google, Hugging Face).
- Proficiency in contextual memory and multiple vector databases for semantic search.
- MLOps and LLMOps practices, including CI/CD, model monitoring, versioning, drift detection, and governance.
- Prompt engineering and management practices, including prompt versioning, A/B testing of prompts, and experience with prompt management tools
- AI/ML observability stacks such as Weights & Biases, Langsmith or similar tools.
- Hands-on experience designing and building AI/ML solutions from prototype to production.
- Strong Python development skills, including frameworks and libraries for ML, GenAI, and software engineering best practices.
- Experience with TensorFlow and/or PyTorch for training and deploying models.
- Deep understanding of software engineering, including modular design, testing, version control (Git), and CI/CD pipelines.
- Proven track record of building and running PoCs to validate architecture and feasibility.
- Experience working in agile environments, participating in sprints and cross-functional delivery.
- Ability to communicate technical concepts clearly to a wide range of stakeholders.
- Eagerness and ability to quickly learn and apply new AI/ML and automation technologies.
- Bachelor's degree in computer science, Engineering, or a related field (Master's preferred).
- 8+ years of experience in AI/ML engineering or architecture roles.
- Strong portfolio of real-world deployments in both traditional ML and GenAI.
- Master's or PhD in a technical field.
- Experience architecting agentic AI systems and multi-agent orchestration workflows.
- Experience in regulated industries, especially healthcare or finance.
- AI/ML certifications from AWS, Azure, or GCP.
- Contributions to open-source AI/ML projects or published research.
eQuest