
Senior Scientist, Generative AI & Agentic Modeling
- Boston, MA
- Permanent
- Full-time
- Design and implement next-generation generative AI models for biologics discovery, including LLMs, protein language models, diffusion models, and multi-modal architectures.
- Build and refine agentic AI frameworks (e.g., tool-using agents, retrieval-augmented generation (RAG), memory-augmented LLMs) to autonomously plan and optimize design–make–test–analyze (DMTA) cycles.
- Collaborate with bioinformatics, automation, and wet-lab teams to build lab-in-the-loop systems, enabling AI agents to initiate hypotheses, request experiments, and iterate based on outcomes.
- Develop reusable AI pipelines that integrate multi-objective optimization (e.g., binding affinity, immunogenicity, developability, manufacturability) from in vitro and in silico data sources.
- Lead or contribute to the development of internal foundation models for biological design, including fine-tuning of LLMs on proprietary sequence, omics, and experimental datasets.
- Prototype and deploy agentic AI solutions using industry-standard frameworks such as LangChain, OpenAI API, Hugging Face Transformers, or similar.
- Stay current with advancements in transformer architectures, RLHF, memory systems, cognitive planning agents, and apply these innovations to real-world therapeutic discovery challenges.
- Clearly communicate ideas to cross-functional stakeholders and contribute to internal and external scientific knowledge-sharing.
- PhD degree in Computer Science, Computational Biology, Machine Learning, Bioinformatics, or related field (or equivalent) with 2+ years relevant experience, or MS with 8+ years relevant experience, or BS with 10+ years relevant experience Demonstrated expertise in developing large-scale deep learning models, including LLMs, generative models (e.g., diffusion, VAEs, autoregressive transformers), or multi-modal architectures.
- Proficiency in Python and modern ML frameworks (e.g., PyTorch, Hugging Face, JAX, LangChain); strong software engineering and reproducibility skills.
- Experience applying machine learning to biological sequences, experimental data, or computational biology pipelines (e.g., protein or antibody engineering, `omics analysis, etc.).
- Experience working with large datasets (NGS, high-throughput screens, bioassays) for supervised or unsupervised model training and evaluation.
- Deep familiarity with agentic AI design: planning agents, tool-using agents, memory-augmented LLMs, or lab automation interfaces (e.g., lab robots, simulation frameworks, or autonomous workflows).
- Proven ability to integrate AI models into production environments and iterative workflows, preferably within a pharmaceutical or biotech context.
- Excellent interpersonal and written communication skills; thrives in collaborative, interdisciplinary environments.
- Prior experience fine-tuning or training foundation models (e.g., ESM, ProGen, ProtGPT2, OpenFold, GPT-style LLMs) on biological or scientific data.
- Familiarity with retrieval-augmented generation (RAG), tool-use planning, graph-enhanced reasoning, or long-context transformers for scientific applications.
- Knowledge of wet-lab feedback integration for DMTA cycles or closed-loop experimental platforms.
- Experience with cloud-based model deployment, distributed training, and model evaluation pipelines.
- Understanding of key drug-like properties (e.g., immunogenicity, aggregation risk, developability) and how to model them computationally.
- Demonstrated thought leadership (e.g., peer-reviewed publications, open-source contributions, patents) in GenAI or AI for biology.