Senior Data Engineer
Anaconda
- Anaconda, MT
- Permanent
- Full-time
- Create and manage tooling and infrastructure for Anaconda's data platform.
- Identify and implement process improvements: designing infrastructure that scales, automating manual processes, etc.
- Drive database design and the underlying information architecture, transformation logic, and efficient query development to support our growing data needs.
- Implement testing and observability across the data infrastructure to ensure data quality from raw sources to downstream models.
- Write documentation that supports code maintainability.
- Take ownership of the various tasks that will allow us to maintain high-quality data; ingestion, validation, transformation, enrichment, mapping, storage, etc
- Work closely with Product teams to anticipate and support changes to the data.
- Work with Strategic Operations and Platform teams to build reliable, scalable tooling for analysis and experimentation.
- Values collaboration and is very comfortable with pair programming
- 6+ years of relevant experience as a data engineer or significantly related work
- Foundation & proficiency in Python
- Experience in building, optimizing, and maintaining data architectures
- Experience building ELT pipelines
- Experience with Airflow, Prefect, or other orchestration tools
- Cloud experience, i.e. AWS, Azure, GCP
- Experience with Infrastructure as code, Terraform or CloudFormation, Ansible
- Database experience with relational and non-relational data stores
- Experience working with large data sets, and an understanding of how to write code that leverages the parallel capabilities of Python and database platforms
- Strong knowledge of database performance concepts like indices, segmentation, projections, and partitions
- Experience leading projects with Engineering and Product teams from start to finish
- Team attitude: “I am not done, until WE are done”
- Embody our core values:
- Ability & Humility
- Innovation & Action
- Empathy & Connection
- Care deeply about fostering an environment where people of all backgrounds and experiences can flourish
- Experience with Kafka or other event-streaming technologies
- Experience with Snowflake
- Experience working in a fast-paced startup environment
- Experience working in an open-source or data science-oriented company
- Unique opportunity to translate strong open-source adoption and user enthusiasm into commercial product growth
- Dynamic company that rewards high performers
- On the cutting edge of enterprise application of data science, machine learning and AI
- Collaborative team environment that values multiple perspectives and clear thinking
- Employees-first culture
- Flexible working hours
- Medical*, Dental*, Vision*, HSA*, Life* and 401K*
- Paid parental leave - both parents
- Monthly productivity stipend
- Pre-IPO stock options
- Open vacation policy*
- Quarterly Snake days (company-wide bonus day off)
- 100% remote