
Senior Software Engineer, Data Acquisition & Curation (Python)
- New York City, NY
- Permanent
- Full-time
- Build, scale, and maintain core data model and data products to power personalization and analytics across product engineering and data teams.
- Design and deploy efficient, reliable pipelines for data ingestion and transformation
- Deploy inclusive data quality checks to ensure quality and timeliness of data, and diagnose/fix issues to ensure SLAs
- Work with other engineers to improve reliability, documentation and automation to simplify self-service support for data products
- Implement data governance principles, including data classification, data ownership, and data stewardship as part of our core data pipelines
- Drive platform tooling toward self-service, improving usability and accelerating data product development
- Collaborate with product manager and product engineers, data owners, data stewards and customers to implement and maintain data products to serve analytics, ML and reporting needs
- 5+ years experience leading design, implementation, and monitoring of large-scale data warehouses and data marts
- 5+ years experience building data products that are well-modeled, documented and easy to understand and maintain
- 5+ years of experience building robust data pipelines to ingest, process, and ensure data quality from disparate sources
- Strong expertise in SQL and performance tuning
- Strong familiarity with dbt and orchestration tools like Airflow
- Programming experience in Python, Scala or other language(s)
- Strong understanding of data governance principles and their integration into data pipeline design
- Experience working with modern data storage (BigQuery, Postgres) and services in cloud environment(s) like GCP
- Experience leading small project teams, mentoring more junior engineers, and scoping work
- Experience designing and developing robust APIs for data integration and accessibility
- Experience building real-time streaming data pipelines using frameworks like Apache Spark
- Familiarity with asynchronous eventing systems such as Kafka
- A choice between medical plans with an option for 100% covered premiums
- Fertility and adoption benefits
- Access to supplemental insurance plans for additional coverage
- Headspace mindfulness app subscription
- Global Employee Assistance Program
- Retirement benefits with employer match
- Flexible paid time off
- 20 weeks for parental leave and up to 12 weeks to care for an ill family member
- Pretax commuter benefit
- Education reimbursement
- Employee donation match to community organizations
- 8 Global Employee Resource Groups (ERGs)
- Dog-friendly workplace
- Free lunch and snacks
- Private rooftop
- Hack week twice per year