
Lead Data Developer
- Atlanta, GA
- Permanent
- Full-time
- Lead the end-to-end development of data pipelines and ETL/ELT workflows using Azure Data Factory, Databricks (PySpark/Scala), and SQL.
- Architect and maintain efficient, reusable, and reliable data systems within the Azure ecosystem (e.g., Azure Synapse, Azure Data Lake, Azure SQL DB).
- Design data models and data warehousing solutions that support analytics and reporting across the business.
- Collaborate with data scientists, analysts, and business stakeholders to understand data needs and deliver high-quality, trusted data products.
- Optimize performance of data processes, including monitoring and troubleshooting jobs, managing resource consumption, and tuning Spark clusters.
- Enforce governance, security, and compliance standards, including data quality, lineage, and cataloging using tools like Azure Purview.
- Provide technical leadership and mentorship to junior developers, including code reviews and guidance on architectural decisions.
- Support CI/CD automation, version control, and DevOps practices in the data engineering workflow.
- Stay current with evolving data technologies and recommend improvements to existing infrastructure and architecture.
- Bachelor's degree in Computer Science, Engineering, Information Systems, or related field preferred.
- 7+ years of experience in data engineering or software development roles required.
- 3+ years of hands-on experience with Azure Data Services (e.g., Azure Data Factory, Azure Data Lake, Synapse, Azure SQL) required.
- Strong proficiency in Databricks, particularly with PySpark and/or Scala required.
- Deep understanding of distributed data processing and optimization in cloud environments required.
- Advanced SQL skills and familiarity with structured/unstructured data formats (Parquet, Avro, JSON) required.
- Experience with CI/CD pipelines, Git, and Infrastructure as Code (e.g., Terraform, ARM templates) preferred.