We are seeking a Cloud Data Engineer to implement scalable, modern data engineering activities on some of the most mission-driven projects in the health industry. As a Data Engineer, you will develop and deploy the pipelines and platforms that organize and make disparate data meaningful.
Responsibilities Include:
- Working with a multi-disciplinary team of scientists, data engineers, developers, and data consumers in a fast-paced, Agile environment.
- Monitoring and optimizing data pipelines for performance, scalability, and cost-effectiveness.
- Sharpening skills in analytical exploration and data examination while supporting the assessment, design, development, and maintenance of scalable platforms for clients.
Basic Qualifications:
- 3+ years of experience with extract, transform, load (ETL) operations, focusing on Azure technologies.
- 2+ years of experience with source control and collaboration software, including Git or Atlassian tools.
- Knowledge of Azure Batch and its application in processing large data sets.
- Experience with SQL and relational databases (e.g., Azure SQL Database, SQL Server).
- Experience with Python or R, including data manipulation libraries (e.g., Pandas, NumPy, Polars, Tidyverse).
- Strong problem-solving skills with the ability to work independently and in a team environment.
- Proficiency in Azure Data Factory and its components.
Nice If You Have:
- Experience with developing pipelines utilizing Azure Batch and Azure Data Factory.
- Familiarity with Apache Airflow or similar workflow orchestration tools.
- Experience with Azure Synapse Analytics, Azure Databricks, or Azure Blob Storage.
- Familiarity with cloud security best practices and data governance.
- Ability to quickly learn technical concepts and communicate with multiple functional groups.