- Azure
- ETL
- Spark
- Kafka
- Agile
- Hadoop
- Ci/Cd
- SQL
- Scrum
- Python
- Leadership
- Power Bi
- Data Modelling
- Terraform
- Data Governance
- Architected Azure data platform serving 25+ analytics consumers with daily refresh cycles.
- Implemented CI/CD pipelines reducing deployment lead time from 10 days to 4 hours.
- Optimised Spark jobs, cutting ETL run times by 40% and simplifying downstream consumption.
- Mentored junior engineers on best practices for data governance and quality assurance.
- Designed data models that supported real-time dashboards without compromising SLAs.
- Built and maintained ETL pipelines that processed 10TB+ monthly from varied source systems.
- Introduced automated testing for data quality, reducing production defects by 70%.
- Collaborated with data science teams to productionise models with self-service feature stores.
- Standardised documentation around data lineage and transformation logic for governance.
- Optimised SQL queries for consumption by BI tools, improving dashboard load times by 35%.
- Led migration of hybrid data warehouse to Azure, improving processing capacity by 3x.
- Implemented role-based access controls that strengthened compliance across analytics dashboards.
- Drove adoption of Airflow for orchestration, giving teams clear visibility into pipeline health.
- Partnered with enterprise architects to align data strategy with broader digital transformation goals.
- Presented quarterly data health reviews to executive stakeholders, shaping prioritisation.
- Azure Data Engineer Associate
- Databricks Data Engineer