H

Senior Data Engineer - Cape Town

Hrvisory Pty
Full-time
On-site
Pretoria, South Africa
Senior Jobs

Introduction

We are seeking a highly skilled Senior Data Engineer to join our client and play a critical role in designing, building, and maintaining modern, scalable, and secure data solutions in the cloud. The ideal candidate will have extensive hands-on experience with AWS, Databricks, and DBT Core, with a proven ability to deliver high-quality data pipelines, models, and workflows that enable advanced analytics and reporting.

This role requires strong technical expertise, problem-solving abilities, and the ability to collaborate with cross-functional teams to deliver reliable, optimized, and secure data solutions

Duties & Responsibilities

  • Design & Develop Data Pipelines: Build and optimize scalable, reliable, and automated ETL/ELT pipelines using AWS services (e.g., AWS Glue, AWS Lambda, Redshift, S3) and Databricks.
  • Cloud Data Architecture: Design, implement, and support in maintaining data infrastructure in AWS, ensuring high availability, security, and scalability. Work with lake houses, data lakes, data warehouses, and distributed computing.
  • DBT Core Implementation: Lead the implementation of DBT Core to automate data transformations, develop reusable models, and maintain efficient ELT processes.
  • Data Modelling: Build efficient data models to support required analytics/reporting.
  • Optimize Data Workflows: Monitor, troubleshoot, and optimize data pipelines for performance and cost-efficiency in cloud environments. Utilize Databricks for processing large-scale data sets and streamlining data workflows.
  • Data Quality & Monitoring: Ensure high-quality data by implementing data validation and monitoring systems. Troubleshoot data issues and create solutions to ensure data reliability.
  • Automation & CI/CD: Implement CI/CD practices for data pipeline deployment and maintain automation for monitoring and scaling data infrastructure in AWS and Databricks.
  • Documentation & Best Practices: Maintain comprehensive documentation for data pipelines, architectures, and best practices in AWS, Databricks, and DBT Core. Ensure knowledge sharing across teams.

Desired Experience & Qualification

  • Bachelor’s / Master’s degree in computer science, Engineering or a related field.
  • 8+ years of experience as a Data Engineer or in a similar role.
  • Extensive hands-on experience with AWS services (S3, Redshift, Glue, Lambda, Kinesis, etc.) for building scalable and reliable data solutions.
  • Advanced expertise in Databricks, including the creation and optimization of data pipelines, notebooks, and integration with other AWS services.
  • Strong experience with DBT Core for data transformation and modelling, including writing, testing, and maintaining DBT models.
  • Proficiency in SQL and experience with designing and optimizing complex queries for large datasets.
  • Strong programming skills in Python/PySpark, with the ability to develop custom data processing logic and automate tasks.
  • Experience with Data Warehousing and knowledge of concepts related to OLAP and OLTP systems.
  • Expertise in building and managing ETL/ELT pipelines, automating data workflows, and performing data validation.
  • Familiarity with CI/CD concepts, version control (e.g., Git), and deployment automation.
  • Having worked under Agile project environment

 

Preferred:

  • Experience with Apache Spark and distributed data processing in Databricks.
  • Familiarity with streaming data solutions (e.g., AWS Kinesis, Apache Kafka).
  • Knowledge of Data Governance, data security, and privacy best practices.