We are seeking a skilled Data Engineer to design, implement, and manage scalable data pipelines, ensuring high data quality, performance, and integration across systems. The ideal candidate will have a strong background in database management, ETL processes, and cloud platforms, with a passion for optimizing data workflows and enabling data-driven decision-making
Responsibilities:
- Data Pipeline Development:
- Design, implement, and manage scalable data pipelines for efficient data collection, processing, and storage.
- Data Integration:
- Collaborate with data scientists, analysts, and stakeholders to integrate and streamline data across various systems.
- Database Management:
- Optimize and maintain databases to ensure efficient storage, querying, and retrieval of data.
- Data Quality Assurance:
- Regularly monitor and improve data quality through validation, cleaning, and consistency checks.
- Performance Optimization:
- Identify and resolve performance bottlenecks in data workflows to enhance system efficiency.
- Documentation:
- Create and maintain clear, detailed documentation for data architectures, pipelines, and processes to support internal use, auditing, and compliance.
Qualifications:
- Education:
- Bachelor’s degree in Computer Science, Information Technology, or a related field.
- Experience:
- 3–5 years of experience in data engineering or a related technical role.
Technical Skills:
- Strong experience with SQL and NoSQL databases (e.g., MySQL, PostgreSQL, MongoDB).
- Expertise in ETL (Extract, Transform, Load) processes and data warehousing solutions.
- Knowledge of big data technologies such as Hadoop and Apache Spark is a plus.
- Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP) is advantageous.
Additional Skills:
- Strong analytical and problem-solving abilities with high attention to detail.
- Excellent verbal and written communication skills.
- Ability to collaborate effectively within cross-functional teams.
- Strong organizational skills with the ability to manage multiple priorities.