I

Architect – Data Engineering Governance & Data Security

INDIA CAI Info India Private
Full-time
Remote
India
Architect – Data Engineering Governance & Data Security

Req number:

R5966

Employment type:

Full time

Worksite flexibility:

Remote

Who we are

CAI is a global technology services firm with over 8,500 associates worldwide and a yearly revenue of $1 billion+. We have over 40 years of excellence in uniting talent and technology to power the possible for our clients, colleagues, and communities. As a privately held company, we have the freedom and focus to do what is right—whatever it takes. Our tailor-made solutions create lasting results across the public and commercial sectors, and we are trailblazers in bringing neurodiversity to the enterprise.

Job Summary

As an Architect – Data Engineering Governance and Security, you will be instrumental in establishing and maintaining robust data engineering governance frameworks, ensuring best practices and standards are followed, and implementing advanced data security across the enterprise and self-service. You will be responsible for safeguarding data across its entire lifecycle, from ingestion and pipeline management to analytics and AI applications, while also ensuring compliance with regulatory standards.

Job Description

We are looking for an Architect – Data Engineering Governance & Data Security to develop and maintain documentation for data engineering processes. This position will be full-time and remote.

The Architect – Data Engineering Governance and Security will be instrumental in establishing and maintaining robust data engineering governance frameworks, ensuring best practices and standards are followed, and implementing advanced data security across the enterprise and self-service. This position will be responsible for safeguarding data across its entire lifecycle, from ingestion and pipeline management to analytics and AI applications, while also ensuring compliance with regulatory standards.

What You’ll Do

  • Establish, enforce, and monitor Data Engineering governance standards, best practices, and guidelines across enterprise and self-service environments

  • Develop and maintain documentation for data engineering processes

  • Define and implement data security policies and role-based access controls (RBAC/ABAC) across all data engineering processes

  • Oversee data classification, lifecycle management, and comprehensive metadata management to ensure transparency, traceability, and compliance

  • Implement and manage robust change control processes for all data engineering activities

  • Monitor, maintain, and optimize data pipelines and workflows, ensuring reliability, scalability, and efficiency

  • Continuously monitor and optimize the performance of data engineering processes and resource utilization

  • Promote a culture of data engineering governance, data security awareness, and operational excellence within the team and across the organization

What You'll Need

Required:

Experience: 3+ years in architecture

  • Data Security: defining and implementing data security policies; security & role-based access control (RBAC/ABAC)

  • Data Engineering Governance: establishing and enforcing governance standards, best practices, and guidelines; data classification and lifecycle management

  • Documentation and Metadata Management: maintaining comprehensive documentation and metadata for data engineering processes to track data lineage, understand data transformations, and ensure transparency

  • Change Control Process: implementing and maintaining effective change control for data engineering activities

  • Performance Monitoring and Optimization: continuously monitoring and optimizing data engineering process performance and resource utilization

  • Data Pipeline Management: ensuring data pipelines are reliable, efficient, and scalable, including monitoring, maintenance, and workflow optimization 

  • Databricks Lakehouse Platform: Medallion Architecture; Unity Catalog (data governance, lineage); Delta Lake & DLT Pipelines; PySpark Workbooks; Spark SQL & SQL Warehouse

  • Programming: Python, SQL, PySpark

  • AWS Cloud Services: IAM, S3, Lambda, EMR, Redshift, Bedrock

  • Familiarity with DevOps and CI/CD processes

  • Experience with any Data Security tool

Preferred:

  • Certifications in Databricks, AWS, or Data Security platforms

  • Experience working in regulated industries (e.g., finance, healthcare, manufacturing)

  • Strong communication and documentation skills

  • Experience with dynamic data masking, data privacy, and compliance frameworks

Physical Demands

  • Ability to safely and successfully perform the essential job functions

  • Sedentary work that involves sitting or remaining stationary most of the time with occasional need to move around the office to attend meetings, etc.

  • Ability to conduct repetitive tasks on a computer, utilizing a mouse, keyboard, and monitor

Reasonable accommodation statement

If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employment selection process, please direct your inquiries to application.accommodations@cai.io or (888) 824 – 8111.