V

Data Engineer

Vayu Health
Full-time
Remote
United States
$100,000 - $135,000 USD yearly

Who we are

Vayu Health is an equity-focused non-profit healthcare organization based in California, pioneering a novel care model tailored for low-income individuals managing uncontrolled chronic conditions, initially focusing on diabetes, alongside significant social and behavioral needs. We seek individuals who share our passion for equity, person-centered care, and eliminating barriers to accessing exceptional healthcare. We believe a transformative approach to care delivery is essential.

 

We are and believe in:

Acting with passion and creativity

Leading with integrity

Committing to being better

Achieving strength through teams

Inclusivity, where all individuals should be treated with grace and dignity

 

The Opportunity:

We are seeking an experienced data engineer to join our dynamic, multidisciplinary team focused on innovating and enhancing programs in primary care. This role requires the ability to drive the programmatic curation, cleaning, and generation of healthcare data. As a Data Engineer, you will design, build, and maintain the data infrastructure that empowers clinical teams and business leaders to deliver better outcomes through data-driven insights. Additionally, you will play a critical role in managing the pipelines and systems that enable secure, reliable access to healthcare and operational data across the organization.  In addition, the Vayu team is looking for data engineers who have experience with data visualization platforms(such as PowerBi, Tableau etc.) to bring together many data sets in support of determining Vayu’s direct impact on healthcare outcomes.  The connection of these data sets in collaboration with multidisciplinary teams will support complex, high-risk patients and enable data driven decision making with leadership, customers, and stakeholders. 

 

Key Responsibilities:

  • Architect and maintain robust, scalable data pipelines and storage systems using modern cloud platforms and ETL frameworks.

  • Ensure efficient ingestion, transformation, and delivery of healthcare and operational data, supporting both batch and real-time processing needs.

  • Support Vayu’s data infrastructure for analytics, reporting, and clinical applications.

  • Ensure the quality and integrity of health information databases.

  • Design and implement data collection systems and other strategies to optimize statistical efficiency and data quality.

  • Identify, analyze, and interpret trends or patterns in complex data sets.

  • Work with management to prioritize business and information needs.

  • Prepare and present detailed reports based on data analysis.

  • Collaborate with IT and operational staff to support data management needs.

  • Ensure compliance with healthcare data privacy laws and policies.

  • Maintain and update health information records accurately and securely.

  • Identify and resolve discrepancies in the data.

  • Identify incomplete or missing data sets to the operations team.  

  • Design and maintain scalable and secure data pipelines using modern ETL frameworks (e.g., dbt, Fivetran) on cloud platforms, including AWS and GCP.

  • Build and manage cloud-based data warehouses (e.g., BigQuery, Snowflake) and data marts.

  • Perform data quality checks, validations, and mapping for health plan data feeds (standard and custom formats).

  • Translate health plan specifications and map differences across varying data structures.

  • Collaborate with analytics, reporting, and clinical teams to ensure data is accessible and reliable.

  • Optimize data systems and troubleshoot data issues across the pipeline and storage layers.

  • Implement and enforce data governance, security, and privacy standards aligned with HIPAA and other regulatory requirements.

  • Manage data access controls within and outside the organization.  

Education and Licenses Required: 

 

  • Bachelor's degree in Computer Science, Information Systems, Engineering, or related field.

  • 6+ years of professional experience in data engineering, with 3+ years in the healthcare industry

  • Strong experience with SQL and relational databases (PostgreSQL, SQL Server, etc.)

  • Professional working experience with dbt

  • Proficiency in Python, Spark, or other data processing frameworks

  • Experience with data modeling and ETL frameworks like Fivetran

  • Hands-on experience with cloud data platforms (e.g., AWS, GCP, Azure)

  • Experience with data warehousing (e.g., BigQuery, Snowflake, Redshift)

  • Experience working with healthcare claims or clinical data

  • Familiar with HIPAA compliance, security, and privacy best practices

  • Experience working with TUVA strongly preferred

Qualifications: 

Required: 

 

  • Expertise in building distributed data systems and applying data engineering best practices to healthcare data integrations is essential

  • Strong understanding of healthcare data privacy regulations (e.g., HIPAA).

  • Ability to work collaboratively with a multidisciplinary team.

  • Proficiency in data analysis tools and software (e.g., SQL, SAS, R). 

  • Proficiency in data visualization tools and software (e.g., PowerBI, Tableau, Looker)

  • Advanced analytical and problem-solving skills.

  • Excellent communication and presentation skills.

  • Experience in a clinical or hospital setting.

  • Willingness to travel for in-person meetings approximately every three months.

Preferred: 

  • Knowledge of pros and cons of architectural technology stack decisions 

  • Knowledge of pros and cons of multiple data programming languages 

  • Knowledge of pros and cons of data visualization tools 

This is a remote position with PST hours/location desired.