I

Curated Data Integration Engineer - MPLS, MN - Hybrid, local preferred

Iceberg Technology Group
Contract
On-site
Minneapolis, Minnesota, United States
Curated Data Integration Engineer - MPLS, MN - Hybrid, local preferred

Iceberg's direct seeks a Curated Data Integration Engineer who will focus on turning cleansed data into business-ready, Curated (Gold) layer datasets for analytics and reporting. This role centers on combining and refining data from the Enriched (Silver) zone – which may include applying master data management rules and integrating geospatial context – to produce trusted, analytics-friendly tables and views. The engineer will work closely with our Master Data Management (MDM) system (Profisee) and the Enterprise GIS team to ensure that Curated data aligns with master data definitions and supports situational awareness use cases (such as airport operational dashboards and KPI reports). In this position, you will build Delta Lake tables and SQL views that serve as the single source of truth for various business domains, accessible via Synapse serverless SQL and Power BI. You will also implement data quality checks, complex joins, and data harmonization logic to guarantee that the Curated zone data is consistent, accurate, and ready for consumption.
  • Combine data from multiple Enriched zone tables to create business-ready Curated datasets. Design and build denormalized tables, star-schema models, or consolidated views that meet specific reporting and analytics requirements. Ensure these Curated Delta tables are optimized for query performance via Synapse serverless SQL pools and Power BI.
  • Apply master data management and data quality rules during transformation. Use Profisee MDM information to merge and reconcile records from different sources, achieving a single version of truth. Implement data cleansing, deduplication, and consistency checks so that Curated data is reliable and free of significant errors or mismatches.
  • Work with the Enterprise GIS team to integrate geospatial attributes or ensure that Curated datasets align with spatial data (e.g., location IDs, facility maps) for situational awareness.
  • Engage with business stakeholders to understand the information needs for dashboards and reports. Translate those needs into Curated Client IT-aligned data models
  • Develop and maintain the data pipelines (using Synapse Spark notebooks, SQL scripts, or Synapse data flows)
  • Implement validation checks and document solutions.


Requirements

  • 4+ years of experience in data engineering or BI development, with a focus on building data transformations and integrations for analytics. Solid understanding of data warehouse/lakehouse concepts and hands-on experience crafting fact and dimension tables, aggregate tables, etc.
  • Proficient in using Azure Synapse Analytics for data pipeline development – including Spark notebooks for complex processing and serverless SQL pools for creating views or querying data. Experience with Delta Lake format on Azure Data Lake Storage – able to read/write Delta tables, handle schema evolution, and use Delta time-travel or ACID features.
  • Strong SQL skills for data manipulation, analysis, and performance tuning (e.g., writing efficient joins, window functions). Experience with PySpark or Spark SQL in a data lake environment to perform transformations that go beyond SQL capabilities when needed (using Python)
  • Familiarity with Master Data Management principles and experience integrating master data or reference data into data pipelines. Able to implement data quality checks, identify anomalies, and reconcile data from different sources.
  • Ability to understand business context and requirements and write clear documentation.
  • Lakehouse/Medallion Experience: Prior experience working in a medallion architecture (Raw/Enriched/Curated) or lakehouse environment is highly preferred. Understanding of metadata-driven pipelines and how each zone (Raw, Enriched, Curated) is used in such frameworks.
  • MDM Tools & GIS Exposure: Experience with Profisee MDM or other MDM tools (e.g., Microsoft MDS, Informatica MDM) in a data engineering context. Exposure to GIS data or spatial analytics (ArcGIS, PostGIS, etc.)
  • Azure BI Stack: Knowledge of Power BI report development or optimization, and familiarity with how Power BI interacts with Azure Synapse (via serverless SQL or DirectQuery to Delta). While this is not a Power BI developer role, understanding the consumption layer helps in designing better Curated data models.
  • Certifications & Domain Experience: Azure Data Engineer Associate certification or similar. Experience in government, airport operations, transportation, or other regulated industries