1648 Group logo

Senior Data Engineer

1648 Group
Part-time
Remote

About us:

At 1648 Factory, we help create the tech solution the world needs. Working at 1648 Factory means taking ownership and making a meaningful impact. We are a remote-first company connecting people with diverse skills and backgrounds, united by a love for building great products in a fun environment. We believe that our products will only be as good as the people who make them, so we aim to find the best talent who are excited about the ideas. 
Following the same philosophy that applies to our ventures: we invest in good people to achieve the best results. 
We develop innovative ideas and build successful digital products with our partners. With start-ups, established companies, and consulting firms. We support all growth phases from day zero with experience, technology, highly qualified software teams, and our investor network. 


Your role: Senior Data Engineer

Your role will be to build the data foundation and pipelines for a smart AI-driven agent that collects, structures, and analyzes operational and business data. 


Requirements: 

- 5+ years of experience in data engineering

Strong hands-on knowledge of:

Azure Cloud, Databricks (Delta Lake, PySpark, performance tuning), Azure security practices (RBAC, Key Vault, encryption) 

- Proven experience with data migrations, especially from on-prem to Azure 

- Familiarity with hybrid data architecture 

- Understanding of cost and performance optimization techniques in big data workflows 

- CI/CD for data pipelines (Azure DevOps, GitHub Actions or similar)

- English Upper-Intermediate+ 


Responsibilities: 

- Design scalable data architectures and ETL/ELT pipelines using Azure and Databricks 

- Build hybrid pipelines integrating on-prem and cloud data sources (SQL Server, MongoDB, CosmosDB) 

- Apply CDC, Delta Lake, and medallion architecture principles 

- Optimize performance and costs: auto-scaling, caching, partitioning, Z-ordering, instance planning 

- Ensure data security and governance, including encryption, access control, and monitoring 

- Collaborate closely with AI/ML engineers for downstream LLM applications 

- Participate in CI/CD and infrastructure automation with DevOps 

- Strong communication and documentation skills 

- Ability to justify architectural decisions

Nice to have: 

- Experience with vector databases or graph databases 

- Knowledge of LLM data preparation: chunking, embedding strategies, RAG architecture 

- Familiarity with Unity Catalog, Azure Purview, and governance tools


We offer: 
- A working environment with plenty of scope for creativity. 
- Independent work with a well-rehearsed team in the background. 
- Remote work. 
- Super-experienced international team. 
- Trust and support from the management team. 
- High degrees of responsibility and autonomy. 
- Agile teams where your ideas and solutions are valued.