Position Title: Senior Data Engineer
Location: Remote (US Only) Experience Level: 5+ years
Type: Full-time
Salary: $160,000-$190,000 Equity: Competitive Token Grants
Our client is seeking a highly skilled Data Engineer with a strong background in developing algorithms for ETL (Extract, Transfer, Load), data parsing, categorization, and labeling/tagging across multiple data types—including structured and unstructured sources like images, audio, video, and natural language. This role is ideal for someone with a deep interest in data infrastructure, federated and edge computing, and ontology architecture, and who thrives in a fast-paced, collaborative environment.
Design, build, and maintain robust ETL pipelines that support high-volume, multi-type data ingestion and transformation.
Develop and optimize parsing and tagging algorithms—both rule-based and AI-driven—for numeric, categorical, tabular, visual, and natural language data.
Collaborate on the design and implementation of data ontologies and metadata structures to enable scalable and intelligent data systems.
Work closely with cross-functional teams to support federated and edge computing data strategies.
Maintain clean, well-documented, and efficient code and contribute to internal repositories and documentation.
Communicate technical decisions and trade-offs effectively with both technical and non-technical stakeholders.
5+ years of experience as a Data Engineer with a focus on ETL and parsing systems.
Bachelor’s degree in Data Engineering, Data Science, Computer Science, or equivalent hands-on experience (certifications, open-source contributions, GitHub projects).
Experience working with diverse data types (e.g., structured tables, time series, images, audio, video, and text).
Excellent technical communication skills, both written and verbal.
Strong problem-solving skills and the ability to contribute to a high-velocity team environment.
Prior experience with Databricks or a similar cloud-based data platform.
Master’s degree in Data Science, Computer Science, or a related field.
Proven innovation in ETL workflows, parsing methods, or data ontology frameworks.
Demonstrated work in federated/edge computing scenarios or semantic web technologies.