<\/span><\/span>Mid/Senior
Data Engineer<\/span><\/b>
<\/p>Location: Lisbon, Portugal<\/span>
<\/p> <\/span><\/b>
<\/span><\/p>About Our Client<\/span><\/b>
Our Client offers a platform with multiple data delivery options that leverages
machine learning technology and human intelligence to deliver
quality -guaranteed training data for AI systems. The platform offers
self -service and fully customizable solutions that deliver high -quality
project -specific training data, enabling AI products reach market quicker. It
is this business model that has allowed Our Client to raise a total of $63.6M
in funding over 4 rounds. Their value proposition is quality, privacy, speed
and scale, covering more than 50 different languages. With strong expertise in
speech and natural language processing technologies, they have been serving AI
companies and Fortune 500 companies since day one. Our Client was founded in
Seattle and has an office in Lisbon.<\/span><\/p>
What will you do?<\/b><\/span><\/div>- Design and implement scalable PySpark -based data pipelines to
process (clean, validate, package and deliver) multimodal AI
training datasets (e.g., text, audio, video, images, etc.);<\/span>
<\/li>- Develop ETL pipelines to fuel the Operations areas with data
for their analytical dashboards;<\/span>
<\/li>- Set software engineering tools, platforms, and best practices
while performing trade -off analysis to best match engineering, product,
and project constraints and expectations;<\/span>
<\/li>- Operate data pipelines to ingest data from multiple
sources, and deliver it to different destinations.<\/span>
<\/li>- Help the Product Manager and stakeholders in structuring,
breaking down, and prioritizing the product roadmap into backlog work
items;<\/span>
<\/li>- Collaborate with other software engineering teams such as SREs
and DevOps to achieve your team’s goals;<\/span>
<\/li>- Work together with Software Engineering teams so as to
integrate the Data Platform with other tools and platforms.<\/span>
<\/li><\/ul><\/div><\/span>
Requirements<\/h3>
Who are we looking for?<\/span><\/b>
Do you have the drive to work in an innovative and ambitious environment?
We’re looking for someone with a determined and proactive mindset, someone
inspired and passionate to help us achieve our goals. Our successful candidate
is a strong critical thinker, reliable and transparent, with an ability to
learn and communicate. We are looking for someone special to contribute to our
unique culture.
<\/span><\/p>- BSc or MSc in Computer Science or similar background;<\/span>
<\/li>- Minimum of 3 years of experience;<\/span>
<\/li>- Experience in PySpark -based data pipelines and software quality
best practices;<\/span>
<\/li>- Worked with Azure services such as Synapse Analytics (mainly
PySpark Jobs, Pipelines, and Notebooks), ADLS, Power BI, DevOps, and SQL
and NoSQL databases;<\/span>
<\/li>- Solid understanding of data -related architectures, concepts,
technologies, and processes (e.g., Medallion, Data Lake, Data
Lakehouse, Data Warehouse, ETL, etc.);<\/span>
<\/li>- Comfortable with evaluating and applying software design
and architectural patterns/principles;<\/span>
<\/li>- Knowledge of RESTful APIs based on FastAPI, from the provider
as well as the consumer point of view;<\/span>
<\/li>- Problem Solving skills;<\/span>
<\/li>- Proficient in both written and spoken English.<\/span>
<\/li><\/ul><\/div><\/span>
Benefits<\/h3>
Benefits<\/span><\/b>
You spend a lot of your time at work, so it should be challenging, fun and
interesting. At Our Client it will be all of those things and more. Here’s what
we offer:<\/span><\/p>- Flexible working schedule and hybrid model.<\/span><\/b> We know comfort can boost creativity and performance, so
you can manage your schedule and work both from one of our modern office
spaces or home.<\/span>
<\/li>- Excellent career development opportunities in a high growth
company.<\/span><\/b> With us, you can accomplish your
career goals and follow a well -described career path with the support of
your supervisor.<\/span>
<\/li>- Culture of feedback and continuous improvement.<\/span><\/b> AI is a fast -paced area, so we keep track of tech trends,
and we always ask for feedback.<\/span>
<\/li>- An international and diverse team. <\/span><\/b>We have more than 30 nationalities at our 3 locations, and we
provide language classes.<\/span>
<\/li>- Continuous training opportunities.<\/span><\/b> You
can choose from many options: leveraging hand -on workshops, unlimited
access to Udemy and formal development opportunities.<\/span>
<\/li>- We love to have fun together.<\/span><\/b> We
joke a lot, and we can't imagine work without fun activities – we already
surfed, raced carts and played soccer together.<\/span>
<\/li><\/ul>
<\/div><\/span>