our group logo

Data Engineering Intern (ETL & Python)

our group
Internship
On-site

About our group:

We are an IT MES (Manufacturing Execution System) team based in Woodlands, supporting Seagate’s global factory operations in Singapore, Malaysia, US, Thailand, and China. Our core mission is to design and implement scalable data integration solutions that power MES and Factory IT applications.

Our focus includes Database ETL processes, complex SQL development, and Python-based automation to optimize data flows and ensure system reliability. Beyond traditional data engineering, we are also exploring Generative AI and Agentic AI solutions to modernize data platforms and create new value for factory operations.

This internship is ideal for students who are passionate about ETL/Data Engineering with Oracle, eager to sharpen their Python skills, and curious about the application of LLMs and AI frameworks in enterprise IT.

About the role - you will:

As a Data Engineering Intern, you will:

  • Work with senior engineers on ETL processes in Postgres / Oracle, including writing and optimizing stored procedures, functions, and packages.
  • Develop and optimize complex SQL queries to support data extraction, transformation, and reporting needs.
  • Use Python for automation, data processing, and proof-of-concepts.
  • Collaborate with Application Architects and Business SMEs to deliver data integration solutions supporting MES and factory applications.
  • Contribute to projects involving LLMs, LangChain, LangGraph, and Marimo notebooks for GenAI-enabled data pipelines.
  • Support testing, troubleshooting, and documentation to ensure system reliability and performance.

About you:

  • Strong foundation in SQL and relational database concepts.
  • Hands-on skills in Database stored procedures, triggers, and performance tuning.
  • Comfortable coding in Python and eager to apply it for ETL automation and analytics.
  • Interested in emerging technologies like Generative AI, LLM frameworks (LangChain, LangGraph), and Marimo notebooks.
  • Detail-oriented, analytical, and self-motivated with strong problem-solving skills.
  • Good communication and teamwork abilities.

Your experience includes:

  • Pursuing a degree in Computer Science, Software Engineering, Information Systems, or related field.
  • Experience (academic or project-based) with ETL pipelines in Oracle/Postgres
  • Familiarity with Generative AI frameworks (LangChain, LangGraph, Chainlit, or similar).
  • Knowledge of version control (Git) and Agile practices.

Location:

Penang, Malaysia

 

Location: Penang Malaysia Suntech
Travel: None