ETL / Data Engineer(ND)

Seven Arc Info Systems
ContractSingaporeSGD 7,500 - 9,500/month

POSITION OVERVIEW : Software Development Analyst

POSITION GENERAL DUTIES AND TASKS :

Job Description

We are seeking a skilled Data Engineer to do the migration of our existing data warehouse and data model from MariaDB to the Hadoop Big Data Cloudera on-premise platform. The ideal candidate must be proficient in SQL, Hive SQL, Spark, and data modeling. Additionally, they should possess a strong understanding of the production deployment process, including design, development, testing, UAT, and production deployment. Experience of scheduling jobs using Autosys is essential for this role.

Key Responsibilities

• Migrate existing data warehouse and data model from MariaDB to Hadoop Big Data Cloudera on-premises platform.

• Develop and optimize SQL, Hive SQL, and Spark scripts to ensure efficient data processing.

• Design and implement data models to support business requirements and optimize performance.

• Collaborate with cross-functional teams to understand data requirements and ensure data integrity throughout the migration process.

• Develop and execute test plans to validate data accuracy and system performance.

• Coordinate with stakeholders (internal) to plan and execute production deployments.

• Schedule and monitor jobs using Autosys to ensure timely execution and minimize downtime.

• Provide technical expertise and guidance to team members throughout the migration project.

• Document processes, procedures, technical specifications, and best practices to facilitate knowledge sharing and ensure scalability.

• Create and maintain unit test case documents to ensure code quality and reliability.

Qualifications

• Bachelor’s (or Higher) degree in computer science, Engineering, or a related field.

• Proven experience of 5+ years in data engineering and migration projects for big data (Hortonworks / Cloudera)

• Strong hands-on experience on Cloudera and related ecosystem components.

• Strong experience in implementing ETL (Extract, Transform, Load) processes.

• Highly proficient in SQL, Hive SQL, Spark, and data modeling.

• Strong understanding of the production deployment process.

• Experience with scheduling jobs using Autosys or similar.

• Experience with version control systems such as Bitbucket, GIT etc.

• Ability to troubleshoot and resolve data related issues efficiently.

• Good communication and interpersonal skills

 Contract : One year renewable

Apply for this job

Resume/CV*

Click or drag file to this area to upload your Resume

Please make sure to upload a PDF

First Name*
Last Name*
Email*
Phone Number*