ETL / Data Engineer(ND)

Seven Arc Info Systems

Contract

SingaporeSGD 7,500 - 9,500/month

POSITION OVERVIEW : Software Development Analyst

POSITION GENERAL DUTIES AND TASKS :

Job Description

We are seeking a skilled Data Engineer to do the migration of our existing data warehouse and data model from MariaDB to the Hadoop Big Data Cloudera on-premise platform. The ideal candidate must be proficient in SQL, Hive SQL, Spark, and data modeling. Additionally, they should possess a strong understanding of the production deployment process, including design, development, testing, UAT, and production deployment. Experience of scheduling jobs using Autosys is essential for this role.

Key Responsibilities

• Migrate existing data warehouse and data model from MariaDB to Hadoop Big Data Cloudera on-premises platform.

• Develop and optimize SQL, Hive SQL, and Spark scripts to ensure efficient data processing.

• Design and implement data models to support business requirements and optimize performance.

• Collaborate with cross-functional teams to understand data requirements and ensure data integrity throughout the migration process.

• Develop and execute test plans to validate data accuracy and system performance.

• Coordinate with stakeholders (internal) to plan and execute production deployments.

• Schedule and monitor jobs using Autosys to ensure timely execution and minimize downtime.

• Provide technical expertise and guidance to team members throughout the migration project.

• Document processes, procedures, technical specifications, and best practices to facilitate knowledge sharing and ensure scalability.

• Create and maintain unit test case documents to ensure code quality and reliability.

Qualifications

• Bachelor’s (or Higher) degree in computer science, Engineering, or a related field.

• Proven experience of 5+ years in data engineering and migration projects for big data (Hortonworks / Cloudera)

• Strong hands-on experience on Cloudera and related ecosystem components.

• Strong experience in implementing ETL (Extract, Transform, Load) processes.

• Highly proficient in SQL, Hive SQL, Spark, and data modeling.

• Strong understanding of the production deployment process.

• Experience with scheduling jobs using Autosys or similar.

• Experience with version control systems such as Bitbucket, GIT etc.

• Ability to troubleshoot and resolve data related issues efficiently.

• Good communication and interpersonal skills

 Contract : One year renewable

Apply for this job

Resume/CV*

Click or drag file to this area to upload your Resume

Please make sure to upload a PDF

First Name*

Last Name*

Email*

Phone Number*

The hiring team may use this number to contact you about this job.

By clicking 'Submit Application', you agree to receive job application updates from Seven Arc Info Systems via text and/or WhatsApp. Message frequency may vary. Reply STOP to unsubscribe at any time. Message & data rates may apply.