{bc}

Senior Data Engineer (PySpark + ML Pipelines)

ValueLabsDubai, UAE1 weeks agoMid-Senior
Mid-Seniorfulltime

Skills

PythonSQLGitScalaMachine LearningAgile
Get My Free Tailored Resume
Via LinkedIn·

About This Role

Location: Dubai WFO

Notice: Immediate/Serving notice - 30 days only.

Skill: PySpark + ML Pipeline + Banking Domain

Experience: 7+ years

Job Description

  • 1.
  • The Data Engineer role involves acquiring requirements, conducting EDA, ingesting required datasets, and transforming data using Big-data technologies & Feature Engineering techniques for machine learning models.
  • 2.
  • Key tasks include building high-performance, secure, and scalable data pipelines, collaborating with Analytics Delivery Leads, Data scientists and other teams in an Agile environment, and communicating stakeholders effectively.
  • 3.
  • The position requires a degree in a relevant field, at least 8-10 years of experience in data engineering and ML pipelines, and proficiency in technologies like Python, Spark (including expertise in optimization techniques), Hadoop, SQL, and Git.
  • 4.
  • Should be good with Python as well.
  • Roles and Responsibilities:
  • Data Pipeline Development: Design, develop, and maintain highly scalable and optimized ETL pipelines using PySpark on the Cloudera Data Platform, ensuring data integrity and accuracy.
  • Data Ingestion: Implement and manage data ingestion processes from a variety of sources (e.g., relational databases, APIs, file systems) to the data lake or data warehouse on CDP.
  • Data Transformation and Processing: Use PySpark to process, cleanse, and transform large datasets into meaningful formats that support analytical needs and business requirements.
  • Performance Optimization: Conduct performance tuning of PySpark code and Cloudera components, optimizing resource utilization and reducing runtime of ETL processes.
  • Data Quality and Validation: Implement data quality checks, monitoring, and validation routines to ensure data accuracy and reliability throughout the pipeline.
  • Automation and Orchestration: Automate data workflows using tools like Apache Oozie, Airflow, or similar orchestration tools within the Cloudera ecosystem.
  • Monitoring and Maintenance: Monitor pipeline performance, troubleshoot issues, and perform routine maintenance on the Cloudera Data Platform and associated data processes.
  • Collaboration: Work closely with other data engineers, analysts, product managers, and other stakeholders to understand data requirements and support various data-driven initiatives.
  • Documentation: Maintain thorough documentation of data engineering processes, code, and pipeline configurations.

Education

  • and Experience
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related field.
  • 7+ years of experience as a Data Engineer, with a strong focus on PySpark and the Cloudera Data Platform. Technical Skills
  • PySpark: Advanced proficiency in PySpark, including working with RDDs, DataFrames, and optimization techniques.
  • Cloudera Data Platform: Strong experience with Cloudera Data Platform (CDP) components, including Cloudera Manager, Hive, Impala, HDFS, and HBase.
  • Data Warehousing: Knowledge of data warehousing concepts, ETL best practices, and experience with SQL-based tools (e.g., Hive, Impala).
  • Big Data Technologies: Familiarity with Hadoop, Kafka, and other distributed computing tools.
  • Orchestration and Scheduling: Experience with Apache Oozie, Airflow, or similar orchestration frameworks.
  • Scripting and Automation: Strong scripting skills in Linux.
  • Good at Data Modelling as well. Soft Skills
  • Strong analytical and problem-solving skills.
  • Excellent verbal and written communication abilities.
  • Ability to work independently and collaboratively in a team environment. Attention to detail and commitment to data quality.
  • Note: Looking for immediate to 30 days’ official Notice period candidates only.
  • Interested candidates please share your CV to
  • sonam.singh@valuelabs.com
  • with below details:

Expected CTC

Current location:

Preferred location:

Notice period:

Your resume, rewritten for this exact role.

Sign up free — Base Career tailors your CV to this job description in 60 seconds.

01 / 05

Resume Tailored to This Job

Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.

Get My Free Resume

Free · No card · 60 seconds

02 / 05

Cover Letter for This Role, Done

Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.

Get My Cover Letter

Free · No card · 60 seconds

03 / 05

See How Well You Fit This Role

See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.

Check My Fit Score

Free · No card · 60 seconds

04 / 05

Apply in One Click

Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.

Start Applying Faster

Free · No card · 60 seconds

05 / 05

Track It. Follow Up at the Right Time.

Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.

Track My Applications

Free · No card · 60 seconds

Similar Jobs

Senior Data Intelligence Manager

Dyson · Dubai

Directorfulltime

About Us At Dyson, we’re driven by a relentless pursuit of innovation—pushing boundaries in engineering, AI, and robotics. Our new Data Intelligence team sits at the heart of this mission: shaping Dyson’s future through

Skills

LeadershipStrategic PlanningBudgeting

Senior Data Scientist

Presight · Abu Dhabi

Mid-Seniorfulltime

Overview Responsibilities: Experimentation and Prototype Development: Design, develop, and assess data-driven algorithms for various tasks (regression, classification, segmentation, etc.) using cutting-edge AI technique

Skills

Machine LearningDeep LearningPython

Senior Data Engineer

QuantumGate · Abu Dhabi Emirate

Mid-Seniorfulltime

QuantumGate is dedicated to developing and commercializing cutting-edge post-quantum cryptographic solutions. Our mission is to safeguard enterprise digital environments through innovative protocols and applications that

Skills

Big DataETLData Warehousing

Senior Database Admin Engineer

Dautom · Sharjah

Mid-Seniorfulltime

Role: Senior Database Admin Engineer Location: Sharjah, UAE Duration: 24 months extendable Client: Government Entity Payroll: Dautom Information Technology 🔹 Role Overview The ideal candidate will be responsible for man

Skills

engineeringdesignproject management

Senior Data Engineer

GSSTech Group · Dubai

Senior

Gather and analyze requirements, design scalable data pipelines using PySpark and Python, and collaborate with teams for data-driven solutions.

Skills

Big DataETLData Warehousing

Senior Data Engineer

Keystone Consulting · Dubai

Senior

Design and optimize data systems, develop data pipelines, and support AI/ML analysis while ensuring data governance and collaborating with cross-functional teams.

Skills

Big DataETLData Warehousing

Senior Data Engineer (Data Platform Databricks)

Luxoft · Abu Dhabi Emirate

Mid-Seniorfulltime

Project Description: The Senior Engineer - Data Governance & Platform Lead (IB&M) is a hands-on senior technical role responsible for defining, implementing, and governing the IB&M Data Platform with a strong focus on da

Skills

Big DataETLData Warehousing

Senior Data Scientist – AI & Decision Intelligence

Confidential · Abu Dhabi

Mid-Seniorfulltime

We are seeking an experienced and highly skilled and forward-thinking Senior Data Scientist with strong expertise in Artificial Intelligence (AI), advanced analytics, and data-driven decision support. The successful cand

Skills

Machine LearningDeep LearningPython

Senior Data Engineer

Parser · Dubai

Mid-Seniorfulltime

Senior Data Engineer We are seeking strong Data Engineers to help design and build next-generation AI-driven data products and investment intelligence platforms. This role sits at the foundation of new AI initiatives, wh

Skills

Big DataETLData Warehousing

Professionals hired via Base Career

I kept getting rejections from London. Base Career rewrote my CV for Dubai, and I landed Emirates in 3 weeks.

Sarah M.

Sarah M. · Marketing Manager

🇬🇧 UK → 🇦🇪 Dubai

50 applications in Canada, zero replies. Base Career tailored my resume for Riyadh and I got 4 interviews within a month.

James T.

James T. · Software Engineer

🇨🇦 Canada → 🇸🇦 Riyadh

The cover letters matched Gulf tone immediately. I got hired by a semi-government team in Doha on my first round.

Maya R.

Maya R. · Product Manager

🇺🇸 USA → 🇶🇦 Doha

As an expat I had no idea how Gulf CVs work. Base Career nailed it. Offer from a Big 4 in Abu Dhabi in 6 weeks.

PK

Priya K. · Finance Analyst

🇮🇳 India → 🇦🇪 Abu Dhabi

2.2K+

Cover Letters & Follow-ups

1.8K+

Resumes Tailored

190.5K+

Jobs Tracked

Trusted by professionals at

PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
AI Job Platform

Stop applying blindly. Start getting hired.

Base Career automates the hardest parts of job searching — apply smarter, not harder.

AI Resume in 60s

Your resume rewritten for this exact role using the job description as the brief.

ATS-Optimized

Get past automated screening filters with the right keywords matched to each job.

Application Tracker

Track every job, follow-up, and interview in one visual kanban board.

Get My Free Resume for This Job

Free plan · No credit card required