Senior Data Engineer (PySpark + ML Pipelines)
Skills
About This Role
Location: Dubai WFO
Notice: Immediate/Serving notice - 30 days only.
Skill: PySpark + ML Pipeline + Banking Domain
Experience: 7+ years
Job Description
- 1.
- The Data Engineer role involves acquiring requirements, conducting EDA, ingesting required datasets, and transforming data using Big-data technologies & Feature Engineering techniques for machine learning models.
- 2.
- Key tasks include building high-performance, secure, and scalable data pipelines, collaborating with Analytics Delivery Leads, Data scientists and other teams in an Agile environment, and communicating stakeholders effectively.
- 3.
- The position requires a degree in a relevant field, at least 8-10 years of experience in data engineering and ML pipelines, and proficiency in technologies like Python, Spark (including expertise in optimization techniques), Hadoop, SQL, and Git.
- 4.
- Should be good with Python as well.
- Roles and Responsibilities:
- Data Pipeline Development: Design, develop, and maintain highly scalable and optimized ETL pipelines using PySpark on the Cloudera Data Platform, ensuring data integrity and accuracy.
- Data Ingestion: Implement and manage data ingestion processes from a variety of sources (e.g., relational databases, APIs, file systems) to the data lake or data warehouse on CDP.
- Data Transformation and Processing: Use PySpark to process, cleanse, and transform large datasets into meaningful formats that support analytical needs and business requirements.
- Performance Optimization: Conduct performance tuning of PySpark code and Cloudera components, optimizing resource utilization and reducing runtime of ETL processes.
- Data Quality and Validation: Implement data quality checks, monitoring, and validation routines to ensure data accuracy and reliability throughout the pipeline.
- Automation and Orchestration: Automate data workflows using tools like Apache Oozie, Airflow, or similar orchestration tools within the Cloudera ecosystem.
- Monitoring and Maintenance: Monitor pipeline performance, troubleshoot issues, and perform routine maintenance on the Cloudera Data Platform and associated data processes.
- Collaboration: Work closely with other data engineers, analysts, product managers, and other stakeholders to understand data requirements and support various data-driven initiatives.
- Documentation: Maintain thorough documentation of data engineering processes, code, and pipeline configurations.
Education
- and Experience
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related field.
- 7+ years of experience as a Data Engineer, with a strong focus on PySpark and the Cloudera Data Platform. Technical Skills
- PySpark: Advanced proficiency in PySpark, including working with RDDs, DataFrames, and optimization techniques.
- Cloudera Data Platform: Strong experience with Cloudera Data Platform (CDP) components, including Cloudera Manager, Hive, Impala, HDFS, and HBase.
- Data Warehousing: Knowledge of data warehousing concepts, ETL best practices, and experience with SQL-based tools (e.g., Hive, Impala).
- Big Data Technologies: Familiarity with Hadoop, Kafka, and other distributed computing tools.
- Orchestration and Scheduling: Experience with Apache Oozie, Airflow, or similar orchestration frameworks.
- Scripting and Automation: Strong scripting skills in Linux.
- Good at Data Modelling as well. Soft Skills
- Strong analytical and problem-solving skills.
- Excellent verbal and written communication abilities.
- Ability to work independently and collaboratively in a team environment. Attention to detail and commitment to data quality.
- Note: Looking for immediate to 30 days’ official Notice period candidates only.
- Interested candidates please share your CV to
- sonam.singh@valuelabs.com
- with below details:
Expected CTC
Current location:
Preferred location:
Notice period:
Your resume, rewritten
for this exact role.
Sign up free — Base Career tailors your CV to this job description in 60 seconds.
01 / 05
Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.
Free · No card · 60 seconds
02 / 05
Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.
Free · No card · 60 seconds
03 / 05
See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.
Free · No card · 60 seconds
04 / 05
Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.
Free · No card · 60 seconds
05 / 05
Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.
Free · No card · 60 seconds
Similar Jobs
Senior Data Intelligence Manager
Dyson · Dubai
About Us At Dyson, we’re driven by a relentless pursuit of innovation—pushing boundaries in engineering, AI, and robotics. Our new Data Intelligence team sits at the heart of this mission: shaping Dyson’s future through
Skills
Senior Data Scientist
Presight · Abu Dhabi
Overview Responsibilities: Experimentation and Prototype Development: Design, develop, and assess data-driven algorithms for various tasks (regression, classification, segmentation, etc.) using cutting-edge AI technique
Skills
Senior Data Engineer
QuantumGate · Abu Dhabi Emirate
QuantumGate is dedicated to developing and commercializing cutting-edge post-quantum cryptographic solutions. Our mission is to safeguard enterprise digital environments through innovative protocols and applications that
Skills
Senior Database Admin Engineer
Dautom · Sharjah
Role: Senior Database Admin Engineer Location: Sharjah, UAE Duration: 24 months extendable Client: Government Entity Payroll: Dautom Information Technology 🔹 Role Overview The ideal candidate will be responsible for man
Skills
Senior Data Engineer
GSSTech Group · Dubai
Gather and analyze requirements, design scalable data pipelines using PySpark and Python, and collaborate with teams for data-driven solutions.
Skills
Senior Data Engineer
Keystone Consulting · Dubai
Design and optimize data systems, develop data pipelines, and support AI/ML analysis while ensuring data governance and collaborating with cross-functional teams.
Skills
Senior Data Engineer (Data Platform Databricks)
Luxoft · Abu Dhabi Emirate
Project Description: The Senior Engineer - Data Governance & Platform Lead (IB&M) is a hands-on senior technical role responsible for defining, implementing, and governing the IB&M Data Platform with a strong focus on da
Skills
Senior Data Scientist – AI & Decision Intelligence
Confidential · Abu Dhabi
We are seeking an experienced and highly skilled and forward-thinking Senior Data Scientist with strong expertise in Artificial Intelligence (AI), advanced analytics, and data-driven decision support. The successful cand
Skills
Senior Data Engineer
Parser · Dubai
Senior Data Engineer We are seeking strong Data Engineers to help design and build next-generation AI-driven data products and investment intelligence platforms. This role sits at the foundation of new AI initiatives, wh
Skills
Professionals hired via Base Career
“I kept getting rejections from London. Base Career rewrote my CV for Dubai, and I landed Emirates in 3 weeks.”
Sarah M. · Marketing Manager
🇬🇧 UK → 🇦🇪 Dubai
“50 applications in Canada, zero replies. Base Career tailored my resume for Riyadh and I got 4 interviews within a month.”
James T. · Software Engineer
🇨🇦 Canada → 🇸🇦 Riyadh
“The cover letters matched Gulf tone immediately. I got hired by a semi-government team in Doha on my first round.”
Maya R. · Product Manager
🇺🇸 USA → 🇶🇦 Doha
“As an expat I had no idea how Gulf CVs work. Base Career nailed it. Offer from a Big 4 in Abu Dhabi in 6 weeks.”
Priya K. · Finance Analyst
🇮🇳 India → 🇦🇪 Abu Dhabi
2.2K+
Cover Letters & Follow-ups
1.8K+
Resumes Tailored
190.5K+
Jobs Tracked
Trusted by professionals at
Stop applying blindly.
Start getting hired.
Base Career automates the hardest parts of job searching — apply smarter, not harder.
AI Resume in 60s
Your resume rewritten for this exact role using the job description as the brief.
ATS-Optimized
Get past automated screening filters with the right keywords matched to each job.
Application Tracker
Track every job, follow-up, and interview in one visual kanban board.
Free plan · No credit card required