{bc}

Senior Data Engineer Pyspark with data modelling

VirtusaDubai, UAE1 months agoEntryfulltime
Scala
Generate Resume for this Job
Via LinkedIn·

About This Role

About The Role We are seeking a highly skilled Data Engineer with deep expertise in PySpark and the Cloudera Data Platform (CDP) to join our data engineering team. As a Data Engineer, you will be responsible for designing, developing, and maintaining scalable data pipelines that ensure high data quality and availability across the organization. This role requires a strong background in big data ecosystems, cloud-native tools, and advanced data processing techniques.

The ideal candidate has hands-on experience with data ingestion, transformation, and optimization on the Cloudera Data Platform, along with a proven track record of implementing data engineering best practices. You will work closely with other data engineers to build solutions that drive impactful business insights.

Responsibilities

  • Data Pipeline Development: Design, develop, and maintain highly scalable and optimized ETL pipelines using PySpark on the Cloudera Data Platform, ensuring data integrity and accuracy.
  • Data Ingestion: Implement and manage data ingestion processes from a variety of sources (e.g., relational databases, APIs, file systems) to the data lake or data warehouse on CDP.
  • Data Transformation and Processing: Use PySpark to process, cleanse, and transform large datasets into meaningful formats that support analytical needs and business requirements.
  • Performance Optimization: Conduct performance tuning of PySpark code and Cloudera components, optimizing resource utilization and reducing runtime of ETL processes.
  • Data Quality and Validation: Implement data quality checks, monitoring, and validation routines to ensure data accuracy and reliability throughout the pipeline.
  • Automation and Orchestration: Automate data workflows using tools like Apache Oozie, Airflow, or similar orchestration tools within the Cloudera ecosystem.
  • Monitoring and Maintenance: Monitor pipeline performance, troubleshoot issue.

Similar Jobs

Senior Database Platform Engineer

Kraken ·

Mid-Senior

**Building the Future of Crypto** Our Krakenites are a world\-class team with crypto conviction, united by our desire to discover and unlock the potential of crypto and blockchain technology. **What makes us different?**

ElasticsearchKubernetesLinux

Senior Data Engineer

Presight · Abu Dhabi

Mid-Senior

**Overview** **About Presight** Presight is an ADX\-listed public company with Abu Dhabi based G42 as its majority shareholder and is the region’s leading big data analytics company powered by GenAI. It combines big data

PythonExcelVAT

Senior Data Scientist

Dataiku · Dubai

Mid-Senior

Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. In a single environment, teams design and operate analytics, machine learning, and AI agents with the

PythonJavaScriptSQL

Data Analyst/ Senior Data Analyst (Statistics/Python/BI) (Bangkok-based, relocation provided)

Agoda · Abu Dhabi

Entry

**About Agoda** At Agoda, we bridge the world through travel. Our story began in 2005, when two lifelong friends and entrepreneurs, driven by their passion for travel, launched Agoda to make it easier for everyone to exp

PythonVAT

Senior Data Analyst, Partner Development - (Statistics/ML/BI) (Bangkok-based, relocation provided)

Agoda · Dubai

Entry

**About Agoda** At Agoda, we bridge the world through travel. Our story began in 2005, when two lifelong friends and entrepreneurs, driven by their passion for travel, launched Agoda to make it easier for everyone to exp

VAT

Senior Data Engineer

Emaratech · Dubai

Senior

Manage and optimize data lake platforms, implement cloud-native solutions, and ensure data governance while utilizing tools like Apache NiFi and Spark.

Senior Data Engineer

Senior Data Analytics Manager UAEN

Confidential · Abu Dhabi Emirate

Mid-Senior

A leading organization is seeking a **Senior Manager – Data Analytics** to lead and transform its enterprise data and analytics capabilities. This is a strategic leadership role focused on driving a **data\-driven cultur

Scala

Senior Data Analyst, Partner Development - (Statistics/ML/BI) (Bangkok-based, relocation provided)

Agoda · Sharjah

Entry

**About Agoda** At Agoda, we bridge the world through travel. Our story began in 2005, when two lifelong friends and entrepreneurs, driven by their passion for travel, launched Agoda to make it easier for everyone to exp

VAT

Senior Data Engineer

emaratech · Dubai

Senior

About the Role We are seeking a highly skilled and experienced Data Lake Cloud Engineer with a proven track record of designing, implementing, and maintaining large\-scale cloud\-based data lake platforms. This role requ

Scala
AI Job Platform

Stop applying blindly. Start getting hired.

Base Career automates the hardest parts of job searching — apply smarter, not harder.

AI Resume in 60s

Your resume rewritten for this exact role using the job description as the brief.

ATS-Optimized

Get past automated screening filters with the right keywords matched to each job.

Application Tracker

Track every job, follow-up, and interview in one visual kanban board.

Start Today for Free

Free plan · No credit card required