{bc}

Data Lakehouse Architect

The GloveDubai, UAE2 days agoMid-Senior
Mid-Seniorfulltime

Skills

Architectural DesignAutoCADRevit

About This Role

Experience

11 years - 16 years

Notice period :

immediate to 15 days max.

Note: Candidate should be from the Retail or Ecommerce domain only

Role Overview

We are looking for a hands-on

Lead Data Engineer

with 11+ years experience who has built and operated production-grade data lakehouse platforms.

This person will work directly under the Data Lakehouse Architect and own the end-to-end engineering execution across ingestion, transformation, orchestration, governance, and consumption layers.

This is a technical delivery role requiring deep engineering skills and the ability to work across all layers of the lakehouse.

Key Responsibilities

  • Build and operate end-to-end ingestion pipelines from 20+ heterogeneous source systems including Oracle Retail, WMS, TMS, Loyalty platforms, and third-party APIs into the Bronze layer
  • Implement CDC pipelines for real-time and near-real-time data capture across relational and NoSQL sources
  • Design and build Silver and Gold transformation layers including data cleansing, enrichment, SCD Type 1 and 2, and complex business rule application
  • Develop and maintain orchestration workflows with automated retry, failure alerting, and SLA tracking
  • Implement data quality checks, validation rules, and reprocessing/backfill capabilities
  • Enforce security policies, PII masking, row-level and column-level access control as defined by the Architect
  • Enable consumption layers for Tableau, direct SQL users, and downstream API integrations
  • Support historical data migration from legacy cloud data warehouse into the lakehouse
  • Maintain pipeline documentation, source-to-target mappings, and data dictionaries
  • Build clean, well-structured data pipelines that support AI/ML feature engineering and model training workflows

Required Skills And Experience

  • 12+ years in data engineering with hands-on experience across ingestion, transformation, orchestration, and governance layers in production lakehouse environments
  • Proficient in Python and PySpark for pipeline development and custom connector building
  • Hands-on experience with cloud data services (e.g. S3/Blob Storage, Glue, DMS, Kinesis/Event Hubs, EMR or equivalent)
  • Strong CDC implementation experience using DMS, Debezium, or equivalent across Oracle, RDS, and NoSQL sources
  • Experienced with Delta Lake/Apache Iceberg ACID transactions, merge/upsert operations, and partition management at scale
  • Strong hands-on experience with dbt for SQL-based transformation layers
  • Proficient with orchestration tools such as Apache Airflow (MWAA), Dagster, or Step Functions
  • Experienced with data quality frameworks such as Great Expectations, Deequ, or dbt tests
  • Hands-on with security implementation: IAM policies, PII masking, column-level and row-level access control
  • Strong SQL skills and dimensional modeling (star schema, snowflake schema) for BI consumption layers
  • Retail or e-commerce domain experience (Oracle Retail, Magento, Shopify, WMS, TMS) is a strong advantage
  • Familiarity with AI/ML pipeline requirements including feature store design, data preparation for model training, and vector database integration

Preferred / Nice-To-Have Platforms

  • AWS (Lake Formation, Glue, Redshift, EMR, SageMaker)
  • Databricks
  • Snowflake
  • Microsoft Fabric
  • Google BigQuery
  • Dataiku

Education

  • Bachelor’s or master’s degree in computer science, Information Systems, or a related field
  • Cloud Data Analytics or Solutions Architect certification from a major cloud provider (AWS, Azure, or GCP) is a strong advantage
  • Databricks Certified Data Engineer or Snowflake SnowPro certifications are a plus
  • Interested candidates share their resume at uma.jangra@glovetalent.com !!

Your resume, rewritten for this exact role.

Sign up free — Base Career tailors your CV to this job description in 60 seconds.

01 / 05

Resume Tailored to This Job

Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.

Get My Free Resume

Free · No card · 60 seconds

02 / 05

Cover Letter for This Role, Done

Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.

Get My Cover Letter

Free · No card · 60 seconds

03 / 05

See How Well You Fit This Role

See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.

Check My Fit Score

Free · No card · 60 seconds

04 / 05

Apply in One Click

Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.

Start Applying Faster

Free · No card · 60 seconds

05 / 05

Track It. Follow Up at the Right Time.

Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.

Track My Applications

Free · No card · 60 seconds

Similar Jobs

Data Lakehouse Architect / Lead Data Engineer (Retail Domain)

The Glove · Dubai

Mid-Seniorfulltime

Job Title: Data Lakehouse Architect / Lead Data Engineer (Retail Domain) Location: UAE (Dubai Preferred) – On-site Experience: 11+ Years Employment Type: Full-time Immediate Joiners Preferred We are hiring an experienced

Skills

ETLData WarehousingSQL

Lead Specialist, Data Lakehouse II

Maaden · Riyadh

Mid-Seniorfulltime

About Maaden JOB DESCRIPTION Maaden, established in 1997, is one of the fastest-growing mining companies in the world and the largest multi-commodity mining and metals company in the Middle East. We are leading the devel

Skills

ScalaExcel

Lead Specialist, Data Lakehouse II

Ma'aden Aluminium Company (MAC) · Riyadh

Senior

The role involves optimizing lakehouse architecture, ensuring data governance, and providing technical leadership in data engineering and platform operations.

Skills

Lead SpecialistData Lakehouse II

2.2K+

Cover Letters & Follow-ups

1.8K+

Resumes Tailored

190.5K+

Jobs Tracked

Trusted by professionals at

PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
AI Job Platform

Stop applying blindly. Start getting hired.

Base Career automates the hardest parts of job searching — apply smarter, not harder.

AI Resume in 60s

Your resume rewritten for this exact role using the job description as the brief.

ATS-Optimized

Get past automated screening filters with the right keywords matched to each job.

Application Tracker

Track every job, follow-up, and interview in one visual kanban board.

Free plan · No credit card required