{bc}
naukri

Senior Data Engineer

Mirai Arabian International Company Limited
, KSA
8-13 years
Today
Big DataETLData WarehousingCloud Computing (AWSAzureGCP)
Free

Job Fit Check

Base Career helps you apply smarter for this job.

?%
Ready to Scan

Key skills for this role

Big DataETLData Warehousing
Smart Apply

Full Job Posting

Overview

Our Generative AI products are only as good as the data behind them.

This role owns that data layer from end to end: the pipelines that bring data in, the transformations that shape it, and the way it reaches retrieval systems, agents, and analytics.

The work runs on AWS, and the aim is a single governed source that every consumer can rely on.

We want someone who has already built data pipelines for AI systems, not only for reporting.

Preparing data for an LLM or an agent brings its own work around chunking, embeddings, indexing, and keeping content current, and you have done it before.

The team is small and spans several languages, so you will own your pipelines and help set the standards the rest of us follow.

What You Will Do

  • Build and run the batch and streaming pipelines that move data from source systems into the lake and through to the warehouse, owning the layers in between from raw to curated, along with their schema, quality, and lineage.
  • Build the data layer behind retrieval: source connectors, document parsing, chunking, embedding generation, and vector indexing, including re-embedding when content changes.
  • Model curated, query-ready datasets and metrics so AI and analytics consumers work from one definition instead of each rebuilding the logic.
  • Add quality checks, validation, and monitoring so problems surface before they reach a model or a user.
  • Apply access control where it belongs: row and column level rules, PII handling, and entitlement-aware datasets, enforced as close to query time as the stack allows.
  • Work with the platform and DevOps engineers to expose data and retrieval as documented, dependable services.
  • Keep storage, compute, and query costs in check, with particular attention to the cost of embedding and vector workloads
  • Review code, write the documentation, and help shape how the team builds its data layer.

Apply for this job in 1 click

Skip the repetitive application forms

Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.

Sarah M.James T.Maya R.

Trusted by over 500,000 job seekers on Base Career

Start Free Today

More from this employer

More jobs at Mirai Arabian International Company Limited