Senior Data Engineer
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
Our Generative AI products are only as good as the data behind them. This role owns that data layer from end to end: the pipelines that bring data in, the transformations that shape it, and the way it reaches retrieval systems, agents, and analytics.
Key Skills for This Role
Full Job Posting
Overview
Our Generative AI products are only as good as the data behind them.
This role owns that data layer from end to end: the pipelines that bring data in, the transformations that shape it, and the way it reaches retrieval systems, agents, and analytics.
The work runs on AWS, and the aim is a single governed source that every consumer can rely on.
We want someone who has already built data pipelines for AI systems, not only for reporting.
Preparing data for an LLM or an agent brings its own work around chunking, embeddings, indexing, and keeping content current, and you have done it before.
The team is small and spans several languages, so you will own your pipelines and help set the standards the rest of us follow.
What You Will Do
- Build and run the batch and streaming pipelines that move data from source systems into the lake and through to the warehouse, owning the layers in between from raw to curated, along with their schema, quality, and lineage
- Build the data layer behind retrieval: source connectors, document parsing, chunking, embedding generation, and vector indexing, including re-embedding when content changes
- Model curated, query-ready datasets and metrics so AI and analytics consumers work from one definition instead of each rebuilding the logic
- Add quality checks, validation, and monitoring so problems surface before they reach a model or a user
- Apply access control where it belongs: row and column level rules, PII handling, and entitlement-aware datasets, enforced as close to query time as the stack allows
- Work with the platform and DevOps engineers to expose data and retrieval as documented, dependable services
- Keep storage, compute, and query costs in check, with particular attention to the cost of embedding and vector workloads
- Review code, write the documentation, and help shape how the team builds its data layer
Requirements
- Eight or more years in data engineering overall. That includes hands-on work building data for AI or ML systems such as retrieval, embeddings, or feature data, which can be a more recent part of your background
- Strong SQL and strong Python, including PySpark or similar distributed processing
- Production experience across the AWS data stack: S3 for the lake, Glue for ETL and the Data Catalog, Athena for serverless query, and Redshift as the warehouse
- Hands-on experience with a layered data architecture, whether you call it medallion (bronze, silver, gold), a data lake feeding a warehouse, or a lakehouse, including building the transformation stages that move data from raw to curated
- Experience with an ELT or integration tool such as Airbyte, Fivetran, or Meltano, including building or maintaining connectors
- Experience with event-driven pipelines using SQS and SNS, and with at least one streaming or change-data-capture technology such as Kinesis, Amazon MSK, or Debezium
- Hands-on experience with a semantic or metrics layer over the warehouse, such as Cube or the dbt Semantic Layer
- Hands-on experience with at least one vector store and embedding workflow: pgvector, Amazon OpenSearch, Pinecone, Weaviate, or Milvus
- Comfort with columnar and open table formats: Parquet together with Apache Iceberg, Delta Lake, or Hudi
- Working knowledge of an orchestrator such as Amazon MWAA, Step Functions, Dagster, or Prefect, and enough infrastructure as code to work closely with DevOps
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at Mirai, a Scopely company
Director of AI
Riyadh, KSA
Company Overview Mirai, a Scopely company, is a video game support studio based in Riyadh, Saudi Arabia, focused on building industry-defining interactive entertainment experiences. As part of Savvy Games Group, Mirai is
Senior DevOps Engineer
Riyadh, KSA
This role builds and runs the infrastructure our Generative AI products depend on: the pipelines that ship code, the platforms that run services and models, and the controls that keep all of it secure and reliable. AI wo
Senior IT Support Specialist - Google Experience
Riyadh, KSA
We are looking for a Senior IT Specialist to take ownership of our IT operations and endpoint management environment. This role will be instrumental in managing, improving, and scaling our device management, collaboratio
Workplace Manager
Riyadh, KSA
About Mirai Mirai is a Scopely company, part of Savvy Games Group, backed by the Public Investment Fund (PIF) of Saudi Arabia. Based in Riyadh, Mirai focuses on building talent, enabling technology, and supporting the gr
Learning and Development Lead
Riyadh, KSA
About Mirai Mirai, a Scopely company based in Riyadh, Mirai is a subsidiary of Scopely, a global interactive entertainment and mobile-first video game company, home to many top, award-winning experiences such as "MONOPOL
Marketing Community Lead
Riyadh, KSA
Role Overview Mirai is looking for a passionate and strategic Arabic Community Manager to help us grow in the MENA region. In this key role, you'll build our local community from scratch and lead a team of junior moderat
Marketing Creative Lead
Riyadh, KSA
The Role We're on the hunt for a Creative Lead who blends artistic flair with a passion for results. In this role, you won't just manage—you'll roll up your sleeves and create standout ad campaigns for our mobile games.
People Business Partner
Riyadh, KSA
The People Business Partner (PBP) is a strategic and hands-on partner to Mirai's leaders across QA, Product, and Operations. The role is focused on building a high-quality employee journey, strengthening manager capabili
Director of AI
Riyadh, KSA
Senior DevOps Engineer
Riyadh, KSA
Senior IT Support Specialist - Google Experience
Riyadh, KSA
Workplace Manager
Riyadh, KSA
Learning and Development Lead
Riyadh, KSA
Marketing Community Lead
Riyadh, KSA
Marketing Creative Lead
Riyadh, KSA
People Business Partner
Riyadh, KSA