{bc}

Bioinformatics Data Engineer

GenBio AIAbu Dhabi, UAE2 months agoSenior
Senior

Skills

ETLData WarehousingSQL

About This Role

Overview

As our data ingestion needs grow, we are looking for a Bioinformatics Data Engineer to act as the crucial bridge between raw biological data and our scalable infrastructure.

Reporting to the Data Engineering Lead, you will leverage your deep biological domain expertise to build the initial scripts and processing logic for complex datasets, ensuring they are primed for large-scale foundation model training.

• Source & Acquire Biological Data -

Identify, evaluate, and obtain high-quality bioinformatics datasets from public and partner sources (e.g., NCBI, PubChem, ENCODE,UniProt etc.) to support research and model development initiatives.

• Deeply Understand Complex Datasets -

  • Develop a comprehensive understanding of biological datasets, including data structures, schemas, metadata standards, entity relationships, and underlying biological context to ensure accurate interpretation and usage.
  • Design & Implement Data Processing Pipelines -
  • Develop robust preprocessing scripts and scalable data transformation workflows using Python, R, and relevant Tools.
  • Leverage AI-assisted tools where appropriate to process, clean, normalize, and integrate complex biological data for foundation model training.

• Structure & Standardize Biological Data -

Organize heterogeneous datasets into well-defined, interoperable formats aligned with internal infrastructure requirements and downstream AI training pipelines.

• Bioinformatics Data Analysis -

Perform exploratory and statistical analysis of genomic, transcriptomic, proteomic, and other multi-omics datasets to assess data quality, uncover biological patterns, and generate insights that inform model development.

Apply appropriate computational and statistical methods to validate assumptions and support downstream AI training and evaluation.

• Build Data Products -

  • Create production-ready data assets, including standardized datasets, curated releases, dashboards, analytical reports, and technical documentation to enable efficient research and model evaluation.
  • Ensure Data Quality & FAIR Compliance -
  • Curate, annotate, validate, and standardize public and partner datasets in alignment with FAIR (Findable, Accessible, Interoperable, Reusable) principles, ensuring long-term usability and reproducibility.

• Collaborate Cross-Functionally -

Partner closely with research scientists and ML engineers to translate biological research needs into scalable data engineering solutions that support AI model training and evaluation.

• Knowledge Sharing & Documentation -

Contribute domain expertise by documenting data methodologies, maintaining clear technical documentation, and sharing biological data insights across teams.

Your resume, rewritten for this exact role.

Sign up free — Base Career tailors your CV to this job description in 60 seconds.

01 / 05

Resume Tailored to This Job

Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.

Get My Free Resume

Free · No card · 60 seconds

02 / 05

Cover Letter for This Role, Done

Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.

Get My Cover Letter

Free · No card · 60 seconds

03 / 05

See How Well You Fit This Role

See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.

Check My Fit Score

Free · No card · 60 seconds

04 / 05

Apply in One Click

Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.

Start Applying Faster

Free · No card · 60 seconds

05 / 05

Track It. Follow Up at the Right Time.

Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.

Track My Applications

Free · No card · 60 seconds

Similar Jobs

Bioinformatics Data Engineer

GenBio AI · Abu Dhabi

Mid-Seniorfulltime

GenBio AI develops multiscale foundation models to decode and simulate human biology. Our team is accelerating towards an ambitious future where scientists can unlock humanity's biggest challenges in drug discovery, heal

Skills

ETLData WarehousingSQL

2.2K+

Cover Letters & Follow-ups

1.8K+

Resumes Tailored

190.5K+

Jobs Tracked

Trusted by professionals at

PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
AI Job Platform

Stop applying blindly. Start getting hired.

Base Career automates the hardest parts of job searching — apply smarter, not harder.

AI Resume in 60s

Your resume rewritten for this exact role using the job description as the brief.

ATS-Optimized

Get past automated screening filters with the right keywords matched to each job.

Application Tracker

Track every job, follow-up, and interview in one visual kanban board.

Free plan · No credit card required