{bc}

AI Research Engineer (Multi-Modal Reinforcement Learning)

JobgetherAbu Dhabi, UAE2 days agoMid-Senior
Mid-Seniorfulltime

Skills

engineeringdesignproject management

About This Role

Overview

This position is posted by Jobgether on behalf of a partner company.

We are currently looking for a AI Research Engineer (Multi-Modal Reinforcement Learning) in United Arab Emirates.

This role sits at the intersection of cutting-edge AI research and large-scale system engineering, focusing on advancing multi-modal reinforcement learning across text, image, audio, and complex simulated environments.

You will contribute to the design of next-generation intelligent systems capable of adaptive decision-making in real-world scenarios.

Working in a highly research-driven, globally distributed environment, you will help build and scale reinforcement learning frameworks that power advanced multimodal models.

Your work will directly influence model performance, training stability, and reward optimization strategies at scale.

You will collaborate with top-tier researchers and engineers to push the boundaries of AI capabilities.

The role combines deep theoretical research with hands-on system development and experimentation.

It is ideal for someone passionate about foundational AI breakthroughs and real-world deployment impact.

Accountabilities

  • In this role, you will lead research and engineering efforts across multi-modal reinforcement learning systems while contributing to scalable AI infrastructure and experimentation frameworks.
  • You will be responsible for advancing model performance and robustness through innovative algorithm design and rigorous evaluation practices.
  • Conduct research on reinforcement learning methods for multi-modal systems, including diffusion-based and autoregressive model approaches.
  • Design and build scalable RL infrastructure supporting distributed training and evaluation across complex multi-modal environments.
  • Develop reward modeling strategies to improve alignment, training stability, and mitigate failure modes such as reward hacking.
  • Create and curate simulation environments and datasets for training, benchmarking, and validating multi-modal RL models.
  • Design and execute evaluation protocols to measure performance improvements and ensure reproducibility across experiments.
  • Analyze model behavior across modalities, identifying bottlenecks in optimization, exploration, and cross-modal alignment.
  • Explore and develop next-generation RL paradigms to enhance learning from environment feedback and improve SOTA performance.
  • Publish research in leading AI conferences such as NeurIPS, ICML, ICLR, CVPR, and related venues.

Requirements

  • The ideal candidate has a strong academic and practical background in machine learning, reinforcement learning, and multi-modal AI systems, with a proven record of research excellence and scalable system development.
  • You are comfortable working at the frontier of AI research while building production-grade experimentation pipelines.
  • Master’s degree in Computer Science or related field required; PhD preferred in ML, CV, NLP, or AI-related disciplines.
  • Strong publication record in top-tier AI conferences (NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV).
  • Proven experience in large-scale reinforcement learning experiments, particularly in multi-modal or vision-centric systems.
  • Deep understanding of reinforcement learning theory, optimization, and policy learning in high-dimensional environments.
  • Strong hands-on experience with PyTorch and deep learning frameworks for multimodal AI systems.
  • Experience building end-to-end RL pipelines including simulation, training, evaluation, and deployment.
  • Ability to address core RL challenges such as sample efficiency, exploration-exploitation trade-offs, and training stability.
  • Strong analytical and problem-solving skills with a research-driven, experimental mindset.

Benefits

  • Competitive compensation package aligned with top-tier AI research talent
  • Fully remote, global-first work environment
  • Opportunity to work on frontier AI research problems at scale
  • High-impact role influencing next-generation multimodal intelligence systems
  • Collaboration with leading researchers and engineers in AI and reinforcement learning
  • Access to large-scale experimentation infrastructure and research resources
  • Strong culture of innovation, autonomy, and research publication support

How Jobgether Works

We use an

AI-powered matching process

to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements.

Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice

By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer.

This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR).

You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses.

These tools assist our recruitment team but do not replace human judgment.

Final hiring decisions are ultimately made by humans.

If you would like more information about how your data is processed, please contact us.

Your resume, rewritten for this exact role.

Sign up free — Base Career tailors your CV to this job description in 60 seconds.

01 / 05

Resume Tailored to This Job

Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.

Get My Free Resume

Free · No card · 60 seconds

02 / 05

Cover Letter for This Role, Done

Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.

Get My Cover Letter

Free · No card · 60 seconds

03 / 05

See How Well You Fit This Role

See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.

Check My Fit Score

Free · No card · 60 seconds

04 / 05

Apply in One Click

Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.

Start Applying Faster

Free · No card · 60 seconds

05 / 05

Track It. Follow Up at the Right Time.

Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.

Track My Applications

Free · No card · 60 seconds

Similar Jobs

AI Research Engineer (Model Compression & Quantization)

Tether Operations Limited · Abu Dhabi

Senior

Drive innovation in model compression and quantization for multimodal AI systems, requiring expertise in neural networks and hands-on experience with compression techniques.

Skills

engineeringdesignproject management

AI Research Engineer (Agentic Post-training)

Jobgether · Abu Dhabi

Mid-Seniorfulltime

This position is posted by Jobgether on behalf of a partner company. We are currently looking for an AI Research Engineer (Agentic Post-training) in United Arab Emirates. This role sits at the frontier of large language

Skills

engineeringdesignproject management

AI Research Engineer

Tether Operations Limited · Dubai

Mid-Senior

Drive innovation in model compression and efficient deployment for multimodal AI systems, requiring expertise in quantization, distillation, and pruning techniques.

Skills

engineeringdesignproject management

AI Research Engineer

Tether Operations Limited · Dubai

Senior

Drive innovation in AI model serving and inference architectures, optimizing performance for resource-constrained devices, requiring expertise in kernel optimizations and advanc...

Skills

engineeringdesignproject management

AI Research Engineer Model Compression And Quantization

TALENTMATE · Dubai

Mid-Seniorfulltime

Job Description Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchange

Skills

engineeringdesignproject management

AI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide

Tether Operations Limited · Dubai

fulltime

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to

Skills

engineeringdesignproject management

AI Research Engineer (Multi-Modal & Vision) - 100% Remote Worldwide

Tether Operations Limited · Dubai

fulltime

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to

Skills

engineeringdesignproject management

AI Research Engineer (Model Compression & Quantization)

Tether.io · Abu Dhabi

fulltime

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to

Skills

engineeringdesignproject management

AI Research Engineer (Model Compression & Quantization)

Tether.io · Abu Dhabi

fulltime

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to

Skills

engineeringdesignproject management

2.2K+

Cover Letters & Follow-ups

1.8K+

Resumes Tailored

190.5K+

Jobs Tracked

Trusted by professionals at

PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
PwC//
Emaar//
KPMG//
Noon//
Amazon AWS//
Talabat//
Deloitte//
Emirates//
Careem//
Aramex//
McKinsey//
Property Finder//
Majid Al Futtaim//
Chalhoub Group//
AI Job Platform

Stop applying blindly. Start getting hired.

Base Career automates the hardest parts of job searching — apply smarter, not harder.

AI Resume in 60s

Your resume rewritten for this exact role using the job description as the brief.

ATS-Optimized

Get past automated screening filters with the right keywords matched to each job.

Application Tracker

Track every job, follow-up, and interview in one visual kanban board.

Free plan · No credit card required