AI Research Engineer (Multi-Modal Reinforcement Learning)

JobgetherAbu Dhabi, UAE2 days agoMid-Senior

Mid-Seniorfulltime

Skills

engineeringdesignproject management

About This Role

Overview

This position is posted by Jobgether on behalf of a partner company.

We are currently looking for a AI Research Engineer (Multi-Modal Reinforcement Learning) in United Arab Emirates.

This role sits at the intersection of cutting-edge AI research and large-scale system engineering, focusing on advancing multi-modal reinforcement learning across text, image, audio, and complex simulated environments.

You will contribute to the design of next-generation intelligent systems capable of adaptive decision-making in real-world scenarios.

Working in a highly research-driven, globally distributed environment, you will help build and scale reinforcement learning frameworks that power advanced multimodal models.

Your work will directly influence model performance, training stability, and reward optimization strategies at scale.

You will collaborate with top-tier researchers and engineers to push the boundaries of AI capabilities.

The role combines deep theoretical research with hands-on system development and experimentation.

It is ideal for someone passionate about foundational AI breakthroughs and real-world deployment impact.

Accountabilities

In this role, you will lead research and engineering efforts across multi-modal reinforcement learning systems while contributing to scalable AI infrastructure and experimentation frameworks.
You will be responsible for advancing model performance and robustness through innovative algorithm design and rigorous evaluation practices.
Conduct research on reinforcement learning methods for multi-modal systems, including diffusion-based and autoregressive model approaches.
Design and build scalable RL infrastructure supporting distributed training and evaluation across complex multi-modal environments.
Develop reward modeling strategies to improve alignment, training stability, and mitigate failure modes such as reward hacking.
Create and curate simulation environments and datasets for training, benchmarking, and validating multi-modal RL models.
Design and execute evaluation protocols to measure performance improvements and ensure reproducibility across experiments.
Analyze model behavior across modalities, identifying bottlenecks in optimization, exploration, and cross-modal alignment.
Explore and develop next-generation RL paradigms to enhance learning from environment feedback and improve SOTA performance.
Publish research in leading AI conferences such as NeurIPS, ICML, ICLR, CVPR, and related venues.

Requirements

The ideal candidate has a strong academic and practical background in machine learning, reinforcement learning, and multi-modal AI systems, with a proven record of research excellence and scalable system development.
You are comfortable working at the frontier of AI research while building production-grade experimentation pipelines.
Master’s degree in Computer Science or related field required; PhD preferred in ML, CV, NLP, or AI-related disciplines.
Strong publication record in top-tier AI conferences (NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV).
Proven experience in large-scale reinforcement learning experiments, particularly in multi-modal or vision-centric systems.
Deep understanding of reinforcement learning theory, optimization, and policy learning in high-dimensional environments.
Strong hands-on experience with PyTorch and deep learning frameworks for multimodal AI systems.
Experience building end-to-end RL pipelines including simulation, training, evaluation, and deployment.
Ability to address core RL challenges such as sample efficiency, exploration-exploitation trade-offs, and training stability.
Strong analytical and problem-solving skills with a research-driven, experimental mindset.

Benefits

Competitive compensation package aligned with top-tier AI research talent
Fully remote, global-first work environment
Opportunity to work on frontier AI research problems at scale
High-impact role influencing next-generation multimodal intelligence systems
Collaboration with leading researchers and engineers in AI and reinforcement learning
Access to large-scale experimentation infrastructure and research resources
Strong culture of innovation, autonomy, and research publication support

How Jobgether Works

We use an

AI-powered matching process

to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements.

Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice

By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer.

This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR).

You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses.

These tools assist our recruitment team but do not replace human judgment.

Final hiring decisions are ultimately made by humans.

If you would like more information about how your data is processed, please contact us.

Your resume, rewritten
for this exact role.

01 / 05

Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.

Get My Free Resume

Free · No card · 60 seconds

02 / 05

Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.

Get My Cover Letter

Free · No card · 60 seconds

03 / 05

See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.

Check My Fit Score

Free · No card · 60 seconds

04 / 05

Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.

Start Applying Faster

Free · No card · 60 seconds

05 / 05

Track It. Follow Up at the Right Time.

4 days ago

Apply Now↗Apply Now ↗

AI Research Engineer (Model Compression & Quantization)

Tether.io · Abu Dhabi

fulltime

Skills

engineeringdesignproject management

4 days ago

Apply Now↗Apply Now ↗

2.2K+

Cover Letters & Follow-ups

1.8K+

Resumes Tailored

190.5K+

Jobs Tracked

Trusted by professionals at

PwC//

Emaar//

KPMG//

Noon//

Amazon AWS//

Talabat//

Deloitte//

Emirates//

Careem//

Aramex//

McKinsey//

Property Finder//

Majid Al Futtaim//

Chalhoub Group//

PwC//

Emaar//

KPMG//

Noon//

Amazon AWS//

Talabat//

Deloitte//

Emirates//

Careem//

Aramex//

McKinsey//

Property Finder//

Majid Al Futtaim//

Chalhoub Group//

AI Job Platform

Stop applying blindly.
Start getting hired.

Base Career automates the hardest parts of job searching — apply smarter, not harder.

AI Resume in 60s

Your resume rewritten for this exact role using the job description as the brief.

ATS-Optimized

Get past automated screening filters with the right keywords matched to each job.

Application Tracker

Track every job, follow-up, and interview in one visual kanban board.

Free plan · No credit card required

AI Research Engineer (Multi-Modal Reinforcement Learning)

About This Role

Overview

Accountabilities

Requirements

Benefits

How Jobgether Works

Data Privacy Notice

Your resume, rewritten for this exact role.

Similar Jobs

AI Research Engineer (Model Compression & Quantization)

AI Research Engineer (Agentic Post-training)