AI/ML Engineer - 6 Months Contract
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
We are looking for a hands‑on AI/ML Engineer to own and execute MLOps, evaluation, and deployment practices for a production AI platform built on LLMs, agentic workflows, vision, and voice AI.
Key Skills for This Role
Full Job Posting
Overview
We are looking for a hands‑on
Ai And Ml Engineer
to own and execute
MLOps, evaluation, and deployment practices
for a production AI platform built on
LLMs, agentic workflows, vision, and voice AI
.
This role is strongly execution‑focused.
You will work across the
entire AI lifecycle
—from evaluation and observability to RLHF, deployment in constrained environments, and production readiness sign‑off—while collaborating with internal teams and directing external vendors.
Mlops & Deployment Ownership
- Define and oversee
- MLOps practices
- including:
- Agent and model versioning
- Evaluation tracking
- Deployment gating and promotion workflows
- Rollback and recovery procedures
- Collaborate with internal stakeholders and
- external delivery teams
- to ensure reliable production deployments.
Evaluation, Monitoring & Observability
- Own the
- evaluation framework
- for:
- LLM‑based agents
- RAG pipelines
• Vision Language Models (VLMs)
- Voice AI models (OpenAI Whisper, Chatterbox, Vibe Voice, or equivalent)
- Define and maintain:
- Offline evaluation methodologies
- Online monitoring and regression detection thresholds
- Human‑in‑the‑loop review processes
- Set up and manage
- AI observability tooling
- (e.g., Langfuse or equivalent) across all environments.
Performance Reporting & Insights
- Build and maintain
- product performance reporting
- , covering:
- Model accuracy and agent effectiveness
- Latency and cost‑per‑interaction
- Bias, quality trends, and stability across markets
- Provide clear technical insights to
- non‑technical stakeholders
- .
Rlhf & Continuous Improvement
- Design and oversee
Rlhf (Reinforcement Learning From Human Feedback)
- pipelines:
- Data collection and feedback ingestion
- Annotation guidelines and reward criteria
- Feedback loops for continuous improvement
- Direct implementation by external teams and
- monitor quality improvements over time
- .
Agent Memory Systems
- Own the design and validation of
- agent memory architectures
- , including:
- Short‑term context windows
- Long‑term retrieval
- Episodic memory across sessions
- Memory lifecycle policies (retention, expiry, cost control)
- Define test criteria to ensure consistency across deployment environments.
Model Benchmarking & Optimization
- Evaluate and benchmark
- VLMs and voice models
- under constrained infrastructure.
- Recommend optimization strategies:
- Quantization
- Distillation
- Runtime and model selection per jurisdiction
- Validate production readiness in
- on‑prem or sovereign environments
- .
Production Readiness & Rollouts
- Oversee production deployments executed by vendor teams.
- Run final validation checks and
- sign off on production readiness
- .
- Document deployment patterns, baselines, and environment‑specific configurations to accelerate future market rollouts.
Privacy & Data Residency
- Evaluate and recommend
- privacy‑preserving deployment patterns
- , including:
- On‑device inference
- Data isolation
- Local or sovereign model hosting
- Ensure compliance with jurisdictional data residency requirements.
Technical Requirements
- **3–5 years**
- of experience in applied AI, LLMOps, MLOps, or similar technical AI roles.
- Strong
Python
- expertise:
- Type hints, async programming, FastAPI
- Code reviews, evaluation scripts, prototyping pipelines
- Experience with LLM application patterns:
- RAG pipelines
- Prompt engineering
- Multi‑agent orchestration
- Solid background in
- supervised ML
- (scikit‑learn, XGBoost, LightGBM, or equivalent).
- Strong understanding of
- MLOps fundamentals
- :
- Model versioning
- Experiment tracking
- CI/CD deployment pipelines
- Monitoring and rollback strategies
- Hands‑on experience with:
- RLHF or human‑feedback‑driven improvement loops
- LLM/VLM/voice AI evaluation frameworks
- Agent memory architectures
- Working knowledge of:
• Vision Language Models (VLMs)
- Voice AI systems across latency, language, and hosting constraints
- Understanding of
- model optimisation techniques
- (quantization, distillation, ONNX).
- Experience using
- AI observability tools
- (Langfuse, LangSmith, or equivalent).
- Comfortable directing or overseeing
- external/vendor engineering teams
- .
- Ability to work independently in ambiguous and non‑standard infrastructure environments.
- Good to Have
- Experience with
- sovereign cloud
- or government‑regulated infrastructure.
- Familiarity with
- agentic AI frameworks
- (LangChain/LangGraph, CrewAI, PydanticAI).
- Exposure to
- federated learning
- or privacy‑preserving inference.
- Background in
- healthcare, insurance, or regulated domains
- .
- Experience building
- performance dashboards
- for non‑technical audiences.
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at Hays
AI Agent Engineer - 12 Months Contract
Dubai, UAE
The job holder is responsible for designing, building, and operating production-grade AI agents on Microsoft Foundry — LLM-powered systems that reason, use tools, and orchestrate multi-step workflows against enterprise d
Senior Computational Biologist
Dubai, UAE
Senior Computational Biologist United Arab Emirates Health-Tech / Bio-Tech *\ This role is a permanent position, requiring you to live and work in the UAE \ We are currently looking for a Senior Computational Biologist t
Advanced analytics lead
Dubai, UAE
We’re partnering with one of the major banks in the UAE to hire a cross functional technical leader to drive AI and advanced analytics across the organisation. This is a standout opportunity to shape how cutting-edge AI
Odoo Developer | Product Company
Riyadh, KSA
Descrizione del ruolo Stiamo cercando uno Sviluppatore Senior Odoo con comprovata esperienza nello sviluppo e nella personalizzazione della piattaforma Odoo. La risorsa sarà responsabile dell’analisi, progettazione, svil
Chartered Engineer - Marine Infrastructure Design
Newcastle Upon Tyne, QAT
Are you a Chartered Civil Engineer looking for your next career step? Do you have experience of marine infrastructure design? If so, this could be an opportunity for you. We're hiring a Chartered Civil Engineer to join a
Mechanical Design Engineer
Antrim, UAE
Hays are working with a leading global engineering and technology organisation seeking a Mechanical Design Engineer to join its high-performing product development team in County Antrim. This company designs and manufact
Senior Security Engineer - IAM (Broadcom IGA)
Dubai, UAE
Your new company You will be joining a Cyber Security advisory which delivers strategic consultancy and tailored information security solutions to leading enterprise businesses across the region. Known for its vendor-agn
Vice President of Artificial Intelligence
Abu Dhabi, UAE
Vice President of AI Smart Manufacturing Abu Dhabi, United Arab Emirates Permanent position, on-site We are looking for a VP of AI to lead the global enterprise AI agenda, for a company specialising in smart solutions fo
AI Agent Engineer - 12 Months Contract
Dubai, UAE
Senior Computational Biologist
Dubai, UAE
Advanced analytics lead
Dubai, UAE
Odoo Developer | Product Company
Riyadh, KSA
Chartered Engineer - Marine Infrastructure Design
Newcastle Upon Tyne, QAT
Mechanical Design Engineer
Antrim, UAE
Senior Security Engineer - IAM (Broadcom IGA)
Dubai, UAE
Vice President of Artificial Intelligence
Abu Dhabi, UAE