Platform Site Reliability Engineer
Skills
About This Role
Role Overview
We are seeking an SiteReliability Engineer to own the "Production Readiness" of our cloud-based AI solutions.
This hybrid role combines automated software testing, and Site Reliability Engineering (SRE).
You will build the automated frameworks that validate our AI outputs and ensure the underlying Azure/AWS infrastructure is resilient, performant, and compliant with banking standards.
Key Responsibilities
- Resiliency Engineering (SRE): Implement "Chaos Engineering" and load testing to ensure web/mobile backends can handle banking-scale traffic. Maintain high availability through automated recovery scripts.
- Automated Regression: Build CI/CD-integrated test suites using Python that validate both the application logic and the infrastructure state (IaC validation).
- Observability & SLIs: Define and monitor Service Level Indicators (SLIs) and Objectives (SLOs). Set up advanced alerting in Azure Monitor or AWS CloudWatch to catch performance degradation before users do.
- Security & Compliance Testing: Automate security scans and compliance checks to ensure all AI data handling meets strict banking data residency and privacy protocols.
Technical & Professional Requirements
- Automation Stack: High proficiency in Python (for AI testing) and framework automation (PyTest, Selenium, or Robot Framework).
- Cloud Infrastructure: Strong hands-on experience with Azure or AWS, specifically regarding networking, scaling, and serverless reliability.
- AI/ML Understanding: Understanding of Prompt Engineering and how to evaluate AI model outputs (RAG evaluation, ROUGE/BLEU scores, or custom LLM-benchmarks).
- Monitoring Tools: Experience with Grafana, Prometheus, or native cloud monitoring tools to build real-time reliability dashboards.
- FinOps Awareness: Ability to identify "expensive" failing tests or inefficient cloud resource usage during the testing phase.
Recommended Skillset & Tools
- Languages: Python (Mandatory), Bash scripting.
- Tools: GitHub Actions (CI/CD), Terraform (reading/validating), K6 or JMeter (Performance).
- AI Frameworks: DeepEval, Ragas, or LangSmith (for automated AI evaluation).
Your resume, rewritten
for this exact role.
Sign up free — Base Career tailors your CV to this job description in 60 seconds.
01 / 05
Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.
Free · No card · 60 seconds
02 / 05
Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.
Free · No card · 60 seconds
03 / 05
See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.
Free · No card · 60 seconds
04 / 05
Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.
Free · No card · 60 seconds
05 / 05
Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.
Free · No card · 60 seconds
Similar Jobs
Platform Site Reliability Engineer
Dicetek LLC · Abu Dhabi
Implement Chaos Engineering and automated testing using Python, while ensuring cloud infrastructure reliability and compliance in banking environments.
Skills
1 weeks ago
Apply Now↗Apply Now ↗Platform Site Reliability Engineer
DICETEK LLC · Abu Dhabi
Site Reliability Engineer (SRE) Role Overview: We are seeking an SiteReliability Engineer to own the "Production Readiness" of our cloud-based AI solutions. This hybrid role combines automated software testing, and Site
Skills
1 weeks ago
Apply Now↗Apply Now ↗Platform Site Reliability Engineer
DICETEK LLC · Abu Dhabi
Site Reliability Engineer (SRE) Role Overview: We are seeking an SiteReliability Engineer to own the "Production Readiness" of our cloud-based AI solutions. This hybrid role combines automated software testing, and Site
Skills
1 weeks ago
Apply Now↗Apply Now ↗Platform Site Reliability Engineer
DICETEK LLC · Abu Dhabi
Site Reliability Engineer (SRE) Role Overview: We are seeking an SiteReliability Engineer to own the "Production Readiness" of our cloud-based AI solutions. This hybrid role combines automated software testing, and Site
Skills
1 weeks ago
Apply Now↗Apply Now ↗2.2K+
Cover Letters & Follow-ups
1.8K+
Resumes Tailored
190.5K+
Jobs Tracked
Trusted by professionals at
Stop applying blindly.
Start getting hired.
Base Career automates the hardest parts of job searching — apply smarter, not harder.
AI Resume in 60s
Your resume rewritten for this exact role using the job description as the brief.
ATS-Optimized
Get past automated screening filters with the right keywords matched to each job.
Application Tracker
Track every job, follow-up, and interview in one visual kanban board.
Free plan · No credit card required