Site Reliability Engineer-AI production-automated testing ,Observability
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
**Urgent requirement for Site Reliability Engineer( AI production readiness automated testing ,Observability, SLIs, resilience) in banking domain required for our banking clients in Abu Dhabi ,UAE** **Hybrid role combines SRE and automated testing to ensure AI-driven cloud applications are production-ready, resilient, and compliant with banking standards.-** **-Must** **Strong expertise in Python-based testing frameworks (PyTest, Robot, or similar) & experience with Azure / A
Key Skills for This Role
Full Job Posting
Overview
Urgent requirement for Site Reliability Engineer( AI production readiness automated testing ,Observability, SLIs, resilience) in banking domain required for our banking clients in Abu Dhabi ,UAE
Hybrid role combines SRE and automated testing to ensure AI-driven cloud applications are production-ready, resilient, and compliant with banking standards.-
-Must
Strong expertise in Python-based testing frameworks (PyTest, Robot, or similar) & experience with Azure / AWS cloud platforms.--
Must
Hands-on observability tools (Prometheus, Grafana, ELK, Datadog) & experience defining and implementing SLIs/SLOs for distributed systems.-
-Must
Practical exposure to chaos engineering and load testing frameworks (Gremlin, Locust, Jmeter) & Familiarity with AI/ML evaluation tools for production readiness.--
Must
Strong background in security and compliance automation within regulated industries (banking/finance )--
Role Overview
We are seeking a Site Reliability Engineer (AI Production Readiness) to ensure our AI-driven cloud applications are production-ready, resilient, and compliant with banking standards.
This hybrid role combines SRE practices with automated testing expertise, focusing on reliability, observability, and proactive validation of both application logic and infrastructure.
Key Responsibilities
- Automated Validation Frameworks Design and implement Python-based automated testing frameworks to validate AI application logic, APIs, and cloud infrastructure.
- Resilience Engineering Conduct chaos testing, load testing, and fault injection to ensure systems withstand failures and maintain service continuity.
- SLIs/SLOs Definition Establish clear Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for AI workloads, ensuring measurable reliability targets.
- Observability & Monitoring Build proactive monitoring, alerting, and logging pipelines across Azure and AWS environments to detect anomalies before they impact users.
- Security & Compliance Implement automated compliance checks aligned with banking regulations, ensuring secure deployment pipelines and audit readiness.
- AI Evaluation Tools Integrate AI-specific evaluation frameworks to continuously assess model performance, fairness, and reliability in production.
- Skills: reliability,ai,automated testing
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at TAT IT Technolgies
Full stack .Net Developer-Microservice, RestfulAPI, Kafaka, ASP,MVC and SQL
Abu Dhabi, UAE
We have an urgent requirement for Full stack .Net Developer (Microservice, RestfulAPI, Kafaka, ASP,Netcore MVC and SQL) for one of our banking client in Abu Dhabi, UAE Design, develop, maintain, and support Dotnet back-e
Business analyst with experience in T24 core banking , digital banking
Doha, QAT
We have an urgent requirement Business analyst with experience in T24 core banking modules, digital banking, channels, payment, AML & Banking Domain with our client based in Doha Qatar Hands on Business Analysis experien
Senior Software Engineer / SME – Oracle BRM,PDC,OCOMC & BI Publisher
Doha, QAT
We have an urgent requirement for Senior Software Engineer / SME – Oracle BRM (Billing and Revenue Management)PDC (Pricing Design Center), OCOMC (Oracle Communications Offline Mediation Controller) & BI Publisher with ex
Systems Engineer -Window Server aligned with Wintel Squad in banking domain
Abu Dhabi, UAE
We have an urgent requirement for – Systems Engineer -Window Server aligned with Wintel Squad in banking domain is required for our banking clients in Abu Dhabi ,UAE Hands on Windows Server Operations and management, hig
Assistant Warehouse Manager in Construct Domain(must)
Sharjah, UAE
We have an urgent requirement for Assistant Warehouse Manager in Construct Domain(must) is required for one of our clients in Sharjah Hand on warehouse management experience in the construction industry.--Must Strong h
Vulnerability Management Specialist (Using Qualys & CVSSv3.1)
Abu Dhabi, UAE
We have an urgent requirement for Vulnerability Management Specialist (Using Qualys & CVSSv3.1) with experience in banking domain is required for our banking clients in Abu Dhabi ,UAE Conduct enterprise-wide vulnerabilit
Data Privacy Lawyer --IT Projects
Dubai, UAE
We have an urgent requirement for Data Privacy Lawyer --IT Projects for the client in Dubai, UAE Experience in data protection, privacy law, compliance, or technology law and familiarity with GDPR and comparable privacy
Systems Engineer - Messaging & Groupware ((Vulnerability Management))
Abu Dhabi, UAE
We have an urgent requirement for – Systems Engineer - Messaging & Groupware ((Vulnerability Management)) with experience in banking domain is required for our banking clients in Abu Dhabi ,UAE strong experience in vulne
Full stack .Net Developer-Microservice, RestfulAPI, Kafaka, ASP,MVC and SQL
Abu Dhabi, UAE
Business analyst with experience in T24 core banking , digital banking
Doha, QAT
Senior Software Engineer / SME – Oracle BRM,PDC,OCOMC & BI Publisher
Doha, QAT
Systems Engineer -Window Server aligned with Wintel Squad in banking domain
Abu Dhabi, UAE
Assistant Warehouse Manager in Construct Domain(must)
Sharjah, UAE
Vulnerability Management Specialist (Using Qualys & CVSSv3.1)
Abu Dhabi, UAE
Data Privacy Lawyer --IT Projects
Dubai, UAE
Systems Engineer - Messaging & Groupware ((Vulnerability Management))
Abu Dhabi, UAE