Site Reliability Engineer
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
Khazna was founded in 2012 and has grown rapidly into becoming the leading and trusted wholesale Data Center provider in the Middle East and North Africa region.
Key Skills for This Role
Full Job Posting
Overview
Khazna was founded in 2012 and has grown rapidly into becoming the leading and trusted wholesale Data Center provider in the Middle East and North Africa region.
Through our Data Centers, we provide industry benchmark levels of power supply and cooling services to better serve the growing need for data center operations in the UAE and wider region.
We are seeking a
Site Reliability Engineer
to support the reliability engineering program across multiple data centers in our fleet.
Reporting to the Reliability Manager, you will be responsible for monitoring system performance, driving preventative and predictive maintenance initiatives, leading root cause analysis efforts, and collaborating with cross-functional teams to minimize downtime and enhance infrastructure resilience.
Key Accountabilities
- Monitor real-time and historical performance metrics for critical power, cooling, and IT systems.
- Analyse system data to identify trends, failure modes, and reliability risks.
- Execute Root Cause Analyses (RCA) and Failure Mode & Effects Analyses (FMEA), then drive corrective and preventive actions.
- Develop and maintain condition-based and predictive maintenance routines, leveraging IoT, data analytics, and machine learning tools.
- Support preventive maintenance programs: schedule, document, and validate maintenance activities.
- Assist in asset lifecycle planning, including upgrades, decommissioning, and end-of-life strategies.
- Contribute to capacity runway assessments to forecast infrastructure needs.
- Implement and enforce availability management plans, risk assessments, and mitigation strategies.
- Ensure data collection and reporting processes for reliability KPIs (e.g., MTBF, MTTR, availability) are standardized and accurate.
- Prepare reliability reports and dashboards; present findings and recommendations to site leadership.
- Respond to and lead failure-response efforts during site incidents, ensuring rapid recovery and root-cause follow-through.
- Maintain compliance with industry standards and regulations (Uptime Institute, ISO, ASHRAE).
- Collaborate with Operations, Engineering, Facilities, and Vendors to integrate reliability best practices into day-to-day workflows.
- Propose continuous-improvement initiatives and pilot emerging reliability technologies.
- The job holder may be required to undertake additional duties, which may be reasonably expected and forms part of the function of the job.
Minimum Qualifications
- Bachelor’s degree in mechanical, Electrical, Reliability, or related Engineering discipline.
Minimum Experience
- 3+ years of experience in reliability engineering, maintenance engineering, or a data center operations environment.
- Hands-on experience with RCA, FMEA, and predictive maintenance methodologies.
- Proficiency with monitoring platforms, data-analytics tools, and scripting (e.g., Python, R).
- Familiarity with IoT sensors, machine-learning frameworks, and condition-based monitoring systems.
- Knowledge of industry reliability standards and regulations (ISO, ASHRAE, Uptime Institute).
Job-Specific Skills (Generic And Technical)
- Strong analytical and problem-solving skills, with acute attention to detail.
- Effective communicator, able to present technical findings to diverse audiences.
- Project coordination skills and the ability to manage multiple reliability initiatives.
- Collaborative mindset, comfortable working in cross-functional teams.
- Self-starter with a continuous-improvement attitude and commitment to resilience.
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at Khazna Data Centers
Building Management Systems Engineer
Abu Dhabi Emirate, UAE
Khazna was founded in 2012 and has grown rapidly into becoming the leading and trusted wholesale Data Center provider in the Middle East and North Africa region. Through our Data Centers, we provide industry benchmark le
Building Management Systems Engineer (UAE Nationals)
Abu Dhabi Emirate, UAE
Khazna was founded in 2012 and has grown rapidly into becoming the leading and trusted wholesale Data Center provider in the Middle East and North Africa region. Through our Data Centers, we provide industry benchmark le
Tax Specialist
Dubai, UAE
Since 2012, Khazna has become the leading wholesale data center provider in the Middle East and North Africa , delivering reliable and scalable infrastructure to support the region’s growing digital ecosystem. We are see
Senior Project Manager – International Projects & Planning
Dubai, UAE
Since 2012, Khazna has become the leading wholesale data center provider in the Middle East and North Africa , delivering reliable and scalable infrastructure to support the region’s growing digital ecosystem. We are see
CAFM Implementation Engineer - Planon
Abu Dhabi Emirate, UAE
Khazna was founded in 2012 and has grown rapidly into becoming the leading and trusted wholesale Data Center provider in the Middle East and North Africa region. Through our Data Centers, we provide industry benchmark le
Project Coordinator
Dubai, UAE
Since 2012, Khazna has become the leading wholesale data center provider in the Middle East and North Africa , delivering reliable and scalable infrastructure to support the region’s growing digital ecosystem. We are see
Document Controller
Dubai, UAE
Maintain document control systems, manage project documentation, and ensure compliance while demonstrating strong analytical skills and proficiency in document management software.
Document Controller
Dubai, UAE
RESPONSIBILITIES KEY ACCOUNTABILITIES: • Maintain document control systems, ensuring documents are reviewed, approved, distributed, archived, and easily retrievable. • Coordinate with internal teams, vendors, and clients
Building Management Systems Engineer
Abu Dhabi Emirate, UAE
Building Management Systems Engineer (UAE Nationals)
Abu Dhabi Emirate, UAE
Tax Specialist
Dubai, UAE
Senior Project Manager – International Projects & Planning
Dubai, UAE
CAFM Implementation Engineer - Planon
Abu Dhabi Emirate, UAE
Project Coordinator
Dubai, UAE
Document Controller
Dubai, UAE
Document Controller
Dubai, UAE
