Site Reliability Engineer
Skills
About This Role
Overview
Role overview
Design, build, and operate reliable, scalable production systems by applying software engineering to operations: automate toil, improve observability, manage incidents, and enable fast, safe delivery of features.
Key responsibilities
- Build and maintain robust production infrastructure and automation for deployment, scaling, and recovery.
- Implement and operate CI/CD pipelines, release orchestration, and infrastructure-as-code (Terraform/CloudFormation).
- Develop monitoring, alerting, tracing, and dashboards (Prometheus, Grafana, ELK, OpenTelemetry) to maintain SLIs/SLOs and error budgets.
- Lead incident response, runbooks, post-incident reviews, and implement corrective actions to reduce MTTR.
- Automate operational tasks and capacity management with tooling and scripts (Python, Go, Bash).
- Improve system reliability via chaos engineering, load testing, and performance tuning.
- Manage container orchestration platforms (Kubernetes/EKS/GKE/AKS) and platform components.
- Collaborate with development teams to improve service observability, deployment safety, and scalability.
- Enforce security best practices in runtime environments: secrets management, access controls, and vulnerability remediation.
- Mentor engineers on reliability patterns, runbook creation, and on-call practices.
- Required skills & qualifications
- Degree in Computer Science or related experience; 3–7+ years in SRE/DevOps/Platform engineering (mid–senior).
- Hands-on experience with cloud platforms (AWS/Azure/GCP), IaC (Terraform), and CI/CD tooling.
- Strong Linux systems, networking, and container orchestration (Kubernetes) knowledge.
- Proficiency in scripting/programming (Python, Go, or Bash) and automation.
- Experience with observability stacks, incident management, and capacity planning.
- Familiarity with security controls, secrets management, and compliance considerations.
- Excellent troubleshooting, system design, and cross-team communication skills.
Desirable
- Experience with service meshes, policy-as-code, chaos engineering tools, and platform engineering frameworks.
- Certifications (CKA, cloud certs) and exposure to large-scale distributed systems.
Typical metrics / KPIs
- Service availability / SLO compliance and MTTR.
- Change failure rate and deployment lead time.
- Automation coverage (reduction in manual incidents) and operational cost efficiency.
- Mean time between failures and number of post-incident action items closed.
- الراتب المدفوع: QAR٤٥٫٠٠ لكل ساعة
- موقع العمل: بشكل شخصي
Your resume, rewritten
for this exact role.
Sign up free — Base Career tailors your CV to this job description in 60 seconds.
01 / 05
Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.
Free · No card · 60 seconds
02 / 05
Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.
Free · No card · 60 seconds
03 / 05
See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.
Free · No card · 60 seconds
04 / 05
Apply in One Click

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.
Free · No card · 60 seconds
05 / 05
Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.
Free · No card · 60 seconds
Similar Jobs
DevOps / Site Reliability Engineer (SRE)
Reevez Innovations · Doha
Job Description: DevOps / Site Reliability Engineer (SRE) Reevez Innovations is inviting applications for the position of DevOps / Site Reliability Engineer (SRE) to join our innovative team in Doha, Qatar. This role is
Skills
1 weeks ago
Apply Now↗Apply Now ↗DevOps / Site Reliability Engineer (SRE)
Jurident Legal Services · Doha
Job Description: Jurident Legal Services is seeking a skilled DevOps / Site Reliability Engineer (SRE) to join our dynamic team. This role is critical in ensuring the reliability, scalability, and efficiency of our syste
Skills
1 weeks ago
Apply Now↗Apply Now ↗DevOps / Site Reliability Engineer (SRE)
Supplier Post Qatar · Doha
Job Description: DevOps / Site Reliability Engineer (SRE) at Supplier Post Qatar Supplier Post Qatar is seeking a knowledgeable DevOps / Site Reliability Engineer (SRE) to join our IT operations team. This role is crucia
Skills
2 weeks ago
Apply Now↗Apply Now ↗DevOps / Site Reliability Engineer (SRE)
Data Innovations Qatar · Doha
Job Description Data Innovations Qatar is seeking an experienced DevOps / Site Reliability Engineer (SRE) to join our talented team. The ideal candidate will be responsible for enhancing our systems' reliability, perform
Skills
3 weeks ago
Apply Now↗Apply Now ↗Site Reliability Engineer (SRE)
DENTAL PLANET · Doha
Dental Planet is seeking a dedicated Site Reliability Engineer (SRE) to join our team. In this pivotal role, you will be responsible for ensuring the reliability, availability, and performance of our systems, enabling us
Skills
1 months ago
Apply Now↗Apply Now ↗2.2K+
Cover Letters & Follow-ups
1.8K+
Resumes Tailored
190.5K+
Jobs Tracked
Trusted by professionals at
Stop applying blindly.
Start getting hired.
Base Career automates the hardest parts of job searching — apply smarter, not harder.
AI Resume in 60s
Your resume rewritten for this exact role using the job description as the brief.
ATS-Optimized
Get past automated screening filters with the right keywords matched to each job.
Application Tracker
Track every job, follow-up, and interview in one visual kanban board.
Free plan · No credit card required