Site Reliability Engineer (SRE)
Site Reliability Engineer (SRE) — Job Description Overview Ensure reliability, scalability, and performance of production systems by applying software engineering to operations, building automation, and improving incident response and observability.
Skills
About This Role
Overview
- Ensure reliability, scalability, and performance of production systems by applying software engineering to operations, building automation, and improving incident response and observability.
Key Responsibilities
- Design, build, and maintain production infrastructure, automation, and platform tooling to reduce toil and improve reliability.
- Define and track SLOs/SLIs, measure error budgets, and take remediation actions to meet availability targets.
- Implement and maintain CI/CD pipelines, deployment automation, and release strategies (blue/green, canary).
- Build monitoring, logging, tracing, and alerting systems; create dashboards and runbooks for on-call teams.
- Lead incident response, coordinate post-incident reviews (RCA/blameless postmortems), and drive corrective actions.
- Perform capacity planning, performance tuning, and resource optimization for services and infrastructure.
- Manage and operate container orchestration platforms (Kubernetes/EKS/GKE/AKS) and supporting services.
- Automate provisioning and configuration using IaC (Terraform, CloudFormation, Ansible) and manage secrets/configuration securely.
- Implement fault-tolerant architectures, disaster recovery, backup strategies, and multi-region designs.
- Collaborate with developers to improve observability, reliability, and operational readiness of services.
- Harden systems for security and compliance; implement patching, vulnerability scanning, and access controls.
- Mentor engineering teams on reliability best practices and contribute to SRE culture and tooling.
Required Skills & Qualifications
- 3–6+ years experience in SRE, DevOps, or production operations engineering (adjust per level).
- Strong experience with cloud platforms (AWS, GCP, Azure) and managed services.
- Proficiency with containerization and orchestration (Docker, Kubernetes) and related tooling (Helm, Istio/Linkerd optional).
- Experience with infrastructure-as-code (Terraform, CloudFormation) and configuration management.
- Strong scripting/programming skills (Python, Go, Bash) for automation and tooling.
- Familiarity with observability stacks (Prometheus, Grafana, Datadog, ELK/Opensearch, Jaeger/Zipkin).
- Deep understanding of networking, load balancing, storage, and OS internals (Linux).
- Experience implementing CI/CD (GitHub Actions, Jenkins, GitLab CI) and release automation.
- Proven incident management experience and ability to work under pressure.
- Strong collaboration, communication, and documentation skills.
Preferred
- Experience defining SLO/SLA frameworks and driving organization-wide adoption.
- Background in distributed systems, large-scale production services, or platform engineering.
- Experience with chaos engineering, fault injection, or resilience testing.
- Familiarity with policy-as-code (OPA, Sentinel), service meshes, and GitOps workflows.
- Certifications (CKA, AWS/Azure/GCP certs) or contributions to open-source SRE tooling.
- Pay: QAR15,321.44 - QAR22,214.09 per month
Your resume, rewritten
for this exact role.
Sign up free — Base Career tailors your CV to this job description in 60 seconds.
01 / 05
Resume Tailored to This Job

Your keywords, structure, and story — rewritten to match this exact role and pass ATS filters.
Free · No card · 60 seconds
02 / 05
Cover Letter for This Role, Done

Job-specific cover letters written in Gulf professional tone — ready in seconds, not hours.
Free · No card · 60 seconds
03 / 05
See How Well You Fit This Role

AI match score with clear reasons — know your fit before investing time in the application.
Free · No card · 60 seconds
04 / 05
Use Autofill When You Apply

Autofill any application form on Workday, LinkedIn, Bayt, Greenhouse — with your tailored content.
Free · No card · 60 seconds
05 / 05
Track It. Follow Up at the Right Time.

Visual pipeline for every application with AI-timed follow-up reminders so nothing slips.
Free · No card · 60 seconds
Similar Jobs
Site Reliability Engineer (SRE)
Software Developer · Doha
Network Engineer — Job Description Overview A Network Engineer designs, implements, and maintains an organization’s network infrastructure to ensure secure, reliable, and high-performance connectivity for users and servi
Skills
1 weeks ago
Tailor Resume↗Tailor Resume ↗DevOps / Site Reliability Engineer (SRE)
Reevez Innovations · Doha
Job Description: DevOps / Site Reliability Engineer (SRE) Reevez Innovations is inviting applications for the position of DevOps / Site Reliability Engineer (SRE) to join our innovative team in Doha, Qatar. This role is
Skills
3 weeks ago
Tailor Resume↗Tailor Resume ↗DevOps / Site Reliability Engineer (SRE)
Jurident Legal Services · Doha
Job Description: Jurident Legal Services is seeking a skilled DevOps / Site Reliability Engineer (SRE) to join our dynamic team. This role is critical in ensuring the reliability, scalability, and efficiency of our syste
Skills
3 weeks ago
Tailor Resume↗Tailor Resume ↗Site Reliability Engineer
Noor Urban Estates Pty · Doha
Role overview Design, build, and operate reliable, scalable production systems by applying software engineering to operations: automate toil, improve observability, manage incidents, and enable fast, safe delivery of fea
Skills
4 weeks ago
Tailor Resume↗Tailor Resume ↗DevOps / Site Reliability Engineer (SRE)
Supplier Post Qatar · Doha
Job Description: DevOps / Site Reliability Engineer (SRE) at Supplier Post Qatar Supplier Post Qatar is seeking a knowledgeable DevOps / Site Reliability Engineer (SRE) to join our IT operations team. This role is crucia
Skills
1 months ago
Tailor Resume↗Tailor Resume ↗DevOps / Site Reliability Engineer (SRE)
Data Innovations Qatar · Doha
Job Description Data Innovations Qatar is seeking an experienced DevOps / Site Reliability Engineer (SRE) to join our talented team. The ideal candidate will be responsible for enhancing our systems' reliability, perform
Skills
1 months ago
Tailor Resume↗Tailor Resume ↗2.2K+
Cover Letters & Follow-ups
1.8K+
Resumes Tailored
190.5K+
Jobs Tracked
Trusted by professionals at
Stop applying blindly.
Start getting hired.
Base Career automates the hardest parts of job searching — apply smarter, not harder.
AI Resume in 60s
Your resume rewritten for this exact role using the job description as the brief.
ATS-Optimized
Get past automated screening filters with the right keywords matched to each job.
Application Tracker
Track every job, follow-up, and interview in one visual kanban board.
Free plan · No credit card required