Operation Architect
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
The Operations Architect defines and governs the operational model for enterprise platform capabilities delivered by multiple vendors, ensuring solutions are production-ready, observable, secure, and supportable at scale.
Key Skills for This Role
Full Job Posting
Operations Architect
defines and governs the operational model for enterprise platform capabilities delivered by multiple vendors, ensuring solutions are production-ready, observable, secure, and supportable at scale.
The role designs end-to-end service management practices (SLOs/SLAs, monitoring, incident/change/problem management, DR, and capacity/cost controls) and ensures operational requirements are embedded from design through delivery.
Working with platform/cloud, security, and solution architects, as well as vendor teams and operations teams, the architect drives operations readiness reviews, creates runbooks and support processes, and enables a consistent, efficient operating model across cloud-agnostic deployments.
Duties & Responsibilities
- Define operational architecture and service management model across capabilities (ITIL-aligned where applicable).
- Establish observability standards: metrics/logs/traces/audits, OpenTelemetry instrumentation, dashboarding, alerting, and anomaly detection.
- Define SLOs/SLAs/OLAs, error budgets, and operational KPIs; ensure vendors deliver evidence and meet acceptance gates.
- Design incident management workflows (triage, escalation, RCA), integrate with ITSM, and standardize runbooks/playbooks.
- Define change and release management practices (CAB inputs, deployment rings, canary/rollback, feature flags coordination).
- Establish resiliency and DR requirements: backup/restore patterns, RPO/RTO targets, DR testing cadence, and failover runbooks.
- Define capacity, performance, and availability engineering processes (load testing, scaling policies, GPU/TPU capacity planning).
- Implement security operations integration: SIEM/SOAR alignment, alert routing, vulnerability/patch management SLAs.
- Define FinOps operational controls: tagging standards, showback/chargeback, budgets, anomaly detection, cost optimization playbooks.
- Lead operational readiness and handover: L1/L2/L3 training, reverse-shadowing, SOPs, and post-go-live stabilization plans.
Skills & Abilities
- Strong expertise in operating cloud-native platforms: SRE/ITIL practices, reliability engineering, and service management.
- Ability to turn NFRs into measurable SLOs, monitoring, and operational acceptance criteria.
- Solid understanding of observability stacks and telemetry design (OTel, APM, SIEM integration).
- Experience designing DR/BCP, backup strategies, and operational test plans in regulated environments.
- Proven capability to drive operational standardization across multiple vendors and teams.
Education & Background
- Bachelor’s degree in
Computer Science, Information Technology, Cybersecurity
- , or related field; Master’s degree highly preferred.
- 8+ years in operations architecture, SRE, DevOps leadership, or service management for enterprise platforms.
- Experience running production systems on Azure plus exposure to at least one other cloud (GCP/AWS) and hybrid setups.
- Experience with ITSM tooling and processes (incident/change/problem, CMDB), including KPI/SLA reporting.
- Proven experience with monitoring/APM and security operations integration (SIEM, vulnerability management).
- Certifications desirable: ITIL, SRE-related training, Azure/AWS/GCP ops certs, Kubernetes CKA/CKS (optional).
Preferred Tools
- Observability/APM: OpenTelemetry, Dynatrace/Datadog, Prometheus/Grafana/Loki/Tempo (as applicable)
- ITSM & operations: ServiceNow (or equivalent), CMDB, PagerDuty/Opsgenie-style on-call tooling
- Security & cloud ops: Microsoft Sentinel, Defender for Cloud, Azure Monitor/Log Analytics, Kubernetes tooling
Soft Skills
- Calm, structured leadership during incidents and high-pressure escalations
- Strong facilitation skills for readiness reviews, RCAs, and cross-vendor alignment
- Clear documentation and operational discipline (runbooks, SOPs, checklists)
- Continuous improvement mindset and ability to drive measurable reliability gains
- Strong collaboration and influencing skills across engineering, security, and vendor teams
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at Starlink Qatar
Field Force Manager
Doha, QAT
Role Summary The Field Force Manager (FFM) is responsible for leading and managing Enterprise Field Operations within a 24x7 Telecom Managed Services environment. The role is accountable for the end-to-end delivery of fi
Service Delivery Manager
Doha, QAT
Lead end-to-end delivery of Enterprise ICT Managed Services, ensuring SLA/KPI achievement, customer satisfaction, and continual service improvement. Manage incident, problem, and change management processes. Requires 12+
Treasury Analyst
Doha, QAT
Responsible for accurate and timely reconciliation of financial transactions across all major payment channels, ensuring integrity of treasury operations and prompt resolution of discrepancies. Investigative and analytic
Cloud Architect
Doha, QAT
Job Purpose The Cloud Architect is responsible for designing, implementing, and managing secure, scalable, and highly available cloud and hybrid-cloud architectures across Azure, Google Cloud (GCP), and on-premises HCI p
Network Technician
Doha, QAT
We are seeking a highly skilled and experienced Network Technician to join our team. The successful candidate will be responsible for the expert installation, maintenance, and repair of our passive network infrastructure
Senior TIBCO Developer (Remote)
Doha, QAT
We are looking for an experienced Senior Tibco Developer to join our team in a fully remote role. You will be responsible for developing, supporting, and enhancing our Tibco integration landscape while delivering high-qu
Commercial Manager - Telecommunications
Doha, QAT
The Commercial Manager is responsible for leading the commercial, financial, and contractual management of large-scale Telecommunications and ICT projects. The role ensures commercial viability, profitability, compliance
Assistant Venue Technology Manager
Doha, QAT
The Assistant Venue Technology Manager (AVTM) supports the Venue Technology Manager (VTM) in managing all Venue Technology operations across the assigned venue cluster. The role is responsible for assisting with the plan
Field Force Manager
Doha, QAT
Service Delivery Manager
Doha, QAT
Treasury Analyst
Doha, QAT
Cloud Architect
Doha, QAT
Network Technician
Doha, QAT
Senior TIBCO Developer (Remote)
Doha, QAT
Commercial Manager - Telecommunications
Doha, QAT
Assistant Venue Technology Manager
Doha, QAT