Staff DevOps Engineer
About This Role
What You ll Do
-
Define and own the infrastructure architecture strategy across hybrid cloud (GCP, OCI, on-prem)
-
Design and evolve Kubernetes platforms for multi-region scalability, resilience, and cost efficiency
-
Establish and drive Infrastructure as Code and GitOps standards across all environments
-
Build and scale observability systems (metrics, logging, tracing) to support proactive reliability
-
Lead efforts to meet and exceed SLA, SLO, and error budget targets
-
Own and improve incident management processes, including postmortems and systemic fixes
-
Drive automation strategy across provisioning, scaling, patching, and operations
-
Champion security, compliance, and reliability best practices (SOC 2, ISO 27001, CIS benchmarks)
-
Partner with engineering teams to improve developer experience and platform usability
-
Mentor engineers and act as a technical leader across DevOps and infrastructure domains
Must-Have Technical Skills
-
8+ years of experience in DevOps / SRE / Platform Engineering
Deep expertise in:
-
Linux systems and internals
-
Networking (TCP/IP, DNS, VPNs, routing, firewalls)
-
Kubernetes (design, operations, and troubleshooting)
-
Terraform (or equivalent IaC tools)
-
Strong programming/scripting skills (Python, Go, or similar)
-
Experience with multi-cloud or hybrid cloud environments (GCP, OCI preferred)
Additional Technical Experience
-
CI/CD systems (GitHub Actions, Jenkins, ArgoCD)
-
Containerization and orchestration (Docker, Kubernetes)
-
Observability stacks (Prometheus, Grafana, ELK, OpenTelemetry)
-
Datastores: PostgreSQL, MongoDB, Redis
-
Security and compliance frameworks (SOC 2, ISO 27001, CIS)
-
Performance optimization in distributed systems
Stand out from 400+ applicants.
Base Career rewrites your resume for this exact role in under 60 seconds.
Generate Resume for this JobFree plan available · No credit card required