Cloud Ops Engineer
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
_Job Summary_ A Cloud Ops Engineer is responsible for architecting, implementing, and managing highly available, scalable, and secure infrastructure across Cloud Platforms. A key focus of this role is infrastructure automation, ensuring consistent, repeatable, and efficient provisioning and configuration of environments using Infrastructure as Code (IaC) and other DevOps and AI\ML Ops best practices. The engineer enables seamless continuous integration and delivery (CI/CD),
Key Skills for This Role
Full Job Posting
Overview
_Job Summary_
A Cloud Ops Engineer is responsible for architecting, implementing, and managing highly available, scalable, and secure infrastructure across Cloud Platforms.
A key focus of this role is infrastructure automation, ensuring consistent, repeatable, and efficient provisioning and configuration of environments using Infrastructure as Code (IaC) and other DevOps and AI\ML Ops best practices.
The engineer enables seamless continuous integration and delivery (CI/CD), leverages AI/ML-driven monitoring tools for predictive analytics and system health, and collaborates cross-functionally to align infrastructure with agile development needs.
This role plays a pivotal part in accelerating deployment velocity, improving system reliability, and maintaining a resilient, production-grade infrastructure across hybrid and multi-cloud platforms.
_Job Responsibilities 1_
Design, develop, and maintain automated infrastructure solutions across multi-cloud environments (Azure, GCP) and on-premises systems, ensuring high availability, scalability, and security.
Develop and integrate AI/ML-based automation and AI agents to support infrastructure operations, including real-time monitoring, anomaly detection, auto-remediation, and self-healing capabilities.
Leverage AI-driven agents for incident triage and resolution, automating common support tasks and enabling intelligent decision-making during outages and performance issues.
Automate routine operational tasks and infrastructure workflows using scripting languages (Bash, Python, PowerShell) to reduce manual overhead and improve response times.
Implement infrastructure as code using tools like Terraform, Bicep, Ansible, Puppet, and Chef to provision, configure, and manage infrastructure in a repeatable and efficient manner.
Build and manage CI/CD pipelines using Azure DevOps, GitLab CI/CD, or Jenkins to automate the end-to-end delivery lifecycle for infrastructure and application code.
Deploy and orchestrate containerized workloads using Docker, Kubernetes, and NKP, supporting microservices-based architectures and scalable infrastructure deployments.
Implement AI-based predictive analytics for infrastructure capacity planning, performance tuning, and preemptive fault detection.
Configure and manage cloud networking components including VPNs, firewalls, and load balancers, ensuring secure and optimized connectivity.
Administer identity and access controls (IAM, RBAC) and manage secrets and encryption using Azure Key Vault and GCP KMS, aligning with security and compliance standards.
_Job Responsibilities 2_
Monitor infrastructure health and performance using Prometheus, Grafana, and the ELK Stack, and integrate AI-powered observability to enhance root cause analysis and alert accuracy.
Optimize cloud costs using automation and AI-powered insights, including resource tagging, automated scaling, budgeting, and rightsizing recommendations.
Collaborate cross-functionally with development, IT, and security teams to support automated infrastructure provisioning, pipeline integration, and deployment orchestration.
Troubleshoot infrastructure and deployment issues across all environments, utilizing AI agents where possible to automate diagnostics and resolution.
Ensure that all development and testing environments are fully automated, secured, and aligned with production standards to ensure environment consistency.
Maintain detailed documentation of infrastructure designs, automation workflows, AI agent integrations, CI/CD processes, and operational procedures.
Continuously evaluate and integrate emerging tools and technologies in infrastructure automation, AIOps, and DevSecOps to improve performance, reliability, and operational efficiency.
Additional Responsibilities 3 _Job Knowledge & Skills_
A solid understanding of cloud platforms like Azure and GCP, including how to deploy, manage, and monitor resources.
Familiarity with automation tools and the ability to write scripts for automating repetitive testing tasks is increasingly important
Awareness of AI/ML-based tools and AI agents used in infrastructure automation, monitoring, and self-healing operations.
Good knowledge of containerization using Docker and orchestration platforms like Kubernetes for managing microservices.
Familiarity with SecOps practices to ensure infrastructure is secure by design.
Understanding of cloud cost management, including tagging, budgeting, and resource rightsizing.
Strong documentation and collaboration skills to work effectively with development, IT, and security teams.
_Job Experience_
Minimum 5 years working experience, 3 years relevant working experience, 2 years GCC experience is a plus.
_Competencies_ AgilityAI FluencyBuild High-Performing TeamsCloud Specific Skills L3Data Center Network Architecture L3IT Infrastructure and Application Integration L3LAN Network Security L3LeadershipNetwork Security L3Provide DirectionQualityResilience _Education_ Bachelor's Degree in Information Technology or Computer Science
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at UrbaCon Contracting & Trading Company
Organizational Design Team Leader
Qatar, QAT
Job Summary The Organizational Design Team Leader leads a team of specialists and analysts to develop, implement, and oversee organizational…
Digital Learning Specialist
Qatar, QAT
Job Summary The Digital Learning Specialist is responsible to design, develop and deploy digital learning solutions, to help the team…
Talent Acquisition Manager
Qatar, QAT
Job Summary The Talent Acquisition Manager manage the recruitment and hiring ... an organization, ensuring the acquisition of top talent to meet the Company ... techniques, and talent
Senior Insurance Manager
Qatar, QAT
The Senior Insurance Manager plays a pivotal role in overseeing insurance operations, ensuring effective risk management and client service delivery. This position involves strategic planning for insurance products, mana
Radiology Technologist II
Qatar, QAT
Job Summary To deliver and facilitate most radiological procedures. To diagnose and treat patients using diagnostic imaging examinations…
Senior Technical Engineer - Facilities Management
Qatar, QAT
The role of Senior Technical Engineer - Facilities Management encompasses leadership in operational, technical, and functional aspects of facilities management. Responsibilities include performance monitoring, compliance
Senior Legal Counsel
Qatar, QAT
The Senior Legal Counsel plays a pivotal role in overseeing all commercial legal activities within the Group, ensuring compliance with applicable laws and regulations. This position involves providing timely legal advice
PMV Operations Manager
Qatar, QAT
Job Summary The PMV (Plant, Machinery and Vehicle) Operations Manager oversees the day to day operations to guarantee the adherence…
Organizational Design Team Leader
Qatar, QAT
Digital Learning Specialist
Qatar, QAT
Talent Acquisition Manager
Qatar, QAT
Senior Insurance Manager
Qatar, QAT
Radiology Technologist II
Qatar, QAT
Senior Technical Engineer - Facilities Management
Qatar, QAT
Senior Legal Counsel
Qatar, QAT
PMV Operations Manager
Qatar, QAT
