Senior Solutions Architect Cloud Infrastructure And DevOps
Job Fit Check
Base Career helps you apply smarter for this job.
Key skills for this role
About the Role
Job Requisition ID JR2016420 Job Category Professional Services Time Type Full time NVIDIA is looking for a Senior Cloud Infrastructure and DevOps Solutions Architect to join its NVIDIA Infrastructure Specialist Team.
Key Skills for This Role
Full Job Posting
Time Type
Full time
NVIDIA is looking for a Senior Cloud Infrastructure and DevOps Solutions Architect to join its NVIDIA Infrastructure Specialist Team.
Academic and commercial organizations around the world are using NVIDIA products to redefine deep learning and data analytics, and to power next-generation data centers.
Join the team building and advising on many of the largest and fastest AI/HPC systems in the world!
We are looking for someone who combines deep technical expertise with strong consulting and communication skills.
This role will engage directly with customers, partners, and multi-functional teams to assess, architect, and guide the implementation of large-scale infrastructure projects.
The scope spans system architecture, Kubernetes-based platforms, and automation—serving as both a trusted advisor and a hands-on technical leader.
What You’ll Be Doing
- Advise on and help maintain large-scale computational and AI infrastructure, including monitoring, logging, and workload orchestration (Kubernetes and Linux job schedulers).
- Provide consultative guidance and perform hands-on solving across the full stack—from bare metal and operating system, through the software stack, container platform, networking, and storage.
- Assess customer environments and recommend optimized, production-ready Kubernetes-based container platforms integrated with enterprise-grade networking and storage solutions.
- Serve as a key technical resource: develop, refine, and document standard methodologies and operational guidelines to be shared with internal teams and customer partners.
- Support Research & Development activities and engage in POCs/POVs to validate new features, architectures, and upgrade approaches.
- Create and deliver high-quality documentation, including runbooks, onboarding materials, and best-practice guides for customers and internal teams.
- Act as the technical leader for assigned customer accounts, providing strategic guidance on DevOps and platform architecture and influencing long-term infrastructure and operations decisions.
What We Need To See
- Education & Experience: BS/MS/PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields (or equivalent experience), with 8+ years of professional experience in leading scalable cloud environments and automation engineering roles.
- Cloud & HPC Expertise: Shown understanding of networking fundamentals, data center architectures, and hands-on experience leading HPC/AI clusters, including deployment, optimization, and solving.
- NVIDIA GPU Expertise: Validated hands-on experience deploying, configuring, and optimizing NVIDIA GPU-accelerated infrastructure, including driver management, CUDA toolkit integration, and GPU workload profiling.
- Kubernetes & AI/ML Workloads: Extensive experience with Kubernetes for container orchestration, resource scheduling, scaling, and integration with GPU-accelerated and HPC environments.
- Hardware & Software Knowledge: Strong familiarity with HPC and AI technologies (CPUs, GPUs, high-speed interconnects) and supporting software stacks.
- Linux & Storage Systems: Deep knowledge of Linux (RedHat, Ubuntu), OS-level security, and protocols. Experience with storage solutions such as Lustre, GPFS, ZFS, XFS, and emerging Kubernetes storage technologies.
- Automation & Observability: Proficiency in Python and Bash scripting, configuration management, and Infrastructure-as-Code tools (e.g., Ansible, Terraform). Experience with observability stacks (Grafana, Loki, Prometheus) for monitoring, logging, and building fault-tolerant systems.
- Solution Architecture & Customer Engagement: Strong background in crafting scalable solutions and providing consultative support to customers, including leading architectural reviews and speaking publicly to executive partners.
Ways To Stand Out From The Crowd
- Knowledge of CI/CD pipelines for software deployment and automation.
- Experience working with NVIDIA GPU and Network Operators to manage automated resource lifecycle in Kubernetes environments.
- Solid hands-on knowledge of Kubernetes and container-based microservices architectures.
- Experience with NVIDIA GPU and Network Operator for automated GPU as well as network resources lifecycle management in Kubernetes environments.
- Experience with NVIDIA Base Command Manager (BCM) for provisioning, managing, and supervising GPU clusters at scale as well as background with RDMA-based fabrics (InfiniBand or RoCE) in HPC or AI environments.
Job Details
Role Level: Mid-Level Work Type: Full-Time Country: United Arab Emirates City: Dubai Company Website: http://nvda.ws/2nfcPK3 Job Function: Information Technology (IT) Company Industry/
About The Company
Searching, interviewing and hiring are all part of the professional life.
The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof.
Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.
Report
Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together.
Applicants are advised to research the bonafides of the prospective employer independently.
We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information.
We also recommend you visit Security Advice for more information.
If you suspect any fraud or malpractice, email us at [email protected].
Apply for this job in 1 click
Skip the repetitive application forms
Install the Base Career Chrome Extension and autofill job applications across major job boards with your profile.
Trusted by over 500,000 job seekers on Base Career
More from this employer
More jobs at TALENTMATE
Design Coordinator - Underground Structures Rail
Abu Dhabi, UAE
Job Description Company Description Work with Us. Change the World. At AECOM, we're delivering a better world. Whether improving your commute, keeping the lights on, providing access to clean water, or transforming skyli
Principal Partner Manager - Channels EMEA GSI
Dubai, UAE
Job Description We are looking for an experienced Partner Sales Manager to ignite partnerships by identifying and recruiting new partners for long-term success and nurture ongoing relationships with key strategic Global
Human Resources Generalist
Abu Dhabi, UAE
Job Description RINA is currently recruiting for a Human Resources Generalist to join its office in Abu Dhabi within the Global Human Resources Division. Mission HR Administrator takes on a more advanced role with increa
UAE National Editorial And Design Project Coordinator
Dubai, UAE
Job Description Job Purpose The Creative & Editorial Project Coordinator drives operational excellence across the Creative & Editorial function. This role is responsible for managing project workflows, streamlining and i
Inspector I - Quality
Abu Dhabi, UAE
Job Description Petrofac is a leading international service provider to the energy industry, with a diverse client portfolio including many of the world’s leading energy companies. We design, build, manage and maintain i
Vice President - Compensation And Benefits PRandPR - Rewards And Recognition People And Intellectual Capital Group UAEN Only
Dubai, UAE
Job Description We are seeking a talented UAE National to join Mashreq UAE as a VP - Compensation & Benefits. In this role, you will play a crucial part in developing and implementing compensation and benefits strategies
法务合规专员
Dubai, UAE
Job Description VARASCAVASP VARASCA 1-3/ VARASCA Microsoft Office/Google Workspace/Lark * relocate Job Details Role Level: Associate Work Type: Full-Time Country: United Arab Emirates City: Dubai Company Website: htt
Senior Production Service Analyst
Dubai, UAE
Job Description 🚨 We're Hiring: Senior Production Service Analyst 🚨 Do you thrive on solving complex technical challenges, leading critical incidents, and ensuring customers receive exceptional service? We're looking f
Design Coordinator - Underground Structures Rail
Abu Dhabi, UAE
Principal Partner Manager - Channels EMEA GSI
Dubai, UAE
Human Resources Generalist
Abu Dhabi, UAE
UAE National Editorial And Design Project Coordinator
Dubai, UAE
Inspector I - Quality
Abu Dhabi, UAE
Vice President - Compensation And Benefits PRandPR - Rewards And Recognition People And Intellectual Capital Group UAEN Only
Dubai, UAE
法务合规专员
Dubai, UAE
Senior Production Service Analyst
Dubai, UAE