Home
Jobs

Lead Solutions Architect AI Infrastructure & Private Cloud

10 - 20 years

35 - 50 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Title:


Job Summary:


Key Responsibilities:

1. Leadership & Strategy

  • Lead solution design and delivery of AI/HPC platforms (OpenShift, Rancher, etc.) aligned with business goals and NVIDIA AI Factory principles.
  • Ensure delivery assurance, stakeholder alignment, and risk management across the project lifecycle.

2. Architecture & Integration

  • Architect containerized and HPC workloads using platforms like Red Hat OpenShift, SUSE Rancher, Slurm, and PBS Pro.
  • Integrate NVIDIA AI Enterprise and open-source ML tools into broader software ecosystems.

3. Technical Engagements

  • Lead RFP/RFI responses, customer consultations, and PoCs to validate performance and integration.
  • Recommend optimal infrastructure configurations using reference architectures from clients and partners.

4. Innovation & Enablement

  • Stay updated on trends across HPC, Kubernetes, hybrid cloud, and security to drive innovation.
  • Mentor consultants and promote internal knowledge-sharing.

5. Customer-Centric Delivery

  • Act as a trusted advisor, translating complex technical concepts into business value.
  • Collaborate with infrastructure, DevOps, and data science teams for seamless delivery.

Required Skills & Experience:


AI Infrastructure & HPC:

  • Proficiency in workload schedulers (Slurm, PBS Pro), cluster management tools (HPCM, Base Command Manager), and high-speed networking (InfiniBand, Ethernet).

Containers & Orchestration:

  • Hands-on experience with Docker, Podman, Singularity; expertise in Kubernetes, OpenShift, Rancher.
  • Deep knowledge of NVIDIA GPU Operator, DCGM, and GPU optimization.

Operating Systems & Virtualization:

  • Advanced Linux admin skills (RHEL, SLES, Ubuntu) and experience with virtualization (KVM, OpenShift Virtualization).

Cloud, DevOps & MLOps:

  • Experience with hybrid cloud environments, CI/CD, IaC, and integrating ML frameworks into production workflows.
  • Understanding of cloud-native security, observability, and compliance.

Networking & Protocols:

  • Strong grasp of networking (TCP/IP, DNS, routing) and protocols like S3, NFS, SMB for hybrid data workflows.

Programming & Automation:

  • Scripting experience in Python, Bash, and automation of infrastructure/ML pipelines.

Soft Skills & Leadership:

  • Strong communication, project leadership, and stakeholder management skills.
  • Ability to align technical solutions with enterprise business objectives.

Qualifications:

  • Bachelor’s/Master’s in Computer Science or related field.
  • 8–10 years of experience in AI/ML, HPC, and container-based solutions.
  • Preferred certifications: RHCSA, RHCE, CKA, CKAD, CKS, NVIDIA Certified – AI Infrastructure & Ops.

Mock Interview

Practice Video Interview with JobPe AI

Start Ai/Ml Infrastructure Interview Now
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Linnk Outsource Solutions
Linnk Outsource Solutions

Business Process Outsourcing (BPO)

Business City

200-500 Employees

6 Jobs

    Key People

  • John Doe

    CEO
  • Jane Smith

    COO

RecommendedJobs for You

Hubli, Mangaluru, Mysuru, Bengaluru, Belgaum

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru

Pune, Chennai, Bengaluru