Senior AI Platform Engineer / Senior DevOps and AI Platform Engineer

9 - 14 years

25 - 40 Lacs

Posted:1 week ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Senior AI Platform Engineer

Key Responsibilities

  • Platform as a Product:

    Define platform value propositions, roadmaps, SLAs, and feedback loops; treat internal developers as customers.
  • Golden Paths & Self-Service:

    Build reusable templates and paved paths for common developer tasks such as deploy, provision, observe, and rollback.
  • Toolchain Integration:

    Seamlessly integrate CI/CD, secrets management, policy, observability, and environment automation into the IDP.
  • Security by Default:

    Collaborate with security teams to embed guardrails and identity controls into the platform.
  • Reliability & Cost Ownership:

    Own platform SLOs, performance, and FinOps hygiene; continuously optimize stability and spend.
  • Adoption & Enablement:

    Lead onboarding, documentation, training workshops, and change management initiatives.

Required Experience

  • 7-10+ years in DevOps / Platform Engineering / SRE roles.
  • Minimum 3-4 years hands-on experience in building and operating IDPs or equivalent developer platforms.
  • Proven experience in building developer abstractions like templates, CLIs, APIs, or internal portals.
  • Experience leading cross-functional technical initiatives and mentoring junior engineers.
  • Experience in AWS or GCP

Technical Skills Must Have

  1. Cloud & Runtime

    : Strong hands-on experience with AWS or GCP, managing Kubernetes (multi-tenant clusters, autoscaling, admission controllers), Docker/OCI, GPU-aware environments (e.g., NVIDIA GPU Operator, CUDA drivers), and artifact registries.
  2. Infrastructure as Code & Configuration

    : Deep expertise in Terraform (modules, workspaces), Helm/Kustomize, and config tools like Ansible for reliable, scalable provisioning and environment automation.
  3. CI/CD, GitOps & Observability

    : Proven ability to build robust pipelines using GitHub Actions, GitLab CI, Jenkins, or Argo CD, with support for blue-green/canary deployments, rollbacks, and observability tools like Prometheus, Grafana, OpenTelemetry, and centralized logging (ELK/EFK).
  4. AI/ML Platform Enablement

    : Knowledge or experience supporting AI/ML workflows, including tools like MLflow, Vertex AI, Kubeflow, model registries, feature stores, and orchestration tools like Airflow or Argo Workflows. Familiarity with data versioning (DVC, lakeFS) and reproducible training pipelines.
  5. Security, FinOps & Developer Experience

    : Strong knowledge of OIDC/OAuth2, ABAC, secrets and policy integration, FinOps best practices (compute/GPU/storage optimization), and commitment to developer enablement (onboarding, documentation, feedback loops, secure-by-default patterns).

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Agco Corporation logo
Agco Corporation

Agriculture / Equipment Manufacturing

Duluth

RecommendedJobs for You