8 - 10 years
11 - 21 Lacs
Posted:21 hours ago|
Platform:
Hybrid
Full Time
Location Bangalore/Hyderabad/Mumbai
Exp 9yrs to 12yrsNotice period Immediate 30 daysJD as given below:
Primary skills: Kubernetes, AWS
Secondary skills: Grafana
Note- There would be Rotational shifts for this role
Key Responsibilities
Operations & AdministrationPerform daily health checks of Kubernetes clusters (AKS/EKS/GKE).Manage and troubleshoot pods, deployments, services, and namespaces.Apply upgrades, patches, and cluster maintenance activities as per SOPs.Handle incident tickets (P1/P2/P3), perform root cause analysis, and provide fixes or escalation.Monitoring & TroubleshootingMonitor cluster health and workloads using tools like Prometheus, Grafana, ELK/EFK, Azure Monitor, or CloudWatch.Resolve issues related to pod failures, node scaling, network policies, or storage volumes.Collaborate with application teams to resolve issues in containerized workloads.Security & ComplianceManage RBAC, secrets, and config maps as per enterprise governance policies.Perform image scanning, vulnerability patching, and apply compliance standards.Ensure clusters adhere to IT security and audit requirements.Automation & MaintenanceSupport CI/CD pipelines for deploying applications into Kubernetes.Use Helm/Kustomize for upgrades and configuration management.Automate repetitive operational tasks with scripts (Bash, Python, PowerShell).Collaboration & EscalationWork with Cloud Platform and Application teams on incident triage.Escalate complex design/architecture issues to the Cloud Engineering team.Provide on-call support and after-hours incident resolution when required.Required Skills & ExperienceHands-on experience with Kubernetes operations/support (24+ years).Strong knowledge of containers (Docker) and workload management.Experience with at least one cloud provider: Azure (AKS), AWS (EKS), or GCP (GKE).Familiarity with Helm, Kustomize, and CI/CD pipelines.Knowledge of monitoring tools (Prometheus, Grafana, ELK/EFK, Datadog).Good understanding of RBAC, networking basics (CNI, Ingress, DNS), and storage classes.Scripting knowledge (Bash, Python, PowerShell) for automating ops tasks.Strong troubleshooting skills for incidents in production environments.
Nice to Have
Exposure to GitOps tools (ArgoCD, Flux).Experience with logging/alerting integrations (PagerDuty, ServiceNow).Familiarity with FinOps practices in Kubernetes (cost monitoring, resource quotas).Basic knowledge of service mesh (Istio, Linkerd).
Soft Skills
Strong problem-solving and analytical thinking.Ability to work under pressure in P1/P2 incidents.Good communication skills for working with application and cloud teams.Willingness to work in 24x7 support model (rotational shifts).
If JD is matching to your profile kindly share details and updated resume on GAYATRI.RAJESH-GUPTA@CAPGEMINI.COM
Capgemini
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
hyderabad, bengaluru, mumbai (all areas)
11.0 - 21.0 Lacs P.A.
pune, bengaluru, delhi / ncr
16.0 - 25.0 Lacs P.A.
4.0 - 6.0 Lacs P.A.
chennai
3.0 - 8.0 Lacs P.A.
hyderabad, pune, gurugram
5.0 - 12.0 Lacs P.A.
kolkata, indore, mumbai (all areas)
5.0 - 15.0 Lacs P.A.
5.0 - 10.0 Lacs P.A.
4.8 - 6.0 Lacs P.A.
8.0 - 15.0 Lacs P.A.
bengaluru
0.5 - 0.7 Lacs P.A.