About Us
Empowering Businesses with the Right Technology Solutions
Are you ready to partner with Starlly for your projects?
- Streamlining post-sales service management with Servy
- Empowering businesses with seamless IoT integration through Spectra
- Moving from legacy systems to digitisation or modern technology stacks
- Expert consultation on solution design and architecture
The OpenShift Container Platform (OCP) Operations Team is responsible for the continuous availability, health, and performance of OpenShift clusters that support mission-critical workloads. The team operates under a tiered structure (L1, L2, L3) to manage day-to-day operations, incident management, automation, and lifecycle management of the container platform.This team is central to supporting stakeholders by ensuring the container orchestration layer is secure, resilient, scalable, and optimized.
L1 – OCP Monitoring & Support Operations (Platform Technician)
Role Focus: Daily Ops, Monitoring, 1st-Level SupportExperience: 1–3 yearsResources : 5
Key Responsibilities:
- Perform 24x7 monitoring of clusters, nodes, pods, and services via oc CLI and OpenShift Console.
- Execute SOPs and health checks for clusters and platform components.
- Handle incident alerts, perform basic triage, and escalate to L2.
- Support basic administrative tasks (RBAC, Projects, ConfigMaps).
- Perform scheduled maintenance verifications and backups.
- Generate daily/weekly platform health reports.
L2 – OCP Support & Platform Engineering (Platform Analyst)
Role Focus: Advanced Troubleshooting, Change Management, AutomationExperience: 3–6 yearsResources : 5
Key Responsibilities:
- Analyze and resolve platform issues related to workloads, PVCs, ingress, services, and image registries.
- Implement configuration changes via YAML/Helm/Kustomize.
- Maintain Operators, upgrade OpenShift clusters, and validate post-patching health.
- Work with CI/CD pipelines and DevOps teams for build & deploy troubleshooting.
- Manage and automate namespace provisioning, RBAC, NetworkPolicies.
- Maintain logs, monitoring, and alerting tools (Prometheus, EFK, Grafana).
- Participate in CR and patch planning cycles.
Requirements
L3 – OCP Platform Architect & Automation Lead (Platform SME)
Role Focus: Architecture, Lifecycle Management, Platform GovernanceExperience: 6+ yearsResources : 2
Key Responsibilities:
- Own lifecycle management: upgrades, patching, cluster DR, backup strategy.
- Automate platform operations via GitOps, Ansible, Terraform.
- Lead SEV1 issue resolution, post-mortems, and RCA reviews.
- Define compliance standards: RBAC, SCCs, Network Segmentation, CIS hardening.
- Integrate OCP with IDPs (ArgoCD, Vault, Harbor, GitLab).
- Drive platform observability and performance tuning initiatives.
- Mentor L1/L2 team members and lead operational best practices.
Core Tools & Technology Stack
- Container Platform: OpenShift, Kubernetes
- CLI Tools: oc, kubectl, Helm, Kustomize
- Monitoring: Prometheus, Grafana, Thanos
- Logging: Fluentd, EFK Stack, Loki
- CI/CD: Jenkins, GitLab CI, ArgoCD, Tekton
- Automation: Ansible, Terraform
Security: Vault, SCCs, RBAC, NetworkPolicies