Member Of Technical Staff- DevOps & Site Reliability Engineer

3 - 8 years

13 - 17 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

As a DevOps and SRE engineer, you will work to improve the reliability and performance of Pure Storage Flash Blades critical test infrastructure applications. You will be part of a team designing and developing DevOps & SRE services, as well as operating and maintaining the test infrastructure. This means monitoring SLO goals for uptime and latency, as well as helping colleagues leverage the features and workflows available to them.
We are looking for engineers who are passionate about reliability, devops, performance, and efficiency.

Responsibilities

  • Maintain and monitor test infrastructure systems such as databases, message queues, APIs, and distributed applications through the use of data and metrics such as SLOs and error budgets
  • Troubleshoot enterprise virtualization technologies based on VMs and containers such as VMWare, OpenStack, and Kubernetes
  • Support Jenkin based CI pipelines and maintain jenkin nodes VMs
  • Manage and support open Kubernetes, Nomad, LDAP, Active Directory, DNS, DHCP, NIS and Kerberos services
  • Support engineering projects through activities such as resource management, capacity planning, and general application lifecycle management
  • Collaborate with team members, across business units, and across multiple time zones to create high quality customer outcomes
  • Coordinate with vendor partners for operational support

Minimum Qualifications

  • Demonstrated coding ability with one or similar of the following: C, C++, Java, Python, or Go
  • Able to work in a 24x7 oncall rotation using a follow the sun model (i.e. 8am to 8pm local time pager duty, approximately 1 week every 2-3 months)
  • Systematic problem-solving approach, strong communication skills, and a sense of ownership and drive
  • Experience in analyzing performance & debugging Enterprise Systems

Preferred Qualifications

  • 3+ years of experience in DevOps, SRE, or Systems Administration roles
  • Understanding of Unix/Linux, and optionally Windows operating systems
  • Experience working with Infrastructure as Code / Automation tools (Ansible, Terraform, CloudFormation)
  • Experience with containers and container orchestration systems such as Docker and/or Kubernetes
  • Experience with Continuous deployment with ArgoCD or similar tools.
  • Experience configuring Layer2/Layer3 hybrid networks, and managing network services such as DNS, DHCP, and NTP
  • Understanding of hardware management services such as CIMC, or UCS Manager
  • Experience with monitoring platforms such as Nagios, Prometheus, and Grafana to comprehend service health and create dashboards
  • Experience with storage administration such as Pure Storage, including troubleshooting
  • Well organized, with ability to prioritize tasks and see them to completion

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Golang Skills

Practice Golang coding challenges to boost your skills

Start Practicing Golang Now

RecommendedJobs for You