Home
Jobs

Senior Site Reliability Engineer

5 - 7 years

30 - 35 Lacs

Posted:23 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

The Senior Site Reliability Engineer will be responsible for performance and availability of Compute and Network infrastructure consumed by all business segments. The Site Reliability teams are composed of highly talented individuals obsessively focused with availability through operational excellence. The ideal individual is relentlessly technical, passionate for automating everything and totally committed to delivering amazing customer experiences.

 

In this role, you will:

  • Establish performance baseline, capacity thresholds, correlate events, and define monitoring/alerting criteria
  • Develop automated solutions to address potential problems before they result in a service interruption
  • Provide impact assessment and mitigation plan for changes going into the production environment
  • Investigate root cause of severe and systemic outages, identify corrective actions and apply across the enterprise
  • Develop availability measures that align with consumer experience to accurately assess the usability of crucial services
  • Build capacity models to baseline transactional load compared to resource performance and leverage data to predict overall system capacity while automating load placement to avoid outages
  • Identify thresholds for all critical links in the data path to quickly isolate where imbalances may result in potential outages
  • Analyze failure points in services to model risk level and resolution steps if failure occurs.
  • Assist in driving architecture enhancements into system to mitigate potential failure points.
  • Programmatically monitor for and remediate configuration drift of critical devices
  • Develop response plans to potential failure points and evaluate effectiveness during planned tests
  • Perform comprehensive operational health checks of the entire services to identify areas of concern and track activities to drive improvements at all levels of the architecture
  • Provide technical coaching and direction to more junior teammates
Required Qualifications:
Bachelors Degree in Computer Science or STEM Majors (Science, Technology, Engineering and Math) with at least years of experience 5 years
 
Desired Qualifications:
  • Excellent knowledge of common operating systems (Unix/Linux, Windows)
  • Excellent knowledge of TCP/IP networking, and inter-networking technologies (routing/switching, proxy, firewall, load balancing etc.)
  • Demonstrated experience scripting or developing software and services for the cloud Ruby, Python, Go, Java, Node.js, .NET, etc.
  • Extensive Experience with Infrastructure Automation
  • Experience using an automated configuration management system (Terraform, Chef, Puppet, Ansible, Salt, etc.)
  • Experience deploying and managing infrastructure on public clouds such as AWS or Azure
  • Experience with configuring, customizing, and extending monitoring tools (Datadog, Sensu, Grafana, Splunk, etc.)

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You