Lead Site Reliability Engineer

5 - 10 years

15 - 30 Lacs

Posted:2 days ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Lead Site Reliability Engineer

Lead Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation.

Lead Site Reliability Engineers must be passionate about learning and evolving with current technology trends. They strive to innovate and are relentless in pursuing a flawless customer experience. They have an automate everything” mindset, helping us bring value to our customers by deploying services with incredible speed, consistency, and availability

Job Responsibilities:

  • Engage in and improve the lifecycle of services from conception to EOL, including system design

consulting, and capacity planning

  • Define and implement standards and best practices related to: System Architecture, Service

delivery, metrics and the automation of operational tasks

  • Support services, product & engineering teams by providing common tooling and frameworks to

deliver increased availability and improved incident response.

  • Improve system performance, application delivery and efficiency through automation, process

refinement, postmortem reviews, and in-depth configuration analysis

  • Collaborate closely with engineering professionals within the organization to deliver reliable

services

  • Increase operational efficiency, effectiveness, and quality of services by treating operational

challenges as a software engineering problem (reduce toil)

  • Guide junior team members and serve as a champion for Site Reliability Engineering
  • Actively participate in incident response, including on-call responsibilities
  • Partner with stakeholders to influence and help drive the best possible technical and business outcomes

Required Qualifications

  • Engineering degree, or a related technical discipline, or equivalent work experience
  • Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java)
  • Knowledge of Cloud based applications & Containerization Technologies
  • Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing
  • Working experience with industry standards like Terraform, Ansible
  • Demonstrable fundamentals in 2 of the following: Computer Science, Cloud architecture, Security or Network Design fundamentals Demonstrable fundamentals in 2 of the following: Computer Science, Cloud architecture, Security, or Network Design fundamentals

(Experience, Education, Certification, License and Training)

  • Must have at least 5 years of hands-on experience working in Engineering or Cloud
  • Minimum 5 years' experience with public cloud platforms (e.g. GCP, AWS, Azure)
  • Minimum 3 years' Experience in configuration and maintenance of applications and/or

systems infrastructure for large scale customer facing company

  • Experience with distributed system design and architecture

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
UKG logo
UKG

Human Resources Software

Lowell

RecommendedJobs for You