Site Reliability Engineer

4 - 8 years

25 - 40 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Site Reliability Engineer (SRE 2/3)

Experience

Salary

Shift

Opportunity Type

Placement Type

(*Note: This is a requirement for one of Uplers' client)

What do you need for this opportunity?

Must have skills required:

About Role:

SRE 2/3 Site Reliability Engineering (SRE)

What will you do?

  • Lead, mentor, and grow a team of 2-5 Site Reliability Engineers.
  • Define, implement, and advocate SRE best practices like SLAs, SLOs, SLIs, error budgets, and chaos engineering.
  • Build and maintain automated CI/CD pipelines and infrastructure using tools like Terraform, Jenkins, or GitHub Actions.
  • Own the observability stackmonitoring, alerting, logging, and tracing across microservices and platforms.
  • Improve reliability and scalability of services by proactively identifying bottlenecks and automating manual ops tasks.
  • Drive incident response practices including on-call rotations, runbooks, and blameless postmortems.
  • Ensure high availability and uptime across distributed systems hosted on AWS.
  • Collaborate with cross-functional teams to ensure the architecture is cloud-native, secure, and fault-tolerant.
  • Implement and optimize systems for cost-efficiency, auto-scaling, and performance.
  • Contribute to open source or write technical blogs to share insights and practices with the broader tech community.
  • This is a startup, so expect rapid changes and plenty of opportunities to take initiative and drive new initiatives.

Some Specific Requirements

  • At least 4+ years of experience leading SRE/DevOps/Infrastructure teams, with 5+ years overall in backend, systems, or infrastructure roles.
  • Strong experience managing distributed systems and microservices at scale.
  • Good understanding of Linux, Networking, Load Balancing, and Security concepts.
  • Hands-on experience with AWS services like EC2, ELB, AutoScaling, CloudFront, S3, CloudWatch.
  • Experience with container technologies and orchestrationDocker and Kubernetes is a must.
  • Strong proficiency with Infrastructure-as-Code tools like Terraform, CloudFormation, or Pulumi.
  • Familiarity with observability tools like Prometheus, Grafana, ELK, or Datadog.
  • Programming/scripting skills in Python, Go, Bash or similar for automation and tooling.
  • Understanding of message queues and event-driven architectures using Kafka or RabbitMQ.
  • Ability to manage incidents, write detailed postmortems, and improve reliability across teams and services.
  • Comfortable working in a fast-paced environment with a strong culture of ownership and continuous improvement.

About Client:

Fynd

How to apply for this opportunity?

  • Step 1: Click On Apply! And Register or Login on our portal.
  • Step 2: Complete the Screening Form & Upload updated Resume
  • Step 3: Increase your chances to get shortlisted & meet the client for the Interview!

Mock Interview

Practice Video Interview with JobPe AI

Start JavaScript Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Javascript Skills

Practice Javascript coding challenges to boost your skills

Start Practicing Javascript Now
Uplers logo
Uplers

Digital Services

Ahmedabad

RecommendedJobs for You

Navi Mumbai, Maharashtra, India

Chennai, Tamil Nadu, India

Kolkata, West Bengal, India

Hyderabad, Telangana, India