Work from Office
Full Time
Position Overview :
Roles and Responsibilities :
- Automate health checks, incident remediation, and reliability guardrails.
- Manage on-call rotations, conduct root cause analysis, and implement postmortem action
plans.
- Define and track SLOs, SLIs, and error budgets.
- Use chaos engineering and resilience testing to ensure fault tolerance.
Must Have Skills :
- Proficiency in Linux system internals, containers, and networking.
- Scripting/automation expertise in Python/Go/Shell.
- Familiarity with incident management, runbooks, and observability standards.
- Exposure to service discovery, DNS routing, and load balancing is a bonus.
Qualification : BE/BTech/MCA/ME/MTech/MS in Computer Science or a related technical field or equivalent practical experience.
Nomiso
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Nowhyderabad, telangana, india
Salary: Not disclosed
ahmedabad, gujarat
Salary: Not disclosed
bengaluru
7.0 - 11.0 Lacs P.A.
gurugram
8.0 - 12.0 Lacs P.A.
hyderabad, telangana, india
Salary: Not disclosed
gurugram, haryana, india
Salary: Not disclosed
pune, maharashtra, india
Experience: Not specified
Salary: Not disclosed
greater hyderabad area
Experience: Not specified
Salary: Not disclosed
hyderabad, chennai, bengaluru
17.0 - 27.5 Lacs P.A.
12.0 - 16.0 Lacs P.A.