Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in hyderabad
>
Management Health Solutions India
>
Sr Site Reliability Engineer

Sr Site Reliability Engineer

Management Health Solutions India

7 - 12 years

9 - 13 Lacs

hyderabad

Posted:2 days ago| Platform:

Apply

Skills Required

supply chain automation networking instrumentation healthcare oracle troubleshooting analytics monitoring sql

Work Mode

Work from Office

Job Type

Full Time

Job Description

Site Reliability Engineer (SRE)

Position Summary

The Site Reliability Engineer (SRE) will be a hands-on contributor within the Site Reliability Engineering Center of Excellence (CoE), responsible for building monitoring and observability solutions, troubleshooting production issues, and participating in 24x7 on-call operations.

This role focuses on the execution of reliability practices, implementing observability tooling, improving MTTR/MTTD through automation, and ensuring production systems are resilient, observable, and performant. The SRE will collaborate closely with Principal and Senior Staff SREs, adopting best practices and frameworks defined by the CoE while directly contributing to enterprise reliability goals. This role reports to the Sr. Manager, SRE.

Key Responsibilities

Execution & CoE Alignment

Implement SRE frameworks, best practices, and playbooks provided by the CoE.

Act as a hands-on engineer, contributing to observability, reliability, and incident response initiatives.

Partner with senior SREs and leadership to maintain consistency in monitoring and incident processes.

Contribute to automation projects that improve reliability and reduce manual work.

Observability & Monitoring

Build and maintain monitoring solutions with New Relic, Datadog, Prometheus, Grafana, CloudWatch, OpenTelemetry, Graylog.

Create and refine dashboards, metrics, and alerts for proactive anomaly detection.

Extend observability coverage across infrastructure, applications, APIs, and databases.

Reliability Engineering & Automation

Implement SLIs, SLOs, SLAs, and error budgets in partnership with product and platform teams.

Contribute to reducing MTTD and MTTR through improved instrumentation and automation.

Participate in capacity planning, resiliency testing, and scaling reviews.

Support chaos engineering and reliability validation activities.

Incident & Problem Management

Participate in incident response, including on-call rotations for 24x7 coverage.

Assist with root cause analysis (RCA) and implement corrective actions.

Ensure alignment with ITSM processes for incident, problem, and change management.

Contribute to playbooks and runbooks to strengthen on-call readiness.

Collaboration & Knowledge Sharing

Collaborate with Engineering, Product, Security, Cloud, and DevSecOps teams to embed reliability practices.

Provide input on instrumentation, monitoring hooks, and operational readiness for services.

Work with DBAs and platform teams on database observability and performance optimization.

Share knowledge within the SRE team and adopt practices from Staff and Principal SREs.

Qualifications & Experience

Required

7+ years in SRE, Operations, or Infrastructure Engineering.

Strong hands-on experience with monitoring and observability platforms.

Experience with tools such as New Relic, Datadog, Prometheus, Grafana, CloudWatch, OpenTelemetry, Graylog.

Proven experience in incident response, troubleshooting production issues, and improving MTTR/MTTD.

Good knowledge of SLIs, SLOs, SLAs, and error budgets.

Hands-on experience with AWS services (EC2, ECS, EKS, networking, scaling groups).

Proficiency in containers & Kubernetes (Docker, EKS).

Scripting/programming in Python, Go, or shell scripting.

Understanding of networking, distributed systems, and high-availability architectures.

Exposure to ITIL/ITSM processes.

Preferred

Experience in SaaS or healthcare environments.

Knowledge of databases (MongoDB, Elasticsearch, SQL Server, Oracle).

Familiarity with chaos engineering and resiliency testing.

Certifications: AWS Solutions Architect / DevOps Engineer, CKA/CKA

GHX: Its the way you do business in healthcare

GHX is a healthcare business and data automation company, empowering healthcare organizations to enable better patient care and maximize industry savings using our world class cloud-based supply chain technology exchange platform, solutions, analytics and services. We bring together healthcare providers and manufacturers and distributors in North America and Europe - who rely on smart, secure healthcare-focused technology and comprehensive data to automate their business processes and make more informed decisions.

It is our passion and vision for a more operationally efficient healthcare supply chain, helping organizations reduce - not shift - the cost of doing business, paving the way to delivering patient care more effectively. Together we take more than a billion dollars out of the cost of delivering healthcare every year. GHX is privately owned, operates in the United States, Canada and Europe, and employs more than 1000 people worldwide. Our corporate headquarters is in Colorado, with additional offices in Europe.

Disclaimer

Read our GHX Privacy Policy

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

Management Health Solutions India

Login to

Please Verify Your Phone or Email

Confirm Action

Sr Site Reliability Engineer