6 - 11 years

9 - 18 Lacs

Posted:None| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Role & responsibilities

  • Continuously monitor enterprise tools (e.g., Dynatrace, SolarWinds, Splunk, Prometheus, ServiceNow Event Management, etc.).
  • Detect abnormal system behavior and service degradations through event correlation and anomaly detection.
  • Ensure monitoring thresholds and dashboards are accurately configured and optimized.
  • Analyze event patterns, reduce alert noise, and identify root symptoms from events.
  • Classify events based on severity and impact; escalate confirmed alerts as incidents when required.
  • Correlate events across infrastructure, application, and cloud layers to determine potential service impacts
  • Collaborate with L2/L3, Network, Cloud, and Application teams for real-time response.
  • Maintain event logs, shift handover reports, and status dashboards.
  • Provide inputs to weekly/monthly reporting on alerts, false positives, and event trends.
  • Support improvements in automation, noise reduction, and event workflow optimization.
  • Coordinate with the Observability and Automation teams to enhance proactive monitoring capabilities.
  • Participate in tool integration, testing, and continuous improvements in event management processes.
  • Support compliance audits by maintaining traceability of event detection to resolution.

Conduct periodic reviews of event rules and suppression filters to reduce false positives.

Preferred candidate profile

  • 7 to 9 years of experience in IT Operations, Event Monitoring, or Command Center/NOC.
  • Strong hands-on experience with event management tools like ServiceNow Event Management, Dynatrace, AppDynamics, Splunk, BMC TrueSight, or equivalent.
  • Knowledge of ITIL practices, especially Event Management, Incident Management, Problem Management.
  • Experience working in a 24x7 operations environment, including night/weekend shifts.
  • Understanding of cloud environments (AWS/Azure/GCP) and associated monitoring capabilities.
  • Ability to identify false positives and drive monitoring optimization efforts.
  • Good communication skills for shift handovers, war room coordination, and reporting.
  • Experience coordinating with multiple support teams under high-pressure scenarios.
  • Exposure to business-critical application support and service reliability practices.
  • Demonstrated ability to work independently and collaboratively in a distributed team model.

Preferred Qualifications

  • ITIL v4 Foundation certified.
  • Experience with observability platforms (Elastic, New Relic, Grafana, Prometheus, etc.).
  • Scripting skills (PowerShell, Python, etc.) for automation of alert validation or report generation.
  • Exposure to SRE practices or AIOps initiatives.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You