0 - 2 years

3 - 7 Lacs

Posted:-1 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

QA Engineer - Job Description
Key Responsibilities


  • Test product-specific use cases and validate end-to-end alerting workflows across monitoring systems.



  • Simulate incidents and test scenarios that trigger alerts in tools like Datadog, Prometheus, or similar monitoring platforms.



  • Verify that alerts raised in monitoring tools are correctly consumed and acted upon by downstream systems or automated workflows.



  • Understand alert rules so test cases are easier to design, execute, debug, and maintain (alert configuration will be handled by Developers/SREs, but QA must understand them).



  • Collaborate closely with engineering teams (Developers, SREs/DevOps) to improve detection, investigation, and automated incident response.



  • Analyze alert behaviour, validate incident pipelines, and ensure seamless integration across all monitoring and automation tools.



  • Identify gaps in monitoring, logging, and alert workflows and provide clear, actionable QA feedback.



  • Document test scenarios, alert behaviour, and monitoring workflows in a clear and reproducible manner.



Mandatory Skills


  • Monitoring Tools Expertise: Hands-on experience with at least one major monitoring system (Datadog or Prometheus), including working with alerts, dashboards, and troubleshooting.



  • Alert Simulation & Validation: Ability to trigger, simulate, and validate alert events end-to-end.



  • Incident Workflow Understanding: Strong understanding of how alerts propagate through monitoring systems and how automated systems respond to them.



  • Automation Mindset: Ability to use or write simple scripts (Python, Shell, etc.) to simulate workloads or events that trigger alerts.



  • Communication & Problem Solving: Ability to collaborate effectively with Developers and SRE/DevOps teams to ensure monitoring accuracy.



Good to Have


  • Experience with automated incident investigation or remediation tools.



  • Familiarity with CI/CD pipelines and integrating monitoring validation into pipelines.



  • Understanding of observability fundamentals metrics, logs, traces.



  • Exposure to infrastructure or SRE environments.



  • Basic knowledge of Kubernetes, Docker, or cloud platforms (AWS/GCP/Azure).


Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Infracloud Technologies logo
Infracloud Technologies

Cloud Computing, Technology

N/A

RecommendedJobs for You