Observability Engineer

5 - 9 years

0 Lacs

Posted:2 weeks ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Senior Observability Engineer, you will play a crucial role in leading the design, development, and maintenance of observability solutions across the infrastructure, applications, and services. You will collaborate with cross-functional teams to optimize system observability and enhance incident response capabilities. Your responsibilities will include: - Lead the Design & Implementation of observability solutions for both cloud and on-premises environments. - Drive the Development and maintenance of advanced monitoring tools like Prometheus, Grafana, Datadog, New Relic, and AppDynamics. - Implement Distributed Tracing frameworks such as OpenTelemetry, Jaeger, or Zipkin for application performance diagnostics. - Optimize Log Management and analysis strategies using tools like Elasticsearch, Splunk, Loki, and Fluentd. - Develop Advanced Alerting and anomaly detection strategies to proactively identify system issues. - Collaborate with Development & SRE Teams to enhance observability in CI/CD pipelines and microservices architectures. - Automate Observability Tasks using scripting languages like Python, Bash, or Golang. - Ensure Scalability & Efficiency of monitoring solutions to handle evolving business requirements. - Lead Incident Response by providing actionable insights through observability data. - Stay Abreast of Industry Trends in observability and Site Reliability Engineering (SRE). Qualifications Required: - 5+ years of experience in observability, SRE, DevOps, with a track record of managing large-scale distributed systems. - Expertise in observability tools like Prometheus, Grafana, Datadog, New Relic, AppDynamics. - Proficiency in log management platforms such as Elasticsearch, Splunk, Loki, and Fluentd. - Deep expertise in distributed tracing tools like OpenTelemetry, Jaeger, or Zipkin. - Experience with cloud environments (Azure, AWS, GCP) and Kubernetes. - Proficiency in scripting languages, Infrastructure as Code tools, and system architecture. - Strong problem-solving skills, ability to lead high-impact projects, and collaborate effectively. - Strong communication skills and the ability to work closely with engineering teams and stakeholders.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

hyderabad, telangana, india