Site Reliability Engineer with Microsoft Azure

10 - 15 years

15 - 30 Lacs

Posted:3 hours ago| Platform: Naukri logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Site Reliability Engineer with Microsoft Azure

The customer

The project

Responsibilities:

  • Ensuring high availability, performance, and scalability of cloud infrastructure through proactive monitoring, automation, and continuous improvement. 
  • Designing and maintaining resilient Azure-based infrastructure using IaC (Terraform). 
  • Implementing end-to-end observability with telemetry, CUJ-level metrics, dashboards, alerts, and real-time performance insights. 
  • Monitoring Critical User Journeys with product and business teams to maintain a reliable user experience. 
  • Conducting load testing, capacity planning, and performance tuning to prepare systems for traffic growth and spikes. 
  • Managing SLIs, SLOs, SLAs, and error budgets across critical services. 
  • Implementing next-generation cloud reliability and fault-tolerance solutions, including disaster recovery improvements. 
  • Identifying risks and preventing service disruptions through proactive reliability engineering. 
  • Automating deployments, scaling, failover, and remediation to reduce manual toil and operational bottlenecks. 
  • Leading incident response, participating in on-call rotations, conducting root cause analysis, and delivering blameless post-mortems. 
  • Creating and maintaining runbooks, documentation, and operational guidelines. 
  • Collaborating with engineering and global teams on reliability best practices; mentoring junior SREs and supporting SRE hiring.

Must-haves:

  • Experience as an SRE in cloud and infrastructure teams for 10+ years.
  • Extensive experience with Microsoft Azure cloud services and infrastructure management for a minimum of 5+ years.
  • Strong technical background with solid knowledge of software development principles, application production support, SDLC best practices, and Agile methodology.
  • Hands-on SRE experience with a strong understanding of SLOs, SLIs, error budgets, incident management, and conducting blameless post-mortems.
  • Strong ability to analyze and understand application architectures and identify areas for improvement.
  • Experience working with monitoring, logging, and observability tools to assess and improve application performance.
  • Proficiency in scripting and automation tools, including Python, Bash, and Terraform, to reduce toil and enhance operational efficiency.
  • Strong incident response and troubleshooting skills with the ability to perform effective root cause analysis.
  • Excellent communication and collaboration skills for working with cross-functional teams and clearly explaining technical concepts.
  • Ability to coach and mentor team members in SRE practices and foster a culture of reliability.
  • Practical experience applying Agile development practices and working in Agile teams.
  • Proactive mindset focused on continuous improvement to increase system reliability and performance.
  • Microsoft Azure certifications such as Azure Administrator, Azure DevOps Engineer, or Azure Solutions Architect are preferred.
  • Level of English from Intermediate+ and above.

Nice-to-haves:

  • Additional certifications in cloud computing, DevOps, or SRE practices.

Reasons why this job would be interesting to you:

  • Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others. Andersen cooperates with such businesses as Samsung, Siemens, Johnson & Johnson, BNP Paribas, Ryanair, Mercedes, TUI, Verivox, Allianz, T-Systems, etc..
  • The opportunity to change the project and/or develop expertise in an interesting business domain.
  • Job conditions you can work both fully remotely and from the office or can choose a hybrid variant.
  • Guarantee of professional, financial, and career growth! The company has introduced systems of mentoring and adaptation for each new employee.
  • The opportunity to earn up to an additional 1,000 USD per month, depending on the level of expertise, which will be included in the annual bonus, by participating in the company's activities.
  • Access to the corporate training portal, where the entire knowledge base of the company is collected and which is constantly updated.
  • Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies).
  • Certification compensation (AWS, PMP, etc).
  • Referral program.
  • English courses.
  • Private health insurance and compensation for sports activities.

Your personal data is protected in accordance with GDPR regulations. Learn more: https://andersenlab.com/privacy-policy

Join us!

Mock Interview

Practice Video Interview with JobPe AI

Start Azure DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You