Site Reliability Engineer, VP

12 years

0 Lacs

Posted:2 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Join our digital revolution in NatWest Digital XIn everything we do, we work to one aim. To make digital experiences which are effortless and secure.So we organise ourselves around three principles: engineer, protect, and operate. We engineer simple solutions, we protect our customers, and we operate smarter.Our people work differently depending on their jobs and needs. From hybrid working to flexible hours, we have plenty of options that help our people to thrive.This role is based in India and as such all normal working days must be carried out in India.Join us as a Site Reliability Engineer
  • In this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
  • You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
  • This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
  • We're offering this role at vice president level
What you'll do
As our Site Reliability Engineer, you’ll work closely with our feature team and other colleagues to meet defined service level objectives and continually improve systems and environments. You’ll define error budgets that support finding the right balance between risk and reliability.You’ll also provide structure and help to our release process, suggesting and making improvements where possible. You’ll scale systems sustainably through mechanisms like automation, evolving them by pushing for changes that improve reliability and velocity. We’ll also look to you to coach and provide guidance to colleagues and the wider team, leading where required.In addition to this, you’ll:
  • Proactively contribute new ideas and innovations to meet short term and longer-term goals
  • Continually balance and manage any potential risks
  • Be accountable for the day-to-day health of both production and non-production environments and respond to any incidents as required
  • Provide technical expertise and input to establish the risk tolerance of products and services
  • Communicate incident status updates clearly and frequently to other teams, customers and stakeholders
The skills you'll need
We’re looking for someone with strong knowledge of reliability systems thinking and experience of software engineering. You’ll need at least 12 years of experience of using a data driven and scientific approach to fact finding. We’ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processesWe’re also looking for:
  • Good knowledge and experience of programming languages
  • Experience in AWS, Git, GitLab, Terraform and Grafana along with understanding of Java and Microservices
  • Experience in deployment and release services, automation and troubleshooting
  • Experience of configuring and tuning standard observability tooling
  • Strong communication skills with the ability to proactively engage with a wide range of stakeholders

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now

RecommendedJobs for You