Lead Site Reliability Engineer

10 - 15 years

15 - 20 Lacs

Posted:16 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

YOUR IMPACT:

  • We are looking for Lead Site Reliability Engineer focused on maintaining the reliability and scalability of GitLab infrastructure, heavily utilizing Terraform to manage and automate cloud infrastructure provisioning and configuration across various environments, often including on-call responsibilities to respond to production incidents and optimize system performance.

WHAT THE ROLE OFFERS:

  • Infrastructure as Code (IaC) with Terraform: Design, write, and maintain Terraform modules to provision and manage cloud infrastructure (servers, networks, databases) across different cloud providers, ensuring consistency and repeatability.
  • GitLab CI/CD Integration: Leverage GitLab CI/CD pipelines to automate infrastructure deployments, testing, and monitoring, ensuring smooth integration with Terraform workflows.
  • Monitoring and Alerting: Build and maintain robust monitoring systems using tools like Graylog, Prometheus and Grafana to detect potential issues early and trigger alerts for timely response.
  • Incident Response: Participate in on-call rotations, rapidly diagnose and resolve production incidents by leveraging GitLab tools and Terraform state to quickly identify root causes.
  • Capacity Planning: Analyze system usage trends and proactively scale infrastructure to meet growing demands.
  • Automation: Automate repetitive operational tasks using scripting languages and GitLab features to reduce manual intervention and improve efficiency.

WHAT YOU NEED TO SUCCEED:

  • Experience in Terraform syntax, best practices, and managing complex infrastructure configurations
  • Strong Expertise on Understanding of GitLab features like CI/CD pipelines, issue tracking, and code review
  • Strong expertise on either of cloud providers (AWS, Azure, GCP) and their services
  • GCP preferred
  • Experience managing Linux systems, networking concepts, and security best practices Proficiency with monitoring tools like Graylog, Prometheus, Grafana, and logging solutions
  • Ability to write scripts in languages like Python, Bash to automate tasks

Education: B.Tech, BE, BCA, MCA

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Opentext logo
Opentext

Software Development

Waterloo ON

RecommendedJobs for You

noida, uttar pradesh, india

noida, uttar pradesh, india