Site Reliability Engineer (AWS)

4 - 7 years

15 - 25 Lacs

Posted:2 weeks ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

You work with the Site Reliability Engineering/Support and Development Teams to increase the reliability and quality of our solutions and pipelines.

  • You design, implement, and maintain automation frameworks for cloud and on-prem environments.
  • You work with AWS services to build and maintain scalable and secure cloud-based solutions.
  • You utilize your expertise in containerization and manage orchestration tools such as Kubernetes or AWS EKS to ensure smooth deployment and scalability.
  • You work with various Kubernetes cluster management products to optimize performance, reliability, and scalability for container-based environments.
  • You ensure continuous delivery through automated testing and monitoring, reducing time-to-market and minimizing deployment risks.
  • You utilize industry-standard infrastructure as code (IaC) tools (e.g., Terraform, Ansible) to provision, configure, and manage resources across private and public clouds.
  • You develop comprehensive monitoring and alerting solutions to ensure the health, performance, and uptime of systems with tools like Datadog, Grafana, LGTM, etc.
  • You develop, document, and enforce cloud operations and GitOps standards ensuring secure, efficient, and cost-effective cloud solutions.
  • You troubleshoot issues, perform root cause analysis, and implement fixes to optimize the performance and latency of our systems and data pipeline.

Desired Qualifications:

  • Technical Expertise in Distributed Systems and Cloud
  • Proven track record in infrastructure automation using tools such as Terraform, Ansible, Chef, or Puppet.
  • Extensive experience designing and implementing CI/CD workflows with tools like Jenkins, GitHub Actions, GitLab CI, or similar.
  • Demonstrable expertise in DevOps patterns and best practices across full lifecycle deployments.
  • Hands-on experience with both virtualized and containerized environments.
  • Deep knowledge of Kubernetes and related tools (e.g. ArgoCD, Amazon EKS, Karpenter, Crossplane, Helm, etc.)
  • Cloud Platform Proficiency
  • Solid expertise in configuring and operating services on AWS Cloud.
  • Experience in architecting cloud-native solutions to meet business resilience, scalability, and security requirements.
  • Experience with Monitoring & Alerting Systems (e.g. Datadog, Grafana, Coralogix, OpsGenie, etc.)

Additional Benefits:

  • Varied and challenging work content as well as good development opportunities
  • A role that provides you progress both professionally and personally
  • Attractive terms of employment and a lot of personal responsibility
  • You work in a young and lively environment and you are part of an international SAFe Agile Release Train with around 100 engineers

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Nielseniq India logo
Nielseniq India

Information Services

Chicago Illinois

RecommendedJobs for You