Home
Jobs

Sr Engineer DevOps with SRE

5 - 8 years

15 - 20 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are seeking a highly skilled and motivated Devops and Site Reliability Engineer to join our team. As a Devops/SRE you will play a crucial role in ensuring the reliability, scalability performance of our systems, troubleshooting, cloud infrastructure management and also developing the tools and applications. You will be responsible for incident management, release management, automation, infrastructure monitoring, POCs, writing new tools and collaborating with cross-functional teams.

Requirement :

6 to 8+ years Windows and Linux systems administration

2+ years provisioning, operating, and managing AWS environments.

Proficient with AWS services: Compute and Network, Storage and CDN, Database, Analytics, Application Services, Deployment, and Management

Experience with multi-tier architectures: load balancers, caching, web servers, application servers, databases, and networking.

Familiarity interacting with AWS APIs

AWS Disaster Recovery design and deployment across regions a plus

Experience in automation and testing via scripting & programming (TFS, PowerShell, Jenkins, Python, Ruby, Java)

Self-starter excited to relentlessly solve many technical challenges.

Clear written and verbal communication

Manage your own time and work well both independently and as part of a team.

Required Skills:

Bachelor's degree in Computer Science, Engineering, or a related field.

Proven experience as a Devops/Site Reliability Engineer or in a similar role, with a focus on high-availability production environments.

Strong understanding of cloud computing platforms, such as Amazon Web Services (AWS).

Experience with containerization technologies like Docker and orchestration frameworks like Kubernetes.

Proficiency in scripting or programming languages, such as Python, Bash, Golang and Angular.

Solid understanding of Linux/Unix systems administration and troubleshooting.

Familiarity with monitoring and observability tools like Prometheus, Grafana, Elasticsearch, or Splunk.

Strong analytical and problem-solving skills, with the ability to diagnose and resolve complex technical issues.

Excellent communication and collaboration skills, with the ability to work effectively in cross-functional teams.

Knowledge of DevOps principles and practices, including CI/CD pipelines and version control systems (e.g., Git).

Must Have:

Certified on at least one: AWS DevOps Engineering, AWS Certified Developer – Associate.

Preferred Skills:

Familiarity with configuration management tools like Stash or SaltStack.

Understanding of networking concepts and protocols.

Knowledge of security best practices and experience with securing infrastructure and applications.

Certification in relevant technologies is a plus.

  • has context menu


Roles and Responsibilities

Responsibilities:

Software, Tools and automation : Identify opportunities for automation and drive the development of tools and frameworks to improve system resiliency, efficiency, and

performance. Collaborate with the development and operations teams to implement automation solutions. Implement systems that are highly available, scalable, and

self-healing on the AWS platform. Design, manage, and maintain tools to automate operational processes on AWS. Build tools and processes to support the infrastructure. Automate security controls, governance processes, and compliance validation on AWS.

Collaboration : Work closely with developers to implement continuous delivery systems & methodologies on AWS.

Informally train and share AWS knowledge within the team. Provide training and documentation to enhance team knowledge and capabilities.

CICD : Implement CI/CD pipelines for automated software integration and deployment.

Infra as a code : Utilize Infrastructure as Code (IaC) for managing AWS resources programmatically. Terraform and Cloudformation is a plus.

Process and compliance: Ensure security and compliance standards are met within the AWS environment. Collaborate with cross-functional teams to streamline processes

and optimize resources.

Cost optimization: Optimize AWS resource usage for cost-effectiveness and performance.

Incident Management: Respond to incidents and troubleshoot issues in AWS infrastructure and applications. Act as a key resource in incident management, responding promptly and effectively to incidents to minimize impact. Lead incident resolution efforts, working closely with stakeholders and subject matter experts.

Release Management: Manage the planning, coordination, and execution of releases across multiple environments. Ensure smooth release processes, including risk assessment,

communication, and rollback strategies.

Should be ready to work in a 24*7 shift environment.

Infrastructure Monitoring: Set up monitoring and logging solutions for application performance and security. Establish and maintain comprehensive monitoring systems to ensure high availability and performance of applications and services. Proactively identify potential issues and bottlenecks, and work towards their resolution. Define and deploy monitoring, metrics, and logging systems on AWS.

Collaboration: Work closely with cross-functional teams, including development, operations, and support, to understand requirements, address issues, and drive continuous improvement. Foster a collaborative and proactive culture within the organization.

Incident Post-Mortems: Conduct post-incident analysis and root cause investigations. Identify opportunities for process improvements and work with stakeholders to implement preventive measures.

Documentation: Maintain accurate documentation of system configurations, processes, and procedures. Contribute to the knowledge base and provide training and support to team

members.

 

Mock Interview

Practice Video Interview with JobPe AI

Start Technical Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You