Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
16.0 - 22.0 years
0 - 0 Lacs
bangalore, chennai, noida
On-site
Title: SRE Architect Exp: 16-22 years Location: Mumbai / Pune / Chennai / Hyderabad / Bangalore / Kolkata / Delhi / Noida/ Coimbatore The SRE Architect will play a critical role in designing and implementing Observable, Scalable, Reliable, and Resilient systems and applications that ensure the highest levels of availability and performance for the applications and services. This role requires a deep understanding of software engineering, system architecture, and operations, along with a passion to automate repetitive tasks with GenAI tools and scripts. Key Responsibilities: System Design and Architecture: Lead the design and architecture of scalable and reliable systems that meet the needs of our growing user base and business requirements. Automation and Tooling: Develop and maintain automation tools and frameworks that streamline operations and improve system reliability. Monitoring and Observability: Implement and enhance monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues. Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle current and future demands. Incident Management: Lead incident response efforts, perform root cause analysis, and implement corrective actions to prevent recurrence. Collaboration and Mentorship: Work closely with software engineers, DevOps, and other stakeholders to promote best practices in reliability engineering and provide mentorship to junior team members. Continuous Improvement: Identify areas for improvement in existing systems and processes, and drive initiatives to enhance system reliability and performance. Skillset: Experience: Overall 16-22 years of experience along with minimum of 9+ years of experience in site reliability engineering, DevOps, or a related field, with a proven track record of designing and implementing reliable systems at scale. Technical Skills: Strong programming skills in languages such as Python, Go, or Java/.Net. In-depth knowledge of cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker). Experience with infrastructure as code (Terraform, Ansible, Puppet). Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk, AppDynamics, Dynatrace, ELK stack). Solid understanding of networking, security, and system performance tuning. Soft Skills: Strong problem-solving and analytical skills. Excellent communication and collaboration abilities. Ability to work in a fast-paced environment and manage multiple priorities. Passion for continuous learning and staying up-to-date with industry trends and technologies. Preferred Skillset: Experience with chaos engineering and resilience testing. Familiarity with service mesh architectures (Istio, Linkerd). Certifications in cloud platforms (Azure Certified Architect, AWS Certified Architect, Google Cloud Professional Architect, etc.).
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
31458 Jobs | Dublin
Wipro
16542 Jobs | Bengaluru
EY
10788 Jobs | London
Accenture in India
10711 Jobs | Dublin 2
Amazon
8660 Jobs | Seattle,WA
Uplers
8559 Jobs | Ahmedabad
IBM
7988 Jobs | Armonk
Oracle
7535 Jobs | Redwood City
Muthoot FinCorp (MFL)
6170 Jobs | New Delhi
Capgemini
6091 Jobs | Paris,France