Senior Manager, Infrastructure and SRE

8 years

0 Lacs

Posted:4 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

At SolarWinds, we’re a people-first company. Our purpose is to enrich the lives of the people we serve—including our employees, customers, shareholders, Partners, and communities. Join us in our mission to help customers accelerate business transformation with simple, powerful, and secure solutions.

The ideal candidate thrives in an innovative, fast-paced environment and is collaborative, accountable, ready, and empathetic. We’re looking for individuals who believe they can accomplish more as a team and create lasting growth for themselves and others. We hire based on attitude, competency, and commitment. Solarians are ready to advance our world-class solutions in a fast-paced environment and accept the challenge to lead with purpose. If you’re looking to build your career with an exceptional team, you’ve come to the right place. Join SolarWinds and grow with us!

Responsibilities:

  • Manage and lead a team of SREs responsible for ensuring the reliability and availability of our cloud infrastructure and services
  • Develop and implement site reliability engineering practices to improve service availability, performance, and scalability
  • Collaborate with cross-functional teams to design and implement new features and services that meet customer needs and business requirements
  • Develop and implement incident response plans and post-incident reviews to identify root causes and prevent future incidents
  • Monitor and analyze system performance metrics to identify and resolve performance bottlenecks
  • Build and maintain relationships with key stakeholders, including internal customers and vendors
  • Stay up-to-date with industry trends and emerging technologies related to site reliability engineering and cloud infrastructure

Qualifications:

  • Bachelor's degree in Computer Science or related field, or equivalent work experience
  • 8+ years of experience in site reliability engineering, infrastructure engineering, or a related field
  • 4+ years of experience managing and leading a global team of SREs or infrastructure engineers
  • Must have extensive experience with AWS, Kubernetes in a SaaS product with >$100M ARR
  • Experience with developing and implementing site reliability engineering practices and incident response plans
  • Programming skills with a high-level language like Python/Go and IaC tools like Terraform, Pulumi.
  • Expertise with distributed systems for large-scale data processing and stream processing.
  • Strong analytical and problem-solving skills
  • Excellent communication and interpersonal skills
  • Demonstrated ability to build and maintain relationships with internal and external stakeholders

SolarWinds is an Equal Employment Opportunity Employer. SolarWinds will consider all qualified applicants for employment without regard to race, color, religion, sex, age, national origin, sexual orientation, gender identity, marital status, disability, veteran status or any other characteristic protected by law.

All applications are treated in accordance with the SolarWinds Privacy Notice: https://www.solarwinds.com/applicant-privacy-notice

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You