Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in bengaluru
>
F5
>
Senior Principal Site Reliability Engineer

Senior Principal Site Reliability Engineer

15 - 20 years

17 - 22 Lacs

bengaluru

Posted:1 month ago| Platform:

Apply

Skills Required

kubernetes sre networking devops linux cluster management continuous integration python configuring indexing production golang ci/cd microsoft azure query optimization gcp grafana kafka debugging vector terraform shell scripting aws

Work Mode

Work from Office

Job Type

Full Time

Job Description

F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product.

Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products

Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration

Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) andparticipate in regular monitoring of infrastructure for stability.

Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems.

Scale & Resilient systems: Design & deploy systems/infra which ishighly available and resilient for the configured failure domains.

Design systems using strong security principles with security by default.

The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change.

Knowledge, Skills and Abilities

Hands-on experience with the Cortex suite of observability tools, including Cortex, Loki, Tempo, and Prometheus integration for scalable, multi-tenant monitoring systems.

Proficient in deploying and managing Cortex in microservice environments, including configuration of distributors, ingesters, queriers, and store-gateways for high availability and performance.

Experienced with Grafana Mimir,including cluster setup, alerting, rule evaluation, and long-term metric storage at scale.

Skilled in optimizing Cortex/Mimir query performance, tuning compaction, and managing sharding/replication for massive telemetry workloads.

Familiar with integrating Cortex/Mimir with Grafana dashboards, Thanos, or Prometheus Remote Write to support observability-as-a-service use cases

Elasticsearch: Deep understanding of indexing strategies, query optimization, cluster management, and tuning for high-throughput use cases. Familiarity with slow query analysis, scaling, and shard management.

ClickHouse: Proven experience in designing and managing OLAP workloads, optimizing query performance, and implementing efficient table engines and materialized views.

Apache Kafka: Expertise in event streaming architecture, topic design, producer/consumer configuration, and handling high-volume, low-latency data pipelines. Experience with Kafka Connect and Schema Registry is a plus.

Vector (Datadog/Timber.io/Logs): Proficiency in configuring Vector for observability pipelines, including log transformation, enrichment, and routing to multiple sinks (e.g., Elasticsearch, S3, ClickHouse).

Hands-on programming experience in any one language python,golang + shell scripting.

Strong networking fundamentals and experience dealing with different layers of the networking stack.

SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues.

Experience in upgrading workloads for SaaS Services without downtime.

Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues.

GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD.

CI/CD: Experience working with/designing functional CI/CD systems.

Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure)

Qualifications

Typically, requires at least 15 years of related experience with a bachelors degree, 12+year and a masters degree, or a PhD with 10+ year of experience or equivalent experience.

Excellent organizational agility and communication skills throughout the organization.

Environment

Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged.

Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth.

Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

More Jobs at F5

Sr. Principal Site Reliability Engineer

Bengaluru

7 - 11 yrs

INR 17 - 22 Lacs

Sr Engineer, Software

Bengaluru

3 - 6 yrs

INR 6 - 11 Lacs

Engineer III, Software

Hyderabad

2 - 6 yrs

INR 8 - 12 Lacs

Associate Consultant

Bengaluru

1 - 4 yrs

INR 6 - 10 Lacs

Senior Rust Developer – Proxy Solution

Hyderabad

3 - 8 yrs

INR 6 - 9 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.