Name: Jobpe
Address: T-Hub, Plot No 1/C, Sy No 83/1, Raidurgam panmaktha, Knowledge City Rd, Hyderabad, Telangana, 500081, IN
Telephone: +91-83339-09630
Price range: Free

Site Reliability Engineer, AVP Deutsche Bank

7.0 - 12.0 years

32 - 37 Lacs

Bengaluru

Work from Office

About The Role : Job TitleSite Reliability Engineer LocationBangalore, India Corporate TitleAVP Role Description You will work closely with application teams to ensure stable, well monitored applications that are resilient to faults. You will agree and review Service Level Objectives (SLOs) to achieve high availability for applications based on their criticality. You will maintain Error Budgets for the application teams and prevent releases in the event of production instability and reduced availability. You will focus on reducing manual toil, improving operational reliability and driving automation-first practices. This is a hands-on role with strong focus on implementing SRE practices and reducing toil for Developer Tools. What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy Best in class leave policy Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Drive stability, performance and reliability improvements for TDI Engineering applications. Build Monitoring and alerting solutions to alert in the event of failures/performance issues across TDI Engineering applications to help us providing the optimum service level to the users. Provide feedback loops to continually improve the application resilience across multiple application teams. Collaborate with product owners and engineering team to prioritize reliability and stability of these applications. Define, measure and maintain SLOs and Error Budgets to ensure availability for end users and to achieve appropriate levels of application stability. Identify opportunities for automation and self-service capabilities and implement them to eliminate toil for both the application teams and the SRE team to optimise effectiveness Manage outage resolution and agree actions to reduce the likelihood of failure happening in future by owning RCA and conducting blameless postmortems. Your skills and experience Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma). 8+ Years of Experience in IT in large corporate environments, specifically in controlled production environments. Demonstrable Site Reliability Engineering experience of at least 3+ Years. Excellent analytical and problem-solving skills Experience in implementing observability solution using any industry standard tools Scripting skills (Groovy, shell, Bash, Cron or any equivalent) Experience in mid-range technologies and platforms, i.e. UNIX/LINUX, ORACLE database and Nginx experience. Good to have Understanding and experience in Developer Tools (Jira, Confluence, Bitbucket, TeamCity, Artifactory, Udeploy) as an enterprise level Administrator experienced in managing applications with large user base. Knowledge and experience of observability tools like Grafana, Prometheus. How we'll support you Training and development to help you excel in your career Coaching and support from experts in your team A culture of continuous learning to aid progression A range of flexible benefits that you can tailor to suit your needs

Posted 1 month ago

Apply

Devops AWS DATA Engineeer|| Technical Analyst || Remote Engagement Vcloud Technologies Investment

7.0 - 11.0 years

0 - 1 Lacs

Hyderabad

Work from Office

We are seeking a highly skilled Devops Engineer to join our dynamic development team. In this role, you will be responsible for designing, developing, and maintaining both frontend and backend components of our applications using Devops and associated technologies. You will collaborate with cross-functional teams to deliver robust, scalable, and high-performing software solutions that meet our business needs. The ideal candidate will have a strong background in devops, experience with modern frontend frameworks, and a passion for full-stack development. Requirements : Bachelor's degree in Computer Science Engineering, or a related field. 7 to 10+ years of experience in full-stack development, with a strong focus on DevOps. DevOps with AWS Data Engineer - Roles & Responsibilities: Use AWS services like EC2, VPC, S3, IAM, RDS, and Route 53. Automate infrastructure using Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation . Build and maintain CI/CD pipelines using tools AWS CodePipeline, Jenkins,GitLab CI/CD. Cross-Functional Collaboration Automate build, test, and deployment processes for Java applications. Use Ansible , Chef , or AWS Systems Manager for managing configurations across environments. Containerize Java apps using Docker . Deploy and manage containers using Amazon ECS , EKS (Kubernetes) , or Fargate . Monitoring & Logging using Amazon CloudWatch,Prometheus + Grafana,E Stack (Elasticsearch, Logstash, Kibana),AWS X-Ray for distributed tracing manage access with IAM roles/policies . Use AWS Secrets Manager / Parameter Store for managing credentials. Enforce security best practices , encryption, and audits. Automate backups for databases and services using AWS Backup , RDS Snapshots , and S3 lifecycle rules . Implement Disaster Recovery (DR) strategies. Work closely with development teams to integrate DevOps practices. Document pipelines, architecture, and troubleshooting runbooks. Monitor and optimize AWS resource usage. Use AWS Cost Explorer , Budgets , and Savings Plans . Must-Have Skills: Experience working on Linux-based infrastructure. Excellent understanding of Ruby, Python, Perl, and Java . Configuration and managing databases such as MySQL, Mongo. Excellent troubleshooting. Selecting and deploying appropriate CI/CD tools Working knowledge of various tools, open-source technologies, and cloud services. Awareness of critical concepts in DevOps and Agile principles. Managing stakeholders and external interfaces. Setting up tools and required infrastructure. Defining and setting development, testing, release, update, and support processes for DevOps operation. Have the technical skills to review, verify, and validate the software code developed in the project. Interview Mode : F2F for who are residing in Hyderabad / Zoom for other states Location : 43/A, MLA Colony,Road no 12, Banjara Hills, 500034 Time : 2 - 4pm (Monday-26th May to Friday-30th May)

Posted 1 month ago

Apply

Data Engineer(DevOps) Kezan Consulting

4.0 - 9.0 years

3 - 8 Lacs

Noida, Gurugram, Delhi / NCR

Work from Office

Role & responsibilities Site Reliability Engineer Requirements: We are seeking a proactive and technically strong Site Reliability Engineer (SRE) to ensure the stability, performance, and scalability of our Data Engineering Platform. You will work on cutting-edge technologies including Cloudera Hadoop, Spark, Airflow, NiFi, and JOB DESCRIPTIONS 2 Kubernetesensuring high availability and driving automation to support massive-scale data workloads, especially in the telecom domain. Key Responsibilities • Ensure platform uptime and application health as per SLOs/KPIs • Monitor infrastructure and applications using ELK, Prometheus, Zabbix, etc. • Debug and resolve complex production issues, performing root cause analysis • Automate routine tasks and implement self-healing systems • Design and maintain dashboards, alerts, and operational playbooks • Participate in incident management, problem resolution, and RCA documentation • Own and update SOPs for repeatable processes • Collaborate with L3 and Product teams for deeper issue resolution • Support and guide L1 operations team • Conduct periodic system maintenance and performance tuning • Respond to user data requests and ensure timely resolution • Address and mitigate security vulnerabilities and compliance issues Technical Skillset • Hands-on with Spark, Hive, Cloudera Hadoop, Kafka, Ranger • Strong Linux fundamentals and scripting (Python, Shell) • Experience with Apache NiFi, Airflow, Yarn, and Zookeeper • Proficient in monitoring and observability tools: ELK Stack, Prometheus, Loki • Working knowledge of Kubernetes, Docker, Jenkins CI/CD pipelines • Strong SQL skills (Oracle/Exadata preferred) Job Description: • Familiarity with DataHub, DataMesh, and security best practices is a plus • Strong problem-solving and debugging mindset • Ability to work under pressure in a fast-paced environment. • Excellent communication and collaboration skills. • Ownership, customer orientation, and a bias for action Preferred candidate profile Immediate Joiner

Posted 1 month ago

Apply

Infrastructure Engineer Slide Craft Technologies

3.0 - 5.0 years

15 - 17 Lacs

Bengaluru

Work from Office

About the Role Own the deployment, scaling and hardening of our Kubernetes-based infrastructure. Automate end-to-end provisioning, ensure security and high availability, and troubleshoot production incidents. Key Responsibilities Kubernetes: Deploy, manage & optimize clusters (on-prem, EKS/GKE/AKS) IaC & GitOps: Automate with Terraform, Helm charts & Argo CD (or similar) CI/CD: Build/maintain pipelines (Jenkins, GitHub Actions, etc.) Monitoring: Implement Prometheus, Grafana & ELK for metrics, logs & alerts Troubleshooting: Diagnose container networking, storage & performance issues Security: Enforce RBAC, network policies & image-scanning best practices DR & Optimization: Define backup/restore strategies and cost-control measures Collaboration: Partner with dev teams on containerization and CI/CD workflows Required Qualifications 3-5 yrs in infrastructure, SRE or DevOps roles Hands-on Kubernetes (cluster lifecycle, Helm, CRDs) Linux administration & Bash scripting; networking tools (ip, netstat, tcpdump) IaC with Terraform/Ansible; deep Docker knowledge Monitoring with Prometheus/Grafana & ELK Automation scripting in Bash, Python or Go; Git proficiency; production debugging Preferred Skills Managed K8s services (EKS/GKE/AKS) Advanced IaC/GitOps (Argo CD, Terraform, Helm) Service mesh (Istio, Linkerd) Container security (Trivy, Clair) Custom tooling via Bash/Python automation

Posted 1 month ago

Apply

Staff Engineer (Marketplace & Incore) Innovaccer

8.0 - 13.0 years

50 - 85 Lacs

Noida

Work from Office

About the Role We are seeking a highly skilled Staff Engineer to lead the architecture, development, and scaling of our Marketplace platform including portals & core services such as Identity & Access Management (IAM), Audit, and Tenant Management services. This is a hands-on technical leadership role where you will drive engineering excellence, mentor teams, and ensure our platforms are secure, compliant, and built for scale. A Day in the Life Design and implement scalable, high-performance backend systems for all the platform capabilities Lead the development and integration of IAM, audit logging, and compliance frameworks, ensuring secure access, traceability, and regulatory adherence. Champion best practices for reliability, availability, and performance across all marketplace and core service components. Mentor engineers, conduct code/design reviews, and establish engineering standards and best practices. Work closely with product, security, compliance, and platform teams to translate business and regulatory requirements into technical solutions. Evaluate and integrate new technologies, tools, and processes to enhance platform efficiency, developer experience, and compliance posture. Take end-to-end responsibility for the full software development lifecycle, from requirements and design through deployment, monitoring, and operational health. What You Need 8+ years of experience in backend or infrastructure engineering, with a focus on distributed systems, cloud platforms, and security. Proven expertise in building and scaling marketplace platforms and developer/admin/API portals. Deep hands-on experience with IAM, audit logging, and compliance tooling. Strong programming skills in languages such as Python or Go. Experience with cloud infrastructure (AWS, Azure), containerization (Docker, Kubernetes), and service mesh architectures. Understanding of security protocols (OAuth, SAML, TLS), authentication/authorization, and regulatory compliance. Demonstrated ability to lead technical projects and mentor engineering teams & excellent problem-solving, communication, and collaboration skills. Proficiency in observability tools such as Prometheus, Grafana, OpenTelemetry. Prior experience with Marketplace & Portals Bachelor's or Masters degree in Computer Science, Engineering, or a related field

Posted 1 month ago

Apply

AWS PaaS Engineer (Java) Kansoft Solutions

8.0 - 13.0 years

10 - 20 Lacs

Hyderabad, Chennai, Bengaluru

Work from Office

Platforms: AWS PaaS, AWS DevOps Engineer Programming: Java, Monitoring Tools: Thousand Eyes, App Dynamics, CloudWatch, Grafana, Prometheus Java development (coding / scripting – 5-10 yrs) + AWS PaaS (min 3+ years) – SRE experience is advantage.

Posted 1 month ago

Apply

Senior Platform Engineer Torry Harris Business Solutions

5.0 - 8.0 years

12 - 18 Lacs

Bengaluru

Work from Office

Are you an experienced Platform Engineer looking for a new opportunity to showcase your skills and expertise? If so, then Torry Harris is looking for you! We are currently seeking a skilled and motivated individual to join our team and play a critical role in streamlining and automating our cloud infrastructure. As a Senior Platform Engineer at Torry Harris, you responsible to design, build, and maintain scalable infrastructure that supports software development and deployment. The ideal candidate will have expertise in cloud technologies, automation, and DevOps practices. Roles and Responsibilities • Design and maintain scalable, resilient any cloud infrastructure AWS is recommended. • Implement Infrastructure as Code (IaC) using Terraform, Ansible, or CloudFormation. • Automate provisioning, monitoring, and self-healing mechanisms. • Develop and enhance continuous integration & deployment pipelines. • Develop and maintain Helm charts, Kubernetes manifests, and custom operators. • Implement blue-green deployments, canary releases, and rollback mechanisms. • Ensure fast, reliable software delivery while minimizing downtime. • Integrate security scanning tools (SonarQube, Snyk) into CI/CD workflows. • Ensure secure configurations, RBAC policies, and compliance with industry standards. • Implement secrets management and identity access control in cloud environments. • Deploy monitoring tools (Prometheus, Grafana, Datadog) for real-time observability. • Lead root cause analysis,performance optimization for any platform releated issues. • Ensure system reliability using automated alerting and logging mechanisms • Implement monitoring, logging, and alerting solutions for Kubernetes workloads. • Troubleshoot and resolve issues related to container orchestration and networking. • Stay up to date with Kubernetes ecosystem developments and recommend improvements. • Mentor junior engineers and contribute to technical leadership within the DevOps team. • Work closely with developers, platform engineers, and SREs to optimize workflows. • Drive cross-functional collaboration to align DevOps strategies with business objectives.

Posted 1 month ago

Apply

Devops Support Engineer Acuity Knowledge Partners

3.0 - 6.0 years

15 - 20 Lacs

Pune, Gurugram, Bengaluru

Work from Office

Roles and Responsibilities Design and develop application health dashboards, alerting and notification delivery systems to help with observability of application stack in Azure cloud. Respond to incidents, perform root cause analysis, troubleshoot issues, and implement solutions to prevent recurrence. Act as gatekeeper for production deployments, participate in the application release cycles and perform production releases. Manage, and maintain environments hosting Credit, Swaps & FX FO IT microservices and data lake platform. Manage and maintain the lifecycle of core application suite that provide common capabilities such as continuous deployment, observability, and kafka streaming. Establish, deploy, and maintain CI/CD pipelines to automate the build, test, and deployment processes adhering to firms audit and compliance policies. Migration of on-prem build and deployment projects to adopt existing GitOps, cloud deployment pipeline pattern and branching policies. Assist the development teams in containerising, building, and migration of on-prem applications to Azure cloud. Setup, manage and maintain central observability solution for on-prem and cloud. Identify areas that benefit from automation and build automated processes wherever possible. Collaborate with infra teams to provision and manage infra resources required by FO IT development teams in Azure cloud. Implement backup and disaster recovery strategies and participate in annual DR tests and assist with executing the DR test plan. Create and maintain documentation related to common issues, fixes, deployment/release processes, transfer knowledge among DevOps and support team members to remove any key man dependencies Essential Criteria : 2 to 5 years of experience in a SRE/DevOps role preferably in Investment Banking with solid understanding of both. Strong knowledge of DevOps practices, tools, and technologies. Experience in working with, managing, and maintaining enterprise scale production application microservice environments, observability tools. Strong knowledge of containerization and orchestration of microservices. Experience with Docker/Podman, Helm, ArgoCD GitOps tool, Terraform. Experience with Azure Kubernetes Service, Azure Storage, and other Azure cloud related technologies. Experience with Prometheus, Grafana, Loki, Tempo, Grafana Agent, Azure Monitor logging and observability tools. Bamboo CI/CD tools, Bitbucket, GIT. Automation scripting (Bash, Powershell, Python). Be able to demonstrate a high level of professionalism, organisation, self-motivation, and a desire for self- improvement. Ability to plan, schedule and manage a demanding workload.

Posted 1 month ago

Apply

Lead Azure Cloud Operations Engineer || Immediate Joiner Compunnel

8 - 12 years

18 - 20 Lacs

Noida

Hybrid

Role Overview We are looking for a Lead Cloud Operations Engineer to join our growing team supporting key supply-side technology platforms, including Atlas Integration, GMX, Hotel APIs, and related microservices in Azure. This is a high-impact technical leadership role focused on Azure cloud operations, monitoring, performance, security, and incident resolution. You will be responsible for ensuring the availability, scalability, and reliability of cloud-hosted systems, mentoring a small operations team, and collaborating with developers, architects, and business stakeholders to drive continuous improvement. Key Responsibilities Own day-to-day operations and health of production and pre-prod environments hosted in Azure. Monitor infrastructure and applications using Azure Monitor, Application Insights, and Grafana. Lead the team in proactive incident detection, triage, resolution, and post-incident reviews (RCA, documentation). Implement and enhance automation for common operational tasks using PowerShell, Python, Azure CLI, and Terraform/Ansible. Act as escalation point for complex issues and high-severity incidents. Create, improve, and maintain runbooks, dashboards, alerts, and performance tuning metrics. Collaborate with development and DevOps teams to ensure operational readiness, deployment hygiene, and system resilience. Maintain strong governance around Azure resources, RBAC, policy enforcement, and tagging strategy. Lead disaster recovery planning, testing, and execution across critical systems. Drive cost optimization initiatives using Azure Cost Management and FinOps principles. Ensure compliance with security policies (ISO 27001, GDPR, SOC2) and assist in audits or security reviews. Support team mentoring, training, and promoting a strong culture of ownership and accountability. Required Skills & Experience Azure IaaS: Virtual Machines, Scale Sets, Load Balancer, Disks, Networking (VNETs, NSGs, UDRs, Private Links, Service Endpoints) Azure PaaS: App Services, Azure Functions, Logic Apps, Key Vault, Event Grid, Azure SQL, Application Gateway, Azure Front Door, Traffic Manager Azure Kubernetes Service (AKS) deployment, scaling, security & troubleshooting Azure Site Recovery (ASR), Azure Backup, and Disaster Recovery architecture Deep understanding of Azure Monitor, Application Insights, Log Analytics Ability to write and optimize KQL queries for diagnostics and dashboards Experience with Grafana, Prometheus, and alerting pipelines Hands-on experience with Terraform, Ansible, ARM templates Proficiency in scripting with PowerShell, Bash, and/or Python Experience with Azure DevOps Pipelines or similar CI/CD tooling is a plus RBAC, Managed Identities, Conditional Access, Key Vault integration Awareness of ISO 27001, SOC2, GDPR requirements in cloud environments Proven experience leading 24 engineers (including juniors/mid-levels) Strong verbal and written communication skills; able to interact with technical and non- technical stakeholders Experience participating in on-call rotations, owning major incidents, and delivering RCA reports Ability to train, mentor, and guide junior engineers Collaborative mindset with a strong sense of accountability and urgency Nice to Have Experience with multi-cloud (AWS or GCP) environments and hybrid cloud networking Experience working with microservices-based systems and APIs Exposure to FinOps practices and cloud cost management tools Certifications: AZ-305, AZ-104, AZ-500, AZ-700, AZ-400 preferred

Posted 1 month ago

Apply

Lead Site Reliability Engineer / SRE Augusta Infotech

10 - 13 years

18 - 25 Lacs

Bengaluru

Hybrid

Hiring, Lead Site Reliability Engineer with following skills and expertise. What will this person do? Provide leadership in designing and implementing reliable, scalable, and secure infrastructure solutions. Develop and maintain observability solutions, ensuring visibility into system performance using native Azure Cloud solutions. Define and track SLIs, ensuring compliance with SLOs and SLAs. Lead incident response efforts, conduct root cause analysis, and implement preventive measures to minimize downtime. Automate infrastructure provisioning, configuration and management using Terraform & Ansible. Build and maintain robust Observability pipelines to support automated deployments and continuous monitoring practices. Continuously analyze system health and optimize performance by identifying and resolving bottlenecks. Work with our BCDR team to minimize business impact during failures and measure the quality of services. Work with Cloud Governance team to monitor cloud infrastructure spending and implement cost-saving strategies. Implement centralized logging, metric collection, and distributed tracing for troubleshooting and debugging. Deploy, Manage and Monitor containerized workloads. Maintain configuration consistency and compliance across cloud environments using tools like Ansible. Partner with software development teams to integrate reliability best practices into the application development lifecycle. Conduct detailed post-mortems, document learnings, and drive improvements to reduce future incidents. Develop automation scripts in Python, Bash, or other languages to reduce manual efforts and improve efficiency. Provide mentorship to junior engineers, fostering a culture of learning and continuous technical growth. Research and evaluate new technologies, tools, and methodologies to improve system reliability and efficiency. Maintain detailed documentation on infrastructure, monitoring setups, incident responses, and best practices. Qualifications Bachelors degree in Computer Science, Engineering, or a related field. 10+ years in Observability, DevOps, and Site Reliability Engineering (SRE). At least 2 years of experience in defining Observability KPIs for both on-premises and cloud environments. Strong experience with cloud platforms (AWS, Azure, GCP) and cloud-native technologies. Passion for automation, reducing toil and implementing reliability-focused best practices. Deep knowledge of services/tools like Grafana, PowerBI, Prometheus, Azure Monitor, Application Insights & Azure Metrics. Expertise in Terraform, Ansible, Chef, and CI/CD pipeline tools like GitHub Actions, Jenkins, and GitOps methodologies. Working understanding of load balancing, authentication (AAA), encryption, and network parameters monitoring. Strong troubleshooting skills and experience handling on-call incidents and post-mortem analysis. Ability to work cross-functionally, drive technical discussions, and mentor junior engineers. Ability to work in a dynamic team environment and possess time management skills to meet deadlines. Sense of ownership and pride in your performance and its impact on the companys success. Critical thinker with problem-solving skills. Good interpersonal and communication skills.

Posted 1 month ago

Apply

Staff Site Reliability Engineer - Cloud Solutions Team Surveymonkey

21 - 31 years

50 - 70 Lacs

Bengaluru

Work from Office

What we’re looking for As a member of the infrastructure team at Survey Monkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers. What you’ll be working on Architect, build, and operate AWS environments at scale with well-established industry best practices. Automating infrastructure provisioning, DevOps, and/or continuous integration/delivery. Provide Technical Leadership & Mentorship Mentor and guide senior engineers to build technical expertise and drive a culture of excellence in software development. Foster collaboration within the engineering team, ensuring the adoption of best practices in coding, testing, and deployment. Review code and provide constructive feedback to ensure code quality and adherence to architectural principles. Collaboration & Cross-Functional Leadership Collaborate with cross-functional teams (Product, Security, and other Engineering teams) to drive the roadmap and ensure alignment with business objectives. Provide technical leadership in meetings and discussions, influencing key decisions on architecture, design, and implementation. Innovation & Continuous Improvement Propose, evaluate, and integrate new tools and technologies to improve the performance, security, and scalability of the cloud platform. Drive initiatives for optimizing cloud resource usage and reducing operational costs without compromising performance. Write libraries and APIs that provide a simple, unified interface to other developers when they use our monitoring, logging, and event-processing systems. Participate in on-call rotation. Support and partner with other teams on improving our observability systems to monitor site stability and performance We’d love to hear from people with: 12+ years of relevant professional experience with cloud platforms such as AWS, Heroku. Extensive experience leading design sessions and evolving well-architected environments in AWS at scale. Extensive experience with Terraform, Docker, Kubernetes, scripting (Bash/Python/Yaml), and helm. Experience with Splunk, OpenTelemetry, CloudWatch, or tools like New Relic, Datadog, or Grafana/Prometheus, ELK (Elasticsearch/Logstash/Kibana). Experience with metrics and logging libraries and aggregators, data analysis and visualization tools – Specifically Splunk and Otel. Experience instrumenting PHP, Python, Java and Node.js applications to send metrics, traces, and logs to third-party Observability tooling. Experience with GitOps and tools like ArgoCD/fluxcd. Interest in Instrumentation and Optimization of Kubernetes Clusters. Ability to listen and partner to understand requirements, troubleshoot problems, or promote the adoption of platforms. Experience with GitHub/GitHub Actions/Jenkins/Gitlab in either a software engineering or DevOps environment. Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Elasticsearch, Memcached, Redis, Kafka and Debezium. Preferably experience with secrets management, for example Hashicorp Vault. Preferably experience in an agile environment and JIRA. SurveyMonkey believes in-person collaboration is valuable for building relationships, fostering community, and enhancing our speed and execution in problem-solving and decision-making. As such, this opportunity is hybrid and requires you to work from the SurveyMonkey office in Bengaluru 3 days per week. #LI - Hybrid

Posted 1 month ago

Apply

Senior Site Reliability Engineer II Surveymonkey

21 - 31 years

35 - 42 Lacs

Bengaluru

Work from Office

What we’re looking for As a member of the Infrastructure team at Survey Monkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers. What you’ll be working on Architect, build, and operate AWS environments at scale with well-established industry best practices Automating infrastructure provisioning, DevOps, and/or continuous integration/delivery Support and maintain AWS services, such as EKS, Heroku Write libraries and APIs that provide a simple, unified interface to other developers when they use our monitoring, logging, and event-processing systems Support and partner with other teams on improving our observability systems to monitor site stability and performance Work closely with developers in supporting new features and services. Work in a highly collaborative team environment. Participate in on-call rotation We’d love to hear from people with 8+ years of relevant professional experience with cloud platforms such as AWS, Heroku. Extensive experience with Terraform, Docker, Kubernetes, scripting (Bash/Python/Yaml), and helm. Experience with Splunk, Open Telemetry, CloudWatch, or tools like New Relic, Datadog, or Grafana/Prometheus, ELK (Elasticsearch/Logstash/Kibana). Experience with metrics and logging libraries and aggregators, data analysis and visualization tools – Specifically Splunk and Otel. Experience instrumenting PHP, Python, Java and Node.js applications to send metrics, traces, and logs to third-party Observability tooling. Experience with GitOps and tools like ArgoCD/fluxcd. Interest in Instrumentation and Optimization of Kubernetes Clusters. Ability to listen and partner to understand requirements, troubleshoot problems, or promote the adoption of platforms. Experience with GitHub/GitHub Actions/Jenkins/Gitlab in either a software engineering or DevOps environment. Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Elasticsearch, Memcached, Redis, Kafka and Debezium. Preferably experience with secrets management, for example Hashicorp Vault. Preferably experience in an agile environment and JIRA. SurveyMonkey believes in-person collaboration is valuable for building relationships, fostering community, and enhancing our speed and execution in problem-solving and decision-making. As such, this opportunity is hybrid and requires you to work from the SurveyMonkey office in Bengaluru 3 days per week. #LI - Hybrid

Posted 1 month ago

Apply

Site Reliability Engineer Kezan Consulting

5 - 6 years

7 - 8 Lacs

Gurugram

Work from Office

Site Reliability Engineer Job Description: Requirements: We are seeking a proactive and technically strong Site Reliability Engineer (SRE) to ensure the stability, performance, and scalability of our Data Engineering Platform. You will work on cutting-edge technologies including Cloudera Hadoop, Spark, Airflow, NiFi, and Kubernetesensuring high availability and driving automation to support massive-scale data workloads, especially in the telecom domain. Key Responsibilities Ensure platform uptime and application health as per SLOs/KPIs Monitor infrastructure and applications using ELK, Prometheus, Zabbix, etc. Debug and resolve complex production issues, performing root cause analysis Automate routine tasks and implement self-healing systems Design and maintain dashboards, alerts, and operational playbooks Participate in incident management, problem resolution, and RCA documentation Own and update SOPs for repeatable processes Collaborate with L3 and Product teams for deeper issue resolution Support and guide L1 operations team Conduct periodic system maintenance and performance tuning Respond to user data requests and ensure timely resolution Address and mitigate security vulnerabilities and compliance issues Technical Skillset Hands-on with Spark, Hive, Cloudera Hadoop, Kafka, Ranger Strong Linux fundamentals and scripting (Python, Shell) Experience with Apache NiFi, Airflow, Yarn, and Zookeeper Proficient in monitoring and observability tools: ELK Stack, Prometheus, Loki Working knowledge of Kubernetes, Docker, Jenkins CI/CD pipelines Strong SQL skills (Oracle/Exadata preferred) Familiarity with DataHub, DataMesh, and security best practices is a plus Strong problem-solving and debugging mindset Ability to work under pressure in a fast-paced environment. Excellent communication and collaboration skills. Ownership, customer orientation, and a bias for action

Posted 1 month ago

Apply

Senior Devops Engineer Kansoft Solutions

4 - 8 years

7 - 17 Lacs

Udaipur

Work from Office

Kansoft Solutions Pvt. Ltd. is a fast-growing IT solutions provider delivering cutting-edge digital and cloud solutions to global clients. We are looking for a passionate and experienced Azure DevOps Technical Architect to join our team in Udaipur to lead cloud-based DevOps transformations. Key Responsibilities: Design and implement scalable, secure, and robust DevOps pipelines using Azure DevOps. Architect CI/CD solutions across multiple environments for both legacy and cloud-native applications. Collaborate with development, QA, and operations teams to streamline build, release, and deployment processes. Define and enforce DevOps best practices, infrastructure automation, and configuration management strategies. Evaluate and implement monitoring, alerting, and logging solutions to ensure system reliability and performance. Mentor development teams on DevOps culture, tools, and practices. Lead cloud migration and modernization initiatives for enterprise applications. Manage Infrastructure as Code (IaC) using ARM templates, Terraform, or Bicep. Required Skills & Qualifications: Strong hands-on experience with Azure DevOps , CI/CD Pipelines , and Git repositories . Expertise in Infrastructure as Code (Terraform/ARM/Bicep). Deep knowledge of Azure Cloud Services App Services, Functions, AKS, VNets, Key Vault, Storage, etc. Experience with Docker , Kubernetes , and container orchestration in cloud environments. Solid scripting knowledge using PowerShell , Bash , or Python . Proven experience in DevOps architecture design , cloud security, and system monitoring tools (like Prometheus, Grafana, App Insights). Familiarity with Agile/Scrum methodologies. Excellent problem-solving, leadership, and communication skills.

Posted 1 month ago

Apply

DevOps Lead - L1 Wipro

6 - 10 years

10 - 14 Lacs

Pune

Work from Office

About The Role Location – Pune only CBR 5-8 - 120K Job title Release Engineer LocationPune, India Experience 6-10 years Release Engineer Key Responsibilities: o Support CI/CD PipelinesAssist in managing and improving CI/CD pipelines for various applications, ensuring efficient and automated build, test, and deployment processes. o AutomationWork on scripting and automating routine tasks such as code deployments, server provisioning, and configuration management. o Monitoring & TroubleshootingMonitor build and deployment processes, identify and troubleshoot issues, and ensure the stability of the release process. o CollaborationCollaborate with development, QA, and operations teams to ensure smooth integration and deployment of new software releases. o DocumentationMaintain clear documentation for release processes, configurations, and deployment instructions. o Environment ManagementAssist in managing and maintaining development, staging, and production environments. o Configuration ManagementSupport the use of configuration management tools like Ansible, Puppet, or Chef to ensure consistency across environments. Must Haves : o Basic Knowledge of CI/CD ToolsFamiliarity with CI/CD tools like Azure-DevOps, GitLab CI, Jenkins, or similar. o Scripting Skills: Experience with scripting languages like Bash, Python, or PowerShell. o Version ControlUnderstanding of version control systems like Git. o Operating SystemsFamiliarity with Linux/Unix-based operating systems. Nice to Have: o Experience with containerization tools like Docker and orchestration tools like Kubernetes. o Exposure to Infrastructure as Code (IaC) tools like Terraform, Helm Charts o Basic understanding of networking, DNS, and load balancing. o Familiarity with monitoring tools like Prometheus, Grafana, or Nagios. ? Do Drive technical solution support to the team to align on continuous integration (CI) and continuous deployment (CD) of technology in applications Design and define the overall DevOps architecture/ framework to for a project/ module delivery as per the client requirement Decide on the DevOps tool & platform and which needs to be deployed aligned to the customer’s requirement Create a tool deployment model for validating, testing and monitoring performance and align or provision for resources accordingly Define & manage the IT infrastructure as per the requirement of the supported software code Manage and drive the DevOps pipeline that supports the application life cycle across the DevOps toolchain — from planning, coding and building, to testing, to staging, to release, configuration and monitoring Work with the team to tackle the coding and scripting needed to connect elements of the code that are required to run the software release with operating systems and production infrastructure with minimum disruptions Ensure on boarding application configuration from planning to release stage Integrate security in the entire dev-ops lifecycle to ensure no cyber risk and data privacy is maintained ? Provide customer support/ service on the DevOps tools Timely support internal & external customers escalations on multiple platforms Troubleshoot the various problems that arise in implementation of DevOps tools across the project/ module Perform root cause analysis of major incidents/ critical issues which may hamper project timeliness, quality or cost Develop alternate plans/ solutions to be implemented as per root cause analysis of critical problems Follow escalation matrix/ process as soon as a resolution gets complicated or isn’t resolved Provide knowledge transfer, sharing best practices with the team and motivate ? Team Management Resourcing Forecast talent requirements as per the current and future business needs Hire adequate and right resources for the team Train direct reportees to make right recruitment and selection decisions Talent Management Ensure 100% compliance to Wipro’s standards of adequate onboarding and training for team members to enhance capability & effectiveness Build an internal talent pool of HiPos and ensure their career progression within the organization Promote diversity in leadership positions Performance Management Set goals for direct reportees, conduct timely performance reviews and appraisals, and give constructive feedback to direct reports. Incase of performance issues, take necessary action with zero tolerance for ‘will’ based performance issues Ensure that organizational programs like Performance Nxtarewell understood and that the team is taking the opportunities presented by such programs to their and their levels below Employee Satisfaction and Engagement Lead and drive engagement initiatives for the team Track team satisfaction scores and identify initiatives to build engagement within the team Proactively challenge the team with larger and enriching projects/ initiatives for the organization or team Exercise employee recognition and appreciation ? Deliver No. Performance Parameter Measure 1. Continuous Integration, Deployment & Monitoring 100% error free on boarding & implementation 2. CSAT Manage service tools Troubleshoot queries Customer experience 3. Capability Building & Team Management % trained on new age skills, Team attrition %, Employee satisfaction score Mandatory Skills: Azure DevOps Operations. Experience5-8 Years. Reinvent your world. We are building a modern Wipro. We are an end-to-end digital transformation partner with the boldest ambitions. To realize them, we need people inspired by reinvention. Of yourself, your career, and your skills. We want to see the constant evolution of our business and our industry. It has always been in our DNA - as the world around us changes, so do we. Join a business powered by purpose and a place that empowers you to design your own reinvention. Come to Wipro. Realize your ambitions. Applications from people with disabilities are explicitly welcome.

Posted 1 month ago

Apply

Site Reliability Engineer 2 - Hadoop Phonepe

4 - 9 years

11 - 15 Lacs

Bengaluru

Work from Office

About PhonePe Group: PhonePe is Indias leading digital payments company with 50 crore (500 Million) registered users and 3.7 crore (37 Million) merchants covering over 99% of the postal codes across India. On the back of its leadership in digital payments, PhonePe has expanded into financial services (Insurance, Mutual Funds, Stock Broking, and Lending) as well as adjacent tech-enabled businesses such as Pincode for hyperlocal shopping and Indus App Store which is India's first localized App Store. The PhonePe Group is a portfolio of businesses aligned with the company's vision to offer every Indian an equal opportunity to accelerate their progress by unlocking the flow of money and access to services. Culture At PhonePe, we take extra care to make sure you give your best at work, Everyday! And creating the right environment for you is just one of the things we do. We empower people and trust them to do the right thing. Here, you own your work from start to finish, right from day one. Being enthusiastic about tech is a big part of being at PhonePe. If you like building technology that impacts millions, ideating with some of the best minds in the country and executing on your dreams with purpose and speed, join us! JOB DESCRIPTION Minimum of 1 year of experience in Linux/Unix Administration. Minimum of 2 years of hands on experience with managing infra on public cloud i.e Azure/AWS/GCP Over 4+ years of experience in Hadoop administration. Strong understanding of networking, open-source technologies, and tools. Familiar with best practices and IT operations for maintaining always-up, always-available services. Experience and participation during on-call rotation.Excellent communication skills. Solid expertise in Linux networking, including IP, iptables, and IPsec.Proficient in scripting and coding with languages such as Perl, Golang, or Python. Strong Knowledge of databases like Mysql,Nosql,Sql serverHand on experience with setting up , configuring and Managing Nginx as reverse proxy and load balancing in high traffic environments. Hands-on experience with both private and public cloud environments. Strong troubleshooting skills and operational expertise in areas such as system capacity, bottlenecks, memory, CPU, OS, storage, and networking. Practical experience with the Hadoop stack, including Hdfs,HBase,Hive, Pig, Airflow, YARN, HDFS, Ranger, Kafka, and Druid. Good to have experience with Design,develop and maintain Airflow DAGs and tasks to automate BAU processes,ensuring they are robust,scalable and efficient. Good to have experience with ELK stack administration. Experience in administering Kerberos and LDAP. Familiarity with open-source configuration management and deployment tools like Puppet, Salt, or Ansible.Responsible for the implementation and ongoing administration of Hadoop infrastructure. Experience in capacity planning and performance tuning of Hadoop clusters. Collaborate effectively with infrastructure, network, database, application, and business intelligence teams to ensure high data quality and availability. Develop tools and services to enhance debuggability and supportability.Work closely with security teams to apply Hadoop updates, OS patches, and version upgrades as needed. Troubleshoot complex production issues, identify root causes, and provide mitigation strategies. Work closely with teams to optimize the overall performance of the PhonePe Hadoop ecosystem. Experience with setting up & managing monitoring stack like OpenTsdb,Prometheus,ELK,Grafana,Loki PhonePe Full Time Employee Benefits (Not applicable for Intern or Contract Roles) Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy Working at PhonePe is a rewarding experience! Great people, a work environment that thrives on creativity, the opportunity to take on roles beyond a defined job description are just some of the reasons you should work with us. Read more about PhonePe .

Posted 1 month ago

Apply

Cloud and Operations Consultant Nokia

5 - 10 years

10 - 15 Lacs

Noida

Work from Office

The Cloud and Operations Consultant is responsible for evaluating OSS Tools, processes, and technology needs, and translating requirements into the external clients' strategy. You will provide consulting services for OSS architecture, process, and operations design, while bringing excellence in Agile project execution with an inclusive customer engagement plan and effective tracking of project risks. You have: Master's or Bachelor's Degree in Computer Science, Engineering, or a related field (preferred) Minimum 5+ years of experience in OSS consulting or a related role within the telecommunications industry Knowledge of OSS functionalities and processes like inventory management, service provisioning, trouble ticketing, etc. Awareness of industry standards and frameworks (TM Forum Open APIs, AIOps, ODA, eTOM), experience in these areas is a key differentiator Previous Agile Project Management It would be nice if you also had: Excellent research and problem-solving skills. Awareness of Cloud Automation use cases and technology stack Analyze client requirements, identify OSS improvement opportunities, and review documentation like architecture diagrams, HLDs, and LLDs. Stay informed about OSS tools, industry standards (TM Forum, ETSI, 3GPP), and emerging trends like cloud-native technologies and automation. Develop understanding of Cloud Automation use cases (CI/CD automation, test automation, CM automation, Observability) and technology / open-source tools in GitOps (GitLab /Jenkins), Monitoring (Prometheus / Graphana), Logging (ElasticSearch), Service Mesh (K8/ Docker), SaaS/PaaS (AWS / Azure / GCP). Collaborate with other projects and cross-functional teams (development, operations, sales) to ensure successful project delivery. Develop and track Nokia Cloud and Network Services project plans in Jira, generate reports, and assess project risks while managing the consulting backlog. Create dashboards and financial models using Power BI and Excel, supporting data-driven decision-making. Research and enhance OSS & cloud operations best practices, focusing on processes, roles, tools, and metrics. Support senior team members in operations process improvements, market research, service development, and authoring reports.

Posted 1 month ago

Apply

Senior R&D Engineer Nokia

4 - 9 years

8 - 13 Lacs

Bengaluru

Work from Office

As a Senior R & D Engineer, youll design, automate, and optimize scalable cloud-native systems using Kubernetes, Helm, and CI/CD. You'll also manage MariaDB/MySQL, integrate Kafka for real-time streaming, and deploy microservices on cloud platforms. Your role involves troubleshooting, performance tuning, and working closely with teams to streamline processes. Your expertise in container orchestration, automation, and monitoring tools like Prometheus and ELK stack will be key to scalable infrastructure. You have: Bachelor's or Master's degree in Engineering or equivalent with 4+ years of experience in Kubernetes architecture, container orchestration, and Helm for managing applications. Experience with MariaDB/MySQL, including SQL query optimization and performance tuning. Expertise in microservices architecture, containerization, and cloud-native application development. Ability to diagnose and resolve issues using tools like Prometheus, Grafana, and ELK stack. Experience with Apache Kafka, including producers, consumers, and real-time data pipelines. It would be nice if you also had: Familiarity with automated deployment strategies, rollback mechanisms, and CI/CD tools. Knowledge of scaling strategies for cloud applications and database performance tuning. Understanding of secure development practices and cloud security best practices. Design, develop, and maintain scalable infrastructure using Kubernetes or Helm, implement and manage CI/CD pipelines for automated deployments. Administer and optimize relational databases such as MariaDB, MySQL and ensure data integrity, performance tuning, and regular backups. Integrate third-party OEM (Original Equipment Manufacturer) software and services. Implement and manage Kafka for real-time data streaming and integration. Develop and deploy cloud-native applications and microservices, utilizing cloud platforms (e.g., AWS, Azure, OCP) for application hosting and scalability. Perform routine deployments and updates, ensuring minimal downtime, troubleshoot and resolve infrastructure and application issues. Collaborate with development and operations teams to streamline processes along with document processes, configurations, and infrastructure changes.

Posted 1 month ago

Apply

Senior DevOps Engineer Manav Sansadhan Vikas Salaahkar

6 - 8 years

16 - 20 Lacs

Bengaluru

Work from Office

Senior DevOps Engineer Location: Bengaluru South, Karnataka, India Experience: 68 Years Compensation: 1620 LPA Industry: PropTech | AgriTech | Cloud Infrastructure | Platform Engineering Employment Type: Full-Time | On-Site/Hybrid Are you a DevOps Engineer passionate about building scalable and efficient infrastructure for innovative platforms? If you’re excited by the challenge of automating and optimizing cloud infrastructure for a mission-driven PropTech platform, this opportunity is for you. We are seeking a seasoned DevOps Engineer to be a key player in scaling a pioneering property-tech ecosystem that reimagines how people discover, trust, and own their dream land or property. Our ideal candidate thrives in dynamic environments, embraces automation, and values security, performance, and reliability. You’ll be working alongside a passionate and agile team that blends technology with sustainability, enabling seamless experiences for both property buyers and developers. Key Responsibilities Architect, deploy, and maintain highly available, scalable, and secure cloud infrastructure, preferably on AWS. Design, develop, and optimize CI/CD pipelines for automated software build, test, and deployment. Implement and manage Infrastructure as Code (IaC) using Terraform, CloudFormation, or similar tools. Set up and manage robust monitoring, logging, and alerting systems (Prometheus, Grafana, ELK, etc.). Proactively monitor and improve system performance, availability, and resilience. Ensure compliance, access control, and secrets management across environments using best-in-class DevSecOps practices. Collaborate closely with development, QA, and product teams to streamline software delivery lifecycles. Troubleshoot production issues, identify root causes, and implement long-term solutions. Optimize infrastructure costs while maintaining performance SLAs. Build and maintain internal tools and automation scripts to support development workflows. Stay updated with the latest in DevOps practices, cloud technologies, and infrastructure design. Participate in on-call support rotation for critical incidents and infrastructure health. Preferred Qualifications Bachelor's degree in Computer Science, Engineering, or related field. 6–8 years of hands-on experience in DevOps, SRE, or Infrastructure roles. Strong proficiency in AWS (EC2, S3, RDS, Lambda, ECS/EKS). Expert-level scripting skills in Python, Bash, or Go. Solid experience with CI/CD tools such as Jenkins, GitLab CI, CircleCI, etc. Expertise in Docker, Kubernetes, and container orchestration at scale. Experience with configuration management tools like Ansible, Chef, or Puppet. Solid understanding of networking, DNS, SSL, firewalls, and load balancing. Familiarity with relational and non-relational databases (PostgreSQL, MySQL, etc.) is a plus. Excellent troubleshooting and analytical skills with a performance- and security-first mindset. Experience working in agile, fast-paced startup environments is a strong plus. Nice to Have Experience working in PropTech, AgriTech, or sustainability-focused platforms. Exposure to geospatial mapping systems, virtual land visualization, or real-time data platforms. Prior work with DevSecOps, service meshes like Istio, or secrets management with Vault. Passion for building tech that positively impacts people and the planet. Why Join Us? Join India’s first revolutionary PropTech platform, blending human-centric design with cutting-edge technology to empower property discovery and ownership. Be part of a company that doesn’t just build products—it builds ecosystems: for urban buyers, rural farmers, and the environment. Work with a forward-thinking leadership team from one of India’s most respected sustainability and land stewardship organizations. Collaborate across cross-disciplinary teams solving real-world challenges at the intersection of tech, land, and sustainability.

Posted 1 month ago

Apply

Site Reliability Engineer Central Business Solutions

4 - 9 years

10 - 15 Lacs

Bengaluru

Work from Office

- Strong Linux or Kubernetes experience - JFrog Artifactory experience - Task automation experience in any programming language - Experience of observability stack such as Prometheus, Grafana

Posted 1 month ago

Apply

BMC Helix Operations Management Engineer (BHOM) Emergys

5 - 10 years

10 - 20 Lacs

Pune

Work from Office

Job Description: We are looking for a passionate and detail-oriented BHOM Administrator to join our SaaS Operations team. In this role, youll be responsible for maintaining and optimizing our BMC Helix Operations Management (BHOM) environment, ensuring high availability, accurate alerting, and actionable insights. Youll work closely with our monitoring and automation teams, leveraging Prometheus for metrics collection and shell scripting to automate operational tasks and streamline workflows. Key Responsibilities: Administer and support BMC Helix Operations Management (BHOM) including configuration, integration, and health monitoring. Develop and manage Prometheus-based monitoring solutions; write and tune PromQL queries. Automate routine monitoring and administrative tasks using shell scripts. Troubleshoot issues related to infrastructure observability and improve system reliability. Contribute to documentation and best practices for monitoring and automation. Required Skills & Experience: 2 - 3 years of hands-on experience with BMC Helix Operations Management. overall 5-6 years experience in TSOM+ BHOM Strong knowledge of Prometheus and metric-based monitoring systems. Proficient in Linux shell scripting (Bash or equivalent). Good understanding of incident management, alert tuning, and operational observability. Ability to work in a collaborative, fast-paced environment. Preferred: Familiarity with Grafana for dashboarding. Basic understanding of microservices, containers (Docker/Kubernetes), and cloud platforms.

Posted 1 month ago

Apply

DevOps Engineer Siemens

3 - 5 years

5 - 7 Lacs

Pune

Work from Office

We are looking for DevOps Engineer How do you craft the future Smart Buildings? Were looking for the makers of tomorrow, the hardworking individuals ready to help Siemens transform entire industries, cities and even countries. Get to know us from the inside, develop your skills on the job. Youll make a difference by Designing, deploying, and managing AWS cloud infrastructure, including compute, storage, networking, and security services. Implementing and maintaining CI/CD pipelines using tools like GitLab CI, Jenkins, or similar technologies to automate build, test, and deployment processes. Collaborating with development teams to streamline development workflows and improve release cycles. Monitor and troubleshoot infrastructure and application issues, ensuring high availability and performance. Implementing infrastructure as code (IaC) using tools like Terraform or CloudFormation to automate provisioning and configuration management. Maintaining version control systems and Git repositories for codebase management and collaboration. Implementing and enforce security best practices and compliance standards in cloud environments. Continuously evaluate and embrace new technologies, tools, and best practices to improve efficiency and reliability. There are a lot of learning opportunities for our new team member. An openness to learn more about data analytics (including AI) offerings is part of your motivation Your defining qualities A University degree in Computer Science or a comparable education, we are flexible if a high quality of code is ensured. Proven experience (3-5 years) with common DevOps practices such as CI/CD pipelines (GitLab), Container and orchestration (Docker, ECS, EKS, Helm) and infrastructure as code (Terraform) Working knowledge of TypeScript, JavaScript, and Node.js. Good exposure to AWS cloud Thriving in working independently, i.e., can break down high-level objectives into concrete key results and implement those. Able to work with AWS from day one, familiarity with AWS services beyond EC2 (e.g., Fargate, RDS, IAM, Lambda) is something we expect from applicants. Having good knowledge of configuring logging and monitoring infrastructure with ELK, Prometheus, CloudWatch, Grafana. When it comes to methodologies, having knowledge of agile software development processes would be highly valued. Having the right demeanor, allowing you to navigate within a complex global organization and getting things done. We need a person with an absolute willingness to support the team, a proactive and stress-resistant personality. Business fluency in English

Posted 1 month ago

Apply

Lead Engineer – Core Wireless Testing (AMF/MME) Tejas Networks

4 - 8 years

10 - 14 Lacs

Bengaluru

Work from Office

Job TitleLead Engineer (Core Wireless Testing) LocationBengaluru Work EmploymentFull time DepartmentProduct Engineering DomainProduct Validation Reporting toManager About Us: Tejas Networks is a global broadband, optical and wireless networking company, with a focus on technology, innovation and R&D. We design and manufacture high-performance wireline and wireless networking products for telecommunications service providers, internet service providers, utilities, defence and government entities in over 75 countries. Tejas has an extensive portfolio of leading-edge telecom products for building end-to-end telecom networks based on the latest technologies and global standards with IPR ownership. We are a part of the Tata Group, with Panatone Finvest Ltd. (a subsidiary of Tata Sons Pvt. Ltd.) being the majority shareholder. Tejas has a rich portfolio of patents and has shipped more than 900,000 systems across the globe with an uptime of 99.999%. Our product portfolio encompasses wireless technologies (4G/5G based on 3GPP and O-RAN standards), fiber broadband (GPON/XGS-PON), carrier-grade optical transmission (DWDM/OTN), packet switching and routing (Ethernet, PTN, IP/MPLS) and Direct-to-Mobile and Satellite-IoT communication platforms. Our unified network management suite simplifies network deployments and service implementation across all our products with advanced capabilities for predictive fault detection and resolution. As an R&D-driven company, we recognize that human intelligence is a core asset that drives the organization’s long-term success. Over 60% of our employees are in R&D, we are reshaping telecom networks, one innovation at a time. Why Join Tejas: We are on a journey to connect the world with some of the most innovative products and solutions in the wireless and wireline optical networking domains. Would you like to be part of this journey and do something truly meaningful? Challenge yourself by working in Tejas’ fast-paced, autonomous learning environment and see your output and contributions become a part of live products worldwide. At Tejas, you will have the unique opportunity to work with cutting-edge technologies, alongside some of the industry’s brightest minds. From 5G to DWDM/ OTN, Switching and Routing, we work on technologies and solutions that create a connected society. Our solutions power over 500 networks across 75+ countries worldwide, and we’re constantly pushing boundaries to achieve more. If you thrive on taking ownership, have a passion for learning and enjoy challenging the status quo, we want to hear from you! Who We Are Product Engineering team is responsible for Platform and software validation for the entire product portfolio. They will develop automation Framework for the entire product portfolio. Team will develop and deliver customer documentation and training solutions. Compliance with technical certifications such as TL9000 and TSEC is essential for ensuring industry standards and regulatory requirements are met. Team works closely with PLM, HW and SW architects, sales and customer account teams to innovate and develop network deployment strategy for a broad spectrum of networking products and software solutions. As part of this team, you will get an opportunity to validate, demonstrate and influence new technologies to shape future optical, routing, fiber broadband and wireless networks. What You Work: As a Lead Engineer you will be responsible for driving technical projects, managing resources effectively, balancing team workloads. You will design solutions, oversee testing, and mentor junior engineers to ensure productivity and skill development. Also, you will manage resources, troubleshoot, debug issues, writing and reviewing test cases to ensure code quality, and collaborate with cross-functional teams to deliver high-quality products on time. Knowledge of software development methodology, build tools, and product life cycle Build a 5G Cloud-native test solution in a virtualized environment with end to end understanding of 5G Network functions (i.e., AMF, SMF, UPF and PCF) and protocols Exposure to customer deployment models and configuration of large mobile packet core solutions Have 4-12 years of Industry experience in Mobile packet core technologies with validation background and solid exposure in automation You have End to End or System Testing background Good knowledge in Kubernetes, docker and Cloud Native solutions Experience in bringing up Open stack , VMWare based test setups Interest & Passion in Automation and framework development using Python and Robot Framework Exposure in Spirent Landslide, Mobilium DsTest or Ixia simulators Exposure in automation frameworks like pyats and robot framework. Certification in Kubernetes, Exposure to Grafana and Prometheus is added advantage. Experience in CI/CD tools Jenkins and GIT Mandatory skills: Solid experience in 5G core End to End validation Kubernetes, Docker, OpenStack Working exposure on AMF and UPF Have been to customer escalation role Spirent Landslide Python, Shell Scripting, Robot framework Desired skills: Certification in Kubernetes, Exposure to Grafana and Prometheus is added advantage. Experience in CI/CD tools Jenkins and GIT Preferred Qualifications Experience 6 to 10 years of relevant experience Education B.Tech/BE or any other equivalent degree, PG in communication field Diversity and Inclusion Statement : Tejas Networks is an equal opportunity employer. We celebrate diversity and are committed to creating all-inclusive environment for all employees. We welcome applicants of all backgrounds regardless of race color, religion, gender, sexual orientation, age or veteran status. Our goal is to build a workforce that reflects the diverse communities we serve and to ensure every employee feels valued and respected.

Posted 1 month ago

Apply

Lead Engineer -CI CD Devops Tejas Networks

4 - 8 years

13 - 18 Lacs

Bengaluru

Work from Office

Job TitleLead Engineer – CI CD Devops LocationBengaluru Work EmploymentFull time DepartmentWireline DomainSoftware Reporting toGroup Engineer About Us Tejas Networks is a global broadband, optical and wireless networking company, with a focus on technology, innovation and R&D. We design and manufacture high-performance wireline and wireless networking products for telecommunications service providers, internet service providers, utilities, defence and government entities in over 75 countries. Tejas has an extensive portfolio of leading-edge telecom products for building end-to-end telecom networks based on the latest technologies and global standards with IPR ownership. We are a part of the Tata Group, with Panatone Finvest Ltd. (a subsidiary of Tata Sons Pvt. Ltd.) being the majority shareholder. Tejas has a rich portfolio of patents and has shipped more than 900,000 systems across the globe with an uptime of 99.999%. Our product portfolio encompasses wireless technologies (4G/5G based on 3GPP and O-RAN standards), fiber broadband (GPON/XGS-PON), carrier-grade optical transmission (DWDM/OTN), packet switching and routing (Ethernet, PTN, IP/MPLS) and Direct-to-Mobile and Satellite-IoT communication platforms. Our unified network management suite simplifies network deployments and service implementation across all our products with advanced capabilities for predictive fault detection and resolution. As an R&D-driven company, we recognize that human intelligence is a core asset that drives the organization’s long-term success. Over 60% of our employees are in R&D, we are reshaping telecom networks, one innovation at a time. Why join Tejas ? We are on a journey to connect the world with some of the most innovative products and solutions in the wireless and wireline optical networking domains. Would you like to be part of this journey and do something truly meaningful? Challenge yourself by working in Tejas’ fast-paced, autonomous learning environment and see your output and contributions become a part of live products worldwide. ? ? At Tejas, you will have the unique opportunity to work with cutting-edge technologies, alongside some of the industry’s brightest minds. From 5G to DWDM/ OTN, Switching and Routing, we work on technologies and solutions that create a connected society. Our solutions power over 500 networks across 75+ countries worldwide, and we’re constantly pushing boundaries to achieve more. If you thrive on taking ownership, have a passion for learning and enjoy challenging the status quo, we want to hear from you! ? Who we are:? ? In the dynamic world of enterprise technology, the shift towards cloud-native solutions is not just a trend but a necessity. ? As we embark on developing a state-of-the-art Network Management System (NMS) and Reporting tool, our goal is to leverage the latest technologies to create a robust, scalable, and efficient solution. ? This initiative is crucial for ensuring our network’s optimal performance, security, and reliability while providing insightful analytics through advanced reporting capabilities. ? Our project aims to design and implement a cloud-native NMS and reporting tool that will revolutionize how we manage and monitor our network infrastructure. By utilizing cutting-edge technologies, we will ensure that our solution is not only future-proof but also capable of adapting to the ever-evolving demands of our enterprise environment. What you work ? Develop and implement automation strategies for software build, deployment, and infrastructure management. ? Design and maintain CI/CD pipelines to enable frequent and reliable software releases. ? Collaborate with development, QA, and operations teams to optimize workflows and enhance software quality. ? Automate repetitive tasks and processes to improve efficiency and reduce manual intervention. ? Monitor and troubleshoot CI/CD pipelines to ensure smooth operation and quick resolution of issues. ? Implement and maintain robust monitoring and alerting tools to ensure system reliability. ? Work with various tools and technologies such as Git, Jenkins, Docker, Kubernetes, and cloud platforms (e.g., AWS, Azure). ? Ensure compliance with security standards and best practices throughout the development lifecycle. ? Continuously improve the CI/CD processes by incorporating new tools, techniques, and best practices. ? Provide training and guidance to team members on DevOps principles and practices. ? You will be responsible for leading a team and guiding them for optimum output. Mandatory skills ? Strong experience in software development and system administration. ? Proficiency in programming languages such as Python, Java, or similar. ? Strong understanding of CI/CD concepts and experience with tools like Jenkins, Git, Docker, and Kubernetes. ? Experience with cloud platforms such as AWS, Azure, or Google Cloud. ? Excellent problem-solving skills and attention to detail. ? Strong communication and collaboration skills. ? Ability to work in a fast-paced, dynamic environment. ? Desired skills ? Experience with infrastructure such as code (IaC) tools like Terraform or Ansible. ? Knowledge of container orchestration tools like Kubernetes or Rancher ? Familiarity with monitoring and logging tools such as Prometheus, Grafana, or ELK stack. ? Certification in AWS, Azure, or other relevant technologies. ? Preferred Qualifications:? ? Experience 6 to 9 years’ experience from Telecommunication or Networking background. ? Education B.Tech/BE (CSE/ECE/EEE/IS) or any other equivalent degree ? Candidate should be good at coding skills in CI CD, Devops with Java . Diversity and Inclusion Statement :??? Tejas Networks is an equal opportunity employer. We celebrate diversity and are committed to creating all inclusive environment for all employees. ? We welcome applicants of all backgrounds regardless of race color, religion, gender, sexual orientation, age or veteran status. ? Our goal is to build a workforce that reflects the diverse communities we serve and to ensure every employee feels valued and respected. ?

Posted 1 month ago

Apply

Senior Engineer -CI CD Devops Tejas Networks

3 - 5 years

12 - 16 Lacs

Bengaluru

Work from Office

Job TitleSenior Engineer – CI CD Devops LocationBengaluru Work EmploymentFull time DepartmentWireline DomainSoftware Reporting toGroup Engineer About Us Tejas Networks is a global broadband, optical and wireless networking company, with a focus on technology, innovation and R&D. We design and manufacture high-performance wireline and wireless networking products for telecommunications service providers, internet service providers, utilities, defence and government entities in over 75 countries. Tejas has an extensive portfolio of leading-edge telecom products for building end-to-end telecom networks based on the latest technologies and global standards with IPR ownership. We are a part of the Tata Group, with Panatone Finvest Ltd. (a subsidiary of Tata Sons Pvt. Ltd.) being the majority shareholder. Tejas has a rich portfolio of patents and has shipped more than 900,000 systems across the globe with an uptime of 99.999%. Our product portfolio encompasses wireless technologies (4G/5G based on 3GPP and O-RAN standards), fiber broadband (GPON/XGS-PON), carrier-grade optical transmission (DWDM/OTN), packet switching and routing (Ethernet, PTN, IP/MPLS) and Direct-to-Mobile and Satellite-IoT communication platforms. Our unified network management suite simplifies network deployments and service implementation across all our products with advanced capabilities for predictive fault detection and resolution. As an R&D-driven company, we recognize that human intelligence is a core asset that drives the organization’s long-term success. Over 60% of our employees are in R&D, we are reshaping telecom networks, one innovation at a time. Why join Tejas ? We are on a journey to connect the world with some of the most innovative products and solutions in the wireless and wireline optical networking domains. Would you like to be part of this journey and do something truly meaningful? Challenge yourself by working in Tejas’ fast-paced, autonomous learning environment and see your output and contributions become a part of live products worldwide. ? ? At Tejas, you will have the unique opportunity to work with cutting-edge technologies, alongside some of the industry’s brightest minds. From 5G to DWDM/ OTN, Switching and Routing, we work on technologies and solutions that create a connected society. Our solutions power over 500 networks across 75+ countries worldwide, and we’re constantly pushing boundaries to achieve more. If you thrive on taking ownership, have a passion for learning and enjoy challenging the status quo, we want to hear from you! ? Who we are:? ? In the dynamic world of enterprise technology, the shift towards cloud-native solutions is not just a trend but a necessity. ? As we embark on developing a state-of-the-art Network Management System (NMS) and Reporting tool, our goal is to leverage the latest technologies to create a robust, scalable, and efficient solution. ? This initiative is crucial for ensuring our network’s optimal performance,security, and reliability while providing insightful analytics through advanced reporting capabilities. ? ? ? Our project aims to design and implement a cloud-native NMS and reporting tool that will revolutionize how we manage and monitor our network infrastructure. By utilizing cutting-edge technologies, we will ensure that our solution is not only future-proof but also capable of adapting to the ever-evolving demands of our enterprise environment. What you work ? Develop and implement automation strategies for software build, deployment, and infrastructure management. ? Design and maintain CI/CD pipelines to enable frequent and reliable software releases. ? Collaborate with development, QA, and operations teams to optimize workflows and enhance software quality. ? Automate repetitive tasks and processes to improve efficiency and reduce manual intervention. ? Monitor and troubleshoot CI/CD pipelines to ensure smooth operation and quick resolution of issues. ? Implement and maintain robust monitoring and alerting tools to ensure system reliability. ? Work with various tools and technologies such as Git, Jenkins, Docker, Kubernetes, and cloud platforms (e.g., AWS, Azure). ? Ensure compliance with security standards and best practices throughout the development lifecycle. ? Continuously improve the CI/CD processes by incorporating new tools, techniques, and best practices. ? Provide training and guidance to team members on DevOps principles and practices. ? Mandatory skills ? Strong experience in software development and system administration. Proficiency in programming languages such as Python, Java, or similar. Strong understanding of CI/CD concepts and experience with tools like Jenkins, Git, Docker, and Kubernetes. Experience with cloud platforms such as AWS, Azure, or Google Cloud. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. Ability to work in a fast-paced, dynamic environment. Desired skills ? Experience with infrastructure such as code (IaC) tools like Terraform or Ansible. Knowledge of container orchestration tools like Kubernetes or Rancher? Familiarity with monitoring and logging tools such as Prometheus, Grafana, or ELK stack. Certification in AWS, Azure, or other relevant technologies. Preferred Qualifications:? ? Experience 3 to 5 years’ experience from Telecommunication or Networking background. Education B.Tech/BE (CSE/ECE/EEE/IS) or any other equivalent degree? Candidate should be good at coding skills in CI CD, Devops with Java. Diversity and Inclusion Statement :?? ? Tejas Networks is an equal opportunity employer. We celebrate diversity and are committed to creating all inclusive environment for all employees. We welcome applicants of all backgrounds regardless of race color, religion, gender, sexual orientation, age or veteran status. Our goal is to build a workforce that reflects the diverse communities we serve and to ensure every employee feels valued and respected.

Posted 1 month ago

Apply

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

577 Prometheus Jobs - Page 20

Job Alert

Start Your Job Search Today

Please Verify Your Phone or Email

Job Application AI Bot

Download the Mobile App

Setup Job Alerts

Featured Companies

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

577 Prometheus Jobs - Page 20

Job Alert

Upload Resume

AI Job Matching Summary

Pros

Cons

Summary

Start Your Job Search Today

Please Verify Your Phone or Email

Job Application AI Bot

Download the Mobile App

Setup Job Alerts

Featured Companies