Jobs
Interviews

922 Prometheus Jobs - Page 30

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

4.0 - 8.0 years

5 - 9 Lacs

Hyderabad, Bengaluru

Work from Office

Whats in it for you? Pay above market standards The role is going to be contract based with project timelines from 2 12 months, or freelancing Be a part of an Elite Community of professionals who can solve complex AI challenges Work location could be: Remote (Highly likely) Onsite on client location Deccan AIs Office: Hyderabad or Bangalore Responsibilities: Design and architect enterprise-scale data platforms, integrating diverse data sources and tools Develop real-time and batch data pipelines to support analytics and machine learning Define and enforce data governance strategies to ensure security, integrity, and compliance along with optimizing data pipelines for high performance, scalability, and cost efficiency in cloud environments Implement solutions for real-time streaming data (Kafka, AWS Kinesis, Apache Flink) and adopt DevOps/DataOps best practices Required Skills: Strong experience in designing scalable, distributed data systems and programming (Python, Scala, Java) with expertise in Apache Spark, Hadoop, Flink, Kafka, and cloud platforms (AWS, Azure, GCP) Proficient in data modeling, governance, warehousing (Snowflake, Redshift, BigQuery), and security/compliance standards (GDPR, HIPAA) Hands-on experience with CI/CD (Terraform, Cloud Formation, Airflow, Kubernetes) and data infrastructure optimization (Prometheus, Grafana) Nice to Have: Experience with graph databases, machine learning pipeline integration, real-time analytics, and IoT solutions Contributions to open-source data engineering communities What are the next steps? Register on our Soul AI website

Posted 1 month ago

Apply

4.0 - 8.0 years

13 - 17 Lacs

Hyderabad, Bengaluru

Work from Office

Responsibilities: Design and architect enterprise-scale data platforms, integrating diverse data sources and tools Develop real-time and batch data pipelines to support analytics and machine learning Define and enforce data governance strategies to ensure security, integrity, and compliance along with optimizing data pipelines for high performance, scalability, and cost efficiency in cloud environments Implement solutions for real-time streaming data (Kafka, AWS Kinesis, Apache Flink) and adopt DevOps/DataOps best practices Required Skills: Strong experience in designing scalable, distributed data systems and programming (Python, Scala, Java) with expertise in Apache Spark, Hadoop, Flink, Kafka, and cloud platforms (AWS, Azure, GCP) Proficient in data modeling, governance, warehousing (Snowflake, Redshift, Big Query), and security/compliance standards (GDPR, HIPAA) Hands-on experience with CI/CD (Terraform, Cloud Formation, Airflow, Kubernetes) and data infrastructure optimization (Prometheus, Grafana) Nice to Have: Experience with graph databases, machine learning pipeline integration, real-time analytics, and IoT solutions Contributions to open-source data engineering communities

Posted 1 month ago

Apply

4.0 - 8.0 years

9 - 13 Lacs

Mumbai, Bengaluru, Delhi / NCR

Work from Office

We are looking for Indias top 1% Platform Engineers for a unique job opportunity to work with the industry leaders Who can be a part of the community? We are looking for Platform Engineers focusing on building scalable and high-performance AI/ML platforms Strong background in cloud architecture, distributed systems, Kubernetes, and infrastructure automation is expected If you have experience in this field then this is your chance to collaborate with industry leaders Whats in it for you? Pay above market standards The role is going to be contract based with project timelines from 2 12 months, or freelancing Be a part of an Elite Community of professionals who can solve complex AI challenges Work location could be: Remote (Highly likely) Onsite on client location Deccan AIs Office: Hyderabad or Bangalore Responsibilities: Architect and maintain scalable cloud infrastructure on AWS, GCP, or Azure using tools like Terraform and Cloud Formation Design and implement Kubernetes clusters with Helm, Kustomize, and Service Mesh (Istio, Linkerd) Develop CI/CD pipelines using GitHub Actions, GitLab CI/CD, Jenkins, and Argo CD for automated deployments Implement observability solutions (Prometheus, Grafana, ELK stack) for logging, monitoring, and tracing & automate infrastructure provisioning with tools like Ansible, Chef, Puppet, and optimize cloud costs and security Required Skills: Expertise in cloud platforms (AWS, GCP, Azure) and infrastructure as code (Terraform, Pulumi) with strong knowledge of Kubernetes, Docker, CI/CD pipelines, and scripting (Bash, Python) Experience with observability tools (Prometheus, Grafana, ELK stack) and security practices (RBAC, IAM) Familiarity with networking (VPC, Load Balancers, DNS) and performance optimization Nice to Have: Experience with Chaos Engineering (Gremlin, LitmusChaos), Canary or Blue-Green deployments Knowledge of multi-cloud environments, FinOps, and cost optimization strategies Location-Delhi NCR,Bangalore,Chennai,Pune,Kolkata,Ahmedabad,Mumbai,Hyderabad

Posted 1 month ago

Apply

8.0 - 13.0 years

20 - 35 Lacs

Bengaluru

Work from Office

Manage CMMS master data, asset hierarchies, preventive maintenance plans, user roles, configuration changes, and digital tool support (JDE, Maximo, SAP) to ensure maintenance data quality and asset reliability in refinery and LNG assets. Required Candidate profile 5–15 years’ experience in CMMS systems (JDE mandatory), master data management, and maintenance workflows in complex industrial facilities (LNG, refinery, petrochemical).

Posted 1 month ago

Apply

7.0 - 11.0 years

7 - 16 Lacs

Hyderabad

Hybrid

Job role name - SRE Devops Work mode - Hybrid Location - Hyderabad Experience - 7+ years Requirement : SRE Devops , Grafana, Prometheus, Datadog , Azure/AWS cloud Should be okay with work from office

Posted 1 month ago

Apply

5.0 - 6.0 years

27 - 42 Lacs

Pune

Work from Office

Primary & Mandatory Skill: AWS, DevOps Engineer with containerization technology (EKS/Docker/Kubernetes) Client Round (Yes/ No): Yes Location Constraint if any : India – Pune (Preferred), Mumbai, Bangalore, Hyderabad, Chennai Shift timing: 1Pm-11 PM (UK/Prague Shift) Fitment Level : A/SA Job Description : Design, implement and manage Infrastructure as code using tools like AWS CloudFormation or Terraform. Create and maintain reusable templates and configurations to provision and manage AWS resources Build and maintain CI/CD pipeline to automate build, test and deployment processed In depth understanding of containerization technology such as Docker or Kubernetes, EKS Utilizes tools such as GithubActions, CloudBees to enable reliable software releases Set up monitoring and alerting systems using AWS tools such as CloudWatch or third-party tools such as Grafana, Prometheus Implement security and compliance best practises for infrastructure components Configure access control, encryption, and security measures Implement auto-scaling, load balancing, caching mechanism etc. to improve application availability and performance Create and maintain documentation for infrastructure deployment and standard operating procedures

Posted 1 month ago

Apply

7.0 - 9.0 years

27 - 42 Lacs

Pune

Work from Office

Primary & Mandatory Skill: Kubernetes Administrator and Helm Chart Certification Mandatory: CKA (Certified Kubernetes Administrator) OR CKAD (Certified Kubernetes Application Developer) Level: SA/M Client Round (Yes/ No): Yes Location Constraint if any : PAN India Shift timing: General shift JD: Should have very good understanding of various components of various types of Kubernetes clusters (Community/AKS/GKE/OpenShift) Should have provisioning experience of various type of Kubernetes clusters(Community/AKS/GKE/OpenShift) Should have Upgradation and monitoring experience of various type of Kubernetes clusters (Community/AKS/GKE/OpenShift) Should have good experience of sizing the Kubernetes clusters Should have very good experience on Container Security & Container Storage Should have hands-on development experience on "GO or JavaScript or Java" Should have very good experience on CICD workflow (Preferable Azure DevOps, Ansible and Jenkin) Should have good experience / knowledge of cloud platforms preferably Azure / Google / OpenStack Should have good understanding of application life cycle management on container platform Should have very good understating of container registry Should have very good understanding of Helm and Helm Charts Should have very good understanding of container monitoring tools like Prometheus, Grafana and ELK Should have very good experience on Linux operating system Should have basis understanding of enterprise networks and container networks Should be able to handle Severity#1 and Severity#2 incidents very good communication skills Should have analytical and problem-solving capabilities, ability to work with teams Good to have knowledge of ITIL Process

Posted 1 month ago

Apply

5.0 - 7.0 years

27 - 42 Lacs

Pune

Work from Office

J Primary & Mandatory Skill: AWS, DevOps Engineer with containerization technology (EKS/Docker/Kubernetes) Client Round (Yes/ No): Yes Location Constraint if any : India – Pune (Preferred), Mumbai, Bangalore, Hyderabad, Chennai Shift timing: 1Pm-11 PM (UK/Prague Shift) Fitment Level : A/SA Job Description : Design, implement and manage Infrastructure as code using tools like AWS CloudFormation or Terraform. Create and maintain reusable templates and configurations to provision and manage AWS resources Build and maintain CI/CD pipeline to automate build, test and deployment processed In depth understanding of containerization technology such as Docker or Kubernetes, EKS Utilizes tools such as GithubActions, CloudBees to enable reliable software releases Set up monitoring and alerting systems using AWS tools such as CloudWatch or third-party tools such as Grafana, Prometheus Implement security and compliance best practises for infrastructure components Configure access control, encryption, and security measures Implement auto-scaling, load balancing, caching mechanism etc. to improve application availability and performance Create and maintain documentation for infrastructure deployment and standard operating procedures

Posted 1 month ago

Apply

2.0 - 5.0 years

7 - 12 Lacs

Gurugram

Work from Office

Redhat Openshift Engineer with 3+ years of hands-on experience in Red Hat OpenShift . The ideal candidate will be responsible for managing, configuring, and maintaining container orchestration and cloud infrastructure environments to support enterprise-grade applications and services. Key Responsibilities: Deploy, configure, and maintain OpenShift clusters in production and development environments. Monitor system performance, availability, and capacity planning. Automate infrastructure provisioning and application deployment using CI/CD pipelines. Troubleshoot and resolve issues related to container orchestration, cloud networking, and virtualized environments. Implement security best practices for containerized and cloud-native applications. Collaborate with development, QA, and operations teams to ensure seamless delivery pipelines. Create and maintain documentation related to architecture, processes, and troubleshooting. Required Skills: Strong hands-on experience with Red Hat OpenShift (v4.x preferred). Experience with Kubernetes concepts, Helm charts, and Operators. Familiarity with Linux system administration (RHEL/CentOS). Proficiency in scripting languages like Bash, Python, or Ansible. Understanding of CI/CD pipelines and tools like Jenkins, GitLab CI, or Tekton. Knowledge of cloud networking, load balancers, firewalls, and DNS. Preferred Qualifications: RHCSA/RHCE or OpenShift certification (EX280/EX180). Exposure to monitoring tools such as Prometheus, Grafana, or ELK stack. Experience with GitOps workflows (e.g., ArgoCD or Flux). Basic understanding of ITIL processes and DevOps culture. Education: Bachelors degree in Computer Science, Information Technology, or related field.

Posted 1 month ago

Apply

4.0 - 6.0 years

27 - 42 Lacs

Chennai

Work from Office

Skill – Aks , Istio service mesh Shift timing - Afternoon Shift Location - Chennai, Kolkata, Bangalore Excellent AKS, GKE or Kubernetes admin experience. Good troubleshooting experience on istio service mesh, connectivity issues. Experience with Github Actions or similar ci/cd tool to build pipelines.Working experience on any cloud, preferably Azure, Google with good networking knowledge. Experience on python or shell scripting. Experience on building dashboards, configure alerts using prometheus and Grafana.

Posted 1 month ago

Apply

21.0 - 31.0 years

50 - 70 Lacs

Bengaluru

Work from Office

What we’re looking for As a member of the infrastructure team at Survey Monkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers. What you'll be working on Architect, build, and operate AWS environments at scale with well-established industry best practices. Automating infrastructure provisioning, DevOps, and/or continuous integration/delivery. Provide Technical Leadership & Mentorship Mentor and guide senior engineers to build technical expertise and drive a culture of excellence in software development. Foster collaboration within the engineering team, ensuring the adoption of best practices in coding, testing, and deployment. Review code and provide constructive feedback to ensure code quality and adherence to architectural principles. Collaboration & Cross-Functional Leadership Collaborate with cross-functional teams (Product, Security, and other Engineering teams) to drive the roadmap and ensure alignment with business objectives. Provide technical leadership in meetings and discussions, influencing key decisions on architecture, design, and implementation. Innovation & Continuous Improvement Propose, evaluate, and integrate new tools and technologies to improve the performance, security, and scalability of the cloud platform. Drive initiatives for optimizing cloud resource usage and reducing operational costs without compromising performance. Write libraries and APIs that provide a simple, unified interface to other developers when they use our monitoring, logging, and event-processing systems. Participate in on-call rotation. Support and partner with other teams on improving our observability systems to monitor site stability and performance We’d love to hear from people with 12+ years of relevant professional experience with cloud platforms such as AWS, Heroku. Extensive experience leading design sessions and evolving well-architected environments in AWS at scale. Extensive experience with Terraform, Docker, Kubernetes, scripting (Bash/Python/Yaml), and helm. Experience with Splunk, OpenTelemetry, CloudWatch, or tools like New Relic, Datadog, or Grafana/Prometheus, ELK (Elasticsearch/Logstash/Kibana). Experience with metrics and logging libraries and aggregators, data analysis and visualization tools – Specifically Splunk and Otel. Experience instrumenting PHP, Python, Java and Node.js applications to send metrics, traces, and logs to third-party Observability tooling. Experience with GitOps and tools like ArgoCD/fluxcd. Interest in Instrumentation and Optimization of Kubernetes Clusters. Ability to listen and partner to understand requirements, troubleshoot problems, or promote the adoption of platforms. Experience with GitHub/GitHub Actions/Jenkins/Gitlab in either a software engineering or DevOps environment. Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Elasticsearch, Memcached, Redis, Kafka and Debezium. Preferably experience with secrets management, for example Hashicorp Vault. Preferably experience in an agile environment and JIRA. SurveyMonkey believes in-person collaboration is valuable for building relationships, fostering community, and enhancing our speed and execution in problem-solving and decision-making. As such, this opportunity is hybrid and requires you to work from the SurveyMonkey office in Bengaluru 3 days per week. #LI - Hybrid

Posted 1 month ago

Apply

2.0 - 5.0 years

3 - 7 Lacs

Hyderabad

Work from Office

What you will do Let’s do this. Let’s change the world. In this vital role you will be responsible for designing, developing, and maintaining software applications and solutions that meet business needs and ensuring the availability and performance of critical systems and applications in the Human Resources – Talent & Performance area. This role involves working closely with product managers, designers, and other engineers to create high-quality, scalable software solutions and automating operations, monitoring system health, and responding to incidents to minimize downtime. Roles & Responsibilities: Take ownership of complex software projects from conception to deployment. Manage software delivery scope, risk, and timeline. Possesses strong rapid prototyping skills and can quickly translate concepts into working code. Provide technical guidance and mentorship to junior developers. Contribute to both front-end and back-end development using cloud technology including software development tools like React.js and Python. Develop innovative solution using generative AI technologies including OpenAI and MS CoPilot. Conduct code reviews to ensure code quality and consistency to standard processes. Create and maintain documentation on software architecture, design, deployment, disaster recovery, and operations. Identify and resolve technical challenges effectively. Stay updated with the latest trends and advancements. Work closely with product team, business team, and other collaborators . Design, develop, and implement applications and modules, including custom reports, interfaces, and enhancements . Analyze and understand the functional and technical requirements of applications, solutions and systems and translate them into software architecture and design specifications. Develop and implement unit tests, integration tests, and other testing strategies to ensure the quality of the software. Identify and resolve software bugs and performance issues. Work closely with multi-functional teams, including product management, design, and QA, to deliver high-quality software on time. Maintain detailed documentation of software designs, code, and development processes. Customize modules to meet specific business requirements . Work on integrating with other systems and platforms to ensure seamless data flow and functionality. Provide ongoing support and maintenance for applications, ensuring that they operate smoothly and efficiently . What we expect of you We are all different, yet we all use our unique contributions to serve patients. Basic Qualifications: Master’s degree and 1 to 3 years of Computer Science, IT or related field experience OR Bachelor’s degree and 3 to 5 years of Computer Science, IT or related field experience OR Diploma and 7 to 9 years of Computer Science, IT or related field experience Functional Skills: Must-Have Skills: Strong understanding of user experience (UX) design principles and their application in software development. Proven experience in using Jira for project management and agile development processes. Hands-on experience with the Software Development Life Cycle (SDLC), including standard processes in coding, testing, and deployment, and methodologies, including Agile and Scrum. Proficiency in programming languages such as Python, JavaScript preferred or other programming languages. Good-to-Have Skills: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Splunk) Experience with data processing tools like Hadoop, Spark, or similar Experience with Human Resources systems Professional Certifications: Relevant certifications such as CISSP, CompTIA Network+, or MCSE (preferred) Soft Skills: Excellent analytical and troubleshooting skills. Strong verbal and written communication skills. Ability to work effectively with global, virtual teams . High degree of initiative and self-motivation. Ability to manage multiple priorities successfully. Team-oriented, with a focus on achieving team goals. Strong presentation and public speaking skills. What you can expect of us As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way. In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards. Apply now for a career that defies imagination Objects in your future are closer than they appear. Join us. careers.amgen.com As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Posted 1 month ago

Apply

1.0 - 4.0 years

3 - 7 Lacs

Hyderabad

Work from Office

Join Amgen’s Mission of Serving Patients At Amgen, if you feel like you are part of something bigger, it’s because you are. Our shared mission—to serve patients living with serious illnesses—drives all that we do. Since 1980, we’ve helped pioneer the world of biotech in our fight against the world’s toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, General Medicine, and Rare Disease– we reach millions of patients each year. As a member of the Amgen team, you’ll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives. Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you’ll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career. What you will do Let’s do this. Let’s change the world. In this vital role you will responsible for designing, developing, and maintaining software applications and solutions that meet business needs and ensuring the availability and performance of critical systems and applications in the Human Resources – Talent & Performance area. This role involves working closely with product managers, designers, and other engineers to create high-quality, scalable software solutions and automating operations, monitoring system health, and responding to incidents to minimize downtime. Roles & Responsibilities: Take ownership of complex software projects from conception to deployment. Manage software delivery scope, risk, and timeline. Possesses strong rapid prototyping skills and can quickly translate concepts into working code. Contribute to both front-end and back-end development using cloud technology including software development tools like React.js and Python. Develop innovative solution using generative AI technologies including OpenAI and MS CoPilot. Conduct code reviews to ensure code quality and alignment to best practices. Create and maintain documentation on software architecture, design, deployment, disaster recovery, and operations. Identify and resolve technical challenges effectively. Stay updated with the latest trends and advancements. Work closely with product team, business team, and other collaborators. Design, develop, and implement applications and modules, including custom reports, interfaces, and enhancements. Analyze and understand the functional and technical requirements of applications, solutions and systems and translate them into software architecture and design specifications. Develop and implement unit tests, integration tests, and other testing strategies to ensure the quality of the software. Identify and resolve software bugs and performance issues. Work closely with multi-functional teams, including product management, design, and QA, to deliver high-quality software on time. Maintain detailed documentation of software designs, code, and development processes. Customize modules to meet specific business requirements. Work on integrating with other systems and platforms to ensure seamless data flow and functionality. Provide ongoing support and maintenance for applications, ensuring that they operate smoothly and efficiently. What we expect of you We are all different, yet we all use our unique contributions to serve patients. Basic Qualifications: Bachelor’s degree and 0 to 3 years of Computer Science, IT or related field experience OR Diploma and 4 to 7 years of Computer Science, IT or related field experience Functional Skills: Must-Have Skills: Good understanding of user experience (UX) design principles and their application in software development. Proven experience in applying Jira for project management and agile development processes. Hands-on experience with the Software Development Life Cycle (SDLC), including standard processes in coding, testing, and deployment, and methodologies, including Agile and Scrum. Proficiency in programming languages such as Python, JavaScript preferred or other programming languages. Good-to-Have Skills: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Splunk) Experience with data processing tools like Hadoop, Spark, or similar Experience with Human Resources systems Professional Certifications: Relevant certifications such as CISSP, CompTIA Network+, or MCSE (preferred) Soft Skills: Excellent analytical and troubleshooting skills Strong verbal and written communication skills Ability to work effectively with global, virtual teams High degree of initiative and self-motivation Ability to manage multiple priorities successfully Team-oriented, with a focus on achieving team goals Strong presentation and public speaking skills What you can expect of us As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way. In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards. Apply now for a career that defies imagination Objects in your future are closer than they appear. Join us. careers.amgen.com As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Posted 1 month ago

Apply

2.0 - 6.0 years

6 - 10 Lacs

Hyderabad

Work from Office

Site Reliability Engineer ABOUT AMGEN Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 40 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today. What you will do Roles & Responsibilities Ensure high system reliability and uptime. Develop and maintain monitoring systems. Lead incident response and root cause analysis. Automate repetitive tasks for efficiency. Perform capacity planning and resource scaling. Lead infrastructure as code (e.g., Terraform, Kubernetes). Collaborate with development and operations teams. Maintain clear documentation and share knowledge. Optimize system and application performance. Ensure security and compliance standards are met. Define, measure, and monitor Service Level Objectives (SLOs) and Service-Level Agreements (SLAs) to align with business goals. Drive continuous process and system improvements. Define guidelines, standards, strategies, security policies and organizational change policies to support the Data Lake What we expect of you Basic Qualifications and Experience: Master’s degree in computer science or engineering field and 1 to 3 years of relevant experience OR Bachelor’s degree in computer science or engineering field and 3 to 5 years of relevant experience OR Diploma and Minimum of 8+ years of relevant work experience Must-Have Skills: Proficiency in programming/scripting (Python, Java). Experience in Linux/Unix system administration. Experience with cloud platforms (AWS, Databricks, Azure, Snowflake). Proficiency in containerization and orchestration (Docker, Kubernetes). Knowledge of Infrastructure as Code (Terraform, Ansible). Familiarity with monitoring and logging tools (Prometheus, Grafana). Understanding of CI/CD pipelines (Jenkins, GitLab CI/CD). Strong networking knowledge and troubleshooting skills. Understanding of security principles and compliance. Familiarity with database management (SQL and NoSQL). Strong troubleshooting and debugging skills. Experience in performance optimization. Experience with backup and storage solutions. Good-to-Have Skills: Familiarity with the use of AI for development productivity, such as GitHub Copilot, Databricks Assistant, Amazon Q Developer or equivalent. Knowledge of Agile and DevOps practices. Skills in disaster recovery planning. Familiarity with load testing tools (JMeter, Gatling). Basic understanding of AI/ML for monitoring. Knowledge of distributed systems and microservices. Data visualization skills (Tableau, Power BI). Strong communication and leadership skills. Understanding of compliance and auditing requirements. Soft Skills: Excellent analytical and solve skills Excellent written and verbal communications skills (English) in translating technology content into business-language at various levels Ability to work effectively with global, virtual teams High degree of initiative and self-motivation Ability to handle multiple priorities successfully Team-oriented, with a focus on achieving team goals Strong problem-solving and analytical skills. Strong time and task leadership skills to estimate and successfully meet project timeline with ability to bring consistency and quality assurance across various projects. Apply now for a career that defies creativity Objects in your future are closer than they appear. Join us. careers.amgen.com As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Posted 1 month ago

Apply

6.0 - 8.0 years

8 - 12 Lacs

Bengaluru

Work from Office

The Opportunity Join our dynamic and forward-thinking Platform Engineering team at a world-class analytics company. Our solutions power critical decisions in fraud, risk, marketing, and customer management for thousands of businesses worldwide. As part of this team, youll design and develop resilient, scalable services and automation pipelines, ensuring an outstanding developer experience and accelerating innovation across the organization. Sr. Director, 1ES Engineering What Youll Contribute Platform Services: Collaborate with cross-functional teams to architect, build, and maintain platform services that provide reliable, scalable, and secure solutions. Automation & Integration: Develop and integrate automation tools and services, streamlining workflows and ensuring continuous delivery of software across multiple environments. DevOps Pipelines: Own and evolve CI/CD pipeline capabilities, championing best practices that optimize speed, quality, and reliability of deployments. Developer Experience: Innovate and implement tools, frameworks, and processes that enhance developer productivity, reduce friction, and improve self-service capabilities. Performance & Scalability: Identify and mitigate bottlenecks, optimize performance, and ensure high availability and fault tolerance across all services. Continuous Improvement: Stay current on emerging technologies and best practices in Platform Engineering, proactively suggesting enhancements and improvements for organizational benefit. Collaboration & Mentorship: Partner with diverse teams to share knowledge, provide technical guidance, and promote a culture of learning and growth. What Were Seeking Strong Platform Engineering Background: Proven experience designing, implementing, and managing highly available, scalable, and secure platform services. DevOps Expertise: Deep understanding of modern DevOps practices, including CI/CD, Infrastructure as Code (IaC), automated testing, and observability. Cloud & Containerization: Hands-on experience with public cloud providers (e.g., AWS), container orchestration (Kubernetes), and containerization technologies (Docker). Automation Proficiency: Skilled in scripting and configuration management (e.g., Ansible, Terraform, Crossplane) to drive efficiencies and reduce manual overhead. Programming Skills: Proficient in one or more programming languages (Python, Go, NodeJS, etc.) with a focus on building robust, testable code. Monitoring & Logging: Familiarity with tools such as DataDog, CloudWatch, Prometheus, Grafana, and best practices for monitoring, logging, and incident management. Collaboration & Communication: Excellent interpersonal skills to collaborate effectively with both technical and non-technical stakeholders. Educational Background: A Bachelors degree in Computer Science, or a related field (or equivalent experience).

Posted 1 month ago

Apply

3.0 - 8.0 years

1 - 5 Lacs

Bengaluru

Work from Office

Project Role : Infra Tech Support Practitioner Project Role Description : Provide ongoing technical support and maintenance of production and development systems and software products (both remote and onsite) and for configured services running on various platforms (operating within a defined operating model and processes). Provide hardware/software support and implement technology at the operating system-level across all server and network areas, and for particular software solutions/vendors/brands. Work includes L1 and L2/ basic and intermediate level troubleshooting. Must have skills : Site Reliability Engineering, Database Architecture Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Infra Tech Support Practitioner, you will be responsible for providing ongoing technical support and maintenance of production and development systems and software products, both remote and onsite. You will work within a defined operating model and processes, implementing technology at the operating system-level across all server and network areas, and performing basic and intermediate level troubleshooting tasks. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work-related problems.- Ensure timely resolution of technical issues.- Collaborate with cross-functional teams to address system and software problems.- Maintain documentation of system configurations and troubleshooting procedures.- Implement best practices for system reliability and performance optimization.- Provide training and guidance to junior team members. Professional & Technical Skills: - Must To Have Skills: Proficiency in Site Reliability Engineering, Database Architecture.- Strong understanding of system architecture and infrastructure.- Experience with cloud platforms such as AWS or Azure.- Knowledge of scripting languages like Python or Shell scripting.- Hands-on experience with monitoring tools like Nagios or Prometheus. Additional Information:- The candidate should have a minimum of 3 years of experience in Site Reliability Engineering.- This position is based at our Bengaluru office.- A 15 years full-time education is required. Qualification 15 years full time education

Posted 1 month ago

Apply

3.0 - 8.0 years

3 - 7 Lacs

Bengaluru

Work from Office

Project Role : Application Support Engineer Project Role Description : Act as software detectives, provide a dynamic service identifying and solving issues within multiple components of critical business systems. Must have skills : OpenShift Virtualization Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Support Engineer, you will act as software detectives, providing a dynamic service identifying and solving issues within multiple components of critical business systems. Your day will involve troubleshooting, resolving technical issues, and ensuring seamless operation of applications. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work related problems.- Proactively identify and resolve application issues.- Collaborate with cross-functional teams to troubleshoot and resolve technical problems.- Develop and maintain technical documentation for support processes.- Participate in on-call rotation to provide 24/7 support.- Conduct root cause analysis for recurring issues and implement preventive measures. Professional & Technical Skills: - Must To Have Skills: Proficiency in OpenShift Virtualization.- Strong understanding of cloud computing principles.- Experience with containerization technologies like Docker and Kubernetes.- Knowledge of scripting languages such as Python or Bash.- Familiarity with monitoring tools like Prometheus or Grafana. Additional Information:- The candidate should have a minimum of 3 years of experience in OpenShift Virtualization.- This position is based at our Bengaluru office.- A 15 years full time education is required. Qualification 15 years full time education

Posted 1 month ago

Apply

3.0 - 8.0 years

5 - 9 Lacs

Pune

Work from Office

Project Role : Application Developer Project Role Description : Design, build and configure applications to meet business process and application requirements. Must have skills : Apache Kafka Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Developer, you will be responsible for designing, building, and configuring applications to meet business process and application requirements. You will play a crucial role in developing innovative solutions to enhance business operations and user experience. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work related problems.- Collaborate with cross-functional teams to design, develop, and implement applications.- Conduct code reviews and ensure code quality standards are met.- Troubleshoot and debug applications to optimize performance.- Stay updated with the latest technologies and trends in application development.- Provide technical guidance and support to junior team members. Professional & Technical Skills: - Must To Have Skills: Proficiency in Apache Kafka.- Strong understanding of distributed systems and event-driven architecture.- Experience with microservices architecture and containerization technologies like Docker and Kubernetes.- Hands-on experience in developing scalable and high-performance applications using Apache Kafka.- Knowledge of monitoring tools like Prometheus and Grafana. Additional Information:- The candidate should have a minimum of 3 years of experience in Apache Kafka.- This position is based at our Pune office.- A 15 years full time education is required. Qualification 15 years full time education

Posted 1 month ago

Apply

6.0 - 8.0 years

10 - 14 Lacs

Bengaluru

Work from Office

Project Role : Application Lead Project Role Description : Lead the effort to design, build and configure applications, acting as the primary point of contact. Must have skills : Websphere Application Server & Portal Administration Good to have skills : NAMinimum 5 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Lead, you will lead the effort to design, build, and configure applications, acting as the primary point of contact. Your typical day will involve collaborating with various teams to ensure that application requirements are met, overseeing the development process, and providing guidance to team members. You will also engage in problem-solving activities, ensuring that the applications are functioning optimally and meeting the needs of the organization. Your role will require effective communication and coordination with stakeholders to align project goals and deliverables. Roles & Responsibilities:- Expected to be an SME.- Collaborate and manage the team to perform.- Responsible for team decisions.- Engage with multiple teams and contribute on key decisions.- Provide solutions to problems for their immediate team and across multiple teams.- Facilitate knowledge sharing and mentoring within the team to enhance overall performance.- Monitor project progress and ensure timely delivery of application features and updates. Knowledge, Skills and Experience- 6-8 years WebSphere Portal/HCl DX(Digital Experience) experience- Strong expertise performing WebSphere Portal/HCL DX(Digital Experience) and WebSphere Application Servers Administration in a cluster environment on Linux- Proven experience installing, upgrading and supporting WebSphere Portal/HCL DX(Digital Experience) with Oracle RDBMS as a backend database and Apache/IHS as frontend webserver.- Good understanding of web analytics, pmi metrics- Must have completed at least 2 upgrades/migrations- Experience with xmlaccess, configengine, wsadmin, jacl, jython, perl, python and shell scripting- Fluent in English- Knowledge in other supporting products such as Grafana/Prometheus, Splunk, Watson Enterprise Search, SVN/Bitbucket/GIT/Maven/Artifactory/Jenkins is preferred- WebSphere Portal Administration certification is desirable Professional & Technical Skills: - Must To Have Skills: Proficiency in Websphere Application Server & Portal Administration.- Strong understanding of application design and architecture principles.- Experience with troubleshooting and resolving application issues.- Familiarity with deployment processes and application lifecycle management.- Ability to work collaboratively in a team environment and communicate effectively with stakeholders. Additional Information:- The candidate should have minimum 5 years of experience in Websphere Application Server & Portal Administration.- This position is based at our Bengaluru office.- A 15 years full time education is required. Qualification 15 years full time education

Posted 1 month ago

Apply

15.0 - 20.0 years

10 - 14 Lacs

Navi Mumbai

Work from Office

Project Role : Application Lead Project Role Description : Lead the effort to design, build and configure applications, acting as the primary point of contact. Must have skills : Automation in Application Maintenance Good to have skills : NAMinimum 7.5 year(s) of experience is required Educational Qualification : 15 years full time educationRole Description :The SRE and Automations Manager will be responsible for driving the reliability, scalability, and efficiency of AMS operations by leading the automation initiatives and SRE practices across both SAP and non-SAP landscapes. This individual will work closely with application support, infrastructure, DevOps, and ITSM teams to ensure high availability and performance of critical business applications.Key Responsibilities:SRE Responsibilities:- Establish and implement SRE practices such as myWizard and GenWizard app components across supported applications.- Collaborate with support teams to identify improvement areas in incident handling through runbooks, self-healing scripts, and observability tools.- Design and enforce proactive monitoring and alerting strategies for SAP and non-SAP applications using availabl .- Participate in capacity planning, performance tuning, and disaster recovery strategy formulation for delivery teams.Automation Responsibilities:- Define and execute the automation strategy for repetitive operational tasks including system health checks, report generation, job monitoring, user provisioning, and ticket triaging.- Drive the development of automation scripts using Python, PowerShell, Shell, ABAP (for SAP), or other tools as needed.- Partner with application SMEs and functional teams to identify automation use cases and deliver continuous value.- Ensure all automation activities are documented, version-controlled, and aligned with security policies.________________________________________Technical Skills & Tools:- Strong knowledge of SRE principles and automation frameworks.- Familiarity with non-SAP technologies such as Java, .NET, Oracle, SQL, or custom-built apps.- Tools:ServiceNow, Splunk, AppDynamics, Grafana, Prometheus, Jenkins, Git, Ansible, Python, Shell scripting, ABAP (basic automation).- Good to have exposure to cloud platforms (AWS/Azure/GCP) and hybrid environments.________________________________________Leadership & Soft Skills: - Ability to lead a small team of SREs and automation engineers.- Excellent analytical, problem-solving, and communication skills.- Strong stakeholder management skills and experience working in multi-vendor environments.- Agile/DevOps mindset with a focus on continuous improvement. Additional Information:- The candidate should have minimum 7.5 years of experience in Automation in Application Maintenance.- This position is based in Mumbai.- A 15 years full time education is required. Qualification 15 years full time education

Posted 1 month ago

Apply

3.0 - 5.0 years

5 - 9 Lacs

Bengaluru

Work from Office

Job Title: DevOps Engineer Location: Bangalore, KA Mode of Work: Work From Office (5 Days a Week) Job Type: Full-Time Department: Engineering/Operations : We are looking for a skilled DevOps Engineer to join our team in Bangalore . The ideal candidate will have hands-on experience with a range of technologies including Docker , Kubernetes (K8s) , JFrog Artifactory , SonarQube , CI/CD tools , monitoring tools , Ansible , and auto-scaling strategies. This role is key to driving automation, improving the deployment pipeline, and optimizing infrastructure for seamless development and production operations. You will collaborate with development teams to design, implement, and manage systems that improve the software development lifecycle and ensure a high level of reliability, scalability, and performance. Responsibilities: Containerization & Orchestration: Design, deploy, and manage containerized applications using Docker . Manage, scale, and optimize Kubernetes (K8s) clusters for container orchestration. Troubleshoot and resolve issues related to Kubernetes clusters, ensuring high availability and fault tolerance. Collaborate with the development team to containerize new applications and microservices. CI/CD Pipeline Development & Maintenance: Implement and optimize CI/CD pipelines using tools such as Jenkins , GitLab CI , or similar. Integrate SonarQube for continuous code quality checks within the pipeline. Ensure seamless integration of JFrog Artifactory for managing build artifacts and repositories. Automate and streamline build, test, and deployment processes to support continuous delivery. Monitoring & Alerts: Implement and maintain monitoring solutions using tools like Prometheus , Grafana , or others. Set up real-time monitoring, logging, and alerting systems to proactively identify and address issues. Create and manage dashboards for operational insights into application health, performance, and system metrics. Automation & Infrastructure as Code: Automate infrastructure provisioning and management using Ansible or similar tools. Implement Auto-Scaling solutions to ensure the infrastructure dynamically adjusts to workload demands, ensuring optimal performance and cost efficiency. Define, deploy, and maintain infrastructure-as-code practices for consistent and reproducible environments. Collaboration & Best Practices: Work closely with development and QA teams to integrate DevOps best practices into the software development lifecycle. Ensure a high standard of security and compliance within the CI/CD pipelines. Provide technical leadership and mentorship for junior team members on DevOps practices and tools. Participate in cross-functional teams to define, design, and deliver scalable software solutions. Debugging & Issue Resolution: Troubleshoot complex application and infrastructure issues across development, staging, and production environments. Apply root cause analysis to incidents and implement long-term fixes to prevent recurrence. Continuously improve monitoring and debugging tools for faster issue resolution.

Posted 1 month ago

Apply

6.0 - 8.0 years

6 - 10 Lacs

Pune

Work from Office

: Job TitleProduction Specialist, Associate LocationPune, India Role Description Our organization within Deutsche Bank is AFC Production Services. We are responsible for providing technical L2 application support for business applications. The AFC (Anti-Financial Crime) line of business has a current portfolio of 25+ applications. The organization is in process of transforming itself using Google Cloud and many new technology offerings. Your role will include hands-on production support and be actively involved in technical issues resolution across multiple applications. Deutsche Banks Corporate Bank division is a leading provider of cash management, trade finance and securities finance. We complete green-field projects that deliver the best Corporate Bank - Securities Services products in the world. Our team is diverse, international, and driven by shared focus on clean code and valued delivery. At every level, agile minds are rewarded with competitive pay, support, and opportunities to excel. You will work as part of a cross-functional agile delivery team. You will bring an innovative approach to software development, focusing on using the latest technologies and practices, as part of a relentless focus on business value. You will be someone who sees engineering as team activity, with a predisposition to open code, open discussion and creating a supportive, collaborative environment. You will be ready to contribute to all stages of software delivery, from initial analysis right through to production support." What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy, Best in class leave policy. Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Provide technical support by handling and consulting on BAU, Incidents/emails/alerts for the respective applications. Perform post-mortem, root cause analysis using ITIL standards of Incident Management, Service Request fulfillment, Change Management, Knowledge Management, and Problem Management. Analyze occurred errors out of the batch processing and interfaces of related systems. Resolution or Workaround determination and implementation Supporting the resolution of high impact incidents on our applications, including attendance at incident bridge calls Escalate incident tickets timely and communicate effectively with business users, development teams, and stakeholders. Providing resolution for open problems or ensuring that the appropriate parties have been tasked with doing so. Supporting the handover from new Projects / Applications into Production Services with Service Transition before Go Life Phase. Assist in the process to approve application code releases as well as tasks assigned to support to perform. Keep key stakeholders informed using communication templates. Automate routine tasks and enhance operational efficiencies through scripts and tools. Support the transition of applications to Google Cloud and new technologies offering. Proactively Identify performance bottlenecks and suggest optimization strategies. Support audit, compliance, and regulatory requirements related to AFC applications. The candidate will have to work in shifts as part of a Rota covering APAC and EMEA hours between 07:00 IST and 09:00 PM IST (2 shifts). In the event of major outages or issues we may ask for flexibility to help provide appropriate cover. Supporting On Call-Support activities Your skills and experience 4-8 years of experience in providing hands on IT application support. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience/diploma/certification). Preferred: ITIL v3 foundation certification or higher. Clear and concise documentation in general and especially a proper documentation of the status of incidents, problems, and service requests in the Service Management tool. Monitoring ToolsKnowledge of Elastic Search, Control M, Grafana, Geneos, OpenShift, Prometheus, Google Cloud Monitoring,Airflow, Splunk Red Hat Enterprise Linux (RHEL) professional skill in searching logs, process commands, start/stop processes, use of OS commands to aid in tasks needed to resolve or investigate issues. Shell scripting knowledge a plus. Understanding of database concepts and exposure in working with Oracle, MS SQL, Big Query etc. databases. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. Skills That Will Help You Excel Strong written and oral communication skills, including the ability to communicate technical information to a non-technical audience and good analytical and problem-solving skills. Analytical and problem-solving skills, with a structured approach to troubleshooting, issue resolution and its documentation. Able to train, coach, and mentor and know where each technique is best applied. Experience with GCP or another public cloud provider to build applications. Experience in an investment bank, financial institution or large corporation using enterprise hardware and software. Knowledge of Actimize, Mantas, and case management software is good to have. Working knowledge of Big Data Hadoop/Secure Data Lake is a plus. Prior experience in automation projects is great to have. Exposure to python, shell, Ansible or other scripting language for automation and process improvement Strong stakeholder management skills ensuring seamless coordination between business, development, and infrastructure teams. How we'll support you Training and development to help you excel in your career. Coaching and support from experts in your team A culture of continuous learning to aid progression. A range of flexible benefits that you can tailor to suit your needs.

Posted 1 month ago

Apply

6.0 - 8.0 years

12 - 16 Lacs

Bengaluru

Work from Office

: Job TitleSite Reliability Engineer LocationBangalore, India Corporate TitleAssociate Role Description You will work closely with application teams to ensure stable, well monitored applications that are resilient to faults. You will agree and review Service Level Objectives (SLOs) to achieve high availability for applications based on their criticality. You will maintain Error Budgets for the application teams and prevent releases in the event of production instability and reduced availability. You will focus on reducing manual toil, improving operational reliability and driving automation-first practices. This is a hands-on role with strong focus on implementing SRE practices and reducing toil for Developer Tools. What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy Best in class leave policy Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Drive stability, performance and reliability improvements for TDI Engineering applications. Build Monitoring and alerting solutions to alert in the event of failures/performance issues across TDI Engineering applications to help us providing the optimum service level to the users. Provide feedback loops to continually improve the application resilience across multiple application teams. Collaborate with product owners and engineering team to prioritize reliability and stability of these applications. Define, measure and maintain SLOs and Error Budgets to ensure availability for end users and to achieve appropriate levels of application stability. Identify opportunities for automation and self-service capabilities and implement them to eliminate toil for both the application teams and the SRE team to optimise effectiveness Manage outage resolution and agree actions to reduce the likelihood of failure happening in future by owning RCA and conducting blameless postmortems. Your skills and experience Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma). 4+ Years of Experience in IT in large corporate environments, specifically in controlled production environments. Demonstrable Site Reliability Engineering experience of at least 2+ Years. Excellent analytical and problem-solving skills Experience in implementing observability solution using any industry standard tools Scripting skills (Groovy, shell, Bash, Cron or any equivalent) Experience in mid-range technologies and platforms, i.e. UNIX/LINUX, ORACLE database and Nginx experience. Good to have: Understanding and experience in Developer Tools (Jira, Confluence, Bitbucket, TeamCity, Artifactory, Udeploy) as an enterprise level Administrator experienced in managing applications with large user base. Knowledge and experience of observability tools like Grafana, Prometheus. How we'll support you Training and development to help you excel in your career Coaching and support from experts in your team A culture of continuous learning to aid progression A range of flexible benefits that you can tailor to suit your needs

Posted 1 month ago

Apply

6.0 - 8.0 years

10 - 15 Lacs

Bengaluru

Work from Office

: Job TitleSite Reliability Engineer LocationBangalore,India Corporate TitleAnalyst Role Description You will work closely with application teams to ensure stable, well monitored applications that are resilient to faults. You will agree and review Service Level Objectives (SLOs) to achieve high availability for applications based on their criticality. You will maintain Error Budgets for the application teams and prevent releases in the event of production instability and reduced availability. You will focus on reducing manual toil, improving operational reliability and driving automation-first practices. This is a hands-on role with strong focus on implementing SRE practices and reducing toil for Developer Tools. What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy Best in class leave policy Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Drive stability, performance and reliability improvements for TDI Engineering applications. Build Monitoring and alerting solutions to alert in the event of failures/performance issues across TDI Engineering applications to help us providing the optimum service level to the users. Provide feedback loops to continually improve the application resilience across multiple application teams. Collaborate with product owners and engineering team to prioritize reliability and stability of these applications. Define, measure and maintain SLOs and Error Budgets to ensure availability for end users and to achieve appropriate levels of application stability. Identify opportunities for automation and self-service capabilities and implement them to eliminate toil for both the application teams and the SRE team to optimise effectiveness Manage outage resolution and agree actions to reduce the likelihood of failure happening in future by owning RCA and conducting blameless postmortems. Your skills and experience Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma). 2+ Years of Experience in IT in large corporate environments, specifically in controlled production environments. Demonstrable Site Reliability Engineering experience of at least 1+ Years. Excellent analytical and problem-solving skills Experience in implementing observability solution using any industry standard tools Scripting skills (Groovy, shell, Bash, Cron or any equivalent) Experience in mid-range technologies and platforms, i.e. UNIX/LINUX, ORACLE database and Nginx experience . Good to have Understanding and experience in Developer Tools (Jira, Confluence, Bitbucket, TeamCity, Artifactory, Udeploy) as an enterprise level Administrator experienced in managing applications with large user base. Knowledge and experience of observability tools like Grafana, Prometheus. How we'll support you Training and development to help you excel in your career Coaching and support from experts in your team A culture of continuous learning to aid progression A range of flexible benefits that you can tailor to suit your needs

Posted 1 month ago

Apply

6.0 - 8.0 years

37 - 40 Lacs

Pune

Work from Office

: Job TitleProduction Specialist, AVP LocationPune, India Role Description Our organization within Deutsche Bank is AFC Production Services. We are responsible for providing technical L2 application support for business applications. The AFC (Anti-Financial Crime) line of business has a current portfolio of 25+ applications. The organization is in process of transforming itself using Google Cloud and many new technology offerings. As an Assistant Vice President, your role will include hands-on production support and be actively involved in technical issues resolution across multiple applications. You will also be working as application lead and will be responsible for technical & operational processes for all application you support. Deutsche Banks Corporate Bank division is a leading provider of cash management, trade finance and securities finance. We complete green-field projects that deliver the best Corporate Bank - Securities Services products in the world. Our team is diverse, international, and driven by shared focus on clean code and valued delivery. At every level, agile minds are rewarded with competitive pay, support, and opportunities to excel. You will work as part of a cross-functional agile delivery team. You will bring an innovative approach to software development, focusing on using the latest technologies and practices, as part of a relentless focus on business value. You will be someone who sees engineering as team activity, with a predisposition to open code, open discussion and creating a supportive, collaborative environment. You will be ready to contribute to all stages of software delivery, from initial analysis right through to production support." What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy, Best in class leave policy. Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Provide technical support by handling and consulting on BAU, Incidents/emails/alerts for the respective applications. Perform post-mortem, root cause analysis using ITIL standards of Incident Management, Service Request fulfillment, Change Management, Knowledge Management, and Problem Management. Manage regional L2 team and vendor teams supporting the application. Ensure the team is up to speed and picks up the support duties. Build up technical subject matter expertise on the applications being supported including business flows, application architecture, and hardware configuration. Define and track KPIs, SLAs and operational metrics to measure and improve application stability and performance. Conduct real time monitoring to ensure application SLAs are achieved and maximum application availability (up time) using an array of monitoring tools. Build and maintain effective and productive relationships with the stakeholders in business, development, infrastructure, and third-party systems / data providers & vendors. Assist in the process to approve application code releases as well as tasks assigned to support to perform. Keep key stakeholders informed using communication templates. Approach support with a proactive attitude, desire to seek root cause, in-depth analysis, and strive to reduce inefficiencies and manual efforts. Mentor and guide junior team members, fostering technical upskill and knowledge sharing. Provide strategic input into disaster recovery planning, failover strategies and business continuity procedures Collaborate and deliver on initiatives and install these initiatives to drive stability in the environment. Perform reviews of all open production items with the development team and push for updates and resolutions to outstanding tasks and reoccurring issues. Drive service resilience by implementing SRE(site reliability engineering) principles, ensuring proactive monitoring, automation and operational efficiency. Ensure regulatory and compliance adherence, managing audits,access reviews, and security controls in line with organizational policies. The candidate will have to work in shifts as part of a Rota covering APAC and EMEA hours between 07:00 IST and 09:00 PM IST (2 shifts). In the event of major outages or issues we may ask for flexibility to help provide appropriate cover. Weekend on-call coverage needs to be provided on rotational/need basis. Your skills and experience 9-15 years of experience in providing hands on IT application support. Experience in managing vendor teams providing 24x7 support. Preferred Team lead role experience, Experience in an investment bank, financial institution. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience/diploma/certification). Preferred ITIL v3 foundation certification or higher. Knowledgeable in cloud products like Google Cloud Platform (GCP) and hybrid applications. Strong understanding of ITIL /SRE/ DEVOPS best practices for supporting a production environment. Understanding of KPIs, SLO, SLA and SLI Monitoring ToolsKnowledge of Elastic Search, Control M, Grafana, Geneos, OpenShift, Prometheus, Google Cloud Monitoring, Airflow,Splunk. Working Knowledge of creation of Dashboards and reports for senior management Red Hat Enterprise Linux (RHEL) professional skill in searching logs, process commands, start/stop processes, use of OS commands to aid in tasks needed to resolve or investigate issues. Shell scripting knowledge a plus. Understanding of database concepts and exposure in working with Oracle, MS SQL, Big Query etc. databases. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. Skills That Will Help You Excel Strong written and oral communication skills, including the ability to communicate technical information to a non-technical audience and good analytical and problem-solving skills. Proven experience in leading L2 support teams, including managing vendor teams and offshore resources. Able to train, coach, and mentor and know where each technique is best applied. Experience with GCP or another public cloud provider to build applications. Experience in an investment bank, financial institution or large corporation using enterprise hardware and software. Knowledge of Actimize, Mantas, and case management software is good to have. Working knowledge of Big Data Hadoop/Secure Data Lake is a plus. Prior experience in automation projects is great to have. Exposure to python, shell, Ansible or other scripting language for automation and process improvement Strong stakeholder management skills ensuring seamless coordination between business, development, and infrastructure teams. Ability to manage high-pressure issues, coordinating across teams to drive swift resolution. Strong negotiation skills with interface teams to drive process improvements and efficiency gains. How we'll support you Training and development to help you excel in your career. Coaching and support from experts in your team A culture of continuous learning to aid progression. A range of flexible benefits that you can tailor to suit your needs. About us and our teams Please visit our company website for further information: https://www.db.com/company/company.htm We strive for a culture in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively. Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group. We welcome applications from all people and promote a positive, fair and inclusive work environment.

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies