Application Support Engineer- SRE ORACLE FINANCIAL SERVICES SOFTWARE LIMITED

6.0 - 10.0 years

0 Lacs

karnataka

On-site

As an AMS/SRE Lead, within the Oracle Banking Cloud Services (OBCS) SaaS team, you will assist in designing building, deploying, and operating a micro services-based cloud native SaaS services with extremely high availability and scalability requirements. You will work as a lead member of our OBCS site reliability engineering team who provides guidance to SRE team. You will work in collaboration with product engineering and SaaS DevOps teams to evolve systems/products for better scalability, reliability and enable developer velocity. You will be responsible to ensure our services and systems are designed and build from the start with reliability, scalability, and observability as a critical feature. You will also author, review and maintain operational run books to help reduce incident resolution time and be responsible for managing and triaging operational tickets pertaining to the OBCS services. Emphasis on driving prioritization and execution of work based on business impact is a must. Responsibilities displayed in the job posting Responsibilities: Providing leadership, direction, and strategy to the AMS/SRE team Deploy software to SaaS environments with the key goals of improving the availability, scalability, and efficiency of Oracle products and services. Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning. Work as a member of the development team and share full stack ownership of a collection of services and/or technology area. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Articulate technical characteristics of services and technology areas and guide development teams to engineer and add capabilities to internal Oracle services. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and the dependencies required to troubleshoot issues and define mitigations. Understand and explain the effect of product architecture decisions on micro services-based cloud native SaaS services. Understand and explain SaaS application availability, RTO (Recovery Time Objective) and RPO (Recovery Point Objective) and its impacts as part of incidents and system down time Serve as part of a 24x7 On Call rotation in support of the OBCS SaaS Suite Professional curiosity and a desire to a develop deep understanding of services and technologies. Mandatory Qualifications: Minimum 6+ years of experience in the banking and financial services industry Minimum 3+ years of experience working with cloud (IaaS/PaaS) / SaaS based application deployments, monitoring and production support including Kubernetes / Docker based deployments Experience working with fully managed fault tolerant, highly available, high throughput, multi-tenant, scalable systems Execute, with excellence, delivery of interim patches and hotfixes as required High level Oracle database administration / operations knowledge Experience with Monitoring and Observability technologies like Prometheus, Grafana, OCI Logging or equivalents like ELK Experience with CI/CD pipelines including GitLab Multi Fault Domain (FD), Availability Domain (AD) and Availability Region (AR) based SaaS services deployments Familiarity with security practices in web application delivery and general knowledge of network topology SaaS environment capacity management Experience in working with Agile development frameworks Aptitude to be a good team player and the desire to learn and implement new Cloud technologies as needed,

Posted 2 days ago

Apply

Database Administrator II Halodoc

3.0 - 7.0 years

0 Lacs

karnataka

On-site

As a Database Administrator (DBA) at Halodoc, you will play a crucial role in ensuring the smooth operation and optimization of our database systems. Your expertise in both relational databases like MySql and PostgreSQL, as well as NoSQL databases such as documentDB and MongoDB on AWS, will be essential for maintaining the integrity and security of our data. Your responsibilities will include implementing best practices for database access and security, handling database migration tasks using AWS services, and collaborating with the Site Reliability Engineering (SRE) team on database automation. You will be expected to monitor database performance metrics using tools like PMM, AWS RDS console, performance insights, and cloudwatch to proactively identify and resolve potential issues. With your hands-on experience in planning and executing database activities with minimal downtime, you will work closely with the development team to design effective database solutions. Proficiency in SQL, PL/SQL, and database scripting languages will be essential for your success in this role, along with a good understanding of different file formats like xml, yml, json, and parquet. Additionally, your role will involve engaging in service capacity planning, demand forecasting, performance analysis, and system tuning in AWS. You will be expected to write runbooks effectively and automate repeatable actions to enhance operational efficiency. Familiarity with tools like Jenkins, Gitlab, terraform, and other development tools will be advantageous. To qualify for this position, you should have 3 to 6 years of industry experience and exposure to AWS services like DynamoDB, DocumentDB, Redshift, DMS, and CloudWatch. An interest in learning DevOps on AWS will be a valuable asset as you contribute to our dynamic and innovative work environment. At Halodoc, you will have the opportunity to work with cutting-edge technologies, receive comprehensive medical insurance benefits, use MacBooks provided for work, and enjoy a hybrid work mode for flexibility. Join us in revolutionizing healthcare in Indonesia and beyond by becoming a part of our dedicated and forward-thinking team.,

Posted 2 days ago

Apply

Senior Infrastructure Engineer - zOS MQ engineering Wells Fargo

4.0 - 9.0 years

7 - 17 Lacs

Bengaluru

Work from Office

About this role: Wells Fargo is seeking a Mainframe Infrastructure Senior Engineer to support zOS MQ engineering and support. In this role, you will: Lead or participate in high level technical concepts spanning technology and business Develop specifications for complex infrastructure systems, design and test solutions Contribute to the testing of business, application and technical infrastructure requirements Drive solutions to reduce recovery Review and analyze solutions for cloud security, secrets management and key rotations Design, code, test, debug and document programs using Agile development practices Design complex system upgrades Resolve troublesome trends as they develop Develop a long range plan designed to resolve problems and prevent them from recurring Direct the daily risk and control flow of operations, focusing on policies, procedures and work standards to ensure success Collaborate and consult with peers, colleagues and managers to resolve issues and achieve goals Required Qualifications: 4+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education Desired Qualifications: Good hands-on experience on MQ cluster concepts, CSQUTIL utility and 'IR360' tool. Install, configure, and maintain IBM MQ (WebSphere MQ) on z/OS mainframes. Manage and monitor MQ components including Queue Managers, Queues, Channels, and Listeners. Perform troubleshooting and root cause analysis of MQ-related issues. Work with application development teams to design and implement MQ messaging solutions. Ensure MQ environments are secure, stable, and compliant with internal standards. Perform system tuning, capacity planning, and disaster recovery planning. Develop and maintain MQ-related documentation, including configuration, architecture diagrams, and SOPs. Support and automate MQ operations using tools such as REXX, JCL, or scripting (e.g., Python, Shell). Participate in on-call rotation and provide off-hours support as needed. Collaborate with cross-functional teams including security, networking, and application support. Knowledge in Mainframe infrastructure support functions Experience in Agile/Kanban board methodologies Product operating model concepts Exposure to AI tools/practices Job Expectations: On-call, morning/afternoon shifts as per business need Work with global infrastructure/operations, application, risk/compliance management partners etc.

Posted 1 week ago

Apply

Site Reliability Developer - 2 Oracle

3.0 - 5.0 years

0 Lacs

, India

On-site

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning. Career Level - IC3 Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

Posted 3 weeks ago

Apply

Site Reliability Developer 2 Oracle

3.0 - 5.0 years

0 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning. Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies. Career Level - IC2

Posted 1 month ago

Apply

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.