Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
4.0 - 6.0 years
7 - 17 Lacs
Bengaluru
Work from Office
About this role: Wells Fargo is seeking a Senior Systems Operations Engineer. We believe in the power of working together because great ideas can come from anyone. Through collaboration, any employee can have an impact and make a difference for the entire company. Explore opportunities with us for a career in a supportive environment where you can learn and grow In this role, you will: Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area Contribute in increasing system efficiencies and lowering the human intervention time on related tasks Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability Work with vendors and other technical personnel for problem resolution Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability Required Qualifications: 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education Desired Qualifications: Lead and participate in incident response activities, including identifying, investigating, and resolving incidents to minimize impact on service availability and performance. Conduct post-incident reviews (postmortems) to identify root causes and implement preventative measures. Define and monitor SLOs and SLIs for critical services to ensure they meet performance and reliability targets. Regularly review and adjust these metrics as necessary Continuously evaluate and improve processes, tools, and infrastructure to enhance reliability, efficiency, and scalability. Stay up-to-date with industry trends, emerging technologies, and best practices, and drive innovation within the organization. Monitor system health and performance using monitoring tools and alerting systems, and respond promptly to alerts and incidents. Drive efficiency by automating repetitive tasks and processes. Evaluate and implement technology options for managing our enterprise SaaS products in the cloud. Enhance our platform by identifying areas for improvement based on monitoring data. Work closely with the development team to create a development environment that fosters productivity and innovation. Propose and drive adoption of new solutions that enhance our platform. Job Expectations: Hold a Bachelor degree in Engineering or a related field. Minimum 4 years of relevant experience in Platform Engineering, SRE, and/or DevOps in production environments. Expertise in Clous setup with 3+ years of hands-on experience. Proven track record of owning the uptime of distributed cloud-based systems. Possess at least 3 years of experience with scripting languages and related automation projects. Experience in building and using Observability frameworks for a microservice based distributed cloud setup with tools such as Prometheus, Grafana, AppDynamics, Splunk etc. Proficient in setting up and managing CI/CD pipelines and deployment tools (e.g., Jenkins, Git, GitHub etc). Strong Database knowledge is required :Oracle MongoDB Experienced is 24x7 Support model for Cloud uptime and maintenance activities Strong spoken and written English communication skills. Self-driven, responsible, eager to learn, and proactive. Independent, goal-oriented, and proactive attitude. Disciplined and effective in remote work environments.
Posted 1 week ago
7.0 - 12.0 years
1 - 5 Lacs
Bengaluru
Work from Office
About The Role Project Role : Application Tech Support Practitioner Project Role Description : Act as the ongoing interface between the client and the system or application. Dedicated to quality, using exceptional communication skills to keep our world class systems running. Can accurately define a client issue and can interpret and design a resolution based on deep product knowledge. Must have skills : DevOps Good to have skills : NAMinimum 7.5 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Tech Support Practitioner, you will act as the ongoing interface between the client and the system or application. You will be dedicated to quality, using exceptional communication skills to keep our world-class systems running. With your deep product knowledge, you will accurately define client issues and design resolutions. Your typical day will involve collaborating with clients, interpreting and resolving issues, and ensuring smooth system operations. Roles & Responsibilities:- Expected to be an SME, collaborate and manage the team to perform.- Responsible for team decisions.- Engage with multiple teams and contribute on key decisions.- Provide solutions to problems for their immediate team and across multiple teams.- Manage and prioritize support tickets to ensure timely resolution.- Identify and analyze system or application issues and provide effective solutions.- Communicate with clients to understand their needs and provide technical assistance.- Conduct regular system audits to ensure optimal performance. Professional & Technical Skills: - Must To Have Skills: Proficiency in DevOps.- Experience with cloud platforms such as AWS or Azure.- Strong knowledge of CI/CD pipelines and automation tools like Jenkins or GitLab.- Familiarity with containerization technologies like Docker and Kubernetes.- Good understanding of infrastructure-as-code principles.- Experience with monitoring and logging tools like Prometheus and ELK stack. Additional Information:- The candidate should have a minimum of 7.5 years of experience in DevOps.- This position is based at our Bengaluru office.- A 15 years full-time education is required. Qualification 15 years full time education
Posted 1 week ago
5.0 - 10.0 years
0 - 0 Lacs
Bengaluru
Work from Office
Job Purpose We are seeking an Observability Architect to join our team and be responsible for the design, implementation, and maintenance of our company's observability practices and tools. The Observability Architect will work closely with the Lead Engineers, Managers and Architects of other departments to gather requirements and provide solutions for ensuring our systems' reliability, availability, and performance. They will also implement monitoring, logging, and alerting strategies and be responsible for monitoring and optimizing system performance. This role will require hands on experience and skills. Role & responsibilities Design, implement, and maintain observability practices and tools Work closely with Lead Observability Engineers and Architects/engineers of Other departments to gather requirements and provide solutions Implement monitoring, logging, and alerting strategies Develop and implement dashboards, alerts, and metrics to track system health and performance Monitor and optimize system performance Identify and resolve system-related issues Keep up-to-date with new technologies and industry trends Skills: Strong knowledge of monitoring and logging tools such as Prometheus, Grafana, and Elasticsearch Experience with APM tools like Dynatrace, New Relic, Data Dog, Splunk Experience with Cloud monitoring service like AWS CloudWatch, Azure Monitor, GCP StackDriver Strong understanding of distributed systems and containerization technologies such as Kubernetes, and Cloud GCP/AWS Strong problem-solving and analytical skills Excellent communication and teamwork abilities Be able to mentor and build a team of engineers who will be specialized in observability engineering
Posted 1 week ago
7.0 - 12.0 years
4 - 8 Lacs
Chennai
Work from Office
This is for a Lead Support Analyst role, reporting to the Application Support Manager within ISPL ALMT APS.The role includes ITIL Operations, Transition & Design activity, project management, managing ISPL team, strongly collaborating with the global team and the other Technology teams in ISPL. This is an excellent opportunity for a highly motivated and skilled candidate to join a very dynamic company and work in an exciting environment. APS team member would be working on standard banking softwares & in-house applications etc. The APS team member is responsible for providing production functional support, maintenance of key application platforms, deployment within the CIB ALMT APS domain. Responsibilities Direct Responsibilities Candidate must work as level 1/2 support analyst to bring technical and product issues to resolve. Responsible for monitoring production environment and act proactively to prevent performance issues or application crash. Responsible for resolving support issue by using his functional/ technical expertise and flexible enough to look for solutions that may be out of the box. Handling ITIL Methodologies like Change, Incident, Problem, and Service Management Monitoring night batch and ensuring reports are generated well and transferred to client by adhering the SLA defined. Monitor the recurrent incidents, perform problem management and escalate to the next level of support or development team when required Coordinate with Infrastructure teams on events of patching & up gradation of servers to ensure the applications are stable & running after the infra work Responsible for UAT/PROD deployment & validation, Analyzing/documenting problems, recommending solutions, & initiating corrective action Plan and implement application releases, load tests and configuration changes. Customize production tools (monitoring, batch scheduling, backups etc. Contributing Responsibilities Providing coaching and mentoring to junior colleagues, transferring skills and expertise as required. Participate to DRP activities Should have good experience knowledge on L1/L2 Functional Support Eager to learn Banking Domain Knowledge (Corporate & Investment) Technical & Behavioral Competencies Technical:- Must have technologies (Hands on experience) PRIMARY SKILLS BPM Tools None Databases Knowledge on Sybase IQ/Sybase ASE and SQL server UNIX Knowledge on Linux/Unix systems Knowledge on Windows Mounts/NFS ITIL Good Understanding of ITIL framework, especially on Incident, Request, Problem and Change management. Should have worked in ITIL process and best practices. Middleware Knowledge on Apache, Kubernetes, Kafka, MQ series, Consul, Signal Deployment Process Scripting Scripting (Ansible, Python, Shell, SQL ...) and development skills for administration. DEVOPS Tools Knowledge on Jenkins, Ansible Tower, bitbucket Scheduler & Monitoring Tools: Knowledge on Schedulers, Crontab, Autosys /DollarU Knowledge on Monitoring Tools such as Dynatrace, Prometheus, Grafana SECONDARY SKILLS Networking Network topologies, ability to identify Network issues. Database File Transfer/Security Knowledge on CFT, security authentication, certificate renewals Windows/Unix/Linux OS Knowledge on UNIX/ Linux /Windows Operating System andDOS Commands FUNCTIONAL KNOWLEDGE DOMAIN Excellent knowledge on various banking, and finance applications Excellent knowledge on various financial Products in FX, Derivatives Excellent knowledge on Securities domain Excellent knowledge on Payments Tools, Trade Settlement Excellent knowledge on accounting BEHAVIOURAL Proactive with good verbal and written communication skills Good analytical and problem-solving skills Ability to understand and interpret complex technical issues and identify solutions, quickly and efficiently. Ability to prioritize workloads and manage conflicting requests on time in a continually fast-moving environment. Enthusiastic to work in challenging environment Effective problem-solving abilities Ability to adapt to change Exhibits positive interpersonal and team skills Commitment to company quality standards including issue resolution timescales, quality improvement and commitment to resolving issues correctly. Aptitude for learning Specific Qualifications(if required) Skills Referential Behavioural Skills : (Please select up to 4 skills) Communication skills - oral & written Creativity & Innovation / Problem solving Attention to detail / rigor Ability to collaborate / Teamwork Transversal Skills: Analytical Ability Ability to develop and adapt a process Ability to develop others & improve their skills Choose an item. Choose an item. Education Level: Bachelor Degree or equivalent Experience Level At least 7 years -
Posted 1 week ago
15.0 - 20.0 years
9 - 14 Lacs
Mumbai
Work from Office
This position is for Site reliability Engineer within Client Engagement and Protection APS team. The primary purpose is to be accountable for all core engineering / transformation activities of ISPL Transversal CEP APS Responsibilities Direct Responsibilities Automate away toil using a combination of scripting, tooling, and process improvements Drive transformation strategies involving infrastructure hygiene / end of life Implementing new technologies or processes to improve efficiency and reduce costs eg:- CI/CD implementation Monitoring system performance and capacity levels to ensure high availability of applications with minimal downtime Investigating any service disruptions or other service issues to identify their causes Performing regular audits of computer systems to check for signs of degradation or malfunction Developing and implementing new methods of measuring service quality and customer satisfaction Conducting capacity planning to ensure that new technologies can be accommodated without impacting existing users Conducting post-mortem examinations of failed systems to identify and address root cause Drive various Automation, Monitoring & Tooling common purpose initiatives across CEP APS and other teams within CIB APS Accountable for generation, reporting and improvements of various Production KPIs, SLs and dashboards for APS teams Accountable for improvements in service and presentations for all governances and steering committees Accountable for maintenance and improvement of IT continuity plans (ICP) Contributing Responsibilities Technical & Behavioral Competencies Strong knowledge of DevOps methodology and toolsets Strong knowledge of Cloud based applications/services Strong knowledge of APM Tools i.e. Dynatrace / AppDynamics Strong Distributed Computing and Database technologies skillset Strong knowledge of Jenkin, Ansible, Python, Scripting etc. Good understanding of Log aggregators i.e. Splunk/ELK Good understanding of observability tools i.e. Grafana / Prometheus Ability to work with various APS, Development, Operations stakeholders, locally and globally Dynamic, proactive and teamwork oriented Independent, self-starter and fast learner Good communication and interpersonal skills Practical knowledge of change, incident & problem management tools Innovative and transformational mindset Flexible attitude Ability to perform under pressure Strong analytical skills Preferred to have ITIL Dockers/Kubernetes Prior knowledge on Site Reliability Engineering / Dev-Ops / Application Production Support / Development background Specific Qualifications (if required) Graduate in any discipline or Bachelor in Information Technology 15 of IT experience Skills Referential Behavioural Skills : Ability to collaborate / Teamwork Creativity & Innovation / Problem solving Ability to deliver / Results driven Communication skills - oral & written Transversal Skills: Ability to manage a project Ability to set up relevant performance indicators Ability to anticipate business / strategic evolution Ability to develop and adapt a process Analytical Ability Education Level: Bachelor Degree or equivalent Experience Level At least 15 years
Posted 1 week ago
3.0 - 5.0 years
3 - 7 Lacs
Gurugram
Work from Office
Senior Software Engineer Location-Gurugram Designation-Senior Software Engineer Experience-3 - 5 Years Key Responsibilities: 5G Core Network Testing: Strong Hands-on knowledge of 3G/4G Signalling and Userplane flows. Conduct functional, performance, and regression testing for 5G Core Network elements such as AMF, SMF, UPF, AUSF, NRF, UDM, PCF, NSSF, and NWDAF. Validate 3GPP compliance for protocols including HTTP/2, PFCP, SCTP, NGAP, and N2/N3 interfaces. Automation and Test Frameworks: Develop, execute, and maintain automated test scripts using tools like Robot Framework, Selenium, or custom Python/Go-based solutions. Collaborate with the DevOps team to integrate automated test suites into CI/CD pipelines. Test Lab Setup and Maintenance: Set up and configure 5G Core test environments, including simulators, real-time network elements, and protocol analyzers. Manage lab resources, including test servers, virtual machines, Kubernetes clusters, and network traffic generators. Performance and Load Testing: Utilize tools like Spirent, IXIA, or Keysight for performance testing and load generation. Analyze and troubleshoot issues related to scalability, throughput, latency, and resilience. Defect Management and Reporting: Identify, document, and track defects using tools like JIRA. Provide comprehensive test reports and metrics to stakeholders. Collaboration and Standards: Work closely with development, operations, and QA teams to ensure alignment on testing requirements. Stay updated with 3GPP specifications, testing standards, and industry trends. Required Skills and Qualifications: Bachelors or Masters degree in Computer Science, Telecommunications, or a related field. Experience: 3-5 years in testing and validating telecom networks, preferably 5G Core or LTE EPC. Technical Expertise: Strong understanding of 3GPP specifications, especially TS 23.501, TS 23.502, TS 23.503, and TS 29.xxx series. Hands-on experience with protocol testing tools such as Wireshark, DS Tester, or TShark. Proficiency in scripting languages (Python, Bash) and/or programming (Golang). Tools and Platforms: Experience with Kubernetes, Docker, and cloud platforms (AWS, Azure, or GCP). Familiarity with Grafana, Prometheus, or other monitoring tools. Soft Skills: Strong analytical and troubleshooting skills.Effective communication and documentation skills.
Posted 1 week ago
3.0 - 6.0 years
5 - 10 Lacs
Mumbai, Mumbai Suburban, Navi Mumbai
Work from Office
Description Education B.E./B.Tech/MCA in Computer Science Experience 3 to 6 Years of Experience in Kubernetes/GKE/AKS/OpenShift Administration Mandatory Skills ( Docker and Kubernetes) Should have good understanding of various components of various types of kubernetes clusters (Community/AKS/GKE/OpenShift) Should have provisioning experience of various type of kubernetes clusters (Community/AKS/GKE/OpenSHIFT) Should have Upgradation and monitoring experience of variouos type of kubernetes clusters (Community/AKS/GKE/OpenSHIFT) Should have good experience on Conatiner Security Should have good experience of Container storage Should have good experience on CICD workflow (Preferable Azure DevOps, Ansible and Jenkin) Should have goood experiene / knowlede of cloud platforms preferably Azure / Google / OpenStack Should have good experience of container runtimes like docker/cotainerd Should have basic understanding of application life cycle management on container platform Should have good understatning of container registry Should have good understanding of Helm and Helm Charts Should have good understanding of container monitoring tools like Prometheus, Grafana and ELK Should have good exeperince on Linux operating system Should have basis understanding of enterprise networks and container networks Should able to handle Severity#2 and Severity#3 incidents Good communication skills Should have capability to provide the support Should have analytical and problem solving capabilities, ability to work with teams Should have experince on 24*7 operation support framework) Should have knowledge of ITIL Process Preferred Skills/Knowledge Container Platforms - Docker, Kubernetes, GKE, AKS OR OpenShift Automation Platforms - Shell Scripts, Ansible, Jenkin Cloud Platforms - GCP/AZURE/OpenStack Operating System - Linux/CentOS/Ubuntu Container Storage and Backup Desired Skills 1. Certified Kubernetes Administrator OR 2. Certified Redhat OpenShift Administrator 3. Certification of administration of any Cloud Platform will be an added advantage Soft Skills 1. Must have good troubleshooting skills 2. Must be ready to learn new technologies and acquire new skills 3. Must be a Team Player 4. Should be good in Spoken and Written English
Posted 1 week ago
4.0 - 6.0 years
4 - 8 Lacs
Gurugram
Work from Office
Software Engineer Location-Gurugram Designation-Senior Software Engineer (Python) Experience-4 - 6 Years Technical Expertise: Strong experience with OpenStack architecture and services (Nova, Neutron, Cinder, Keystone, Glance, etc.). Knowledge of NFV architecture, ETSI standards, and VIM (Virtualized Infrastructure Manager). Hands-on experience with containerization platforms like Kubernetes or OpenShift. Familiarity with SDN (Software-Defined Networking) solutions such as OpenDaylight or Tungsten Fabric. Experience with Linux-based systems and scripting languages (Python, Bash, etc.). Understanding of networking protocols (e.g., VXLAN, BGP, OVS, SR-IOV). Knowledge of Ceph or other distributed storage solutions. Tools: Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack, etc.). Configuration management tools like Ansible, Puppet, or Chef. Proficiency in CI/CD tools (Jenkins, GitLab CI, etc.). Certifications (Preferred): OpenStack Certified Administrator (COA). Red Hat Certified Engineer (RHCE). VMware Certified Professional (VCP). Kubernetes certifications like CKA or CKAD.
Posted 1 week ago
4.0 - 6.0 years
6 - 10 Lacs
Gurugram
Work from Office
Strong experience with OpenStack architecture and services (Nova, Neutron, Cinder, Keystone, Glance, etc.). Knowledge of NFV architecture, ETSI standards, and VIM (Virtualized Infrastructure Manager).Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack, etc.). Configuration management tools like Ansible, Puppet, or Chef. Proficiency in CI/CD tools (Jenkins, GitLab CI, etc
Posted 1 week ago
4.0 - 6.0 years
6 - 8 Lacs
Gurugram
Work from Office
Technical Expertise: Strong experience with OpenStack architecture and services (Nova, Neutron, Cinder, Keystone, Glance, etc.). Knowledge of NFV architecture, ETSI standards, and VIM (Virtualized Infrastructure Manager). Hands-on experience with containerization platforms like Kubernetes or OpenShift. Familiarity with SDN (Software-Defined Networking) solutions such as OpenDaylight or Tungsten Fabric. Experience with Linux-based systems and scripting languages (Python, Bash, etc.). Understanding of networking protocols (e.g., VXLAN, BGP, OVS, SR-IOV). Knowledge of Ceph or other distributed storage solutions. Tools: Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack, etc.). Configuration management tools like Ansible, Puppet, or Chef. Proficiency in CI/CD tools (Jenkins, GitLab CI, etc.). Certifications (Preferred): OpenStack Certified Administrator (COA). Red Hat Certified Engineer (RHCE). VMware Certified Professional (VCP). Kubernetes certifications like CKA or CKAD.
Posted 1 week ago
5.0 - 8.0 years
6 - 16 Lacs
Hyderabad, Bengaluru, Mumbai (All Areas)
Work from Office
Job Title : DevOps Engineer Location: Mumbai/Bangalore/Chennai/Delhi NCR/Hyderabad Experience Required: 5+ Years Job Description Key Responsibilities: • Implement and maintain the cloud infrastructure • Ensure the smooth operation of environment • Evaluate new technologies in the field of infrastructure automation and cloud computing • Look for opportunities to improve performance, reliability and automation • Provide DevOps capability to team mates and customers • Perform code deployments • Release management activities • Resolve incidents and change requests • Document Solutions and communicate it to the users • Perform optimizations on existing solutions • Diagnose, troubleshoot, and resolve ensuring smooth operation of services. • Shows attitude and aptitude for owning responsibility of own work done and collaborate with other team member in their activities • Updates job knowledge by self-learning or participating in learning initiatives provided by organization Required Skills & Qualifications: • Bachelors degree in IT, computer science, computer engineering, or similar • 6 years of Overall experience with 3+years as Devops Engineer • Advanced experience with Cloud Infrastructure / Cloud Services (preferable on Microsoft Azure) • Container Orchestration (Kubernetes, docker ,Helm) • Experience with Linux incl. Scripting (Bash, Python) • Log and metrics management (ELK Stack), Monitoring (Prometheus,loki,Grafana,dynatrace) • Infrastructure as code / Deployment and configuration automation ( Terraform) • Continuous Integration / Continuous Delivery (Gitlab CI, Jenkins, Nexus etc.) • Infrastructure Security Principles • Advance experience in Helm and CI/CD pipelines • Advance Experience in configuration of DevOps Tools such as Jenkins ,sonarqube, Nexus etc • Exposure to SDLC & Agile process • Experience with SSO integrations • Knowledgeable on AI tools & efficient usage in day to day work • Attitude, Soft & Communication Skills • Experience in handling technically critical escalated situations, drive team of experts & come-up with best-in-class workarounds / solutions • Critical thinking generated by observation, experience, reflection, reasoning, and communication. • DevOps mindset (you build it, you run it; taking e2e responsibility and accountability) • Able to demonstrate how customer centric thinking is expressed and reinforced through the digital product design processes • Fluent English (written and spoken) is a must, other languages (e.g. German, French, etc.) are a plus. Nice to Have: • Knowledge in python • Databases (e.g. PostgreSQL, Elasticsearch)
Posted 1 week ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
Genpact (NYSE: G) is a global professional services and solutions firm dedicated to creating outcomes that shape the future. With a workforce of over 125,000 individuals spread across more than 30 countries, our team is distinguished by an innate curiosity, entrepreneurial agility, and a commitment to delivering enduring value to our clients. Fueled by our purpose of the relentless pursuit of a world that works better for people, we engage with and transform leading enterprises, including the Fortune Global 500, leveraging our profound business and industry expertise, digital operations services, and proficiency in data, technology, and AI. We are currently seeking applications for the position of Lead Consultant, Python Developer. As a Lead Consultant, your primary responsibility will be to deliver Enhancement & Development services within a back-end environment. You will play a crucial role in designing, testing, and maintaining UI applications while collaborating closely with cross-functional teams to provide robust software solutions. **Responsibilities:** - Design and execute large-scale backend infrastructure and APIs. - Develop high-quality code that is resilient, easily readable, and scalable. - Demonstrate a strong commitment to delving deep into challenges, thriving, and making progress even in ambiguous situations. - Foster and facilitate knowledge sharing within the team and external groups. - Operate within an agile environment that prioritizes the most critical deliverables for our clients. - Hands-on experience in Python, NoSQL databases such as MongoDB or ElasticSearch, caching technologies like Redis or Memcached, and streaming technologies like Kafka or RabbitMQ. - Hold a Bachelor's Degree in Computer Science or a related field with more than 3 years of work experience, or a Master's Degree with over 3 years of experience in Software Development. - Possess solid Computer Science fundamentals in Data Structures, Algorithms, Complexity Analysis, Object-Oriented Design, and the design of Large Scale Data-Intensive Applications. - Excellent analytical and communication skills, including the ability to communicate effectively with both technical and business audiences and collaborate on a global scale. **Qualifications we seek in you!** **Minimum Qualifications/Skills:** - BE/B Tech/MCA/MBA degree. - Exceptional written and verbal communication abilities. - Strong problem-solving skills. **Preferred Qualifications/Skills:** - Proficiency in Java, Django, Tornado, or Flask frameworks. - Experience with ELK Stack (Elasticsearch, Logstash, Kibana) or Prometheus + Grafana. - Knowledge of Linux, Bash, JSON, and SQL. - Familiarity with Credit products such as corporate bonds/loans, credit default swaps, total return swaps is a plus. - Experience working with cloud computing systems. - Proficiency in networking, including TCP, HTTP, DNS, SSL certificates. - Understanding of Slang/SecDB. **Job Details:** - Job Title: Consultant - Primary Location: India-Bangalore - Schedule: Full-time - Education Level: Bachelor's/Graduation/Equivalent - Job Posting: Sep 2, 2024, 12:40:31 AM - Unposting Date: Feb 28, 2025, 7:10:31 PM - Master Skills List: Consulting - Job Category: Full Time,
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
As a QA Automation Architect at Nitor Infotech, an Ascendion company, you will be responsible for designing and implementing comprehensive test automation frameworks using tools such as Playwright, pytest, and Cucumber. Your role will involve developing and executing performance tests utilizing Apache JMeter, as well as integrating automated tests with CI/CD pipelines through GitHub Actions. Utilizing Azure monitoring and services will be crucial to ensure the reliability and performance of applications. In addition to test automation, you will implement and manage Infrastructure as Code (IaC) testing using Terraform. Setting up and maintaining logging and monitoring systems using ELK, Prometheus, and Grafana will be part of your responsibilities. Collaboration with development, DevOps, and product teams is essential to guarantee high-quality releases. Regular code reviews and mentorship to QA engineers will also be expected from you. Ensuring comprehensive test coverage for various Azure services such as Azure Cloud CDN, Azure Service Bus, Azure Postgres, Cosmos DB, Blob Storage, and Azure Vnet will be a key aspect of your role. Identifying and addressing gaps in test coverage and testing processes, as well as developing and maintaining documentation for test plans, test cases, and test scripts, are critical tasks. Participation in Agile/Scrum processes, including sprint planning, daily stand-ups, and retrospectives, is part of the collaborative environment at Nitor Infotech. To be successful in this role, you should possess a Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience. Proven experience as a QA Automation Architect or in a similar role is required. Strong proficiency in Playwright, pytest, and Cucumber, as well as experience with Apache JMeter for performance testing, are essential skills. Proficiency in integrating automated tests with GitHub Actions, in-depth knowledge of Azure monitoring and services, experience with Infrastructure as Code (IaC) testing using Terraform, and familiarity with logging and monitoring tools like ELK, Prometheus, and Grafana are also necessary. Strong problem-solving skills, attention to detail, excellent communication, and leadership skills are attributes that will contribute to your success in this role.,
Posted 1 week ago
4.0 - 8.0 years
0 Lacs
noida, uttar pradesh
On-site
As a highly experienced and motivated Backend Solution Architect, you will be responsible for leading the design and implementation of robust, scalable, and secure backend systems. Your expertise in Node.js and exposure to Python will be crucial in architecting end-to-end backend solutions using microservices and serverless frameworks. You will play a key role in ensuring scalability, maintainability, and security, while also driving innovation through the integration of emerging technologies like AI/ML. Your primary responsibilities will include designing and optimizing backend architecture, managing AWS-based cloud solutions, integrating AI/ML components, containerizing applications, setting up CI/CD pipelines, designing and optimizing databases, implementing security best practices, developing APIs, monitoring system performance, and providing technical leadership and collaboration with cross-functional teams. To be successful in this role, you should have at least 8 years of backend development experience with a minimum of 4 years as a Solution/Technical Architect. Your expertise in Node.js, AWS services, microservices, event-driven architectures, Docker, Kubernetes, CI/CD pipelines, authentication/authorization mechanisms, and API development will be critical. Additionally, hands-on experience with AI/ML workflows, React, Next.js, Angular, and AWS Solution Architect Certification will be advantageous. At TechAhead, a global digital transformation company, you will have the opportunity to work on cutting-edge AI-first product design thinking and bespoke development solutions. By joining our team, you will contribute to shaping the future of digital innovation worldwide and driving impactful results with advanced AI tools and strategies.,
Posted 1 week ago
3.0 - 7.0 years
0 Lacs
udaipur, rajasthan
On-site
As a Backend Engineer (SDE-II) at YoCharge, a leading Electric Vehicle Charging & Energy Management SaaS startup, you will play a crucial role in scaling YoCharge's back-end platform and services to facilitate smart charging for electric vehicles. Based in Udaipur, this full-time on-site position offers an exciting opportunity to contribute to the advancement of the EV and Energy domain on a global scale. With 3-5 years of backend development experience using Python, Django, and FastAPI, you will leverage your expertise in WebSockets, async programming, and real-time APIs to enhance the efficiency and effectiveness of YoCharge's operations. Experience in scaling high-traffic distributed systems and familiarity with OCPP protocols and EV charging infrastructure will be beneficial. Your proficiency in SQL & NoSQL databases, such as PostgreSQL, Redis, and time-series databases, will be essential for optimizing performance, while your knowledge of DevOps practices, CI/CD pipelines, containerization (Docker, Kubernetes), and cloud services (AWS/Azure/GCP) will ensure seamless operations. Additionally, experience with monitoring and logging tools like Prometheus, Grafana, and ELK Stack will be advantageous. If you have prior exposure to IoT, energy management systems, or smart grid technologies, it will be considered a valuable asset. A Bachelor's degree in Computer Science or a related field is required, along with excellent communication skills, strong teamwork abilities, and the capacity to thrive in a dynamic and fast-paced environment while meeting deadlines. If you have a passion for Electric Vehicles & Energy, enjoy building innovative products, thrive in startup environments, and have experience in developing solutions at scale, you are the ideal candidate for this role at YoCharge. In return, you will have the opportunity to work on cutting-edge EV and clean energy solutions that are shaping the future of mobility, tackle real-world scalability and ML challenges, and collaborate with a diverse team of engineers, data scientists, and industry experts. Furthermore, competitive salary, performance-driven incentives, and ample growth opportunities await you as part of the YoCharge team.,
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
delhi
On-site
You are a skilled Senior AWS DevOps Engineer with 5 to 8 years of experience in DevOps, cloud computing, and infrastructure engineering. You will play a crucial role in our team by leveraging your expertise in AWS cloud services, infrastructure automation, CI/CD pipelines, and security best practices to design, implement, and manage scalable, secure, and reliable cloud-based solutions. Your responsibilities will include architecting, building, and maintaining highly scalable AWS infrastructure, managing CI/CD pipelines using tools like Jenkins, Bitbucket, or AWS CodePipeline, and developing Infrastructure as Code (IaC) using Terraform, CloudFormation, or AWS CDK. You will automate deployment, monitoring, and scaling of applications and infrastructure while optimizing cloud costs and performance through effective resource management and scaling strategies. As a Senior AWS DevOps Engineer, you will also manage Kubernetes clusters (EKS) and containerized applications using Docker, monitor system performance, troubleshoot issues, and enforce security best practices such as IAM policies, network security, and compliance with industry standards. Collaboration with developers, architects, and security teams will be essential to enhance DevOps best practices and drive continuous improvement in deployment efficiency and system resilience. To excel in this role, you should possess expertise in AWS services like EC2, S3, Lambda, RDS, IAM, VPC, CloudWatch, ECS, and EKS, proficiency in IaC tools, strong knowledge of Kubernetes and container orchestration, and proficiency in scripting and automation using languages like Python, Bash, or Go. Experience with CI/CD pipelines, monitoring and logging tools, networking, security best practices, IAM policies, and configuration management tools will be beneficial. Experience in Agile/Scrum development environments and AWS certifications such as AWS Certified DevOps Engineer Professional are preferred qualifications. Knowledge of serverless architectures and Service Mesh architectures will also be advantageous for this role. If you are a proactive problem solver with a passion for optimizing cloud performance and cost, we look forward to welcoming you to our team as our Senior AWS DevOps Engineer.,
Posted 1 week ago
5.0 - 12.0 years
0 Lacs
karnataka
On-site
Job Description: As an Engineering Manager, you will lead a high-performing team of 8-12 engineers and engineering leads in the end-to-end delivery of software applications through sophisticated CI/CD pipelines. Your role involves mentoring engineers to build scalable, resilient, and robust cloud-based solutions for Walmart's suite of products, contributing to quality and agility. Within Enterprise Business Services, the Risk Tech/Financial Services Compliance team focuses on designing, developing, and operating large-scale data systems and real-time applications. The team works on creating pipelines, aggregating data on Google Cloud Platform, and collaborating with various teams to provide technical solutions. Key Responsibilities: - Manage a team of engineers and engineering leads across multiple technology stacks, including Java, NodeJS, and Spark with Scala on GCP. - Drive design, development, and documentation processes. - Establish best engineering and operational practices based on product and scrum metrics. - Interact with Walmart engineering teams globally, contribute to the tech community, and collaborate with product and business stakeholders. - Work with senior leadership to plan the future roadmap of products, participate in hiring and mentoring, and lead technical vision and roadmap development. - Prioritize feature development aligned with strategic objectives, establish clear expectations with team members, and engage in organizational events. - Collaborate with business owners and technical teams globally, and develop mid-term technical vision and roadmap to meet future requirements. Qualifications: - Bachelor's/Master's degree in Computer Science or related field with a minimum of 12+ years of software development experience and at least 5+ years of managing engineering teams. - Experience in managing agile technology teams, building Java, Scala-Spark backend systems, and working in cloud-based solutions. - Proficiency in JavaScript, NodeJS, ReactJS, NextJS, CS Fundamentals, Microservices, Data Structures, and Algorithms. - Strong skills in CI/CD development environments/tools, writing modular and testable code, microservices architecture, and working with relational and NoSQL databases. - Hands-on experience with technologies like Spring Boot, concurrency, RESTful services, and cloud platforms such as Azure, GCP. - Knowledge of containerization tools like Docker, Kubernetes, and monitoring/alert tools like Prometheus, Splunk. - Ability to lead a team, contribute to technical design, and collaborate across geographies. About Walmart Global Tech: Walmart Global Tech is a team of software engineers, data scientists, and service professionals at the forefront of retail disruption. We innovate to impact millions and reimagine the future of retail, offering opportunities for personal growth, skill development, and innovation at scale. Flexible Work Approach: Our hybrid work model combines in-office and virtual presence, ensuring collaboration, flexibility, and personal development opportunities across our global team. Benefits: In addition to competitive compensation, we offer incentive awards, best-in-class benefits, maternity/paternal leave, health benefits, and more. Equal Opportunity Employer: Walmart, Inc. is committed to diversity, inclusivity, and valuing unique identities, experiences, and opinions. We strive to create an inclusive environment where all individuals are respected and valued. Minimum Qualifications: - Bachelor's degree in computer science or related field with 5 years of experience in software engineering or 7 years of experience in software engineering with 2 years of supervisory experience. Preferred Qualifications: - Master's degree in computer science or related field with 3 years of experience in software engineering. Location: Pardhanani Wilshire II, Cessna Business Park, Kadubeesanahalli Village, Varthur Hobli, India R-1998235.,
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
noida, uttar pradesh
On-site
The client's product enables the utilization of customer data through cutting-edge technologies to: - Enhance understanding of customer behavior to a previously unattainable level. - Determine the exact impact of advertising and promotions. - Create real-time profiles of customer segments. - Uncover the relationship between team member performance and customer loyalty. You should have: - Over 5 years of commercial experience as a DevOps professional. - Practical experience in cloud infrastructure provisioning, deployment, and monitoring on Azure for at least 2 years. - Strong familiarity with best DevOps practices and methodologies. - Good understanding of Computer Science and Computing Theory, including network interactions, protocols, deployment patterns, security patterns, software architecture (e.g., microservices, event-driven design), orchestration, and containerization (Docker, Kubernetes). - Hands-on experience with Infrastructure as Code (IaC), especially with ARM templates/Terraform. - Knowledge of logging and monitoring technologies like Zabbix, NewRelic, PagerDuty, Prometheus, and ELK stack. - Experience with CI/CD processes using AzureDevOps, Docker, Kubernetes (AKS), and product services written in .NET. - Proficiency in different delivery methodologies such as SCRUM, Agile, and Kanban. - Upper-Intermediate English language skills. Desirable qualifications include certifications in Azure and Kubernetes, along with practical experience in data engineering, Big Data stack, high-load systems, and microservices in a production environment. As part of the DevOps team, your responsibilities will include: - Collaborating on the creation of Azure infrastructure and setting up K8s clusters (AKS). - Managing CI/CD pipelines and automation processes. - Overseeing release management and infrastructure maintenance. - Participating in decision-making regarding infrastructure design. - Creating and managing dashboards for environments/builds. - Ensuring security controls do not adversely affect production by working with architects and developers. - Communicating effectively with various stakeholders including PM, PO, software developers, architects, and QA. GlobalLogic offers a stimulating work environment with diverse projects in industries like High-Tech, communication, media, healthcare, retail, and telecom. You will have the opportunity to collaborate with a talented team and enjoy work-life balance, professional development programs, competitive benefits, and fun perks. About GlobalLogic: GlobalLogic is a digital engineering leader that helps brands worldwide design and develop innovative products and digital experiences. Headquartered in Silicon Valley, GlobalLogic operates globally, assisting clients across various industries to envision and realize digital transformations.,
Posted 1 week ago
4.0 - 8.0 years
0 Lacs
pune, maharashtra
On-site
The L2 NOC Engineer position based in Pune requires 4-8 years of experience. As an L2 NOC Engineer, you will be responsible for monitoring and maintaining the health and performance of the IoT satellite network infrastructure. This includes overseeing various components such as UE devices, firewalls, gateways, and cloud platforms to ensure stability and efficiency. You will also be involved in alarm monitoring, ticket resolution, and acting as the primary point of contact for the Network Operations Center. Your role will involve promptly identifying issues, correlating them, and initiating troubleshooting procedures. You will assess the potential impact of these issues and coordinate with relevant teams for quick and effective resolutions. Collaborating with engineering teams to troubleshoot and resolve issues efficiently is a key aspect of this position. Additionally, you should have a fundamental understanding of the Google Cloud platform. Other responsibilities include evaluating network performance, creating improvement strategies, monitoring network performance and uptime, and providing Tier 2 technical assistance to internal and external clients. You will be required to maintain documentation on network architecture, configuration, and troubleshooting procedures, as well as participate in an on-call rotation for after-hours support. To excel in this role, you should have a strong grasp of Network Operations Center best practices, a Bachelor's degree in Computer Science, Information Technology, or a related field, and a minimum of 4 years of experience in Network Operations, preferably within the telecommunications or wireless network domains. Proficiency in Kubernetes architecture and components, along with setting up monitoring and logging solutions for Kubernetes clusters, is essential. Strong Linux system administration skills, including command line, user and group management, and understanding of Linux networking concepts, are also required. Effective communication skills are crucial for this position.,
Posted 1 week ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
As a DevOps Engineer at Facility & Energy Management, you will be an integral part of a dedicated team of innovators and problem solvers working towards sustainable impact. You will play a crucial role in crafting cutting-edge software solutions that help facilities optimize energy consumption, reduce costs, and enhance operational efficiency. If you have a passion for technology and delivering high-quality software solutions, this opportunity is for you! In this role, you will be responsible for designing and implementing comprehensive infrastructure plans and strategies. Your collaboration with cross-functional teams, including product owners, designers, and architects, will ensure alignment and innovation in software development. Additionally, you will have the opportunity to mentor junior DevOps engineers, fostering their growth and ensuring adherence to best practices and DevOps standards. You will work in a dynamic environment with more than 30 colleagues, striving to create market-leading software solutions for Facility & Energy Management. Your work will involve utilizing cutting-edge technologies and processes such as Kubernetes, Docker, AI, Pair Programming, Mob Programming, Continuous Integration, Continuous Learning, and Microservices. Key Responsibilities: - Develop and maintain infrastructure as code (IaC) using tools like Terraform & Azure Bicep - Implement and maintain Continuous Integration and Continuous Delivery (CI/CD) pipelines (Github CI) - Perform system monitoring using Prometheus and Grafana - Manage log management and distributed tracing with Tempo, Azure Application Insights, OpenTelemetry, and Grafana - Manage cloud environments, primarily Azure with exposure to AWS - Utilize Docker and Kubernetes for containerization and orchestration Ideal Candidate Requirements: - Bachelor's degree or higher in computer science/information science or equivalent - Experience with Docker and Kubernetes - Proficiency in SQL Server, Azure Database, and PostgreSQL - Solid understanding of infrastructure and automation programming Nice-to-Have Skills: - Experience with C# development on .Net tech stack - Familiarity with microservices architecture - Experience with Kafka - Experience in setting up and monitoring Azure App Services, Function App & Logic App If you are ready to dive into the world of cutting-edge technology, drive innovation at lightning speed, and contribute to a sustainable and efficient future, we would love to connect with you for this exciting opportunity. Apply now and be part of a team that is shaping the future of Facility & Energy Management software solutions.,
Posted 1 week ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
As a DevOps Engineer / Site Reliability Engineer (SRE) at our client's new healthcare company, you will play a crucial role in ensuring the reliability, scalability, and performance of infrastructure and applications. Your responsibilities will include designing and implementing CI/CD pipelines, managing cloud infrastructure, building monitoring and alerting systems, collaborating with development teams, and ensuring system security and compliance. You will have the opportunity to work with dynamic teams to pioneer game-changing innovations at the intersection of health, material, and data science. By leveraging your expertise in cloud platforms, containerization tools, Infrastructure as Code (IaC), CI/CD tools, monitoring and logging tools, scripting languages, networking, and security protocols, you will contribute to the betterment of patients" lives and the optimization of healthcare professionals" workflows. To succeed in this role, you should possess a Bachelor's Degree or higher in Computer Science, Engineering, or a related field, along with at least 3 years of experience in a DevOps or Site Reliability Engineering role in a cloud environment. Strong proficiency in cloud platforms like AWS, Azure, Google Cloud, containerization tools such as Docker and Kubernetes, IaC tools like Terraform, CloudFormation, or Ansible, CI/CD tools, version control systems, monitoring and logging tools, and scripting languages is essential. Additionally, familiarity with Agile methodologies, DevSecOps practices, automated testing, problem-solving skills for troubleshooting complex production issues, and cost optimization practices in cloud environments will further enhance your success in this role. By providing technical guidance to team members and stakeholders, you will contribute to the continuous improvement of system reliability, automation, and scalability. In summary, as a DevOps Engineer / Site Reliability Engineer (SRE) at our client's new healthcare company, you will have the opportunity to make a significant impact by ensuring the seamless operation of critical healthcare infrastructure and applications through innovative solutions and best practices.,
Posted 2 weeks ago
7.0 - 12.0 years
8 - 13 Lacs
Bengaluru
Work from Office
Site Reliability Engineer - System Expeience7+ Years Summary We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join ourteam. The ideal candidate will have extensive experience in Linux systemsadministration, understanding of database management, and a proven trackrecord of troubleshooting complex, system-level issues. You will be responsiblefor ensuring the reliability, performance, and scalability of our productionenvironments, balancing system and database stability through robustmonitoring, debugging, and automation practices. Responsibilities: Lead incident response and resolution: Proactively troubleshoot, debug,and resolve complex system-level incidents and outages, encompassingLinux operating systems, applications, and database technologies. Conduct deep-dive root cause analysis: Perform thorough post-incident analysis to identify underlying issues in production environments, implementing sustainable solutions. Design and implement robust monitoring: Develop, maintain, andenhance comprehensive system and database monitoring, alerting, andobservability solutions (e.g., Grafana, Prometheus, PMM). Drive automation and efficiency: Automate Linux system administrationtasks, operational runbooks, and database maintenance to improvesystem reliability, consistency, and operational efficiency. Collaborate on resilient deployments: Partner with development andengineering teams to ensure seamless, reliable, and secure softwaredeployments and infrastructure changes. Architect scalable infrastructure: Contribute to the architectural designand implementation of highly scalable, resilient, and performantinfrastructure solutions. Enhance on-call effectiveness: Participate in and continuously improveon-call rotations, developing tools and processes to reduce alert fatigueand minimize human error. Foster technical growth: Mentor and guide junior Site ReliabilityEngineers (SREs), promoting knowledge sharing and skill developmentwithin the team. Qualifications: Extensive Linux Expertise: Proven experience in advanced Linux systems administration, including deep understanding of file systems, kernel tuning (Sysctl), and performance optimization. Advanced Troubleshooting & Debugging: Exceptional ability to debugand rapidly resolve complex, distributed system-level issues inhigh-pressure production environments. Configuration Management: Hands-on experience with industry-standardconfiguration management tools (e.g., SaltStack, Ansible, Puppet). Load Balancing & Proxying: Practical experience with load balancing technologies (e.g., Nginx, HAProxy, LVS) and their configuration for highavailability. Containerization & Orchestration Strong understanding and practicalexperience with containerization (e.g., Docker) and container orchestrationplatforms (e.g., Kubernetes, Mesosphere). Monitoring & Alerting Tooling Proficiency in implementing, maintaining,and leveraging system and database monitoring platforms (e.g., Grafana,Prometheus, PMM) and custom scripting for alerts. Automation & Scripting Mastery: Highly proficient in developingautomation solutions using scripting languages (e.g., Python, Shellscripting, Go) for operational tasks. Networking Fundamentals: Solid understanding of core networkingconcepts and protocols (e.g., TCP/IP, DNS, DHCP, BGP, IPTables, IP &Routing protocols). Database Administration Fundamentals: Strong grasp of relationaldatabase concepts and practical experience with database administrationprinciples. Preferred Qualifications Cloud Infrastructure Experience: Experience managing and troubleshooting private/on-premise cloud environments, with a focus on identifying and mitigating hardware-related issues and their impact. Relational Database Specialization: Deep practical experience withMariaDB, Percona Server, and/or MySQL, encompassing advanceddatabase administration, performance tuning, and complex replicationtopologies. Backup & Recovery Expertise Hands-on experience with robust backupand restore technologies, including ZFS. Message Queuing Systems: Familiarity with message queuing systemslike RabbitMQ (RMQ). PhonePe Full Time Employee Benefits (Not applicable for Intern or Contract Roles) Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy Working at PhonePe is a rewarding experience! Great people, a work environment that thrives on creativity, the opportunity to take on roles beyond a defined job description are just some of the reasons you should work with us. Read more about PhonePe on our blog. Life at PhonePe PhonePe in the news
Posted 2 weeks ago
7.0 - 12.0 years
10 - 14 Lacs
Bengaluru
Work from Office
Site Reliability Engineer - Database Experience7+ Years We are seeking a highly skilled and experienced SRE Engineer (7+ years of experience) with deep expertise in MySQL database administration and a solid foundation in Linux systems engineering. You will play a critical role in ensuring the resilience, scalability, and performance of our distributed, high-volume database infrastructure spanning tens of terabytes of data across multiple data centers. In this role, you will be expected to design, build, and lead initiatives to improve reliability and efficiency across the database stack, mentor SRE/DBA team members, and drive strategic improvements to infrastructure. Responsibilities Database Architecture & Management: Lead the design, provisioning, and lifecycle management of large-scale MySQL/Galera multi-master clusters across multiple geographic locations. Reliability Engineering: Develop and implement database reliability strategies, including automated failure recovery and disaster recovery solutions. Troubleshooting & Support: Investigate and resolve database-related issues, including performance problems, connectivity issues, and data corruption. Performance, optimization & Security: Own and continuously improve performance tuning, including query optimization, indexing, and resource management, security hardening, and high availability of database systems. Operational Excellence : Standardize and automate database operational tasks such as upgrades, backups, schema changes, and replication management. Drive capacity planning , monitoring, and incident response across infrastructure. Incident Management Proactively identify, diagnose, and resolve complexproduction issues in collaboration with the engineering team. On-Call & Tooling: Participate in and enhance on-call rotations, implementing tools to reduce alert fatigue and human error. Develop and maintain observability tooling for database systems. Leadership & Mentorship: Mentor and guide junior and mid-level SREs andDBAs, fostering knowledge sharing and skill development within the team. Skills and Qualifications Core Expertise: Expertise in Linux systems administration, scripting (Bash/Python), filesystems, disk management, and debugging system-level performanceissues. 78+ years of hands-on experience in MySQL database administration inlarge-scale, high-availability environments. Deep understanding of MySQL internals, InnoDB storage engine,replication mechanisms (async, semi-sync, Galera), and tuningparameters. Proven experience managing 100+ production clusters and databaseslarger than 1TB in size. Preferred Experience: Hands-on experience with Galera clusters is a strong plus. Familiarity with Infrastructure-as-Code tools like Ansible, Terraform, orsimilar. Experience with observability tools such as Prometheus, Grafana, orPercona Monitoring & Management. Exposure to other NOSQL (e.g., Aerospike) will be a plus. Experience working in on-premise environments is highly desirable. Leadership & Communication: Proven ability to lead cross-functional initiatives, including databasemigrations, major version upgrades, and scaling efforts. Excellent communication skills with a demonstrated track record ofmentoring and technical leadership. PhonePe Full Time Employee Benefits (Not applicable for Intern or Contract Roles) Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy Working at PhonePe is a rewarding experience! Great people, a work environment that thrives on creativity, the opportunity to take on roles beyond a defined job description are just some of the reasons you should work with us. Read more about PhonePe on our blog. Life at PhonePe PhonePe in the news
Posted 2 weeks ago
6.0 - 11.0 years
12 - 16 Lacs
Bengaluru
Work from Office
About the Role: This role is responsible for managing and maintaining complex, distributed big data ecosystems. It ensures the reliability, scalability, and security of large-scale production infrastructure. Key responsibilities include automating processes, optimizing workflows, troubleshooting production issues, and driving system improvements across multiple business verticals. Roles and Responsibilities: Manage, maintain, and support incremental changes to Linux/Unix environments. Lead on-call rotations and incident responses, conducting root cause analysis and driving postmortem processes. Design and implement automation systems for managing big data infrastructure, including provisioning, scaling, upgrades, and patching clusters. Troubleshoot and resolve complex production issues while identifying root causes and implementing mitigating strategies. Design and review scalable and reliable system architectures. Collaborate with teams to optimize overall system performance. Enforce security standards across systems and infrastructure. Set technical direction, drive standardization, and operate independently. Ensure availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. Resolve, analyze, and respond to system outages and disruptions and implement measures to prevent similar incidents from recurring. Develop tools and scripts to automate operational processes, reducing manual workload, increasing efficiency and improving system resilience. Monitor and optimize system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaborate with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle. Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities. Develop and enforce SRE best practices and principles. Align across functional teams on priorities and deliverables. Drive automation to enhance operational efficiency. Skills Required: Over 6 years of experience managing and maintaining distributed big data ecosystems. Strong expertise in Linux including IP, Iptables, and IPsec. Proficiency in scripting/programming with languages like Perl, Golang, or Python. Hands-on experience with the Hadoop stack (HDFS, HBase, Airflow, YARN, Ranger, Kafka, Pinot). Familiarity with open-source configuration management and deployment tools such as Puppet, Salt, Chef, or Ansible. Solid understanding of networking, open-source technologies, and related tools. Excellent communication and collaboration skills. DevOps toolsSaltstack, Ansible, docker, Git. SRE Logging and monitoring toolsELK stack, Grafana, Prometheus, opentsdb, Open Telemetry. Good to Have: Experience managing infrastructure on public cloud platforms (AWS, Azure, GCP). Experience in designing and reviewing system architectures for scalability and reliability. Experience with observability tools to visualize and alert on system performance. PhonePe Full Time Employee Benefits (Not applicable for Intern or Contract Roles) Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy Working at PhonePe is a rewarding experience! Great people, a work environment that thrives on creativity, the opportunity to take on roles beyond a defined job description are just some of the reasons you should work with us. Read more about PhonePe on our blog. Life at PhonePe PhonePe in the news
Posted 2 weeks ago
5.0 - 7.0 years
12 - 16 Lacs
Bengaluru
Work from Office
About the Role As an SRE (5 to 7 years) (Big Data) Engineer at PhonePe, you will be responsible for ensuring the stability, scalability, and performance of distributed systems operating at scale. You will collaborate with development, infrastructure, and data teams to automate operations, reduce manual efforts, handle incidents, and continuously improve system reliability. This role requires strong problem-solving skills, operational ownership, and a proactive approach to mentoring and driving engineering excellence. Roles and Responsibilities Ensure the ongoing stability, scalability, and performance of PhonePes Hadoop ecosystem and associated services. Manage and administer Hadoop infrastructure including HDFS, HBase, Hive, Pig, Airflow, YARN, Ranger, Kafka, Pinot, and Druid. Automate BAU operations through scripting and tool development. Perform capacity planning, system tuning, and performance optimization. Set-up, configure, and manage Nginx in high-traffic environments. Administration and troubleshooting of Linux + Bigdata systems, including networking (IP, Iptables, IPsec). Handle on-call responsibilities, investigate incidents, perform root cause analysis, and implement mitigation strategies. Collaborate with infrastructure, network, database, and BI teams to ensure data availability and quality. Apply system updates, patches, and manage version upgrades in coordination with security teams. Build tools and services to improve observability, debuggability, and supportability. Participate in Kerberos and LDAP administration. Experience in capacity planning and performance tuning of Hadoop clusters. Work with configuration management and deployment tools like Puppet, Chef, Salt, or Ansible. Skills Required Minimum 1 year of Linux/Unix system administration experience. Over 4 years of hands-on experience in Hadoop administration. Minimum 1 years of experience managing infrastructure on public cloud platforms like AWS, Azure, or GCP (optional ) . Strong understanding of networking, open-source tools, and IT operations. Proficient in scripting and programming (Perl, Golang, or Python). Hands-on experience with maintaining and managing the Hadoop ecosystem components like HDFS, Yarn, Hbase, Kafka . Strong operational knowledge in systems (CPU, memory, storage, OS-level troubleshooting). Experience in administering and tuning relational and NoSQL databases. Experience in configuring and managing Nginx in production environments. Excellent communication and collaboration skills. Good to Have Experience designing and maintaining Airflow DAGs to automate scalable and efficient workflows. Experience in ELK stack administration. Familiarity with monitoring tools like Grafana, Loki, Prometheus, and OpenTSDB. Exposure to security protocols and tools (Kerberos, LDAP). Familiarity with distributed systems like elasticsearch or similar high-scale environments. PhonePe Full Time Employee Benefits (Not applicable for Intern or Contract Roles) Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy Working at PhonePe is a rewarding experience! Great people, a work environment that thrives on creativity, the opportunity to take on roles beyond a defined job description are just some of the reasons you should work with us. Read more about PhonePe on our blog. Life at PhonePe PhonePe in the news
Posted 2 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
31458 Jobs | Dublin
Wipro
16542 Jobs | Bengaluru
EY
10788 Jobs | London
Accenture in India
10711 Jobs | Dublin 2
Amazon
8660 Jobs | Seattle,WA
Uplers
8559 Jobs | Ahmedabad
IBM
7988 Jobs | Armonk
Oracle
7535 Jobs | Redwood City
Muthoot FinCorp (MFL)
6170 Jobs | New Delhi
Capgemini
6091 Jobs | Paris,France