Name: Jobpe
Address: T-Hub, Plot No 1/C, Sy No 83/1, Raidurgam panmaktha, Knowledge City Rd, Hyderabad, Telangana, 500081, IN
Telephone: +91-83339-09630
Price range: Free

Senior Infra Engineer Softnotions Technologies Private Limited

0 years

0 Lacs

Trivandrum, Kerala, India

On-site

Job Role/Job Description (Please fill in the exact role expected out of the candidate) · Assist in Tier 3, 24x7 technology support for escalated Infrastructure Engineering issues. · Assist with strategic projects to design, deploy, manage and maintain infrastructure systems across the technology verticals of: Network, Storage, Compute, Virtualization, Telephony, Cloud Infrastructure and Automation & Orchestration. · Will assist with system monitoring, performance tuning, capacity planning, and lifecycle managment to ensure the highest level of system and infrastructure availability. · Work to expand working knowledge of infrastructure related security strategies and concepts. Apply that to knowledge to ensure systems meet or exceed standards. · Collaborate with vendors and other relevant IT personnel as required in support of enterprise applications, and hardware. · Work to increase individual technology skill proficiencies through continued education and training opportunities related to areas of responsibility and interest. · Maintain and respect all confidential information. · Demonstrate professional, friendly interaction with peers, members and vendors which are consistent with organizational values and philosophies. · Able to handle aggressive mission critical deadlines and dynamic workloads. · Self-starter · Familiarity with technical documentation standards, tools and processes · Critical thinker · Willingness to see out and learn from senior team mentors to improve overall understanding and skills. · Eagerness to take on tasks outside routine or comfort zone. · Effective communication skills both written and verbal. · Coordinating with US team and India team. · Promotes honest and open communication throughout the credit union. · Understanding of monitoring and alerting systems. · Self-sufficient troubleshooter able to seek out answers or solutions to technical challenges with minimal assistance. · Willingness to seek direction and guidance from teammates and managers. · Must be able to work in a team environment. · Must be detail oriented with ability to maintain and organize workloads. · Must be able to maintain highly confidential information. · Ability to communicate effectively in writing and orally. · Ability to resolve interpersonal conflict and miscommunications. · Plan and coordinate lifecycle and capacity management · What is the expectation from the candidate’s current role/profile? · Must and in-depth experience in Palo Alto. · Expert in Network Technologies: Cisco ACI, Nexus/Catalyst, AVI Load Balancer, Wiring. · Expert in Micro-segmentation experience (ACI/NSX). · Expertise in Telecom and networking (preference is i3) · Experience with Cisco UCS infrastructure. · Good hands-on in Storage Technologies: Pure, Cohesity. · Experience managing and implementing identity providers and federated identity services. (Microsoft ADFS, SAML 2.0, Azure Authentication and Conditional Access) · Experience architecting, deploying, and supporting technologies leveraging cloud-based platforms such as Microsoft Azure Networking. · Experience with scripting, automation, and orchestration technologies to create efficiencies. PowerShell, Python, Puppet/Chef, Ansible, Terraform, etc. · Experience performing enterprise level patch and vulnerability management. Familiarity with change management and ITSM.

Posted 4 days ago

Apply

Distinguished/Sr Tech Lead DevOps Thales

0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Location: Noida Berger Tower, India Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000 organizations already rely on us to verify the identities of people and things, grant access to digital services, analyze vast quantities of information and encrypt data to make the connected world more secure. Present in India since 1953, Thales is headquartered in Noida, Uttar Pradesh, and has operational offices and sites spread across Bengaluru, Delhi, Gurugram, Hyderabad, Mumbai, Pune among others. Over 1800 employees are working with Thales and its joint ventures in India. Since the beginning, Thales has been playing an essential role in India’s growth story by sharing its technologies and expertise in Defence, Transport, Aerospace and Digital Identity and Security markets. Looking for seasoned devops profession who is skilled in Deployment, Monitoring of complex Kubernetes/micro service architecture based applications on any Cloud Provider (Azure/AWS/GCP) . Experience : 9-14 Yrs. Essential Key Skills : Ability to demonstrate solid skills in Azure/AWS/GCP, Kubernetes, and Unix/Linux Platform. Ability to demonstrate knowledge about Cluster, Cloud/VM-based solution deployment, and management, including knowledge about networking, servers, and storage. Experience in DevOps with Kubernetes. Strong knowledge of CI/CD tools (Jenkins, Bamboo, etc.) Experience in cloud platforms and infrastructure automation. Expertise in Python or a similar scripting/language. Must have completed minimum one project end-to-end in a technical DevOps role, preferably in a global organization. Practical understanding of Ansible, Docker, and implementation of the solutions based on these tools is preferred. Ability to handle escalation. Demonstrated experience as an individual contributor with customer focus and service orientation, with solid leadership and coaching skills. Excellent written and verbal communication skills in English. At Thales we provide CAREERS and not only jobs. With Thales employing 80,000 employees in 68 countries our mobility policy enables thousands of employees each year to develop their careers at home and abroad, in their existing areas of expertise or by branching out into new fields. Together we believe that embracing flexibility is a smarter way of working. Great journeys start here, apply now!

Posted 4 days ago

Apply

Senior Application Support Engineer NiCE

4.0 years

0 Lacs

Pune, Maharashtra, India

Remote

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you. At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you. So, what’s the role all about? We are looking for talented and motivated professionals, interested in delivering the latest in Application Operations (SaaS) using AWS, in a culture that encourages autonomous productive teams. You are Someone who loves learning how to configure backend systems, and infrastructure that will help us build our global SaaS platform. You have a comprehensive understanding of Amazon AWS platform and know how to light and put fires out. You are an engineer with confidence in his / her skill set, who is not afraid to look under the hood and break stuff to make stuff. If yes, then come be a part of NiCE Customer Services, a team of software engineers re-inventing Application Operations. How will you make an impact? You will get to team up with highly talented and highly motivated engineers and architects, using the latest in AWS, working on cutting edge technology. As a part of this team, you will be working in a fast-paced environment deploying, monitoring, automating & supporting a highly scalable real-time critical platform(s) impacting, millions of individuals & Billions of dollars. Implementing, configuring custom changes, and deploying new Application release upgrades Setup new environments & deploying solutions. Building proactive Monitoring & alerting service. Automation using ansible, python, Perl scripting. Setup & securing new Application instances Change management, Building deployment and rollback plans and procedures Creating and maintaining knowledge base for various technical resolutions Create and setup deployment scripts for different environments (i.e. Test properties vs Prod properties) Configure and optimize instances and web servers for optimal performance. (ex: adjusting default connection limits, adjusting request queuing thresholds) AWS troubleshooting support Support, Architect and Implement alongside Technical & Operations teams to meet our customers' individual needs for their infrastructure & application deployments. Work on critical, highly complex customer problems that will span multiple AWS services (dealing daily with high severity incidents). Help build and improve customer operations through scripts to automate and deploy AWS resources seamlessly with as little manual intervention as possible. Collaborate and help build utilities and tools for internal use that enable you and your fellow Engineers to operate safely at high speed / wide scale. Drive customer communication during critical events. Provide on-call off hour support and flexible to work in 24*7 shift environment Have you got what it takes? 4+ years of relevant experience Excellent hands-on experience in managing Application Support (3 tier/2 tier apps) Strong problem solving, analytical and communication skills Exposure in handling complex application performance issues Exposure to APM tools like AppDynamics, Dynatrace Excellent skills on managing containerized / cloud-based application with exposure to various cloud services (EC2, S3, IAM, ELB, VPC, VPN). Good experience in a DevOps environment / Operations team / Infrastructure Operations team. Excellent Troubleshooting skills OS level knowledge (Windows or Linux) Database skills ( SQL ,Oracle or Postgres / Casandra) Application Server ( skills on any of Middleware technologies e.g. – Tomcat , WebLogic , WebSphere) Ability to identify the underlying root cause of performance issues & mitigate bottlenecks Good understanding on Networking , Load balancers Good communication both written and verbal Exposure to scripting language (Ansible, Perl, Python, Ruby, Shell script, Powershell etc.) Experience in working with tools like OpsGenie, Nagios, Rundeck, Good understanding in Kubernetes. Cloud / Application level Security experience Experience in Banking & Financial domain Has worked in an Agile / Sprint development model. What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NiCE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NiCEr! Enjoy NiCE-FLEX! At NiCE, we work according to the NiCE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID: 8002 Reporting into: Tech Manager Role Type: Individual Contributor About NiCE NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NiCE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions. Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries. NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.

Posted 4 days ago

Apply

Specialist Application Support Engineer NiCE

8.0 years

0 Lacs

Pune, Maharashtra, India

Remote

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you. So, what’s the role all about? We are looking for talented and motivated professionals, interested in delivering the latest in Application Operations (SaaS) using AWS, in a culture that encourages autonomous productive teams. You are Someone who loves learning how to configure backend systems, and infrastructure that will help us build our global SaaS platform. You have a comprehensive understanding of Amazon AWS platform and know how to light and put fires out. You are an engineer with confidence in his / her skill set, who is not afraid to look under the hood and break stuff to make stuff. If yes, then come be a part of NiCE Customer Services, a team of software engineers re-inventing Application Operations. How will you make an impact? You will get to team up with highly talented and highly motivated engineers and architects, using the latest in AWS, working on cutting edge technology. As a part of this team, you will be working in a fast-paced environment deploying, monitoring, automating & supporting a highly scalable real-time critical platform(s) impacting, millions of individuals & Billions of dollars. Implementing, configuring custom changes, and deploying new Application release upgrades Setup new environments & deploying solutions. Building proactive Monitoring & alerting service. Automation using ansible, python, Perl scripting. Setup & securing new Application instances Change management, Building deployment and rollback plans and procedures Creating and maintaining knowledge base for various technical resolutions Create and setup deployment scripts for different environments (i.e. Test properties vs Prod properties) Configure and optimize instances and web servers for optimal performance. (ex: adjusting default connection limits, adjusting request queuing thresholds) AWS troubleshooting support Support, Architect and Implement alongside Technical & Operations teams to meet our customers' individual needs for their infrastructure & application deployments. Work on critical, highly complex customer problems that will span multiple AWS services (dealing daily with high severity incidents). Help build and improve customer operations through scripts to automate and deploy AWS resources seamlessly with as little manual intervention as possible. Collaborate and help build utilities and tools for internal use that enable you and your fellow Engineers to operate safely at high speed / wide scale. Drive customer communication during critical events. Provide on-call off hour support and flexible to work in 24*7 shift environment Have you got what it takes? 8+ years of relevant experience Excellent hands-on experience in managing Application Support (3 tier/2 tier apps) Strong problem solving, analytical and communication skills Exposure in handling complex application performance issues Exposure to APM tools like AppDynamics, Dynatrace Excellent skills on managing containerized / cloud-based application with exposure to various cloud services (EC2, S3, IAM, ELB, VPC, VPN). Good experience in a DevOps environment / Operations team / Infrastructure Operations team. Excellent Troubleshooting skills OS level knowledge (Windows or Linux) Database skills ( SQL ,Oracle or Postgres / Casandra) Application Server ( skills on any of Middleware technologies e.g. – Tomcat , WebLogic , WebSphere) Ability to identify the underlying root cause of performance issues & mitigate bottlenecks Good understanding on Networking , Load balancers Good communication both written and verbal Exposure to scripting language (Ansible, Perl, Python, Ruby, Shell script, Powershell etc.) Experience in working with tools like OpsGenie, Nagios, Rundeck, Good understanding in Kubernetes. Cloud / Application level Security experience Experience in Banking & Financial domain Has worked in an Agile / Sprint development model. What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NiCE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NiCEr! Enjoy NiCE-FLEX! At NiCE, we work according to the NiCE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID: 8003 Reporting into: Tech Manager Role Type: Individual Contributor About NiCE NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NiCE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions. Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries. NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.

Posted 4 days ago

Apply

Software Engineer II Medtronic

5.0 years

0 Lacs

Pune, Maharashtra, India

On-site

At Medtronic you can begin a life-long career of exploration and innovation, while helping champion healthcare access and equity for all. You’ll lead with purpose, breaking down barriers to innovation in a more connected, compassionate world. A Day in the Life Medtronic is hiring a Software Engineer. As a Software engineer, you will be focused on building automation, developing API integrations, enhancing monitoring and observability capabilities, and providing overall software development support to the IT Operations team. The ideal candidate will have experience working with platforms such as Salesforce, Jira, and SAP, while also having a strong foundation in software engineering and a deep understanding of modern cloud environments. This role is pivotal in enabling the IT Operations team to improve reliability, scalability, and efficiency across our technology stack. This position is an exciting opportunity to work with Medtronic's Diabetes business. Medtronic has announced its intention to separate the Diabetes division to promote future growth and innovation within the business and reallocate investments and resources across Medtronic, subject to applicable information and consultation requirements. This separation provides our team with a bold opportunity to unleash our potential, enabling us to operate with greater speed and agility. As a separate entity, we anticipate leveraging increased investments to drive meaningful innovation and enhance our impact on patient care. Responsibilities may include the following and other duties may be assigned: Design, develop, and maintain automation scripts, frameworks, and tools to reduce manual tasks and enhance operational efficiency. Focus on integrating with platforms like Salesforce, Jira, and SAP. Build and maintain custom integrations with internal and external APIs, including Salesforce, Jira, and SAP, to streamline processes, improve data flow, and enable better monitoring and alerting capabilities. Develop and integrate solutions for improved system monitoring and observability. Implement automated alerts, dashboards, and reporting mechanisms using tools like Prometheus, Grafana, Datadog, Splunk, or similar platforms to ensure proactive issue identification and resolution. Create and implement software solutions to support IT Operations, including custom monitoring tools, dashboards, and other utilities that drive system efficiency and reliability. Develop and implement automation and data synchronization solutions across Salesforce, Jira, and SAP to ensure optimal functionality and performance. Provide software development support for troubleshooting, root cause analysis, and resolution of incidents impacting the availability and performance of applications and services. Work closely with SREs, DevOps, and other engineering teams to identify opportunities for automation and software-driven solutions, ensuring alignment with operational and business goals. Create comprehensive documentation for code, APIs, monitoring tools, and processes to facilitate knowledge sharing and ease of maintainability. Actively participate in post-incident reviews and contribute to postmortems, recommending software-driven enhancements to prevent recurrence and improve overall system reliability and observability. Required Knowledge and Experience: Proficiency in programming languages such as Python, Go, or Java. Hands-on experience with automation frameworks and tools such as Ansible, Terraform, or Jenkins. Strong knowledge of RESTful API design, development, and integration. Experience working with Salesforce APIs, Jira APIs, and SAP connectors or integrations. Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and cloud-native application development. Experience with containerization and orchestration tools like Docker and Kubernetes. Advanced understanding of monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk) to create dashboards, alerts, and automated reporting. Familiarity with CI/CD pipelines and version control systems like Git. Strong analytical and problem-solving skills with the ability to troubleshoot complex systems. Must Have Qualifications Bachelor’s degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a Software Developer or in a similar role. Proven experience working within SRE, DevOps, or IT Operations teams. Experience integrating and automating processes with Salesforce, Jira, and SAP. Hands-on experience with monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog, Splunk). Nice To Have Qualifications Experience with Infrastructure as Code principles. Understanding of network and security principles. Experience working in Agile or DevOps teams. Experience developing custom monitoring solutions and internal tools. Professional certifications related to cloud platforms, automation, or DevOps practices. Physical Job Requirements The above statements are intended to describe the general nature and level of work being performed by employees assigned to this position, but they are not an exhaustive list of all the required responsibilities and skills of this position. Benefits & Compensation Medtronic offers a competitive Salary and flexible Benefits Package A commitment to our employees lives at the core of our values. We recognize their contributions. They share in the success they help to create. We offer a wide range of benefits, resources, and competitive compensation plans designed to support you at every career and life stage. This position is eligible for a short-term incentive called the Medtronic Incentive Plan (MIP). About Medtronic We lead global healthcare technology and boldly attack the most challenging health problems facing humanity by searching out and finding solutions. Our Mission — to alleviate pain, restore health, and extend life — unites a global team of 95,000+ passionate people. We are engineers at heart— putting ambitious ideas to work to generate real solutions for real people. From the R&D lab, to the factory floor, to the conference room, every one of us experiments, creates, builds, improves and solves. We have the talent, diverse perspectives, and guts to engineer the extraordinary. Learn more about our business, mission, and our commitment to diversity here

Posted 4 days ago

Apply

Specialist - IT Transformation Services Accelya

40.0 years

0 Lacs

Pune/Pimpri-Chinchwad Area

On-site

For more than 40 years, Accelya has been the industry’s partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the airline industry forward and proudly puts control back in the hands of airlines so they can move further, faster. Duties & Responsibilities Middleware Management: Install, configure, and maintain middleware platforms including Apache Tomcat, RedHat JBoss, Microsoft IIS, and Oracle Weblogic. Network Security: Implement and manage network encryption technologies such as SSL and HTTPS. Authentication and Authorization: Manage SSO solutions, particularly with Active Directory, and implement MFA. System Administration: Perform basic Linux administration tasks and manage VPNs, DNS, and IPAM. Messaging Technologies: Work with messaging technologies like IBM MQ and Rabbit MQ. Infrastructure-as-Code: Use tools like Terraform, Ansible, and Git for infrastructure management. Shared Services Support: Support applications delivering MFA, data tokenization, and data encryption. Continuous Improvement: Identify and implement improvements to middleware and shared service applications. Agile Methodologies: Participate in Agile environments using Scrum and/or Kanban. Knowledge, Experience & Skills Proficiency with Apache Tomcat, RedHat JBoss, Microsoft IIS, and Oracle Weblogic. Kafka and Citrix experience is a plus. Strong written and verbal English skills. Excellent problem-solving and analytical skills. Strong teamwork and communication abilities. At least one role at a major, global organisation with 100s or 1000s of servers to manage Working with colleagues/customers in the UK and US Some evidence of automation experience, ideally mentioning keywords 'Terraform', 'Ansible', 'Python', 'Powershell' Some evidence the candidate has had to think for themselves - e.g. they have been a consultant, team lead, project lead or just generally someone who has autonomy in one or more of their roles Evidence that they have written documents and/or mentored/coached more junior colleagues or clients What does the future of the air transport industry look like to you? Whether you’re an industry veteran or someone with experience from other industries, we want to make your ambitions a reality!

Posted 4 days ago

Apply

Oracle Cloud Infra Admin-Oracle Devops- Kubernetes-Senior EY

3.0 - 7.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all. Devops Engineer Primary Responsibilities and Accountabilities: Implement automation tools and frameworks for automatic code deployment (CI/CD) for middleware and packaged applications using tools like Jenkins, Azure Devops, GIT etc. Perform Cloud Administration tasks includes provisioning of Server, Identity and Access Management, configure network policies, backups and restore techniques on public cloud. Provision and manage container platforms like Docker, Kubernetes to provide scalable and High Available environments Defining and setting development, test, release, update, and support processes for DevOps operation. Automation using scripts like Perl, Ant, Groovy, Shell Script, Python, Maven. Build tools to reduce occurrences of errors and improve customer experience. Requirements (including Experience, Skills, And Additional Qualifications) Experience: 3 -7 years of experience working on multiple Devops tools and Cloud Infrastructure. Extensive Experience building CI/CD pipelines using Nexus, GIT, Jenkins, Azure Devops or any other standard marketplace tool. Worked on any one cloud platform (AWS, GCP, Azure, Oracle Cloud) for Infrastructure and Role Provisioning as Cloud Administrator Worked on creating JIRA workflows, dashboards to capture sprints, user stories. Excellent written/Verbal Communication, Presentation, Interactive skills with team across geography. Strong Experience in Cloud formation and Configuration Management tools like Ansible, Terraform, Chef, Puppet. Experience in container management on tools like Docker, Kubernetes will be a Plus. Good Knowledge on Ant/Shell/Maven, Python scripting will be a Plus to work on build deployments scripts. Cloud certification in relevant technology will be an added advantage. Willingness to work in 24/7 Model. Competencies / Skills: Possesses good communication skills to coordinate, resolve queries of both external and internal customers Self-motivator and result oriented to complete task as per timelines Must have good analytical and problem-solving skills Willingness to learn and work on new technologies Should have a consulting mindset Education: BTech / MTech / MCA / MBA EY | Building a better working world EY exists to build a better working world, helping to create long-term value for clients, people and society and build trust in the capital markets. Enabled by data and technology, diverse EY teams in over 150 countries provide trust through assurance and help clients grow, transform and operate. Working across assurance, consulting, law, strategy, tax and transactions, EY teams ask better questions to find new answers for the complex issues facing our world today.

Posted 4 days ago

Apply

Software Engineer NetApp

3.0 - 5.0 years

15 - 27 Lacs

Bengaluru

Work from Office

Job Summary As a Interop Test Engineer, you will work as part of a team responsible for Interop testing of NetApp software solutions Interoperability with other hardware and software solutions. You will test storage and data management services to be deployed as containers in Kubernetes or native virtual machines and work with various public and private cloud providers with high focus on customer requirements and product quality meting the project timelines. As part of your work, you would be required to work with other partner teams, engineer and developers to understand the test requirements, execution of required testing and automation of the testcases. Debug and troubleshoot issues seen during setup and testing. Job Requirements • Should have relevant experience working on storage or Host OS Interoperability testing • Should be familiar with Server Hardware and Operating systems • Should have skills for deploying and troubleshooting OS on server (OS install, driver, firmware upgrades etc) • Should have skills in deploying and troubleshooting Kubernetes environments (Vanilla Kubernetes / OpenShift / Anthos / Tanzu / Rancher) • Should have skills in managing, Administrating and troubleshooting cloud (AWS, GCP and Azure) • Experience working with python for test automation or development • Strong oral and written communication skills • Ability to work collaboratively within a team to meet aggressive goals and high quality standards • Strong aptitude for learning new technologies Nice to have skills: • Experience with REST API, Ansible, Terraform, Helm, Golang • OS Administration skills ( Linux, Windows, ESXi) • Familiarity with AWS, Azure or Google Cloud compute. • Working experience of configuring and troubleshooting NAS/SAN environments • Have relevant experience working on storage or Host OS Interoperability testing • Familiar with Server Hardware and Operating systems • Experience with ONTAP, CVS, or other NetApp products, or cloud-native storage platforms would be a plus. Education • Bachelor's or Masters in Computer Science Engineering with 2-4 years of relevant work experience

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Chennai, Tamil Nadu, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Chennai, Tamil Nadu, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Delhi, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Punjab, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Punjab, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Odisha, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Chhattisgarh, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Chhattisgarh, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr Database Administrator-Amazon RDS, Amazon Aurora Job express

10.0 years

10 - 20 Lacs

Tamil Nadu, India

On-site

We are looking for 10+ years of experience Database Administrator (DBA) to provide production support for Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases . This role will support the ongoing operations, performance, and availability of our cloud-based and NoSQL database environments. The ideal candidate will be adept at managing large-scale distributed systems, cloud-native services, and hybrid data platforms in a 24/7 production environment. Technical Skills Proven experiene in Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases Administrator or similar role in production support environments. 10+ years of experience in database administration and production support roles. Hands-on experience managing and supporting Apache Cassandra in a production environment. Strong knowledge of Cassandra internals, architecture, data modeling, replication, and consistency levels. Proficiency in troubleshooting cluster issues, performance tuning, and capacity planning. Strong hands-on experience with: Amazon RDS (PostgreSQL, MySQL, or SQL Server). Apache Cassandra (including setup, scaling, repairs, and performance tuning). Document Databases (e.g., MongoDB, Amazon DocumentDB, Couchbase). Experience with cloud infrastructure (preferably AWS ), including EC2, S3, IAM, and VPC. Proficiency in Linux/Unix environments and shell scripting. Strong SQL and NoSQL data modeling and query optimization skills. Familiarity with backup/recovery tools, high availability, and disaster recovery strategies. Strong troubleshooting and problem-solving skills. Excellent communication and collaboration abilities. Ability to work in a 24/7 support rotation and handle urgent production issues. Responsibilities Production Support & Monitoring Provide 24x7 production support for RDS (PostgreSQL/MySQL/SQL Server), Cassandra, and Document databases. Monitor database health, availability, and performance using AWS CloudWatch, Prometheus, Grafana, or similar tools. Triage and resolve database-related incidents and s in a timely manner. Perform root cause analysis and implement preventive measures for recurring issues. Database Maintenance & Administration Manage database configurations, backup, restore, and disaster recovery for all supported platforms. Perform software upgrades, security patches, and maintenance tasks with minimal downtime. o Maintain cluster integrity and support scaling operations for Cassandra and DocumentDB . Performance Tuning & Optimization Tune queries, indexes, and configurations for optimal performance across RDS, Cassandra, and Document stores. Analyze workloads and suggest improvements to data access patterns and schema design. Security & Compliance Support audits, compliance checks, and data protection policies across environments. Implement and enforce database security best practices (encryption, IAM policies, VPCs, RBAC, etc.). Automation & CI/CD Integration Automate operational tasks and deployments using scripts (Bash, Python) and IaC tools like Terraform or CloudFormation. Collaborate with DevOps teams to integrate database changes into CI/CD pipelines. Collaboration & Documentation Work closely with development, SRE, and infrastructure teams to support application rollouts and releases. Maintain detailed runbooks, SOPs, and incident post-mortem documentation. Qualification Experience with AWS-native database services : Amazon DocumentDB, Keyspaces (for Cassandra), or DynamoDB. Familiarity with container orchestration (e.g., Kubernetes) and cloud-native application patterns. Hands-on experience with automation tools like Ansible, Terraform, or Jenkins. Certifications such as AWS Certified Database – Specialty, or AWS Solutions Architect Associate. Exposure to monitoring/logging tools like CloudWatch, DataDog, ELK stack, or Prometheus/Grafana. Skills PRIMARY COMPETENCY : Data Engineering PRIMARY SKILL : Amazon RDS PRIMARY SKILL PERCENTAGE : 60 SECONDARY COMPETENCY : Data Engineering SECONDARY SKILL : Amazon Aurora SECONDARY SKILL PERCENTAGE : 20 TERTIARY COMPETENCY : Data Engineering TERTIARY SKILL : Cassandra TERTIARY SKILL PERCENTAGE : 20 Skills: amazon rds,,amazon aurora,nosql,ci/cd integration,cassandra,document-based databases,automation tools (bash, python, terraform, cloudformation),amazon rds,sql,document-based,cloud infrastructure (aws),linux/unix,monitoring tools (aws cloudwatch, grafana, prometheus),shell scripting

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Himachal Pradesh, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Senior Network Engineer Index Exchange

20.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

At Index Exchange, we’re reinventing how digital advertising works—at scale. As a global advertising supply-side platform, we empower the world’s leading media owners and marketers to thrive in a programmatic, privacy-first ecosystem. We’re a proud industry pioneer with over 20 years of experience accelerating the ad technology evolution. Our proprietary tech is trusted by some of the world’s largest brands and media owners and plays a crucial role in keeping the internet open, accessible, and largely free. We process more than 550 billion real-time auctions every day (in comparison, Google processes 8.5 billion searches per day) with ultra-low latency. Our platform is vertically integrated from servers to networks and runs primarily on our own metal and cloud infrastructure. This end-to-end infrastructure is designed to provide both stability and agility, enabling us to adapt quickly as the market evolves. At the core of it all is our engineering-first culture. Our engineers tackle internet-scale problems across tight-knit, global teams. From moving petabytes of data and optimizing with AI to making real-time infrastructure decisions, Indexers have the agency and influence to shape the future of advertising. We move fast, build thoughtfully, and stay grounded in our core values. About The Role We are seeking a Senior Network Engineer with a proven track record to further the development of our next-generation network architectures. This role will entail deploying and operating advanced networking solutions with a strong emphasis on high availability, low-latency, and security measures. This position reports directly to the Engineering Lead Manager, Networking based in Canada and will work closely with members of the Technical Operations team. Here’s What You’ll Be Doing Network troubleshooting to isolate and diagnose network problems Analyzing business requirements to develop technical network solutions and their framework Developing implementation plans, test plans and project timelines for various projects Working with technology vendors Staying abreast of how technology infrastructures are currently impacting and driving competitors Writing function requirements/specifications documents Enhancing operational efficiency and quality by implementing network automation practices to streamline processes. Solving complex problems with many variables Participating in a 24x7 on-call rotation to provide timely response and resolution to network incidents and emergencies Here's What You Need 8-10+ years’ experience in network design, operations, and support Exceptional written and verbal communication skills Demonstrated expertise in many of the following protocols and technologies: TCP/IP, BGP, IPv6, QoS, Netflow, EVPN, VXLAN, DMVPN, GRE. Stong expertise in Routing, Switching, Enterprise, and Data Center networking with Cisco (NXOS, IOS-XE, and IOS-XR) and Arista (EOS) platforms. Experience with Arista’s AVD and CVP is a plus. Experience with L4-L7 load balancing solutions, such as Netscalers and HAProxy, Nginx. Expertise with Cisco security solutions (ASA, FirePower, Anyconnect), as well as other security vendors such as Palo Alto or Fortinet, demonstrating an understanding of network security principles and technologies. Familiarity with network automation using tools such as Ansible, Python, and Nornir to support network provisioning, configuration management, and troubleshooting. Understanding of Kubernetes networking concepts and Container Network Interface (CNI) standards. Networking certifications such as Cisco Certified (CCIE, CCNP) in at least one of the following is a plus: Data Centre, Enterprise, Security, DevNet. Knowledge of Linux and scripting Why You’ll Love Working Here Comprehensive health, dental, and vision plans for you and your dependents Paid time off, health days, and personal obligation days plus flexible work schedules Competitive retirement matching plans Equity packages Company contribution to Provident Fund Monthly internet stipend Generous parental leave available to birthing, non-birthing, and adoptive parents Annual well-being allowance plus fitness discounts and group wellness activities Employee assistance program Mental health first aid program that provides an in-the-moment point of contact and reassurance One day of volunteer time off per year and a donation-matching program Bi-weekly town halls and regular community-led team events Multiple resources and programming to support continuous learning A workplace that supports a diverse, equitable, and inclusive environment – learn more here Equal employment opportunity At Index Exchange, we believe that successful products are built by teams just as diverse as the audience who uses them. As such, we are committed to equal employment opportunities. We celebrate diversity of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or expression, or veteran status. Additionally, we realize that diversity is deeper than any status or classification—diversity is the human experience. For those who show grit, passion, and humility—Index will welcome you. Accessibility For Applicants With Disabilities Index Exchange welcomes and encourages individuals with disabilities to apply to work with us. If you require an accommodation, please share the details of your request and any information how we can assist you with the hiring recruiter when they contact you. Index Exchange will make reasonable efforts to ensure accommodation requests are met throughout the recruitment process. Index Everywhere, Index Anywhere Our corporate headquarters are in Toronto, with major offices in New York, Montreal, Kitchener, London, San Francisco, and many other global cities. As a major global advertising exchange, we are committed to operating as a tightly knit global team and embracing and empowering talent wherever our colleagues may be.

Posted 4 days ago

Apply

Lead DevOps Engineer Zeta Global

7.0 - 12.0 years

9 - 14 Lacs

Bengaluru

Work from Office

Summary: As a Lead DevOps Engineer, you will be leading projects targeted at supporting production and development environments, creating new and improving existing tools and processes, automating deployment and monitoring procedures. You will lead continuous integration and deployment effort, administering source control systems, deploying and maintaining production infrastructure and applications. Main Responsibilities: Technical leadership of infrastructure projects. Driving automation of performance testing environments. Leading containerization and IaC projects. Leading automation of engineering and operations processes. Defining and implementing HA and DR strategies. Design and optimization of CI/CD pipelines. Runbooks automation. On-call support of production systems. Requirements: 7+ years of experience in SRE, DevOps, or TechOps. 5+ years of experience leading technical projects or teams. 3+ years of tools development or automation. 3+ years of containerization and orchestration experience. Proficiency in shell scripting, as well as Python or Go. Ability to define project requirements and milestones. Experience leading cross-functional projects and teams. Solid experience in managing AWS production environments. Monitoring and observability expertise: OTEL, Prometheus, Grafana tools. Experience with at least two of the following: Puppet, Salt, Ansible, Terraform.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Hyderabad, Telangana, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Hyderabad, Telangana, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr Database Administrator-Amazon RDS, Amazon Aurora Job express

10.0 years

10 - 20 Lacs

Telangana, India

On-site

We are looking for 10+ years of experience Database Administrator (DBA) to provide production support for Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases . This role will support the ongoing operations, performance, and availability of our cloud-based and NoSQL database environments. The ideal candidate will be adept at managing large-scale distributed systems, cloud-native services, and hybrid data platforms in a 24/7 production environment. Technical Skills Proven experiene in Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases Administrator or similar role in production support environments. 10+ years of experience in database administration and production support roles. Hands-on experience managing and supporting Apache Cassandra in a production environment. Strong knowledge of Cassandra internals, architecture, data modeling, replication, and consistency levels. Proficiency in troubleshooting cluster issues, performance tuning, and capacity planning. Strong hands-on experience with: Amazon RDS (PostgreSQL, MySQL, or SQL Server). Apache Cassandra (including setup, scaling, repairs, and performance tuning). Document Databases (e.g., MongoDB, Amazon DocumentDB, Couchbase). Experience with cloud infrastructure (preferably AWS ), including EC2, S3, IAM, and VPC. Proficiency in Linux/Unix environments and shell scripting. Strong SQL and NoSQL data modeling and query optimization skills. Familiarity with backup/recovery tools, high availability, and disaster recovery strategies. Strong troubleshooting and problem-solving skills. Excellent communication and collaboration abilities. Ability to work in a 24/7 support rotation and handle urgent production issues. Responsibilities Production Support & Monitoring Provide 24x7 production support for RDS (PostgreSQL/MySQL/SQL Server), Cassandra, and Document databases. Monitor database health, availability, and performance using AWS CloudWatch, Prometheus, Grafana, or similar tools. Triage and resolve database-related incidents and s in a timely manner. Perform root cause analysis and implement preventive measures for recurring issues. Database Maintenance & Administration Manage database configurations, backup, restore, and disaster recovery for all supported platforms. Perform software upgrades, security patches, and maintenance tasks with minimal downtime. o Maintain cluster integrity and support scaling operations for Cassandra and DocumentDB . Performance Tuning & Optimization Tune queries, indexes, and configurations for optimal performance across RDS, Cassandra, and Document stores. Analyze workloads and suggest improvements to data access patterns and schema design. Security & Compliance Support audits, compliance checks, and data protection policies across environments. Implement and enforce database security best practices (encryption, IAM policies, VPCs, RBAC, etc.). Automation & CI/CD Integration Automate operational tasks and deployments using scripts (Bash, Python) and IaC tools like Terraform or CloudFormation. Collaborate with DevOps teams to integrate database changes into CI/CD pipelines. Collaboration & Documentation Work closely with development, SRE, and infrastructure teams to support application rollouts and releases. Maintain detailed runbooks, SOPs, and incident post-mortem documentation. Qualification Experience with AWS-native database services : Amazon DocumentDB, Keyspaces (for Cassandra), or DynamoDB. Familiarity with container orchestration (e.g., Kubernetes) and cloud-native application patterns. Hands-on experience with automation tools like Ansible, Terraform, or Jenkins. Certifications such as AWS Certified Database – Specialty, or AWS Solutions Architect Associate. Exposure to monitoring/logging tools like CloudWatch, DataDog, ELK stack, or Prometheus/Grafana. Skills PRIMARY COMPETENCY : Data Engineering PRIMARY SKILL : Amazon RDS PRIMARY SKILL PERCENTAGE : 60 SECONDARY COMPETENCY : Data Engineering SECONDARY SKILL : Amazon Aurora SECONDARY SKILL PERCENTAGE : 20 TERTIARY COMPETENCY : Data Engineering TERTIARY SKILL : Cassandra TERTIARY SKILL PERCENTAGE : 20 Skills: amazon rds,,amazon aurora,nosql,ci/cd integration,cassandra,document-based databases,automation tools (bash, python, terraform, cloudformation),amazon rds,sql,document-based,cloud infrastructure (aws),linux/unix,monitoring tools (aws cloudwatch, grafana, prometheus),shell scripting

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Andhra Pradesh, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Sr. IT Monitoring Engineer/Site Reliability Engineer (Shift -12PM-9PM IST) (Remote) CrowdStrike

5.0 years

0 Lacs

Andhra Pradesh, India

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.

Posted 4 days ago

Apply

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

10726 Ansible Jobs - Page 28

Job Alert

Start Your Job Search Today

Please Verify Your Phone or Email

Job Application AI Bot

Download the Mobile App

Setup Job Alerts

Featured Companies

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

Personal Settings

10726 Ansible Jobs - Page 28

Job Alert

Upload Resume

AI Job Matching Summary

Pros

Cons

Summary

Start Your Job Search Today

Please Verify Your Phone or Email

Job Application AI Bot

Download the Mobile App

Setup Job Alerts

Featured Companies