Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 years
0 Lacs
Chhattisgarh, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Chhattisgarh, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
10.0 years
10 - 20 Lacs
Tamil Nadu, India
On-site
We are looking for 10+ years of experience Database Administrator (DBA) to provide production support for Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases . This role will support the ongoing operations, performance, and availability of our cloud-based and NoSQL database environments. The ideal candidate will be adept at managing large-scale distributed systems, cloud-native services, and hybrid data platforms in a 24/7 production environment. Technical Skills Proven experiene in Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases Administrator or similar role in production support environments. 10+ years of experience in database administration and production support roles. Hands-on experience managing and supporting Apache Cassandra in a production environment. Strong knowledge of Cassandra internals, architecture, data modeling, replication, and consistency levels. Proficiency in troubleshooting cluster issues, performance tuning, and capacity planning. Strong hands-on experience with: Amazon RDS (PostgreSQL, MySQL, or SQL Server). Apache Cassandra (including setup, scaling, repairs, and performance tuning). Document Databases (e.g., MongoDB, Amazon DocumentDB, Couchbase). Experience with cloud infrastructure (preferably AWS ), including EC2, S3, IAM, and VPC. Proficiency in Linux/Unix environments and shell scripting. Strong SQL and NoSQL data modeling and query optimization skills. Familiarity with backup/recovery tools, high availability, and disaster recovery strategies. Strong troubleshooting and problem-solving skills. Excellent communication and collaboration abilities. Ability to work in a 24/7 support rotation and handle urgent production issues. Responsibilities Production Support & Monitoring Provide 24x7 production support for RDS (PostgreSQL/MySQL/SQL Server), Cassandra, and Document databases. Monitor database health, availability, and performance using AWS CloudWatch, Prometheus, Grafana, or similar tools. Triage and resolve database-related incidents and s in a timely manner. Perform root cause analysis and implement preventive measures for recurring issues. Database Maintenance & Administration Manage database configurations, backup, restore, and disaster recovery for all supported platforms. Perform software upgrades, security patches, and maintenance tasks with minimal downtime. o Maintain cluster integrity and support scaling operations for Cassandra and DocumentDB . Performance Tuning & Optimization Tune queries, indexes, and configurations for optimal performance across RDS, Cassandra, and Document stores. Analyze workloads and suggest improvements to data access patterns and schema design. Security & Compliance Support audits, compliance checks, and data protection policies across environments. Implement and enforce database security best practices (encryption, IAM policies, VPCs, RBAC, etc.). Automation & CI/CD Integration Automate operational tasks and deployments using scripts (Bash, Python) and IaC tools like Terraform or CloudFormation. Collaborate with DevOps teams to integrate database changes into CI/CD pipelines. Collaboration & Documentation Work closely with development, SRE, and infrastructure teams to support application rollouts and releases. Maintain detailed runbooks, SOPs, and incident post-mortem documentation. Qualification Experience with AWS-native database services : Amazon DocumentDB, Keyspaces (for Cassandra), or DynamoDB. Familiarity with container orchestration (e.g., Kubernetes) and cloud-native application patterns. Hands-on experience with automation tools like Ansible, Terraform, or Jenkins. Certifications such as AWS Certified Database – Specialty, or AWS Solutions Architect Associate. Exposure to monitoring/logging tools like CloudWatch, DataDog, ELK stack, or Prometheus/Grafana. Skills PRIMARY COMPETENCY : Data Engineering PRIMARY SKILL : Amazon RDS PRIMARY SKILL PERCENTAGE : 60 SECONDARY COMPETENCY : Data Engineering SECONDARY SKILL : Amazon Aurora SECONDARY SKILL PERCENTAGE : 20 TERTIARY COMPETENCY : Data Engineering TERTIARY SKILL : Cassandra TERTIARY SKILL PERCENTAGE : 20 Skills: amazon rds,,amazon aurora,nosql,ci/cd integration,cassandra,document-based databases,automation tools (bash, python, terraform, cloudformation),amazon rds,sql,document-based,cloud infrastructure (aws),linux/unix,monitoring tools (aws cloudwatch, grafana, prometheus),shell scripting
Posted 3 days ago
5.0 years
0 Lacs
Himachal Pradesh, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
20.0 years
0 Lacs
Bengaluru, Karnataka, India
On-site
At Index Exchange, we’re reinventing how digital advertising works—at scale. As a global advertising supply-side platform, we empower the world’s leading media owners and marketers to thrive in a programmatic, privacy-first ecosystem. We’re a proud industry pioneer with over 20 years of experience accelerating the ad technology evolution. Our proprietary tech is trusted by some of the world’s largest brands and media owners and plays a crucial role in keeping the internet open, accessible, and largely free. We process more than 550 billion real-time auctions every day (in comparison, Google processes 8.5 billion searches per day) with ultra-low latency. Our platform is vertically integrated from servers to networks and runs primarily on our own metal and cloud infrastructure. This end-to-end infrastructure is designed to provide both stability and agility, enabling us to adapt quickly as the market evolves. At the core of it all is our engineering-first culture. Our engineers tackle internet-scale problems across tight-knit, global teams. From moving petabytes of data and optimizing with AI to making real-time infrastructure decisions, Indexers have the agency and influence to shape the future of advertising. We move fast, build thoughtfully, and stay grounded in our core values. About The Role We are seeking a Senior Network Engineer with a proven track record to further the development of our next-generation network architectures. This role will entail deploying and operating advanced networking solutions with a strong emphasis on high availability, low-latency, and security measures. This position reports directly to the Engineering Lead Manager, Networking based in Canada and will work closely with members of the Technical Operations team. Here’s What You’ll Be Doing Network troubleshooting to isolate and diagnose network problems Analyzing business requirements to develop technical network solutions and their framework Developing implementation plans, test plans and project timelines for various projects Working with technology vendors Staying abreast of how technology infrastructures are currently impacting and driving competitors Writing function requirements/specifications documents Enhancing operational efficiency and quality by implementing network automation practices to streamline processes. Solving complex problems with many variables Participating in a 24x7 on-call rotation to provide timely response and resolution to network incidents and emergencies Here's What You Need 8-10+ years’ experience in network design, operations, and support Exceptional written and verbal communication skills Demonstrated expertise in many of the following protocols and technologies: TCP/IP, BGP, IPv6, QoS, Netflow, EVPN, VXLAN, DMVPN, GRE. Stong expertise in Routing, Switching, Enterprise, and Data Center networking with Cisco (NXOS, IOS-XE, and IOS-XR) and Arista (EOS) platforms. Experience with Arista’s AVD and CVP is a plus. Experience with L4-L7 load balancing solutions, such as Netscalers and HAProxy, Nginx. Expertise with Cisco security solutions (ASA, FirePower, Anyconnect), as well as other security vendors such as Palo Alto or Fortinet, demonstrating an understanding of network security principles and technologies. Familiarity with network automation using tools such as Ansible, Python, and Nornir to support network provisioning, configuration management, and troubleshooting. Understanding of Kubernetes networking concepts and Container Network Interface (CNI) standards. Networking certifications such as Cisco Certified (CCIE, CCNP) in at least one of the following is a plus: Data Centre, Enterprise, Security, DevNet. Knowledge of Linux and scripting Why You’ll Love Working Here Comprehensive health, dental, and vision plans for you and your dependents Paid time off, health days, and personal obligation days plus flexible work schedules Competitive retirement matching plans Equity packages Company contribution to Provident Fund Monthly internet stipend Generous parental leave available to birthing, non-birthing, and adoptive parents Annual well-being allowance plus fitness discounts and group wellness activities Employee assistance program Mental health first aid program that provides an in-the-moment point of contact and reassurance One day of volunteer time off per year and a donation-matching program Bi-weekly town halls and regular community-led team events Multiple resources and programming to support continuous learning A workplace that supports a diverse, equitable, and inclusive environment – learn more here Equal employment opportunity At Index Exchange, we believe that successful products are built by teams just as diverse as the audience who uses them. As such, we are committed to equal employment opportunities. We celebrate diversity of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or expression, or veteran status. Additionally, we realize that diversity is deeper than any status or classification—diversity is the human experience. For those who show grit, passion, and humility—Index will welcome you. Accessibility For Applicants With Disabilities Index Exchange welcomes and encourages individuals with disabilities to apply to work with us. If you require an accommodation, please share the details of your request and any information how we can assist you with the hiring recruiter when they contact you. Index Exchange will make reasonable efforts to ensure accommodation requests are met throughout the recruitment process. Index Everywhere, Index Anywhere Our corporate headquarters are in Toronto, with major offices in New York, Montreal, Kitchener, London, San Francisco, and many other global cities. As a major global advertising exchange, we are committed to operating as a tightly knit global team and embracing and empowering talent wherever our colleagues may be.
Posted 3 days ago
7.0 - 12.0 years
9 - 14 Lacs
Bengaluru
Work from Office
Summary: As a Lead DevOps Engineer, you will be leading projects targeted at supporting production and development environments, creating new and improving existing tools and processes, automating deployment and monitoring procedures. You will lead continuous integration and deployment effort, administering source control systems, deploying and maintaining production infrastructure and applications. Main Responsibilities: Technical leadership of infrastructure projects. Driving automation of performance testing environments. Leading containerization and IaC projects. Leading automation of engineering and operations processes. Defining and implementing HA and DR strategies. Design and optimization of CI/CD pipelines. Runbooks automation. On-call support of production systems. Requirements: 7+ years of experience in SRE, DevOps, or TechOps. 5+ years of experience leading technical projects or teams. 3+ years of tools development or automation. 3+ years of containerization and orchestration experience. Proficiency in shell scripting, as well as Python or Go. Ability to define project requirements and milestones. Experience leading cross-functional projects and teams. Solid experience in managing AWS production environments. Monitoring and observability expertise: OTEL, Prometheus, Grafana tools. Experience with at least two of the following: Puppet, Salt, Ansible, Terraform.
Posted 3 days ago
5.0 years
0 Lacs
Hyderabad, Telangana, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Hyderabad, Telangana, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
10.0 years
10 - 20 Lacs
Telangana, India
On-site
We are looking for 10+ years of experience Database Administrator (DBA) to provide production support for Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases . This role will support the ongoing operations, performance, and availability of our cloud-based and NoSQL database environments. The ideal candidate will be adept at managing large-scale distributed systems, cloud-native services, and hybrid data platforms in a 24/7 production environment. Technical Skills Proven experiene in Amazon RDS, Amazon Aurora, Cassandra, and Document-based databases Administrator or similar role in production support environments. 10+ years of experience in database administration and production support roles. Hands-on experience managing and supporting Apache Cassandra in a production environment. Strong knowledge of Cassandra internals, architecture, data modeling, replication, and consistency levels. Proficiency in troubleshooting cluster issues, performance tuning, and capacity planning. Strong hands-on experience with: Amazon RDS (PostgreSQL, MySQL, or SQL Server). Apache Cassandra (including setup, scaling, repairs, and performance tuning). Document Databases (e.g., MongoDB, Amazon DocumentDB, Couchbase). Experience with cloud infrastructure (preferably AWS ), including EC2, S3, IAM, and VPC. Proficiency in Linux/Unix environments and shell scripting. Strong SQL and NoSQL data modeling and query optimization skills. Familiarity with backup/recovery tools, high availability, and disaster recovery strategies. Strong troubleshooting and problem-solving skills. Excellent communication and collaboration abilities. Ability to work in a 24/7 support rotation and handle urgent production issues. Responsibilities Production Support & Monitoring Provide 24x7 production support for RDS (PostgreSQL/MySQL/SQL Server), Cassandra, and Document databases. Monitor database health, availability, and performance using AWS CloudWatch, Prometheus, Grafana, or similar tools. Triage and resolve database-related incidents and s in a timely manner. Perform root cause analysis and implement preventive measures for recurring issues. Database Maintenance & Administration Manage database configurations, backup, restore, and disaster recovery for all supported platforms. Perform software upgrades, security patches, and maintenance tasks with minimal downtime. o Maintain cluster integrity and support scaling operations for Cassandra and DocumentDB . Performance Tuning & Optimization Tune queries, indexes, and configurations for optimal performance across RDS, Cassandra, and Document stores. Analyze workloads and suggest improvements to data access patterns and schema design. Security & Compliance Support audits, compliance checks, and data protection policies across environments. Implement and enforce database security best practices (encryption, IAM policies, VPCs, RBAC, etc.). Automation & CI/CD Integration Automate operational tasks and deployments using scripts (Bash, Python) and IaC tools like Terraform or CloudFormation. Collaborate with DevOps teams to integrate database changes into CI/CD pipelines. Collaboration & Documentation Work closely with development, SRE, and infrastructure teams to support application rollouts and releases. Maintain detailed runbooks, SOPs, and incident post-mortem documentation. Qualification Experience with AWS-native database services : Amazon DocumentDB, Keyspaces (for Cassandra), or DynamoDB. Familiarity with container orchestration (e.g., Kubernetes) and cloud-native application patterns. Hands-on experience with automation tools like Ansible, Terraform, or Jenkins. Certifications such as AWS Certified Database – Specialty, or AWS Solutions Architect Associate. Exposure to monitoring/logging tools like CloudWatch, DataDog, ELK stack, or Prometheus/Grafana. Skills PRIMARY COMPETENCY : Data Engineering PRIMARY SKILL : Amazon RDS PRIMARY SKILL PERCENTAGE : 60 SECONDARY COMPETENCY : Data Engineering SECONDARY SKILL : Amazon Aurora SECONDARY SKILL PERCENTAGE : 20 TERTIARY COMPETENCY : Data Engineering TERTIARY SKILL : Cassandra TERTIARY SKILL PERCENTAGE : 20 Skills: amazon rds,,amazon aurora,nosql,ci/cd integration,cassandra,document-based databases,automation tools (bash, python, terraform, cloudformation),amazon rds,sql,document-based,cloud infrastructure (aws),linux/unix,monitoring tools (aws cloudwatch, grafana, prometheus),shell scripting
Posted 3 days ago
5.0 years
0 Lacs
Andhra Pradesh, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Andhra Pradesh, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 - 10.0 years
10 - 20 Lacs
Noida, Pune, Bengaluru
Work from Office
Description: Boomi India Lab (11013020) Requirements: Yrs of Exp - 5-6 Yrs Skills need : Java, Springboot, IntelliJ, Mockito, JUnit,Hibernate, MySQL, Post Gre Sequel, Design Patterns, AWS stack, Proficient in Web Services technologies (Rest, SOAP, WSDL),GIT, Maven Good to Have : Amazon Q, Co-pilot Job Responsibilities: Bamboo/Jenkins/Harness, Ansible, Snyk, SonarQube - NewRelic, AppD, Splunk - Google Suite, Slack, Atlassian Suite (Jira, Confluence) What We Offer: Exciting Projects: We focus on industries like High-Tech, communication, media, healthcare, retail and telecom. Our customer list is full of fantastic global brands and leaders who love what we build for them. Collaborative Environment: You Can expand your skills by collaborating with a diverse team of highly talented people in an open, laidback environment — or even abroad in one of our global centers or client facilities! Work-Life Balance: GlobalLogic prioritizes work-life balance, which is why we offer flexible work schedules, opportunities to work from home, and paid time off and holidays. Professional Development: Our dedicated Learning & Development team regularly organizes Communication skills training(GL Vantage, Toast Master),Stress Management program, professional certifications, and technical and soft skill trainings. Excellent Benefits: We provide our employees with competitive salaries, family medical insurance, Group Term Life Insurance, Group Personal Accident Insurance , NPS(National Pension Scheme ), Periodic health awareness program, extended maternity leave, annual performance bonuses, and referral bonuses. Fun Perks: We want you to love where you work, which is why we host sports events, cultural activities, offer food on subsidies rates, Corporate parties. Our vibrant offices also include dedicated GL Zones, rooftop decks and GL Club where you can drink coffee or tea with your colleagues over a game of table and offer discounts for popular stores and restaurants!
Posted 3 days ago
5.0 years
0 Lacs
Madhya Pradesh, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Madhya Pradesh, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Dehradun, Uttarakhand, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Dehradun, Uttarakhand, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Kerala, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Kerala, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Bihar, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Bihar, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
0 years
0 Lacs
Mumbai, Maharashtra, India
On-site
Acts as liason between Development teams and Platform (PAAS) teams to translate requirements into technical tasks or support requests Uses coding languages or scripting methodologies to solve a problem with a custom workflow Documents problems, articulates solutions or workarounds Acts as a key contributor on given projects to brainstorm about the best way to tackle a complex technological infrastructure, security or development problem Learn methodologies to perform incremental testing actions on code using a test driven approach where possible (TDD) Oral and written communication skills with a keen sense of customer service Problem-solving and troubleshooting skills Process-oriented with great documentation skills Knowledge of best practices in a micro-service architecture in an always-up, always-available service Experience with or knowledge of Agile Software Development methodologies Knoweldge of Seucrity best-practices in a containerized or cloud-based Architecture Familiarity with event drive Architecture and related concepts Experience Familiarity with container orchestration services, preferably Kubernetes Competency with container runtimes like docker, cri-o, mesos, rkt (Core OS) Working knowledge of Kubernetes templating tools such as Helm or Kustomize Proficiency in infrastructure scripting / templating solutions such as BASH, GO, Python Demonstrated experience with Infrastructure code tools such as Terraform, CloudFormation, Chef, Puppet, SaltStack, Ansible or equivalent Competency administering and deploying development lifecycle tooling such as Git, Jira, GitLab, CircleCI or Jenkins Knowledge of logging and monitoring tools such as Splunk, Logz.io, Prometheus, Grafana or full suites of tools like Datadog or New Relic Significant experience with multiple Linux operating systems in both a virtual or containerized platform. Experience with Infrastructure as Code principals utilizing GitOps Experience with Secrets management tools such as Vault, AWS Secrets Manager, Azure Key Vault or equivalent
Posted 3 days ago
5.0 - 8.0 years
13 - 18 Lacs
Bengaluru
Work from Office
Your Impact: The role Cloud Application Engineer is to build solutions to enhance availability, performance, and stability of OpenText services as well as automating away repetitive work as part of a cloud ops organization. This role would be a great fit for someone with creative and innovative problem-solving skills. You will develop and implement solutions that operate at scale. Our teams are empowered and expected to improve our products to truly deliver a reliable experience to customers. What the role offers: Collaborates with Engineering, Professional services teams, sustain and business partners and provides significant contributions to develop specifications to resolve problems, and to address enhancement needs focusing in areas of logging, monitoring and metrics for operational readiness Uses technical knowledge, creativity, and company practices to drive down occurrences of incidents through development of proactive monitoring and alerting. Provide attention to incidents according to Service Level Agreements. Provide continuous feedback to development teams on system stability, defect analysis and system enhancements Develop runbooks and patterns to sustain applications in a production environment Participate in technical discussions and drive transition to sustain activities with the development teams Work with IT business and development partners to gather input to develop new capabilities in displaying/monitoring/alerting on key performance indicators (KPIs) by tracking business transactions (BT) in real-time Partner with application owners to develop creative and effective solutions to mitigate risk and successfully remediate any audit issues, providing quality and timely responses Take ownership and accountability for the incident resolution process, participating in RCA and SWAT investigations. Plan for validation and verification of changes deployed by infrastructure teams, development teams. Participate in day-to-day real time advanced level technical support and troubleshooting on issues reported from user/customer base. Provides guidance in resolving performance related issues and designing solutions for any technical issues faced by the application Establish and maintain a good relationship with team members, Product Development, Product management, Customer Service, Client management and other cross functional teams. Participate in training and information sharing activities. Act as backup for other team members when necessary. Requires rotating shift work as needed. On-call rotation is required, as 7x24x365 support is required. What you need to succeed: The ability to understand and maintain Scripting software Deep understanding of Linux systems Hands on experience with cloud infrastructure; Google, AWS or Azure Experience with PaaS technologies such as Cloud Foundry, Kubernetes, Bosh and Anthos. Good understanding and operational experience with container technologies. Good understanding and working experience with micro services and RESTful architecture. Experience with Continuous delivery tools like Ansible, Rundeck or Argo CD to setup automated pipelines as needed. Strong working knowledge of PaaS or Application operations best practices. Operational understanding or experience with message brokers such as Apache MQ Operational understanding or experience with search technologies such as Solr search or Elasticsearch. Experience in supporting middle-ware technologies such as Apache, Tomcat, Spring. Experience with at least one scripting languages such shell, perl, python, javascripts, etc Experience with installing and configuring Apache and Tomcat. Experience in supporting Java applications built using frameworks such as spring, struts, spark, etc. Experience and knowledge in Oracle and Postgres. Deep expertise in Monitoring distributed systems application architectures and the ability to correlate environment conditions and metrics to application events. Experience with APM tools such as Newrelic, Dynatrace or AppDyanmics. Experience with monitoring tools such as Zabbix or check_mk. Knowledge and familiarity of centralized logging systems such as graylog or Kibana. Strong understanding of ITIL principles, certification is a plus. Is passionate about getting under the hood of systems and technologies to understand their inner workings, and fix what needs fixing. This requires diagnosing & troubleshooting user facing service incidents & outages Diagnosing, resolving problems in high-throughput web applications & network services Proven problem solving and analytical ability. Excellent organizational/time management skills. Ability to handle multiple tasks concurrently. Ability to lead, drive and implement highly scalable and complex solutions A strong understanding of Security best practices. A proven record of being able to work independently and collaboratively.
Posted 3 days ago
5.0 years
0 Lacs
Pune, Maharashtra, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Pune, Maharashtra, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Preferred Qualifications SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift timings - 12PM -9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
5.0 years
0 Lacs
Pune, Maharashtra, India
Remote
As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About The Role The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications. You will work at the intersection of operations and development, applying software engineering principles to operations tasks while focusing on system reliability and automation. This position requires a proactive approach to identifying and resolving issues before they impact business operations, as well as participating in on-call rotations to address incidents when they occur. What You’ll Need 5+ years of experience with enterprise monitoring tools (Prometheus, LogicMonitor, Datadog, ThousandEyes, Zscaler Digital Experience (ZDX)) Strong proficiency in scripting languages (Python, Bash, PowerShell) for automation Experience with log management platforms (ELK stack, Splunk, LogScale) Working knowledge of cloud services monitoring (AWS CloudWatch, GCP) Experience with application performance monitoring (APM), digital experience monitoring (DEM) and infrastructure monitoring Knowledge of SRE principles, SLOs, error budgets, and incident management Experience with automated alerting, remediation workflows, and CI/CD pipeline monitoring Familiarity with Infrastructure as Code (Terraform, Ansible) and containerization (Docker, Kubernetes) Strong incident triage, root cause analysis, and documentation skills Experience participating in on-call rotations and emergency response What You'll Do Monitoring and Reliability Design and maintain comprehensive monitoring solutions across infrastructure and applications Configure appropriate alerting thresholds to ensure timely response to potential issues Define and track SLOs and error budgets for critical services Create and maintain dashboards providing real-time visibility into system health Conduct regular reviews of system reliability and recommend improvements Incident Management and Operations Participate in on-call rotation to respond to alerts and incidents Lead incident response efforts and conduct thorough post-incident reviews Document incidents, resolutions, and lessons learned Develop and refine incident response procedures to improve MTTR Implement proactive monitoring to detect potential issues before they impact users Automation and Collaboration Develop scripts and automation to streamline monitoring tasks and reduce manual effort Create self-healing systems that can automatically remediate common issues Integrate monitoring tools with other operational systems Work closely with development, infrastructure, and security teams Provide guidance on monitoring best practices and observability Maintain comprehensive documentation for monitoring systems and procedures Continuous Improvement Stay current with industry trends in monitoring and site reliability engineering Analyze monitoring data to identify patterns and improvement opportunities Implement metrics to track the effectiveness of monitoring processes Contribute to the evolution of the organization's monitoring strategy Bonus Points SRE, cloud platform, or monitoring tool certifications ITIL Foundation certification Bachelor's degree in Computer Science, Information Technology, or related field Shift Timings: 12PM - 9PM IST Benefits Of Working At CrowdStrike Remote-friendly and flexible work culture Market leader in compensation and equity awards Comprehensive physical and mental wellness programs Competitive vacation and holidays for recharge Paid parental and adoption leaves Professional development opportunities for all employees regardless of level or role Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections Vibrant office culture with world class amenities Great Place to Work Certified™ across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at recruiting@crowdstrike.com for further assistance.
Posted 3 days ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
31458 Jobs | Dublin
Wipro
16542 Jobs | Bengaluru
EY
10788 Jobs | London
Accenture in India
10711 Jobs | Dublin 2
Amazon
8660 Jobs | Seattle,WA
Uplers
8559 Jobs | Ahmedabad
IBM
7988 Jobs | Armonk
Oracle
7535 Jobs | Redwood City
Muthoot FinCorp (MFL)
6170 Jobs | New Delhi
Capgemini
6091 Jobs | Paris,France