Job
Description
Tool Development:Design, build, and maintain scalable tools and frameworks to automate operational workflows (e.g., anomaly detection, root cause analysis, predictive alerting).AIOps Integration:Implement AI/ML models into operational pipelines (e.g., log analysis, metric forecasting, incident triage) using Python, Go, or similar languages.Observability & Automation:Enhance monitoring systems with AI-driven insights and self-healing capabilities.Collaboration:Work with cross-functional teams to identify pain points in operations and develop AI-powered solutions.Performance Optimization:Improve system reliability by reducing noise in alerts and automating remediation tasks.Best Practices:Advocate for AIOps adoption by documenting use cases, conducting demos, and mentoring junior engineers.
Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise Minimum 7+ years experienceProficient in Linux.Expert in configuration management tools like Ansible.Knowledgeable in creating CI/CD pipelines, with Jenkins as a preference.Skilled in optimizing container builds.Hands-on experience with Kubernetes or OpenShift.Comfortable writing scripts in Bash and Python.Practical experience in building React front-end applications with strong proficiency in JavaScript/TypeScript.Expertise in developing backend services and APIs, particularly using Python frameworks.Strong understanding of both SQL and NoSQL databases.Familiar with task scheduling tools such as Kafka, Redis, and Celery for asynchronous task processing.Experience with AI/ML models and integrating them into operational pipelines.Preferred Professional and Technical Expertise
Work with Hiring Manager to ID up to 3 bullets max (encouraging then to focus on required skills) Kubernetes/OpenShift: Strongly preferred experience in working with production Kubernetes/OpenShift environments.
Automation/Scripting: In depth experience with the Ansible, Python, Terraform, and CI/CD tools such as Jenkins, IBM Continuous Delivery, ArgoCD
Monitoring/Observability: Hands on experience crafting alerts and dashboards using tools such as Instana, New Relic, Grafana/Prometheus
Required EducationBachelor's Degree Preferred technical and professional experience Work with Hiring Manager to ID up to 3 bullets max (encouraging then to focus on required skills) Kubernetes/OpenShift: Strongly preferred experience in working with production Kubernetes/OpenShift environments.
Automation/Scripting: In depth experience with the Ansible, Python, Terraform, and CI/CD tools such as Jenkins, IBM Continuous Delivery, ArgoCD
Monitoring/Observability: Hands on experience crafting alerts and dashboards using tools such as Instana, New Relic, Grafana/Prometheu