We are seeking an experienced Application Production Support having experience in platform support to join our team focused on production infrastructure management.
The ideal candidate will have strong skills in automation, monitoring, troubleshooting, and incident management across a variety of tools and platforms.
This position is ideal for an engineer who enjoys working in a dynamic, fast-paced environment and has a passion for production support, infrastructure optimization, automation and inclination towards problem solving
Responsibilities
Direct Responsibilities
Automation & Scripting:
Drive automation initiatives using Ansible/Shell scripting, and/or Python to optimize operational workflows, deployments, and system configuration.
Middleware & Application Debugging:
Monitor, debug, and maintain platform i.e. IIS / Apache to ensure stable application operations and uptime.
Support production migrations to cloud/ virtual machines to enhance system performance and reliability.
Production Support & Incident Management:
Manage and resolve incidents on IIS / Apache, and database components, focusing on areas such as long-running queries and server performance.
Perform root cause analysis (RCA) and implement corrective actions to prevent future occurrences.
Handle MQ troubleshooting and performance issue/failures.
Lead deployments and manage certificate renewals, ensuring seamless deployments and minimizing downtime.
Monitoring & Alerting:
Set up and manage Dynatrace alerts, analyze performance during incidents, create and maintain dashboards to provide real-time insights.
Exposure building custom dashboards in Grafana for production system visibility.
Capacity Planning:
Analyze current and projected application capacity to ensure adequate resources are provisioned.
Plan for capacity upgrades and scaling strategies to meet future demand.
Log Management & Analysis:
Exposure to ELK/OpenSearch for log analysis, dashboarding and troubleshooting, enabling faster root cause identification and resolution.
File Transfer as a Service:
Manage and support secure, reliable file transfer solutions across production systems.
Documentation & Process Management:
Skilled in documenting processes, incident reports, and application configurations for reference and compliance.
Strong attention to detail to maintain accurate and up-to-date KeDB.
Exposure to Cloud & Containerization Knowledge:
Provide high-level support and guidance on cloud architecture, Kubernetes, and containerized environments, enhancing system scalability and modernization.
Exposure to DevOps & CI/CD Knowledge:
Familiarity with DevOps practices and tools (e.g., Jenkins) for automated deployments and configuration management.
Understanding of CI/CD pipelines and version control to manage application releases and updates.
Technical & Behavioral Competencies
Required Skills and Qualifications:
Proven experience in Application Production Support / Platform Management.
Strong knowledge of monitoring / Log aggregation tools, including Dynatrace, Geneos, Grafana and the ELK stack.
Hands-on experience with automation using Ansible / Shell scripting /Power Shell/and Python.
Proficiency in managing incidents and performing root cause analysis on IIS / Apache, and database environments.
Familiarity with Jenkins for continuous integration and deployment, as well as certificate management and renewal processes.
Exposure to SQL skills for data extraction, debugging, and performance tuning.
Exposure of cloud architectures, Kubernetes, and containerized infrastructure.
Preferred to have
ITIL
Dockers/Kubernetes
Prior knowledge on Application Production Support / Platform / Development background
Skills Referential
Behavioural Skills :
Ability to deliver / Results driven
Communication skills - oral & written
Creativity & Innovation / Problem solving
Personal Impact / Ability to influence
Transversal Skills:
Ability to develop and adapt a process
Ability to anticipate business / strategic evolution
Ability to manage / facilitate a meeting, seminar, committee, training
Ability to understand, explain and support change
Ability to develop others & improve their skills