Jobs
Interviews

922 Prometheus Jobs - Page 31

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

4.0 - 8.0 years

6 - 10 Lacs

Bengaluru

Work from Office

Back Key Responsibilities Technical support Incident management, change management, problem management Monitoring Zabbix, Prometheus, ELK, Grafana Troubleshooting Customer issues, Application issues Linux file manipulation Perform system health checks Investigate Customer issues Investigate system alarms Lead/participate in Incident management Perform Change Reviews Problem management lead/participate in Postmortems Problem management drive resolution of customer impacting issues Improve detection of issues (alarm tuning) Fulfill daily requests Oncall Duties Required Qualifications To Be Successful In This Role Monitoring Zabbix, Prometheus, ELK, Grafana, Dynatrace, Nagios Incident Management Linux Additional Information Job Type Full Time Work ProfileHybrid (Work from Office/ Remote) Years of Experience3-7 Years LocationBangalore What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment

Posted 1 month ago

Apply

4.0 - 8.0 years

6 - 10 Lacs

Bengaluru

Work from Office

Back At BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go Thrive in diverse roles like Full Stack Developer, Backend Developer, UI/UX Designer, DevOps Engineer, Cloud Engineer, Data Science Engineer, and Scrum Master; at a workplace that encourages you to freely share your bold and different ideas If you are passionate about technology and eager to make a difference, we want to hear from you! Apply now to join our dynamic team in Bengaluru We're seeking a dedicated Site Reliability Engineer to join our team In this role, you will be responsible for maintaining the reliability, scalability, and performance of our systems You'll implement best practices for monitoring, incident response, and automation to ensure seamless operations Your expertise will help us build resilient infrastructure, reduce downtime, and enhance the overall user experience Key Responsibilities Experience working with various monitoring tools (eg ELK, Dyntrace, Cloudwatch, Cloud logging, Cloud Monitoring, BMC Surveyor, BMC Patrol, Grafana, Prometheus) Ensure monitoring and self-healing strategies are implemented and maintained to proactively prevent production incidents Perform root cause analysis of production issues Design and manage on call and escalation processes- Nice to Have Participate in design reviews and production reviews for new features, products, or pieces of infrastructure Designing and implementing ELK (Elasticsearch, Logstash and Kibana) stack, Prometheus and Grafana solutions for monitoring and alerting Debug production issues across services and levels of the stack Establish KPIs to demonstrate maturity, efficiency, and value to our business partners Works as an integral part of the DevOps team with complimentary skills and common goals L3 Support experience is an asset Work to create a Release management process and help with Out-of-business-hour deployments and support (Rotation with team members) Familiar and comfortable with agile development techniques Technology Skills (Mandatory) ELK, Dyntrace, Cloudwatch, Cloud logging, Cloud Monitoring, BMC Surveyor, BMC Patrol, Grafana, Prometheus Required Qualifications To Be Successful In This Role Bachelors degree in computer science engineering, or related field 8 -10 years of experience as a SRE Proven experience as an SRE, DevOps engineer, or similar role Strong programming skills in languages such as Python, Go, Java, or Ruby Strong problem-solving skills and ability to work under pressure Excellent communication and collaboration skills Flexible to work in EST time zones ( 9-5 EST) Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment

Posted 1 month ago

Apply

4.0 - 8.0 years

6 - 10 Lacs

Bengaluru

Work from Office

Back JDDev ops Engineer: As a DevOps Specialistshould be able to take ownership of the entire DevOps process, including Automated CI/CD pipelines and deployment to production They should also be comfortable with risk analysis and prioritization Leadership in managing a team and providing guidance on best practices is crucial Strong communication skills are required to deal with clients, stakeholders, and cross-functional teams Automation expertise is a key requirement, as automation is a growing focus in many organizations Telecom Domain Experience, especially in Retail (One View) is a huge plus Skill Required CI/CD Pipeline Automation Expertise in tools like Jenkins, Azure DevOps, or GitHub Actions to automate build, test, and deployment processes IIS and NET Deployment KnowledgeStrong understanding of IIS configuration, NET application deployment, and tools like MSDeploy or PowerShell scripts for automating IIS setups Scripting and ProgrammingProficiency in scripting languages like PowerShell or Python for automating deployment tasks and managing configurations Infrastructure as Code (IaC) Familiarity with tools like Terraform or Ansible to automate infrastructure provisioning and configuration Monitoring and TroubleshootingSkills in monitoring tools (e g , Nagios, Prometheus) and log analysis to ensure smooth deployments and quick issue resolution What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment

Posted 1 month ago

Apply

4.0 - 8.0 years

6 - 10 Lacs

Bengaluru

Work from Office

Back As a Platform Support Engineer (APIGEE), you will have a solid understanding of API management platforms, such as Apigee, and cloud infrastructure (GCP), and possess a deep knowledge of networking, authentication, and monitoring This role involves troubleshooting and providing support for API integrations, working with both internal teams and external clients to resolve issues efficiently and maintain a smooth operational flow Key Responsibilities API Management SupportTroubleshoot, diagnose, and resolve issues related to API proxies and flows within the Apigee environment, including both Apigee X and Apigee Hybrid API Transaction DebuggingUse debugging tools to analyze API transactions and identify where problems may exist in the flow between the API Gateway and backend services Backend Integration TroubleshootingSupport tenant teams using Apigee, providing guidance on identifying and resolving issues with their APIs after backend upgrades or changes Platform MonitoringMonitor and interpret data from platforms such as ELK, Dynatrace, Datadog, New Relic, Grafana/Prometheus, and other monitoring tools to proactively detect and troubleshoot API issues Cloud Infrastructure Management (GCP)Utilize GCP services, including Compute Engine, Load Balancers, IAM/Roles permissions, Stack Driver/Cloud Logging, and Kubernetes clusters (GKE), to manage and troubleshoot platform issues Networking TroubleshootingAssist in troubleshooting network-related issues, such as DNS, load balancers, and firewalls, and investigate HTTPS protocol and certificate management issues Authentication SupportAddress API authentication issues, including LDAP, JWT, API Key, OIDC, and OAuth2 authentication flows Support Incident ManagementCoordinate and troubleshoot complex support scenarios, including debugging pipeline errors, analyzing logs, and providing solutions to client-facing issues Terraform & CI/CD Pipeline ManagementUse Terraform for infrastructure as code and GitLab CI/CD pipelines to deploy and maintain infrastructure changes Incident RecoveryBe able to identify and recover from issues such as network appliance crashes or deleted GSLB entries, and assist in the recovery of southbound network appliances via the GCP console Support Engagement Expectations API Access IssuesResolve issues when a tenant team cannot access their APIs after a backend upgrade, including analyzing transaction flows and identifying whether the issue lies with Apigee, the backend, or the client GSLB IssuesInvestigate and restore GSLB configurations when necessary, using Terraform pipelines to repair configurations System CrashesAnalyze logs and troubleshoot error states in network appliance clusters, using GCP console tools for recovery Pipeline ErrorsInvestigate and resolve errors in GitLab CI/CD pipelines, identifying issues with governance rules or pipeline status Requirements API Management KnowledgeStrong understanding of API protocols (REST, SOAP, GraphQL, gRPC) and the role of API Gateways and proxies in API management Apigee Expertise2+ years of experience with Apigee X or Apigee Hybrid, including troubleshooting of API proxy flows, policies, and transactions Cloud Infrastructure (GCP)Basic understanding of GCP services, such as Compute Engine, Load Balancers, IAM/Roles permissions, Stack Driver, and Kubernetes (GKE) Networking & SecurityFamiliarity with firewall management, DNS, Load Balancers (Global/Regional), HTTPS protocol, and certificate management Authentication SystemsKnowledge of LDAP, JWT, API Key-based authentication, OIDC, and OAuth2 authentication flows Monitoring ToolsExperience using data analytics and monitoring platforms like ELK, Dynatrace, Datadog, New Relic, Grafana/Prometheus, and interpreting the results Linux & AutomationExperience working with Linux CLI, Terraform for infrastructure as code, and Python/bash scripting for automation tasks CI/CD PipelinesFamiliarity with GitLab CI/CD-based pipelines for code deployment and troubleshooting pipeline issues TroubleshootingStrong troubleshooting and diagnostic skills to handle complex API system integrations and identify the root cause of issues What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment

Posted 1 month ago

Apply

4.0 - 8.0 years

6 - 10 Lacs

Bengaluru

Work from Office

Back JDDev ops Engineer: As a DevOps Specialistshould be able to take ownership of the entire DevOps process, including Automated CI/CD pipelines and deployment to production They should also be comfortable with risk analysis and prioritization Leadership in managing a team and providing guidance on best practices is crucial Strong communication skills are required to deal with clients, stakeholders, and cross-functional teams Automation expertise is a key requirement, as automation is a growing focus in many organizations Telecom Domain Experience, especially in Retail (One view) is a huge plus Skill Required CI/CD Pipeline Automation Expertise in tools like Jenkins, Azure DevOps, or GitHub Actions to automate build, test, and deployment processes IIS and NET Deployment KnowledgeStrong understanding of IIS configuration, NET application deployment, and tools like MSDeploy or PowerShell scripts for automating IIS setups Scripting and ProgrammingProficiency in scripting languages like PowerShell or Python for automating deployment tasks and managing configurations Infrastructure as Code (IaC) Familiarity with tools like Terraform or Ansible to automate infrastructure provisioning and configuration Monitoring and TroubleshootingSkills in monitoring tools (e g , Nagios, Prometheus) and log analysis to ensure smooth deployments and quick issue resolution What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment

Posted 1 month ago

Apply

4.0 - 8.0 years

6 - 10 Lacs

Hyderabad

Work from Office

AI Opportunities with Soul AIs Expert Community! Are you an MLOps Engineer ready to take your expertise to the next levelSoul AI (by Deccan AI) is building an elite network of AI professionals, connecting top-tier talent with cutting-edge projects Why Join Above market-standard compensation Contract-based or freelance opportunities (2"“12 months) Work with industry leaders solving real AI challenges Flexible work locations- Remote | Onsite | Hyderabad/Bangalore Your Role: Architect and optimize ML infrastructure with Kubeflow, MLflow, SageMaker Pipelines Build CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI/CD) Automate ML workflows (feature engineering, retraining, deployment) Scale ML models with Docker, Kubernetes, Airflow Ensure model observability, security, and cost optimization in cloud (AWS/GCP/Azure) Must-Have Skills: Proficiency in Python, TensorFlow, PyTorch, CI/CD pipelines Hands-on experience with cloud ML platforms (AWS SageMaker, GCP Vertex AI, Azure ML) Expertise in monitoring tools (MLflow, Prometheus, Grafana) Knowledge of distributed data processing (Spark, Kafka) (BonusExperience in A/B testing, canary deployments, serverless ML) Next Steps: Register on Soul AIs website Get shortlisted & complete screening rounds Join our Expert Community and get matched with top AI projects Dont just find a job Build your future in AI with Soul AI!

Posted 1 month ago

Apply

8.0 - 13.0 years

10 - 15 Lacs

Bengaluru

Work from Office

About The Team: Cloud Platform Engineering(CPE) group is responsible for developing and managing platforms that allow Myntras tech products to be deployed and run at scale. The CPE team builds and maintains centralized and high-scale platforms for sophisticated application security frameworks, log collection, monitoring systems, access management, secret management, database access, change management systems, build, release and deployment. You will be part of the SRE team under CPE division.Position: Technical Lead - Site Reliability Engineering (SRE)Location: BengaluruEmployment Type: Full-time Role Overview: As a Technical Lead in Site Reliability Engineering (SRE), you will be responsible for leading a team of talented engineers and overseeing the design, implementation, and maintenance of our ecommerce platform's infrastructure. You will collaborate closely with cross-functional teams, including software development, operations, and program management, to ensure the reliability, availability, and performance of our systems. Your expertise will be essential in proactively identifying and resolving operational issues, improving system performance, and drivingautomation initiatives Responsibilities : Hosting infrastructure and setting up the core platform forms the backbone of any system. As part of this team, you will be responsible for 1. Lead and mentor a team of Site Reliability Engineers, providing technical guidance, support, and fostering a culture of continuous learning anddevelopment. 2. Collaborate with software development teams to ensure the seamless integration of new features and enhancements into the existing infrastructure. 3. Oversee the design, implementation, and maintenance of highly available and scalable systems, ensuring optimal performance and reliability. 4. Develop and implement monitoring and alerting systems to proactively identify and resolve operational issues, ensuring maximum uptime. 5. Conduct regular performance analysis and capacity planning to identify potential bottlenecks, optimize system performance, and plan for future growth. 6. Define and enforce best practices for incident management, change management, and problem resolution, ensuring adherence to SLAs. 7. Drive automation initiatives to streamline operational tasks, increase efficiency, and reduce manual intervention.8. Collaborate with cross-functional teams to identify opportunities for system improvements, scalability enhancements, and cost optimizations. 9. Stay up-to-date with industry trends, emerging technologies, and best practices in Site Reliability Engineering, and look for implementation in our infrastructure and operations. 10.Foster a culture of innovation, continuous improvement, and operational excellence within the team. Requirements: 1. Bachelor's or master's degree in Computer Science, Engineering 2. Experience (8+ years) in a similar role as a Technical Lead or Senior Site Reliability Engineer 3. Strong knowledge of infrastructure design, cloud-based platforms (Azure, GCP, AWS), and containerization technologies (Docker, Kubernetes). 4. Expertise in designing and implementing highly available, scalable, and fault-tolerant systems. 5. Solid understanding of networking, distributed systems, and database technologies. 6. Proficiency in scripting (Python, Bash) and automation tools (Ansible, Terraform).7. Experience with monitoring and logging tools (Prometheus, Grafana, Logging(ELF/EFK) stack).8. Strong problem-solving and troubleshooting skills, with the ability to diagnose and resolve complex system issues. 9. Excellent leadership and communication skills, with the ability to effectively collaborate with cross-functional teams. 10.Strong organizational and project management skills, with the ability to prioritize and manage multiple initiatives simultaneously.

Posted 1 month ago

Apply

3.0 - 6.0 years

5 - 8 Lacs

Bengaluru

Work from Office

About The Team: Cloud Platform Engineering(CPE) group is responsible for developing and managing platforms that allow Myntras tech products to be deployed and run at scale. The CPE team builds and maintains centralized and high-scale platforms for sophisticated application security frameworks, log collection, monitoring systems, access management, secret management, database access, change management systems, build, release and deployment. You will be part of the SRE team under CPE division.Position: M2 - Site Reliability Engineering (SRE)Location: BengaluruEmployment Type: Full-time Role Overview : As an SRE at M2 level, you will be playing an important role in the team related to availability, reliability, scalability and performance of Myntras production site. As part of the role, you will be working on the cloud platform, container platform and observability stack.This will also include developing automation tools mainly in bash,python and occasionally golang. Responsibilities: Hosting infrastructure and setting up the core platform forms the backbone of any system. As part of this team, you will be responsible for1. Collaborate with the lead and architect in the team to design, test and implement scalable and highly available solutions.2. Collaborate with software development teams to ensure the adoption of the platforms and platform components for high visibility.3. Participate in incident response as part of on-call duties of the team and provide solutions(short term and long term) along with providing RCAs for incidents4. Work closely within the team to proactively identify and rectify systems and help in preventing outages/incidents.5. Develop and implement monitoring and alerting systems to proactively identify and resolve operational issues, ensuring maximum uptime.6. Define and enforce best practices for incident management, change management, and problem resolution, ensuring adherence to SLAs.7. Drive automation initiatives to streamline operational tasks, increase efficiency, and reduce manual intervention.8. Collaborate with cross-functional teams to identify opportunities for system improvements, scalability enhancements, and cost optimizations.9. Contribute to the creation and maintenance of documentation related to system architecture, configurations, and operational procedures and actively participate in knowledge-sharing initiatives within the team.10.Foster a culture of innovation, continuous improvement, and operational excellence within the team. Requirements: 1. Bachelor's in Computer Science, Engineering or equivalent2. Experience (3-6 years) in a similar role as a Technical Lead or Senior Site Reliability Engineer3. Strong knowledge of infrastructure design, cloud-based platforms (Azure, GCP,AWS), and containerization technologies (Docker, Kubernetes).4. Solid understanding of networking, distributed systems, and database technologies.5. Proficiency in scripting (Python, Bash) and infra automation tools (Ansible, Terraform).6. Good knowledge of security and its best practices and experience implementing security controls in a production environment.7. Experience with monitoring and logging tools (Prometheus, Grafana,Logging(ELF/EFK) stack).8. Strong problem-solving and troubleshooting skills, with the ability to diagnose and resolve complex system issues.9. Excellent collaboration and communication skills.10.Experience in handling large scale distributed systems such as Elasticsearch

Posted 1 month ago

Apply

5.0 - 8.0 years

7 - 11 Lacs

Chennai

Work from Office

Overview DevOps Engineer \u2013 OpenShift (OCP) Specialist Job Summary: FSS is seeking a highly skilled DevOps Engineer with hands-on experience in Red Hat OpenShift Container Platform (OCP) and associated tools like Argo CD, Jenkins, and Data Grid. The ideal candidate will drive automation, manage containerized environments, and ensure smooth CI/CD pipelines across hybrid infrastructure to support our financial technology solutions. Required Skills & Qualifications: Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customer\u2019s DevOps team during the project implementation. Responsibilities Key Responsibilities: OpenShift Platform Engineering: Deploy, manage, and maintain applications on OpenShift Container Platform. Configure and manage Operators, Helm charts, and OpenShift GitOps (Argo CD). Manage Red Hat Data Grid deployments and integrations. Support OCP cluster upgrades, patching, and troubleshooting. CI/CD Implementation & Automation: Design, implement, and manage CI/CD pipelines using Jenkins and Argo CD. Ensure seamless code integration, testing, and deployment processes with development teams. Infrastructure as Code (IaC): Automate infrastructure provisioning with tools like Terraform and Ansible. Manage hybrid infrastructure across on-prem and public clouds (AWS, Azure, or GCP). Monitoring & Performance Optimization: Implement and manage observability stacks (Prometheus, Grafana, ELK, etc.) for OCP and underlying services. Proactively identify and resolve system performance bottlenecks. Security & Compliance: Enforce security best practices in containerized and cloud environments. Conduct vulnerability assessments and ensure compliance with industry standards. Collaboration & Support: Collaborate with developers, QA, and IT teams to optimize DevOps workflows. Provide ongoing support and incident response for production and non-production environments. Qualifications BE, B-tech,MCA or Equivalent degree Payment gateway, Bank reconciliation, Card, Payment gateway Essential skills Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customer\u2019s DevOps team during the project implementation. Desired skills Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customer\u2019s DevOps team during the project implementation.

Posted 1 month ago

Apply

3.0 - 10.0 years

22 - 26 Lacs

Hyderabad

Work from Office

Skillsoft is the global leader in eLearning. Trusted by the world's leading organizations, including 65% of the Fortune 500. Our 100,000+ courses, videos and books are accessed over 100 million times every month, across more than 100 countries. At Skillsoft, we believe knowledge is the fuel for innovation and innovation is the fuel for business growth. Join us in our quest to democratize learning and help individuals unleash their edge. Are you ready to shape the future of learning through cutting-edge AI? As a Principal AI/Machine Learning Engineer at Skillsoft, you’ll dive into the heart of innovation, crafting intelligent systems that empower millions worldwide. From designing generative AI solutions to pioneering agentic workflows, you’ll collaborate with multiple teams to transform knowledge into a catalyst for growth—unleashing your edge while helping others do the same. Join us in redefining eLearning for the world’s leading organizations! Responsibilities: Hands-on AI/ML software engineer Prompt engineering, agentic workflow development and testing Work with product owners to understand requirements and guide new features Collaborate to identify new feature impacts Evaluate new AI/ML technology advancements and socialize finding Research, prototype, and select appropriate COTS and develop in-house AI/ML technology Consult with external partners to review and guide development and integration of AI technology Collaborate with teams to design, and guide AI development, and enhancements Document designs and implementation to ensure consistency and alignment with standards Create documentation including system and sequence diagrams Create appropriate data pipelines for AI/ML training and inference Analyze, curate, cleanse, and preprocess data Utilize and apply generative AI to increase productivity for yourself and the organization Periodically explore new technologies and design patterns with proof-of-concept Participate in developing best practices and improving operational processes Present research and work to socialize and share knowledge across the organization Contribute to patentable AI innovations Environment, Tools & Technologies: Agile/Scrum Operating Systems – Mac, Linux JavaScript, Node.js, Python PyTorch, Tensorflow, Keras, OpenAI, Anthropic, and friends Langchain, Langgraph, etc. APIs GraphQL, REST Docker, Kubernetes Amazon Web Services (AWS), MS Azure SQL: Postgres RDS NoSQL: Cassandra, Elasticsearch (VectorDb) Messaging – Kafka, RabbitMQ, SQS Monitoring – Prometheus, ELK GitHub, IDE (your choice) Skills & Qualifications: (8+ years experience) Experience with LLMs and fine-tuning models Development experience including unit testing Design and documentation experience of new APIs, data models, service interactions Familiarity with and ability to explain: o system and API security techniques o data privacy concerns o microservices architecture o vertical vs horizontal scaling o Generative AI, NLP, DNN, auto-encoders, etc. Attributes for Success: Proactive, Independent, Adaptable Collaborative team player Customer service minded with an ownership mindset Excellent analytic and communication skills Ability and desire to coach and mentor other developers Passionate, curious, open to new ideas, and ability to research and learn new technologies

Posted 1 month ago

Apply

6.0 - 8.0 years

13 - 17 Lacs

Noida, Hyderabad, Chennai

Hybrid

Role & responsibilities: Design, deploy, and maintain AWS infrastructure using infrastructure as code (IAC) using tools such as Terraform and CloudFormation Build and deploy applications in a repetitive and automated way Design and implement serverless architecture using AWS services such as Lambda, API Gateway, DynamoDB, S3, and others Monitor, troubleshoot, and optimize performance of cloud-based applications using monitoring and analytics tools such as New Relic, Grafana and Prometheus Collaborate with development teams to ensure the reliability, scalability, and security of our systems Automate processes using CI/CD tools such as Azure DevOps, TeamCity or Jenkins. Implement security best practices and ensure compliance with regulatory requirements Continuously improve our infrastructure and processes to meet evolving business needs and technology trends Mandatory Skills: 6+ years of experience in a DevOps role, with a focus on AWS services and infrastructure as code Experience with Terraform or other IaC tools such as CloudFormation or CDK Strong understanding of serverless architectures, microservices, and containerization using Kubernetes or other container orchestration tools Experience with monitoring and analytics tools such as Grafana, Prometheus, and New Relic Familiarity with CI/CD tools such as Azure DevOps, Jenkins, GitLab, or CircleCI Proficient in at least one scripting language (Bash, Python, JavaScript) Proficiency with Linux administration/engineering Deep understanding of cloud-scale and micro/macro-services architectures, experience in operating high performance, highly scalable, and fault-tolerant multi-tenant SaaS based applications. Strong problem-solving skills and the ability to troubleshoot issues in a complex environment. Excellent communication and collaboration skills to work effectively with cross-functional teams. A passion for continuous learning and keeping up with the latest technology trends in the DevOps and cloud computing space. Preferred candidate profile: Looking for immediate joiners minimum 15days PF history mandatory for all companies

Posted 1 month ago

Apply

4.0 - 8.0 years

5 - 15 Lacs

Bengaluru

Work from Office

Azure Monitor, Application Insights, Log Analytics Prometheus / Datadog / Dynatrace Grafana, Power BI Python, REST API Required Skills Network Watcher, Databricks Logs, System tables, REST API Bash, Powershell

Posted 1 month ago

Apply

10.0 - 15.0 years

12 - 17 Lacs

Hyderabad

Work from Office

DevOps Manager - J49058 Job Summary We are looking for an experienced DevOps Manager with 10+ years of experience to lead our DevOps initiatives across AWS and GCP platforms. The ideal candidate will have expertise in cloud migration, CDN deployment, infrastructure automation, and stakeholder reporting. This role requires managing a team of 6 mid-level DevOps engineers and ensuring high availability, security, and scalability of our cloud infrastructure. Key Responsibilities Cloud & Infrastructure Management Manage and optimize cloud infrastructure on AWS and GCP. Lead cloud migration projects from on-premise or other cloud environments. Deploy and manage CDN solutions for improved performance and scalability. Ensure cost optimization, high availability, and disaster recovery best practices. Infrastructure Automation & CI/CD Implement Infrastructure as Code (IaC) using Terraform, Ansible, or similar tools. Automate deployment pipelines using CI/CD tools (GitHub Actions, Jenkins, AWS DevOps, or Google Cloud Build). Drive DevOps best practices, including containerization (Docker, Kubernetes) and serverless architectures. Monitoring, Security & Compliance Set up logging, monitoring, and alerting using tools like Prometheus, Grafana, AWS Monitor, and GCP Stackdriver. Ensure security best practices, including identity management, access controls, and compliance with industry standards. Conduct periodic security audits and vulnerability assessments. Stakeholder Communication & Reporting Prepare and send detailed reports on system performance, uptime, cost, and incidents to all stakeholders. Work closely with engineering, product, and security teams to align DevOps strategies with business goals. Maintain documentation for infrastructure, processes, and best practices. Team Leadership & Collaboration Lead and mentor a team of 6 mid-level DevOps/SRE engineers. Conduct training and knowledge-sharing sessions to upskill the team. Establish KPIs and performance metrics to track team progress and efficiency. Required Skills & Experience 10+ years of DevOps experience, with at least 3 years in a leadership role. Hands-on experience with AWS and GCP cloud platforms. Expertise in cloud migration & CDN deployment (e.g., Cloudflare, Akamai, CloudFront, AWS CDN). Strong knowledge of Infrastructure as Code (IaC) tools like Terraform, Ansible, or CloudFormation. Experience with CI/CD pipelines (AWS DevOps, Jenkins, GitHub Actions, GCP Cloud Build). Proficiency in Kubernetes, Docker, and container orchestration. Strong monitoring and logging skills using AWS Monitor, GCP Stackdriver, Prometheus, Grafana, Splunk. Excellent communication skills for stakeholder reporting and cross-functional collaboration. Ability to lead and mentor a team, ensuring high efficiency and skill growth. Nice to Have Experience with multi-cloud environments (AWS, GCP, ). Knowledge of serverless architectures (AWS Functions, Google Cloud Functions). Familiarity with FinOps for cloud cost management. Location & Work Mode Location:Hyderabad/Bangalore Work Mode: Office Why Join Us- Opportunity to work with cutting-edge cloud technologies and automation. Lead a talented team and drive impactful cloud transformation projects. Competitive salary, benefits, and career growth opportunities. Required Candidate profile Candidate Experience Should Be : 10 To 15 Candidate Degree Should Be : BA,BBA,BBA/BMS,BBI,BCA,BCom,BCS,BDES,BE-Comp/IT,BEd,BE-Other,BFA,BFM,BIS,BIT,BMS,BSc-Comp/IT,BSc-Other,BTech-Comp/IT,BTech-Other,CA,CS,DCA,DCS,DE-Comp/IT,DE-Other,Diploma,ICWA,LLB,MA,MBA,MBBS,MCA,MCM,MCom,MCS,ME-Comp/IT,ME-Other,MIS,MIT,MMS,MSc-Comp/IT,MS-Comp/IT,MSc-Other,MS-Other,MTech-Comp/IT

Posted 1 month ago

Apply

7.0 - 12.0 years

27 - 35 Lacs

Pune

Work from Office

Key Responsibilities: OpenShift Platform Engineering: Deploy, manage, and maintain applications on OpenShift Container Platform. Configure and manage Operators, Helm charts, and OpenShift GitOps (Argo CD). Manage Red Hat Data Grid deployments and integrations. Support OCP cluster upgrades, patching, and troubleshooting. CI/CD Implementation & Automation: Design, implement, and manage CI/CD pipelines using Jenkins and Argo CD. Ensure seamless code integration, testing, and deployment processes with development teams. Infrastructure as Code (IaC): Automate infrastructure provisioning with tools like Terraform and Ansible. Manage hybrid infrastructure across on-prem and public clouds (AWS, Azure, or GCP). Monitoring & Performance Optimization: Implement and manage observability stacks (Prometheus, Grafana, ELK, etc.) for OCP and underlying services. Proactively identify and resolve system performance bottlenecks. Security & Compliance: Enforce security best practices in containerized and cloud environments. Conduct vulnerability assessments and ensure compliance with industry standards. Collaboration & Support: Collaborate with developers, QA, and IT teams to optimize DevOps workflows. Provide ongoing support and incident response for production and non-production environments. Required Skills & Qualifications: Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customers DevOps team during the project implementation.

Posted 1 month ago

Apply

5.0 - 9.0 years

12 - 15 Lacs

Pune

Hybrid

We are hiring "Sr. Devops Engineer" for one of out Product based MNC @Pune EX-5-10 Years Mode-Permanent Work Mode-Hybrid Mandatory Siklls- *Experience in monitoring and troubleshooting of infratsructure and applications. Experience with Cloud platform - AWS / Azure / Google Scripting languages like Bash, Python, or Perl. Hands-on with CI/CD tools (Jenkins, GitHub / GitLab). Familiarity with Docker or similar tool for containerization and Kubernetes or similar tool for orchestration. Knowledge / experience on deployment & config. management tools like ansible, puppet or similar tool.Knowledge of monitoring tools such as AppDynamics, Splunk Basic understanding of security practices integrated into DevOps workflows. Relevant certifications like AWS Certified DevOps Engineer or Docker Certified Associate are beneficial. Ability to handle multiple tasks, prioritize, and work under pressure Ability to learn and apply new skills and processes quickly.

Posted 1 month ago

Apply

6.0 - 8.0 years

40 - 50 Lacs

Mumbai, Pune

Hybrid

Congratulations, you have taken the first step towards bagging a career-defining role. Join the team of superheroes that safeguard data wherever it goes. What should you know about us? Seclore protects and controls digital assets to help enterprises prevent data theft and achieve compliance. Permissions and access to digital assets can be granularly assigned and revoked, or dynamically set at the enterprise-level, including when shared with external parties. Asset discovery and automated policy enforcement allow enterprises to adapt to changing security threats and regulatory requirements in real-time and at scale. Know more about us at www.seclore.com You would love our tribe: If you are a risk-taker, innovator, and fearless problem solver who loves solving challenges of data security, then this is the place for you! Role: Lead Product Engineer - Developer Productivity Experience: 6 - 8 Years Location: Mumbai/Pune A sneak peek into the role: We are seeking a highly motivated and experienced Lead, Developer Productivity & Platform Engineering to spearhead our efforts in building, scaling, and continuously improving our internal developer platform. In this critical role, you will be responsible for empowering our development teams with the tools, infrastructure, and processes necessary to achieve exceptional productivity, accelerate software delivery, and enhance their overall experience. You will driving the vision, strategy, and execution of our IDP initiatives, with a strong focus on measuring and improving developer effectiveness. Here's what you will get to explore: Leadership: This role blends the responsibilities of an individual contributor with the need to lead a team as the practice grows. While the primary focus is on individual contributions and expertise, the role also requires guiding, mentoring, and coordinating the work of others. Foster a collaborative, innovative, and results-oriented team culture. Define clear roles, responsibilities, and performance expectations for team members. Platform Vision, Strategy & Roadmap: Define and articulate a clear vision, strategy, and roadmap for our internal developer platform (IDP), aligning with overall engineering and business objectives. Identify and prioritize key features and improvements for the IDP based on developer needs and productivity goals. Stay abreast of industry trends and emerging technologies in platform engineering, developer experience, and IDPs (e.g., Backstage). Collaboration & Stakeholder Management: Work closely with application development teams, product managers, security teams, operations, and other stakeholders to understand their pain points, needs, and requirements for the IDP. Effectively communicate the value and progress of the IDP to both technical and non-technical audiences. IDP Design, Development & Maintenance: Lead the design, development, and maintenance of core components of our internal developer platform, emphasizing self-service capabilities, automation, standardization, and a seamless developer experience. Drive the adoption of Infrastructure as Code (IaC), Continuous Integration/Continuous Delivery (CI/CD), and robust observability practices within the platform. Ensure the IDP is scalable, reliable, secure, and cost-effective. Focus on Developer Productivity & Measurement: Define and track key metrics to measure the impact of the IDP on developer productivity (e.g., deployment frequency, lead time for changes, time to recovery, developer satisfaction). Implement mechanisms for collecting and analyzing data related to developer workflows and platform usage. Identify and implement solutions to streamline developer workflows, reduce toil, and accelerate application delivery based on data and feedback. Potentially lead initiatives to integrate and leverage tools like Backstage to enhance developer experience and provide a centralized platform. Tooling & Integration: Evaluate and integrate relevant tools and technologies into the IDP ecosystem, including CI/CD systems, monitoring tools, logging solutions, security scanners, and potentially IDP frameworks like Backstage. Ensure seamless integration between different platform components and existing development tools. We can see the next Entrepreneur At Seclore if you: 6+ years of relevant experience in software engineering, platform engineering, or DevOps roles, with increasing levels of responsibility. Proven experience leading and managing engineering teams, including hiring, mentoring, and performance management. Strong understanding of the software development lifecycle and common developer workflows. Deep technical expertise in cloud platforms (e.g., AWS, Azure, GCP) and cloud-native technologies (e.g., Kubernetes, Docker, serverless). Extensive experience with Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation). Significant experience designing and implementing CI/CD pipelines using tools like Jenkins, GitLab CI, GitHub Actions, CircleCI, Argo CD, or Flux CD. Solid understanding of observability principles and hands-on experience with monitoring tools (e.g., Prometheus, Grafana, Datadog), logging solutions (e.g., ELK stack, Splunk), and distributed tracing (e.g., Jaeger, Zipkin). Strong understanding of security best practices for cloud environments and containerized applications, and experience with security scanning tools and secrets management. Experience in managing and configuring Code Quality tools like SonarQube Experience in managing and configuring Git tools like Gitlab Proficiency in at least one Programming language (e.g., Python, Go) for automation. Understanding of API design principles (REST, GraphQL) and experience with building and consuming APIs. Experience with data collection and analysis to identify trends and measure the impact of platform initiatives. Excellent communication, collaboration, and interpersonal skills, with the ability to influence and build consensus across teams. Strong problem-solving and analytical abilities. Experience working in an Agile development environment. Prior experience building and maintaining an Internal Developer Platform (IDP). Hands-on experience with IDP frameworks like Backstage, including setup, configuration, plugin development, and integration with other tools. Familiarity with developer productivity frameworks and methodologies. Experience with other programming languages commonly used by development teams (e.g., Java, Node.js, C++). Experience with service mesh technologies. Knowledge of cost management and optimization in the cloud. Experience in defining and tracking developer productivity metrics. Experience with data visualization tools (e.g., Grafana, Tableau). Why do we call Seclorites Entrepreneurs not Employees? We value and support those who take the initiative and calculate risks. We have an attitude of a problem solver and an aptitude that is tech agnostic. You get to work with the smartest minds in the business. We are thriving not living. At Seclore, it is not just about work but about creating outstanding employee experiences. Our supportive and open culture enables our team to thrive. Excited to be the next Entrepreneur, apply today! Don't have some of the above points in your resume at the moment? Don't worry. We will help you build it. Let's build the future of data security at Seclore together.

Posted 1 month ago

Apply

9.0 - 12.0 years

20 Lacs

Hyderabad, Pune, Bengaluru

Work from Office

We are looking for "Board Architect" with Minimum 9 years experience Contact- Atchaya (95001 64554) Required Candidate profile 7-8 yrs of BOARD experience with Architecture design, System Admin / Performance Mgmt. of 3-4 yrs. performance monitoring tools (e.g., AppDynamics, New Relic, Prometheus, Grafana).

Posted 1 month ago

Apply

13.0 - 18.0 years

35 - 55 Lacs

Bengaluru

Hybrid

SRE Manager About Ushur I Ushur XOS l Ushur GenA I Location: Bangalore Work Mode: Hybrid Experince: 12 to 18 Years The Role Our fast-growing team is seeking a Manager of SRE to join us as we pioneer Customer Experience AutomationTM as an Industry category. As the Manager of SRE you will be responsible for two important charters Operate and manage Ushurs production cloud Build a white-glove customer support and incident management function The ideal candidate for this role will be passionate about building a healthy high-performing team, and bring strong technical leadership, a customer-centric focus, and results-oriented action. You will begin as a player/coach while building and continuously improving execution, processes, tools/technology and analytics. Responsibilities Build and Manage a world-class SRE team. Design a 24x7 follow-the-sun organization including seamless handover across regions. Mentor and grow team focused on delivering white glove support and incident management service. Drive data-driven SRE strategy by defining and prioritizing SRE Objectives and Key Results (OKRs) aligned with company mission. This includes setting measurable targets for key service level agreements Manager Enterprise Support function to deliver exceptional white glove experiences at scale in close partnership with our Customer Success, Solution Consulting and Engineering teams. Responsible for ensuring that the Ushur platform runs reliably in production. Partner with the DevOps, Security and Engineering teams to automate deployment, monitoring and observability of the production cloud. Bring deep technical expertise in Ushur Customer Experience Automation. Provide customers with ongoing technical support and incident management for complex issues and support escalations. Optimize and automate support processes including improving the reliability of on-call processes, managing incidents, updating runbooks and documentation, reviewing RCAs and recommending solutions to prevent the recurrence and severity of incidents. Cross-functionally to drive positive customer outcomes. Engage with Product, Sales, Customer Success, Solution Consulting, Security, and Engineering, as necessary to make customers successful on our platform Qualifications 5+ years of experience of SRE/CloudOps Manager/Lead role in Enterprise SaaS Track record of developing and mentoring great talent, building and motivating high-achieving teams. Ability to lead diverse teams across multiple time zones. Business Acumen - Ability to quickly grasp and adapt to a variety of customer verticals, geographies, and business structures. Excellent verbal, written, and presentation skills with the ability to absorb complex technical concepts and communicate them to a non-technical audience Highly organized, collaborative and detail-oriented Deep experience with AWS cloud services, REST APIs, Linux Experience with DevOps processes and Build deployment, and orchestration technologies Passion for technology and for being a part of a fast-growing SaaS startup where we move quickly and wear many hats Flexible approach, able to operate effectively with uncertainty and change Driven, self-motivated, enthusiastic and with a can do attitude Benefits Great Company Culture. We pride ourselves on having a values-based culture that is welcoming, intentional, and respectful. Bring your whole self to work . We are focused on building a diverse culture, with innovative ideas where you and your ideas are valued. We are a start-up and know that every person has a significant impact! Rest and Relaxation . 20 days of flexible leaves per year, Monthly Wellness Day (aka a day off to care for yourself) and more! Health Benefits. Preventive health checkups, Medical Insurance covering the dependents, wellness sessions, and health talks at the office Keep learning. One of our core values is Growth Mindset - we believe in lifelong learning. Certification courses are reimbursed. Ushur Community offers wide resources for our employees to learn and grow. Flexible Work. In-office or hybrid working model, depending on position and location. We seek to create an environment for all our employees where they can thrive in both their profession and personal life. Why join us? We are passionate about Ushur, our product, and helping our employees grow and develop in their career in a caring, collaborative environment. We offer a very competitive compensation plan & stock options for the ideal candidates.

Posted 1 month ago

Apply

3.0 - 6.0 years

10 - 14 Lacs

Mumbai

Work from Office

About This Role About this role The Aladdin Studio team is focused on developing a world-class digital experience which will help developers of all types build faster and more effectively on Aladdin We are evolving the Studio Developer platform as an integrated digital application where you can discover data, build your own financial apps, and access industry-leading content, documentation and insight with all of this delivered via a design-forward and client-centric experience, As a member of the Studio Developer Operations team, you will interact, engage and solve problems for some of the most technically sophisticated users of Aladdin Our team is also responsible for delivering the monitoring, logging, alerting and observability framework of Studio Developer to ensure our product is scalable and resilient as we enter a period of significant growth, Role Description 3-5 years of hand-on experience working as part of Platform Operations, Site Reliability Engineering,DevOpsor related engineering teams, Building your skills as a domain expert on the functionality and capabilities of the platform, Triaging and timely resolution of client inquiries, Enable user best practice execution on the platform including training and adoption of new platform features, Understanding and acting on platform telemetry alerts including invocation of our Incident Management response plays, Look for opportunities to automate our workflows to improve our teams effectiveness and efficiency, Reporting and metrics generation on platform reliability as well as user inquiry trends, Contribute to building out our observability framework to enhance our platform, Desirable Skills Experience building, managing and supporting large-scale platforms, Understanding of the K8s Operator Pattern -comfort and courage to wade into (predominantly golang based) operator implementation code bases Hands-on experience deploying log management andobservability platform tooling: SPLUNK / Prometheus / Grafana, AlertManager, Strong attention to details and focus on high quality delivery, Comfortable reading and writing Python code, Comfortable working with clients and partners at all levels of the business, Our Benefits To help you stay energized, engaged and inspired, we offer a wide range of benefits including a strong retirement plan, tuition reimbursement, comprehensive healthcare, support for working parents and Flexible Time Off (FTO) so you can relax, recharge and be there for the people you care about, Our hybrid work model BlackRocks hybrid work model is designed to enable a culture of collaboration and apprenticeship that enriches the experience of our employees, while supporting flexibility for all Employees are currently required to work at least 4 days in the office per week, with the flexibility to work from home 1 day a week Some business groups may require more time in the office due to their roles and responsibilities We remain focused on increasing the impactful moments that arise when we work together in person aligned with our commitment to performance and innovation As a new joiner, you can count on this hybrid model to accelerate your learning and onboarding experience here at BlackRock, About BlackRock At BlackRock, we are all connected by one mission: to help more and more people experience financial well-being Our clients, and the people they serve, are saving for retirement, paying for their childrens educations, buying homes and starting businesses Their investments also help to strengthen the global economy: support businesses small and large; finance infrastructure projects that connect and power cities; and facilitate innovations that drive progress, This mission would not be possible without our smartest investment the one we make in our employees

Posted 1 month ago

Apply

5.0 - 7.0 years

7 - 11 Lacs

Jaipur, Bengaluru

Work from Office

In Time Tec is an award-winning IT & software company. In Time Tec offers progressive software development services, enabling its clients to keep their brightest and most valuable talent focused on innovation. In Time Tec has a leadership team averaging 15 years in software/firmware R&D, and 20 years building onshore/offshore R&D teams. We are looking for rare talent to join us. People having a positive mindset and great organizational skills will be drawn to the position. Your capacity to take initiative and solve problems as they emerge, flexibility, and honesty, will be key factors for your success at In Time Tec. We’re looking for an Interactive Backend Engineer – Python & DevOps who will be responsible for managing the release pipeline. This person will not just be involved in the scripting but also in the development and will be directly supporting the development and content teams that are creating and publishing content on most trafficked websites. The ideal candidate is someone who has worked in a build/release role previously, has strong communication skills, and who knows how to handle the unexpected scenarios. Roles and Responsibilities Backend Engineer – Python & DevOps Skills: Strong programming experience in Python (not just scripting — real development). Experience with CI/CD tools like Jenkins. Proficient in Git and source control workflows. Experience with Docker , Kubernetes , and Linux environments . Familiarity with scripting languages like Bash , optionally Groovy or Go . Knowledge of web application servers and deployment processes. Good understanding of DevOps principles , cloud environments, and automation. Nice to Have: Experience with monitoring/logging tools (e.g., Prometheus, Grafana, ELK stack). Exposure to configuration management tools like Ansible . Experience in performance tuning and scaling backend systems.

Posted 1 month ago

Apply

5.0 - 9.0 years

9 - 13 Lacs

Bengaluru

Work from Office

Job Title: Software Engineer (Contractual) Location: Bengaluru (Work From Office) Experience: 5 – 6 Years Salary: 9 LPA – 13 LPA Employment Type: Contractual (1 Year + Extendable) Client: MicroGenesis On Payroll of: Nyxtech Hiring Contact: Yash Sharma (LinkedIn: linkedin.com/in/yashsharma1608) Notice Period: Immediate to 15 Days Job Description: We are looking for a skilled Software Engineer with expertise in GitLab administration, AWS infrastructure management with Terraform, containerization technologies, and monitoring tools to join our client MicroGenesis on a contractual basis. The role requires hands-on experience with cloud infrastructure, DevOps tools, and Linux environments. Responsibilities: Manage and administer GitLab for CI/CD pipelines and repository management. Design and implement AWS infrastructure using Terraform. Manage Linux-based environments and troubleshoot system issues. Containerize applications using Docker and orchestrate using Kubernetes. Monitor system health and application performance with Prometheus, Grafana, and CloudWatch. Collaborate with development and operations teams to streamline deployment and monitoring processes. Required Skills & Experience: 5 to 6 years of relevant experience in software engineering or DevOps roles. Strong experience with GitLab administration. Proficient with AWS and infrastructure-as-code using Terraform. Solid Linux knowledge and troubleshooting skills. Hands-on experience with Docker and Kubernetes. Familiarity with monitoring tools such as Prometheus, Grafana, and CloudWatch. Ability to work onsite in Bengaluru. Immediate to 15 days notice period preferred. Contract Details: Initial contract for 1 year, extendable based on performance and business needs. Candidate will be on payroll of Nyxtech, working with client MicroGenesis.

Posted 1 month ago

Apply

5.0 - 6.0 years

15 - 16 Lacs

Chennai

Work from Office

Job Description: We are looking for a highly skilled DevOps Engineer with strong experience in Red Hat OpenShift Container Platform (v4.x) and related DevOps tools like Argo CD , Jenkins , and Red Hat Data Grid . The ideal candidate will be responsible for automation, managing containerized environments, and ensuring robust CI/CD pipelines across hybrid cloud infrastructure supporting our fintech solutions. Key Responsibilities: OpenShift Platform Engineering: Deploy, manage, and maintain apps on OpenShift v4.x. Manage Operators, Helm charts, and OpenShift GitOps (Argo CD). Handle Red Hat Data Grid deployments. Perform OCP upgrades, patching, and troubleshooting. CI/CD & Automation: Implement CI/CD pipelines using Jenkins, Argo CD, GitHub Actions. Ensure seamless code integration and automated deployment. Infrastructure as Code (IaC): Automate infrastructure using Terraform, Ansible, CloudFormation. Manage infrastructure on AWS, Azure, or GCP. Monitoring & Optimization: Set up observability stacks (Prometheus, Grafana, ELK, Splunk). Troubleshoot and optimize system performance. Security & Collaboration: Apply DevSecOps best practices and ensure compliance. Collaborate with development and DevOps teams for solution implementation. Desired Candidate Profile: Technical Skills: Red Hat OpenShift (v4.x) administration & operations. CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Kubernetes, Docker, Helm, GitOps. Red Hat Data Grid or other in-memory data grids. IaC tools: Terraform, Ansible, CloudFormation. Monitoring tools: Prometheus, Grafana, ELK, Splunk. Scripting: Bash, Python, or Shell. Soft Skills: Excellent analytical and problem-solving skills. Strong communication and collaboration abilities. Ability to work independently and with customer DevOps teams. Education: BE / B.Tech / MCA or equivalent in Computer Science or related fields. Work Location: Chennai

Posted 1 month ago

Apply

4.0 - 7.0 years

9 - 13 Lacs

Pune

Hybrid

So, what’s the role all about? Seeking a skilled and experienced DevOps Engineer in designing, producing, and testing high-quality software that meets specified functional and non-functional requirements within the time and resource constraints given. How will you make an impact? Design, implement, and maintain CI/CD pipelines using Jenkins to support automated builds, testing, and deployments. Manage and optimize AWS infrastructure for scalability, reliability, and cost-effectiveness. To streamline operational workflows and develop automation scripts and tools using shell scripting and other programming languages. Collaborate with cross-functional teams (Development, QA, Operations) to ensure seamless software delivery and deployment. Monitor and troubleshoot infrastructure, build failures, and deployment issues to ensure high availability and performance. Implement and maintain robust configuration management practices and infrastructure-as-code principles. Document processes, systems, and configurations to ensure knowledge sharing and maintain operational consistency. Performing ongoing maintenance and upgrades (Production & non-production) Occasional weekend or after-hours work as needed Have you got what it takes? Experience: 4-7 years in DevOps or a similar role. Cloud Expertise: Proficient in AWS services such as EC2, S3, RDS, Lambda, IAM, CloudFormation, or similar. CI/CD Tools: Hands-on experience with Jenkins pipelines (declarative and scripted). Scripting Skills: Proficiency in either shell scripting or powershell Programming Knowledge: Familiarity with at least one programming language (e.g., Python, Java, or Go). IMP: Scripting/Programming is integral to this role and will be a key focus in the interview process. Version Control: Experience with Git and Git-based workflows. Monitoring Tools: Familiarity with tools like CloudWatch, Prometheus, or similar. Problem-solving: Strong analytical and troubleshooting skills in a fast-paced environment. CDK Knowledge in AWS DevOps. You will have an advantage if you also have: Prior experience in Development or Automation is a significant advantage. Windows system administration is a significant advantage. Experience with monitoring and log analysis tools is an advantage. Jenkins pipeline knowledge What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NICEr! Enjoy NICE-FLEX! At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID: 6119 Reporting into: Tech Manager Role Type: Individual Contributor

Posted 1 month ago

Apply

2.0 - 3.0 years

4 - 5 Lacs

Rajkot

Work from Office

Technical Requirements: Excellent understanding of Linux commands. Thorough knowledge of CI/CD pipelines, automation, and debugging, particularly with Jenkins. Intermediate to advanced understanding of Docker and container orchestration platforms. Hands-on experience with web servers (Apache, Nginx), database servers (MongoDB, MySQL, PostgreSQL), and application servers (PHP, Node.js). Knowledge of proxies and reverse proxies is required. Good understanding and hands-on experience with site reliability tools such as Prometheus, Grafana, New Relic, Datadog, and Splunk. (Hands-on experience with at least one tool is highly desirable.) Ability to identify and fix security vulnerabilities at the OS, database, and application levels. Knowledge of cloud platforms, specifically AWS and DigitalOcean, and their commonly used services. Other Requirements: Good communication skills. Out-of-the-box problem-solving capabilities, especially in the context of technology automation and application architecture reviews. Hands-on experience with GKE, AKS, EKS, or ECS is a plus. Excellent understanding of how to craft effective AI prompts to solve specific issues.

Posted 1 month ago

Apply

5.0 - 7.0 years

15 - 27 Lacs

Bangalore Rural, Bengaluru

Work from Office

DevOps, Site Reliability Engineering,loud platforms,GCP,Infrastructure as Code tools (Terraform, Ansible, CloudFormation), Prometheus, Grafana, ELK stack,Python, Bash, Go, Istio, Linkerd

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies