Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
7.0 - 12.0 years
20 - 35 Lacs
Chennai
Hybrid
Key Skills: Cluster, Performance Optimization, Python, Shell. Roles and Responsibilities: Design, implement, and support high-performance compute (HPC) clusters. Demonstrate solid knowledge of HPC systems, including CPU/GPU architecture, scalable and robust storage solutions, high-bandwidth interconnects, and cloud-based computing architectures. Generate hardware bills of materials (BOMs) for HPC clusters, manage vendor relationships, and oversee hardware release activities. Utilize strong Linux OS skills to configure appropriate operating systems for HPC systems. Understand and consolidate project specifications and performance requirements at both subsystem and system levels. Adhere to project timelines and ensure timely completion of program deliverables. Support the design and release of new products to manufacturing and customers, delivering quality golden images, procedures, scripts, and documentation to the manufacturing and support teams. Experience Requirement: 7-14 years of experience with Kubernetes, Prometheus, and Grafana for container orchestration and monitoring. In-depth and flavor-agnostic knowledge of Linux systems (SuSE, RedHat, Rocky, Ubuntu). Experience in designing and maintaining robust storage systems. Strong knowledge of HPC hardware including servers, GPUs, networking, storage, BIOS, and BMC. Experience with SystemD, net boot/PXE, and Linux High Availability (HA). Strong understanding of TCP/IP fundamentals and protocols such as DNS, DHCP, HTTP, LDAP, and SMTP. Proficiency in Shell and Python scripting. Experience with configuration management tools such as Salt, Chef, or Puppet. Strong DevOps orientation, including experience with continuous integration tools like Jenkins and Git-based repositories. Familiarity with container technologies such as Singularity and Docker. Education: B.Tech M.Tech (Dual), B.E., B.Tech.
Posted 1 week ago
2.0 - 6.0 years
2 - 6 Lacs
Hyderabad / Secunderabad, Telangana, Telangana, India
On-site
Job Description: No of resources required2 Resource Type (Dev / Test)SRE Cloud Engineer Skill Set Experience working on Linux based infrastructure. Must have hands-on experience with container orchestration tools - Docker, Kubernetes, Helm etc. Understand concepts related to continuous integration and deployment (parallelized build pipelines, automated testing). Have hands-on experience in scripting language like Python, Shell. Strong debugging/troubleshooting skills. Experience with monitoring/logging solutions like Prometheus, Grafana Familiarity with continuous integration and deployment tools like Gitlab, Jenkins and other CI/CD tools. Experience with modern cloud development practices (microservices architectures, REST interfaces, etc. ). Have working experience deploying and supporting highly scalable cloud-based applications and architectures without downtime. Have experience with support and on-call responsibilities. Dive deep into the software stack to troubleshoot as needed. Proactive approach to identifying problems, performance bottlenecks, and areas for improvement Must have good communication skills. Nice to have - Experience with Oracle Cloud Infrastructure preferred Experience with Kafka, db. would be added advantage Experience in monitoring/trouble shooting cloud-based production environments Experience with Spring boot, Java Detailed Job Description We are looking for a SRE/Cloud Engineer, who uses automation tools to monitor, responding to system alerts, and improve the reliability, performance, and availability of software systems in an organization. He ensures software applications run smoothly without causing errors after deployment.
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
20312 Jobs | Dublin
Wipro
11977 Jobs | Bengaluru
EY
8165 Jobs | London
Accenture in India
6667 Jobs | Dublin 2
Uplers
6464 Jobs | Ahmedabad
Amazon
6352 Jobs | Seattle,WA
Oracle
5993 Jobs | Redwood City
IBM
5803 Jobs | Armonk
Capgemini
3897 Jobs | Paris,France
Tata Consultancy Services
3776 Jobs | Thane