8 - 12 years
25 - 30 Lacs
Posted:19 hours ago|
Platform:
Work from Office
Full Time
We thrive in a fast-paced, experimentation-rich environment where new technologies arent just welcome theyre expected. Here, you'll work side-by-side with seasoned engineers, architects, and thinkers to craft the kind of iconic products that can reshape industries and unlock entirely new models of operation for the enterprise.
If you're energized by the challenge of solving hard problems, love working at the edge of what's possible, and want to help shape the future of AI infrastructure we'd love to meet you.
-The performance and efficiency of AI workloads on the node.
-The reliability and availability of AI systems for Ciscos customers.
-Advancements in AI and machine learning infrastructure, enabling better utilization and improving efficiency for applications across industries.
-Collaboration across internal teams to bring system level innovation across different cisco products.
Your contributions will help Cisco maintain its leadership in AI infrastructure development and influence the broader AI and machine learning community.
-Design and develop node-level infrastructure components to support high-performance AI workloads.
-Benchmark, analyze, and optimize the performance of AI infrastructure, including CUDA kernels and memory management for GPUs.
-Minimize downtimethrough seamless configandupgrade architecture for software components.
-Manage the installation and deployment of AI infrastructure on Kubernetes clusters, including the use of CRDs and operators.
-Develop and deploy efficient telemetry collection systems for nodes and hardware components without impacting workload performance.
-Work with distributed system fundamentals to ensure scalability, resilience, and reliability.
-Collaborate across teams and time zones to shape the overall direction of AI infrastructure development and achieve shared goals.
-Proficiency in programming languages such as C/C++, Golang, Python, or eBPF.
-Strong understanding of Linux operating systems, including user space and kernel-level components.
-Experience with Linux user space development, including packaging, logging, telemetry and lifecycle management of processes.
-Strong understanding of Kubernetes (K8s) and related technologies, such as custom resource definitions (CRDs).
-Strong debugging and problem-solving skills for complex system-level issues.
-Bachelors degree+ and relevant 8-12 years of Engineering work experience.
-Linux kernel and device driver hands-on expertise is a plus.
-Experience in GPU programming and optimization, including CUDA, UCX is a plus.
-Experience with high-speed data transfer technologies such as RDMA.
-Use of Nvidia GPU operators and Nvidia container toolkit and Nsight, CUPTI.
-Nvidia MIG and MPS concepts for managing GPU consumption.
Cisco
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now25.0 - 30.0 Lacs P.A.
25.0 - 30.0 Lacs P.A.
Salary: Not disclosed
10.0 - 14.0 Lacs P.A.
bengaluru
10.0 - 14.0 Lacs P.A.
pune
8.0 - 12.0 Lacs P.A.
bengaluru
7.0 - 17.0 Lacs P.A.
bengaluru
7.0 - 17.0 Lacs P.A.
bengaluru
3.0 - 7.0 Lacs P.A.
17.0 - 27.5 Lacs P.A.