Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
10.0 - 14.0 years
0 Lacs
karnataka
On-site
You are joining an innovation team with a mission to revolutionize how enterprises utilize AI. Operating with the agility of a startup and the focus of an incubator, we are assembling a close-knit group of AI and infrastructure experts fueled by bold ideas and a common objective: to reimagine systems from the ground up and deliver groundbreaking solutions that redefine what's achievable - faster, leaner, and smarter. In our fast-paced, experimentation-rich environment, where new technologies are not just embraced but expected, you will collaborate closely with seasoned engineers, architects, and visionaries to develop iconic products capable of reshaping industries and introducing entirely new operational models for enterprises. If you are invigorated by the prospect of tackling challenging problems, enjoy pushing the boundaries of what is possible, and are eager to contribute to shaping the future of AI infrastructure, we are excited to connect with you. As Cisco seeks a forward-thinking Architect for AI Infrastructure Software, you will play a pivotal role in spearheading the development of the next-generation AI infrastructure platform. This strategic leadership position at the intersection of software engineering and AI systems will require you to define the vision, architecture, and execution of high-performance software that directly influences how enterprises deploy, scale, and optimize AI workloads. Your responsibilities will include mentoring a high-caliber team, delivering robust control and data plane solutions, and operating them as a SaaS service with a relentless focus on uptime, quality, and customer success. Additionally, you will guide strategic decisions on resource usages in generative AI systems and collaborate across functions to align product direction with infrastructure capabilities. Key Responsibilities: - Architect and develop a SaaS control plane emphasizing ease of use, scalability, and reliability. - Design data models to drive APIs, ensuring best practices for usability and operations. - Utilize Kubernetes (K8s) to build scalable, resilient, and high-availability (HA) architectures. - Demonstrate a profound understanding of Nvidia and AMD metric collection and AI-driven analysis. - Plan and coordinate engineering work, map tasks to releases, conduct code reviews, and address technical challenges to facilitate releases. - Generate architecture specifications and develop proof-of-concept (POC) solutions for clarity as necessary. - Collaborate with product management to comprehend customer requirements and build architecturally sound solutions, working closely with engineers on implementation to ensure alignment with architectural requirements. - Manage technical debt with a strong emphasis on upholding product quality. - Integrate AI tools into everyday engineering practices, including code reviews, early bug detection, and test coverage automation. Required Skills: - Deep expertise in Golang, Python, C++, eBPF. - Proficiency in Kubernetes (K8s), Helm, Kubebuilder, K8S Operator pattern. - Hands-on experience with CI/CD pipelines and their impact on release quality. - Demonstrated experience in building and running SaaS services. - Strong design skills in distributed systems and large-scale data collection. - Familiarity with SLA/SLO principles and managing application scalability. - Practical experience with the NVIDIA stack and CUDA development. Minimum Qualifications: - Demonstrable experience in Golang development. - Leading CI/CD tools and API-first design practices. - Operations of Kubernetes for running SaaS services. - AI tools and generative AI applications for engineering. - Comprehensive understanding of software release processes, including the use of feature flags to ensure predictability. - Proficiency in utilizing agents during coding, review, CI, and CD processes. - Bachelor's degree or equivalent with 10+ years of engineering experience. Preferred Qualifications: - Proven leadership experience in building and guiding SaaS software teams in high-growth, dynamic environments. - Master's degree or equivalent. #WeAreCisco #WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all. Our passion is connection - we celebrate our employees" diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best but be their best. We understand our outstanding opportunity to bring communities together, and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer - 80 hours each year - allows us to give back to causes we are passionate about, and nearly 86% do! Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!,
Posted 5 days ago
3.0 - 5.0 years
2 - 5 Lacs
Bengaluru / Bangalore, Karnataka, India
On-site
The Exaleap Performance Engineering team drives performance optimization and efficiency improvements by working as a trusted expert to teams across the entire engineering organization. We are looking to add a talented, Linux system performance engineer to work on industry-leading performance observability and analysis tools. The team has pioneered and built extensive profiling and eBPF-based tracing tools/Node Exporter tools/GPU Profiling tools and utilities. You will work on building cutting-edge observability, visualization, and analytics tooling to help us stay at the forefront of our domain. What you will do: Build, enhance, and operate performance observability, data analysis and visualization, and benchmarking tools. Learn, evaluate, and integrate new tools and technologies Engage with performance engineers and end engineering users to understand their needs and improve their experience Maintain strong relationships with cross-functional teams through clear communication. Must-Have Skills: 2-5 years of professional Software Engineering experience Demonstrated proficiency in GO language Excellent communication skills and ability to work in a team environment. Good Knowledge Linux performance tools like eBPFBCC tools Nice-to-Have Skills: Good understanding of systems (server) and performance engineering concepts. Knowledge of statistical analysis and data visualization techniques. Exposure to observability tools for machine learning (ML) models(Nvidia -SMI HTApytorch profiler/ DCGM Exporter. Mandatory Key Skills data visualization, performance engineering, machine learning, performance optimization, GO language*,Linux*, eBPF*
Posted 1 month ago
3.0 - 5.0 years
16 - 18 Lacs
Bengaluru
Work from Office
The Role The Exaleap Performance Engineering team drives performance optimization and efficiency improvements by working as a trusted expert to teams across the entire engineering organization. We are looking to add a talented, Linux system performance engineer to work on industry-leading performance observability and analysis tools. The team has pioneered and built extensive profiling and eBPF-based tracing tools/Node Exporter tools/GPU Profiling tools and utilities. You will work on building cutting-edge observability, visualization, and analytics tooling to help us stay at the forefront of our domain. What you will do: Build, enhance, and operate performance observability, data analysis and visualization, and benchmarking tools. Learn, evaluate, and integrate new tools and technologies Engage with performance engineers and end engineering users to understand their needs and improve their experience Maintain strong relationships with cross-functional teams through clear communication. Must-Have Skills: 2-5 years of professional Software Engineering experience Demonstrated proficiency in GO language Excellent communication skills and ability to work in a team environment. Good Knowledge Linux performance tools like eBPFBCC tools Nice-to-Have Skills: Good understanding of systems (server) and performance engineering concepts. Knowledge of statistical analysis and data visualization techniques. Exposure to observability tools for machine learning (ML) models(Nvidia -SMI HTApytorch profiler/ DCGM Exporter. Mandatory Key Skills data visualization,performance engineering,machine learning,performance optimization,GO language*,Linux*,eBPF*
Posted 1 month ago
5 - 9 years
7 - 11 Lacs
Bengaluru
Work from Office
About the Job: The Red Hat Performance and Scale Engineering org is looking for a Senior Software Engineer to join us in the OpenShift Virtualization (OCPv) Performance and Scale team. Red Hat OpenShift Virtualization, an included feature of Red Hat OpenShift, provides a modern platform for organizations to run and deploy their new and existing virtual machine (VM) workloads. The solution allows for easy migration and management of traditional virtual machines onto a trusted, consistent, and comprehensive hybrid cloud application platform. As a senior member of the team, you will be responsible for providing comprehensive storage performance and scalability assessments of Red Hat OpenShift Virtualization (OCPv). Our goal is to make OCPv the platform of choice for Red Hat's enterprise customers for leveraging virtualization technologies. You will help us achieve such goals through targeted improvements in performance and scalability of the OCPv platform. This role needs an engineer that thinks creatively, adapts to rapid change, and has the willingness to learn and apply new technologies. You will be joining a vibrant open source culture, and helping promote performance and innovation in this Red Hat engineering team. The broader mission of the Performance and Scale team is to establish performance and scale leadership of the Red Hat product and cloud services portfolio. The scope includes component level, system and solution analysis and targeted enhancements. The team collaborates with engineering, product management, product marketing and customer support as well as Red Hat's hardware and software ecosystem partners. What will you do? Formulate test plans, and carry out performance and scalability benchmarks against various storage components/features of the OCPv platform to characterize performance, drive product performance improvements, and detect performance regressions through data analysis and visualization Develop tools and automation to aid the performance benchmarking work Collaborate with other engineering teams to resolve performance issues Triage, debug, and solve customer/partner cases related to virtualization storage performance and scale Publish results, conclusions, recommendations and best practices via internal test reports, presentations, external blogs and official documentation to support our partners and customers. Participate in internal and external conferences about your work and results What will you bring? Performance benchmarking, data capture, data analysis, and data Experience with storage systems and protocols (NAS, SAN, NFS, iSCSI, RBD, etc) Experience with testing windows technologies like MsSql, Win Desktop Citrix VDI, .Net etc. Experience with container technologies (podman, Kubernetes) Experience with systems performance engineering and metrics collection and analysis tools such as iostat, vmstat, sar, perf, pcp, prometheus, Grafana, Elasticsearch Programming experience in Python Experience working with the Linux operating system Excellent written and verbal language skills in English The following are considered as a plus: 5+ years of relevant experience Experience of working with virtualization technologies such as VMware Familiarity with storage APIs (snapshot, clone, provision, attach), Data Protection and Disaster Recovery Experience of working with Ansible automation platform Knowledge of performance observability/profiling tools like eBPF, Flame Graphs Bachelor degree in Computer Science or related fields Experience of Git or similar version control system
Posted 2 months ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
31458 Jobs | Dublin
Wipro
16542 Jobs | Bengaluru
EY
10788 Jobs | London
Accenture in India
10711 Jobs | Dublin 2
Amazon
8660 Jobs | Seattle,WA
Uplers
8559 Jobs | Ahmedabad
IBM
7988 Jobs | Armonk
Oracle
7535 Jobs | Redwood City
Muthoot FinCorp (MFL)
6170 Jobs | New Delhi
Capgemini
6091 Jobs | Paris,France