6 Llama.Cpp Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Job Requirements Work on latest machine learning technologies Work on supporting for latest Linux operating system Work on AMD next generation GPUs/Accelerators Work on optimizing latest Rocm drivers and improve performance Design new machine learning technologies Work Experience MS/BS degree in Computer Science or an equivalent Deep Knowledge of C/C++ and Python programming Experience with Linux Commands is must Experience with Scripting language like bash/powershell Understanding of various python ML frameworks like Pytorch, Transformers etc Understanding of various language and compiler for writing highly efficient custom Deep-Learning GPU Kernels. like Triton/Jax Hands on Debugging Exper...

Posted 1 day ago

AI Match Score
Apply

7.0 - 9.0 years

0 Lacs

chennai, tamil nadu, india

On-site

Job Title: Site Reliability Engineer (SRE) Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure , AI infrastructure , and automation . The ideal candidate will have a solid background in managing cloud environments using GitHub/Azure DevOps , and hands-on experience in AI model deployment and scaling . This role involves working closely with engineering teams to deliver reliable, secure, and scalable cloud infrastructure that supports AI workloads and enterprise applications. Key Responsibilities: Design, build, and maintain scalable cloud...

Posted 1 day ago

AI Match Score
Apply

2.0 - 4.0 years

0 Lacs

hyderabad, telangana, india

On-site

Company Qualcomm India Private Limited Job Area Engineering Group, Engineering Group > Software Engineering General Summary More details below: Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run La...

Posted 1 week ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

pune, maharashtra, india

Remote

Position Overview The Infrastructure Systems Engineering team is in the midst of building our next-generation private cloud infrastructure as part of a greenfield initiative to in-house existing internal use cases on public cloud and support new initiatives. The historical deployment primarily supported our database test system: A system that runs over 7.25 million tests per month, writes petabytes to storage daily, and we're currently scaling multiple times in the next year alone. As a Software Engineer, Private Cloud Systems , you will be tasked with the design, implementation, and continuous improvement of systems at lower abstraction levels. You'll collaborate closely with a small, highl...

Posted 1 month ago

AI Match Score
Apply

3.0 - 6.0 years

0 Lacs

hyderabad, telangana, india

On-site

Job Title : AI Systems Engineer GPU/ROCm/CUDA | ML Frameworks Optimization Location : : 3-6 [Mid-Senior] Job Description We are looking for a passionate and experienced AI Systems Engineer to join our team to work on next-generation Machine Learning technologies and optimize performance across AMD GPU accelerators. This role involves low-level GPU programming, custom ML kernel development, and working with state-of-the-art inference engines. Key Responsibilities Develop and optimize custom Deep Learning GPU kernels using ROCm/CUDA or shader languages Support and enhance ML model deployment on Linux platforms Optimize performance of ROCm drivers and inferencing engines for AI/ML workloads Col...

Posted 1 month ago

AI Match Score
Apply

6.0 - 8.0 years

0 Lacs

pune, maharashtra, india

On-site

Role: Gen AI Developer Total Experience: 6+ years with 2+ years working on GenAI initiatives Employment Type: Permanent & Full time Working Model: Hybrid (3 days work from office) Job Summary: We are seeking a Senior AI Developer with proven expertise in Generative AI technologies, a solid foundation in machine learning, and a strong understanding of data governance. The ideal candidate will have hands-on experience with both cloud-based LLM platforms, on-premise, open-source LLMs like Ollama, Llama.cpp, and GGUF-based models. You should also have good knowledge in Model Context Protocol (MCP). You will help architect and implement GenAI-powered products that are secure, scalable, and enterp...

Posted 1 month ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies