Jobs
Interviews

3 Inference Optimization Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

About the Role: --------------------- We are looking for 3-5 years of experience, forward-thinking LLMOps Engineers to join our team and help build the next generation of secure, scalable, and responsible Generative AI (GenAI) platforms. This role will focus on establishing monitoring, governance, security, and operational best practices while enabling development teams to build high-performing GenAI applications. You will also work closely with GenAI agents and integrate LLMs from multiple providers to support diverse use cases. Key Responsibilities: ------------------------- Design and implement governance frameworks for GenAI platforms, ensuring compliance with internal policies and external regulations (e.g., GDPR, AI Act). Define and enforce responsible AI practices including fairness, transparency, explainability, and auditability. Implement robust security protocols including IAM, data encryption, secure API access, and model sandboxing. Collaborate with security teams to conduct risk assessments and ensure secure deployment of LLMs. Build and maintain scalable LLMOps pipelines for model training, fine-tuning, evaluation, deployment, and monitoring. Automate model lifecycle management with CI/CD, versioning, rollback, and observability. Develop and manage GenAI agents capable of reasoning, planning, and tool use. Integrate and orchestrate LLMs from multiple providers (e.g., OpenAI, Anthropic, Cohere, Google, Azure OpenAI) to support hybrid and fallback strategies. Optimize prompt engineering, context management, and agent memory for production use. Ensure high availability, low latency, and cost-efficiency of GenAI workloads across cloud and hybrid environments. Implement monitoring and alerting for model drift, hallucinations, and performance degradation. Partner with GenAI developers to embed best practices and reusable components (SDKs, templates, APIs). Provide technical guidance and documentation to accelerate development and ensure platform consistency. Qualifications: ------------------ Bachelors or Masters degree in Computer Science, Engineering, or related field. 3-5 years of experience in MLOps, DevOps, or platform engineering, with 2+ years in LLM/GenAI environments. Deep understanding of LLMs, GenAI agents, prompt engineering, and inference optimization. Experience with LangChain, LlamaIndex, Langraph or similar agent frameworks. Hands-on with MLflow, or equivalent tools. Proficient in Python, containerization (Docker) and cloud platforms (AWS/GCP/Azure). Familiarity with AI governance frameworks and responsible AI principles. Experience with vector databases (e.g., FAISS, Pinecone), RAG pipelines, and model evaluation frameworks. Knowledge of Responsible AI, red-teaming, and OWASP security principles. Joining: ---------- We must fill this position urgently. Candidates who can start within 2 weeks will be our priority. Location: ----------- Bangalore Hybrid Show more Show less

Posted 1 week ago

Apply

0.0 years

0 Lacs

, India

Remote

Role : AI/ML/DL Specialist Exp Level : 0-2 Years Work Mode : Remote Notice Period : Immediate / 15 Days Role Summary: Were seeking a highly motivated and experienced AI/ML/DL Specialist with a strong foundation in contextual learning and language model development . Youll play a key role in building and training domain-specific models that can understand context, infer meaning, and provide intelligent recommendations including the development of small language models for niche applications. Key Responsibilities: Design and develop contextual AI/ML/DL models for real-world, policy-driven applications Build, train, and fine-tune domain-specific small language models (SLMs) Work closely with data engineers, domain experts, and policy consultants to identify data sources and training strategies Deploy and optimize models for performance, scalability, and accuracy Contribute to the vision and architecture of our AI product roadmap Required Skills & Qualifications: ? Proven experience (0-2 years) in AI/ML/DL model development ? Expertise in contextual learning models, including transformer architectures ? Hands-on experience with building and fine-tuning small language models (e.g., using HuggingFace, PyTorch, TensorFlow) ? Solid programming skills (Python, PyTorch, TensorFlow, etc.) ? Understanding of model deployment, scaling, and inference optimization ? Ability to work in a fast-paced startup or consulting environment ? Strong communication and collaboration skills Desirable: ? Familiarity with open-source LLM tools, tokenization methods, and low-resource domain adaptation ? Experience working with government or public-sector datasets ? Passion for applying AI to real-world social or governance challenges Desirable: ? Familiarity with open-source LLM tools, tokenization methods, and low-resource domain adaptation ? Experience working with government or public-sector datasets ? Passion for applying AI to real-world social or governance challenges Show more Show less

Posted 3 weeks ago

Apply

12 - 22 years

35 - 50 Lacs

Hyderabad, Pune, Chennai

Work from Office

Looking for a highly skilled GenAI Scientist with a strong expertise in the field of LLMs, including multimodal LLMs, agentic AI, fine-tuning, distillation, hands-on approach to research, development, and implementation of cutting-edge AI solutions.

Posted 3 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies