Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in gurugram
>
Kezan Consulting
>
AI Engineer(Immediate Joiner)

AI Engineer(Immediate Joiner)

Kezan Consulting

3 - 8 years

15 - 25 Lacs

gurugram bengaluru delhi / ncr

Posted:9 hours ago| Platform:

Apply

Skills Required

vector db artificial intelligence llm retrieval augmented generation python tensorflow langchain fast api chunking machine learning pytorch nlp llama machine learning algorithms

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Title: AI Engineer Python | RAG | LLM | Chunking | Vector DB

Location: Gurugram/Bangalore

Experience: 35 years

Employment Type: Full-Time

Job Summary

We are looking for an AI Engineer with deep expertise in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) systems, and Vector Database architectures. The candidate should be skilled in Python-based AI pipelines, document chunking and embeddings, and model fine-tuning/integration for production-grade intelligent systems.

The role involves building end-to-end AI-driven solutions that enhance knowledge retrieval, automate reasoning, and deliver scalable conversational or cognitive experiences.

Key Responsibilities

RAG Architecture Design:

Develop and implement Retrieval-Augmented Generation pipelines using LLMs integrated with external knowledge sources and vector stores.

LLM Integration & Fine-Tuning:

Fine-tune or prompt-engineer models like GPT, Llama, Falcon, Mistral, T5, or Claude.

Optimize inference workflows for efficiency, context management, and accuracy.

Document Processing & Chunking:

Design intelligent text-splitting and chunking strategies for long documents.

Build embedding generation and context retrieval pipelines.

Vector Database Management:

Integrate and optimize vector stores like FAISS, Pinecone, Chroma, Weaviate, Milvus, or Qdrant.

Implement similarity search, hybrid retrieval, and ranking mechanisms.

Python-Based AI Development:

Build APIs and microservices using FastAPI / Flask / LangChain / LlamaIndex.

Create reusable AI pipelines for inference, retraining, and data ingestion.

Data Handling & Preprocessing:

Clean, transform, and index structured and unstructured data for efficient knowledge retrieval.

Performance Optimization & Monitoring:

Evaluate model performance using precision, recall, BLEU, ROUGE, or RAG-specific metrics.

Deploy and monitor models using Docker, MLflow, and cloud environments (AWS/GCP/Azure).

Collaboration:

Work cross-functionally with data scientists, backend engineers, and domain experts to integrate AI models into enterprise applications.

Required Skills & Tools

Core Skills

Programming: Python (mandatory), familiarity with TypeScript or Node.js is a plus

LLM Frameworks: LangChain, LlamaIndex, Hugging Face Transformers

Vector Databases: FAISS, Pinecone, Chroma, Weaviate, Milvus, Qdrant

Model Types: OpenAI GPT, Llama2/3, Mistral, Falcon, Claude, Gemini

Embedding Models: Sentence Transformers, OpenAI Embeddings, Instructor, or Custom Models

RAG Stack: Document loaders, text chunking, embedding generation, retrieval, context assembly

APIs & Deployment: FastAPI, Flask, Docker, MLflow, Streamlit

Version Control: Git, GitHub/GitLab

Cloud/Infra: AWS (S3, Lambda, SageMaker), GCP, or Azure AI

More Jobs at Kezan Consulting

Salesforce Support Engineer

Mumbai Suburbs, Mumbai, Mumbai (All Areas)

6 - 11 yrs

INR 5 - 14 Lacs

sailpoint L2

Pune, Bengaluru

3 - 8 yrs

INR 12 - 22 Lacs

CyberArk Conjur Admin

Pune, Bengaluru

2 - 7 yrs

INR 10 - 15 Lacs

AEM Content Author (Immediate Joiner)

Gurugram

2 - 6 yrs

INR 12 - 14 Lacs

Senior BA (Data Application & Integration)

Gurugram

8 - 13 yrs

INR 25 - 35 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Artificial Intelligence Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Kezan Consulting

Consulting

Business City

Login to

Please Verify Your Phone or Email

Confirm Action

AI Engineer(Immediate Joiner)