Jobs
Interviews

73 Vector Database Jobs - Page 2

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

8.0 - 13.0 years

30 - 45 Lacs

Bhopal, Pune, Gurugram

Work from Office

We're Hiring | AI Lead GenAI & LLMs | 8-10 Yrs | Multiple Xebia Locations | Hybrid Locations: Bangalore | Hyderabad | Chennai | Pune | Bhopal | Jaipur | Gurugram (Hybrid 3 days/week in office) Experience: 8 to 10 years Joiners: Immediate to Max 2 Weeks Notice Period ONLY About the Role: Are you a visionary in the AI/ML space looking to build cutting-edge Generative AI solutions ? We are seeking an accomplished AI Lead who will architect and drive enterprise-grade AI systems—leveraging LLMs, deep learning models, vision AI, and GenAI . You’ll lead from the front, mentor top talent, and build transformative solutions that shape the future of intelligent applications. Key Responsibilities: Architect and implement scalable Generative AI (GenAI) and RAG solutions using LLMs , vision models , and vector databases Lead the design, development, and deployment of AI/ML and deep learning systems Integrate AI platforms with Azure AI Studio , SharePoint , and Power BI Guide the team in leveraging Agentic frameworks like LlamaIndex Align with business stakeholders to craft and deliver high-impact AI strategies Drive innovation, technical mentorship, and excellence across the AI team Establish coding best practices, performance tuning, and scalable solution design Must-Have Skills: 8–10 years of AI/ML experience, with 3+ years in leadership or architecture roles Proficiency in Python , TensorFlow , PyTorch , Scikit-learn Strong experience in LLMs , OCR , Vision AI , vector DBs , RAG Familiarity with Azure AI Studio , Azure Cloud Architecture , and Azure DevOps Exposure to Agentic frameworks and tools like LlamaIndex Basic working knowledge of Power BI and SharePoint Proven leadership and team management skills Good-to-Have: Experience with AWS or GCP Knowledge of DevOps practices , CI/CD , Kubernetes Awareness of AI ethics , governance , and compliance Exposure to advanced BI/visualization tools To Apply: Send your resume to vijay.s@xebia.com with the following details: Full Name Total Experience Current CTC Expected CTC Current Location Preferred Xebia Location (from above) Notice Period / Last Working Day (if serving) Primary Skills LinkedIn Profile Important: Apply only if you’re available to join immediately or within 2 weeks and are not currently in process with other roles at Xebia. #AIJobs #GenerativeAI #LLMs #VisionAI #XebiaHiring #ImmediateJoiners #Python #AzureAI #AIML #LeadershipHiring #DataScienceJobs #AgenticFrameworks #ChennaiJobs #PuneJobs #BangaloreJobs #HyderabadJobs #GurugramJobs #JaipurJobs #BhopalJobs #TechHiring #AILead

Posted 1 month ago

Apply

8.0 - 13.0 years

16 - 30 Lacs

Chennai

Hybrid

Interested suitable candidates please be in touch with roopashree.ry@sutherlandglobal.com Exp Level 8-15 Yrs Immediate Joiners preferred Delivery technical manager Detailed JD: Were looking for a Delivery Manager who thrives at the intersection of technical depth and execution leadership . As part of the Robility Platform team, youll play a critical role in delivering high-performance automation capabilities while staying hands-on with modern technologies like .NET Core, Rust, Redis, Postgres, NoSQL, Nginx , and vector databases . Youll lead scalable backend architecture efforts, solve real-world performance challenges, and shape how automation is delivered to enterprises globally. This role is ideal for a builder–leader with a product mindset and engineering expertise. Key Responsibilities: Lead the architecture and delivery of scalable backend services using .NET Core, Python, Rust, SQL, and NoSQL, driving real-time systems, search engines, and automation workflows. Design and implement technical solutions like matchmaking engines, real-time dashboards, and full-text search systems with Redis caching, vector databases, and modern API rate limiting. Optimize platform performance by debugging complex issues, improving query efficiency, and enhancing full-stack responsiveness across Django, React, Node.js, and PostgreSQL-based systems. Drive technology migration strategies , including re platforming Python services to Rust without user impact, while ensuring high system availability and performance. Architect infrastructure enhancements such as database sharding and Nginx proxy configurations to support secure, scalable, and resilient deployments. Collaborate cross-functionally with product, QA, DevOps, and engineering teams to ensure timely, high-quality feature delivery in a CI/CD and cloud-native environment. Required Skills & Experience: Experience: 8–12 years Strong hands-on expertise in .NET Core , C# , and modern backend development. Solid experience with SQL (e.g., PostgreSQL, MS SQL) and NoSQL (e.g., MongoDB, CosmosDB). Familiarity with Python and working knowledge or interest in Rust for high-performance services. Deep understanding of Redis , API rate limiting , and vector databases (e.g., Pinecone, Weaviate). Experience in debugging, root cause analysis, and production issue resolution. Knowledge of database sharding , query optimization, and infrastructure scaling. Hands-on experience with React , Node.js , and Nginx for full-stack optimization and proxy configuration. Familiarity with cloud-native environments (Azure/AWS/GCP), Docker, Kubernetes, and CI/CD pipelines. Nice to Have: Experience with RPA, low-code/no-code, or enterprise automation platforms. Exposure to observability stacks and performance telemetry tools. Comfort leading hybrid engineering teams and Agile project environments. Why Join Us? Contribute to a platform powering automation across global enterprises. Tackle technically complex and high-impact architecture challenges. Work in a forward-thinking environment with the freedom to innovate and deliver. Own delivery outcomes while staying deeply hands-on with emerging technologies.

Posted 1 month ago

Apply

5.0 - 10.0 years

9 - 15 Lacs

Chennai, Bengaluru

Work from Office

Role & responsibilities Role: Senior Technical Lead Skill : AI/ML/GENAI, Python ( NumPy,Pandas, Scikit-learn, Keras, Flask, SciPy , TensorFlow, NLTK),GPT/PaLM/BERT, BART, Azure Cloud Billing Rate: USD 28 #of positions: 1 Work Location; Chennai (preferred), Bangalore , Hyderabad, Noida, Pune Notice Period: Immediate Joiners Those who are interested may send their resume to aswathy.rajan@hcltech.com

Posted 1 month ago

Apply

4.0 - 6.0 years

6 - 8 Lacs

Bengaluru, Bellandur

Hybrid

Hiring an AWS Data Engineer for a 6-month hybrid contractual role based in Bellandur, Bengaluru. The ideal candidate will have 4-6 years of experience in data engineering, with strong expertise in AWS services (S3, EC2, RDS, Lambda, EKS), PostgreSQL, Redis, Apache Iceberg, and Graph/Vector Databases. Proficiency in Python or Golang is essential. Responsibilities include designing and optimizing data pipelines on AWS, managing structured and in-memory data, implementing advanced analytics with vector/graph databases, and collaborating with cross-functional teams. Prior experience with CI/CD and containerization (Docker/Kubernetes) is a plus.

Posted 1 month ago

Apply

4.0 - 7.0 years

6 - 16 Lacs

Chennai, Coimbatore, Bengaluru

Work from Office

Role & responsibilities We are seeking an enthusiastic and highly skilled Senior AI ML Developer to join our team and work on exciting AI initiatives. The ideal candidate should have worked in CRM Domain like Zoho CRM/ Fresh works and have expert knowledge in AI/ML models, LLMS, Vector database and RAG techniques to automate workflows, create conversational analytics assistants and create similarity models by clustering/ embeddings. 1. Develop CRM Agentic AI Design and implement AI agents using frameworks like Lang Chain, AutoGPT, or ReAct. Automate CRM processes by enabling agents to handle emails, generate follow-ups, update pipelines, and classify tickets. Integrate models for sentiment analysis, intent recognition, entity extraction, and summarization. Connect agents to external APIs, calendars, and CRM tools (e.g., Salesforce, HubSpot). Manage agent memory and context via vector databases like FAISS or Pinecone. 2. Conversational KPI Dashboard Assistant Build a chatbot interface that allows users to query CRM KPIs via natural language. Use Python, LangChain, and OpenAI APIs to parse queries and trigger backend data retrieval tools. Render results in both dashboard format (e.g., Recharts/Chart.js) and tabular views. Support queries like "Whats the total expected revenue this quarter?" or "Show me the lead-to-win ratio by region." 3. Similarity and Recommendation Engines Extract features from existing AI/ML libraries. Build and manage an embedding database for semantic similarity search. Implement clustering algorithms (e.g., K-Means) to group songs based on genre. Develop a scoring system to compare new tracks to a database of existing songs. Required Experience and Qualifications: 4+ years of experience in AI/ML Model development. Should have expert knowledge in CRM Domain including workflows, Forms, Dashboards/reports and data import/export as done. Proficient in Python, with strong experience in TensorFlow, PyTorch, and Scikit-learn . Working knowledge of LangChain, AutoGPT and LLM integration. Expertise in developing Agents for CRM, Dashboards in Leading organisation. Expertise in implementing Agentic AI solutions using LLM models, RAG and Vector database. Proven experience in vector similarity, embedding models and clustering techniques. Passionate about AI innovation and solving real-world challenges with automation and intelligence Proven experience in designing platform architecture and managing API integrations. Strong background in microservices architecture and REST API design. Demonstrated experience with scalability and performance optimization for cloud-based platforms. Experience with Agile methodologies and leading cross-functional teams in a fast-paced environment. Excellent communication and presentation skills for delivering product demos and gathering customer feedback. Proven ability to manage a product roadmap and deliver on tight deadlines . Experience with data security , compliance, and industry best practices. A bachelors degree in computer science, engineering, or a related field.

Posted 1 month ago

Apply

4.0 - 9.0 years

25 - 35 Lacs

Bengaluru

Remote

Gainwell is seeking LLM Ops Engineers and ML Ops Engineers to join our growing AI/ML team. This role is responsible for developing, deploying, and maintaining scalable infrastructure and pipelines for Machine Learning (ML) models and Large Language Models (LLMs). You will play a critical role in ensuring smooth model lifecycle management, performance monitoring, version control, and compliance while collaborating closely with Data Scientists, DevOps. Summary Gainwell is seeking LLM Ops Engineers and ML Ops Engineers to join our growing AI/ML team. This role is responsible for developing, deploying, and maintaining scalable infrastructure and pipelines for Machine Learning (ML) models and Large Language Models (LLMs). You will play a critical role in ensuring smooth model lifecycle management, performance monitoring, version control, and compliance while collaborating closely with Data Scientists, DevOps. Your role in our mission Core LLM Ops Responsibilities: Develop and manage scalable deployment strategies specifically tailored for LLMs (GPT, Llama, Claude, etc.). Optimize LLM inference performance, including model parallelization, quantization, pruning, and fine-tuning pipelines. Integrate prompt management, version control, and retrieval-augmented generation (RAG) pipelines. Manage vector databases, embedding stores, and document stores used in conjunction with LLMs. Monitor hallucination rates, token usage, and overall cost optimization for LLM APIs or on-prem deployments. Continuously monitor models for its performance and ensure alert system in place. Ensure compliance with ethical AI practices, privacy regulations, and responsible AI guidelines in LLM workflows. Core ML Ops Responsibilities: Design, build, and maintain robust CI/CD pipelines for ML model training, validation, deployment, and monitoring. Implement version control, model registry, and reproducibility strategies for ML models. Automate data ingestion, feature engineering, and model retraining workflows. Monitor model performance, drift, and ensure proper alerting systems are in place. Implement security, compliance, and governance protocols for model deployment. Collaborate with Data Scientists to streamline model development and experimentation. What we're looking for Bachelor's/Master’s degree in computer science, Engineering, or related fields. Strong experience with ML Ops tools (Kubeflow, MLflow, TFX, SageMaker, etc.). Experience with LLM-specific tools and frameworks (LangChain,Lang Graph, LlamaIndex, Hugging Face, OpenAI APIs, Vector DBs like Pinecone, FAISS, Weavite, Chroma DB etc.). Solid experience in deploying models in cloud (AWS, Azure, GCP) and on-prem environments. Proficient in containerization (Docker, Kubernetes) and CI/CD practices. Familiarity with monitoring tools like Prometheus, Grafana, and ML observability platforms. Strong coding skills in Python, Bash, and familiarity with infrastructure-as-code tools (Terraform, Helm, etc.).Knowledge of healthcare AI applications and regulatory compliance (HIPAA, CMS) is a plus. Strong skills in Giskard, Deepeval etc. What you should expect in this role Fully Remote Opportunity – Work from anywhere in the India Minimal Travel Required – Occasional travel opportunities (0-10%). Opportunity to Work on Cutting-Edge AI Solutions in a mission-driven healthcare technology environment. Role Description Core LLM Ops Responsibilities: Develop and manage scalable deployment strategies specifically tailored for LLMs (GPT, Llama, Claude, etc.). Optimize LLM inference performance, including model parallelization, quantization, pruning, and fine-tuning pipelines. Integrate prompt management, version control, and retrieval-augmented generation (RAG) pipelines. Manage vector databases, embedding stores, and document stores used in conjunction with LLMs. Monitor hallucination rates, token usage, and overall cost optimization for LLM APIs or on-prem deployments. Continuously monitor models for its performance and ensure alert system in place. Ensure compliance with ethical AI practices, privacy regulations, and responsible AI guidelines in LLM workflows. Core ML Ops Responsibilities: Design, build, and maintain robust CI/CD pipelines for ML model training, validation, deployment, and monitoring. Implement version control, model registry, and reproducibility strategies for ML models. Automate data ingestion, feature engineering, and model retraining workflows. Monitor model performance, drift, and ensure proper alerting systems are in place. Implement security, compliance, and governance protocols for model deployment. Collaborate with Data Scientists to streamline model development and experimentation.

Posted 1 month ago

Apply

3.0 - 8.0 years

25 - 35 Lacs

Bengaluru

Remote

Gainwell Technologies LLC Summary Gainwell Technologies is seeking a highly skilled AI Engineer to design, develop, and deploy advanced AI and Generative AI (Gen AI) solutions, across our healthcare technology platforms. This role involves building and optimizing AI / GEN AI technologies, integrating them into existing systems, and ensuring their effectiveness in improving healthcare outcomes and operational efficiency while maintaining compliance with industry standards. Your role in our mission AI/ GEN AI Model Development – Design, build, and train machine learning, deep learning, time series models, Gen AI (Multi modal LLMs), predictive analytics, Natural Language Processing, Image Processing solutions for healthcare applications. Experienced in multiple LLM fine tuning techniques. Experienced in building GEN AI solutions using RAG architecture. Skilled in both Lang Chain and Lang Graph. Experience in Agentic AI Frameworks and Workflow using Lang Chain, Lang Graph or Crew AI or Open AI Swarm. Experienced in multiple Vector Databases as well as Graph Database Skilled in Agentic AI Framework and has built at least one solution using Agentic AI. End-to-End AI Solution Deployment – Develop, test, and deploy AI solutions in cloud and on-premises environments, ensuring reliability, scalability, and real-world impact. Data Processing – Work with large healthcare datasets, performing data preprocessing, feature engineering, and model training while ensuring compliance with HIPAA and other regulatory standards. System Integration – Skilled in API development and integrations. Implement and optimize AI models within Gain well’s existing technology stack, collaborating with software engineers to ensure seamless integration. Experienced in ML Ops and LLM Ops. Experienced in evaluating models and continuous performance monitoring of both ML/DL and LLMs. Experienced in applying security measures in GEN AI solutions, implementing guard rails. Front End Development – Streamlit is a must. Good to have React, Angular or Vue.js What we're looking for Master’s or Ph.D in Computer Science, AI, Data Science, or a related field. 4+ years of experience in AI/ML engineering, with a focus on developing and deploying AI solutions. Hands-on expertise in machine learning, deep learning, Gen AI ( multi modal LLMs), NLP, and computer vision. Strong programming skills in Python, TensorFlow, PyTorch, and other AI frameworks. Experience developing, deploying and finetuning LLMs (GPT, Gemini, Claude or similar) for real world applications including prompt engineering, model optimization and inference efficiency. Experience with cloud platforms (AWS, Azure, or GCP) and MLOps for scalable AI deployments. Proficiency in working with big data technologies (Spark, Hadoop, SQL, NoSQL databases). Strong problem-solving skills with the ability to translate business challenges into AI-driven solutions. Knowledge of healthcare AI applications and regulatory compliance (HIPAA, CMS) is a plus. What you should expect in this role Fully Remote Opportunity – Work from anywhere in the India Minimal Travel Required – Occasional travel opportunities (0-10%). Opportunity to Work on Cutting-Edge AI Solutions in a mission-driven healthcare technology environment. AI/ GEN AI Model Development – Design, build, and train machine learning, deep learning, time series models, Gen AI (Multi modal LLMs), predictive analytics, Natural Language Processing, Image Processing solutions for healthcare applications. Experienced in multiple LLM fine tuning techniques. Experienced in building GEN AI solutions using RAG architecture. Skilled in both Lang Chain and Lang Graph. Experience in Agentic AI Frameworks and Workflow using Lang Chain, Lang Graph or Crew AI or Open AI Swarm. Experienced in multiple Vector Databases as well as Graph Database Skilled in Agentic AI Framework and has built at least one solution using Agentic AI. End-to-End AI Solution Deployment – Develop, test, and deploy AI solutions in cloud and on-premises environments, ensuring reliability, scalability, and real-world impact. Data Processing – Work with large healthcare datasets, performing data preprocessing, feature engineering, and model training while ensuring compliance with HIPAA and other regulatory standards. System Integration – Skilled in API development and integrations. Implement and optimize AI models within Gain well’s existing technology stack, collaborating with software engineers to ensure seamless integration. Experienced in ML Ops and LLM Ops. Experienced in evaluating models and continuous performance monitoring of both ML/DL and LLMs. Experienced in applying security measures in GEN AI solutions, implementing guard rails. Performance Optimization – Continuously monitor, refine, and optimize AI / GEN AI models for accuracy, efficiency, and speed, leveraging ML Ops and LLM Ops best practices. AI Research & Innovation – Stay updated with the latest AI/ML/ GEN AI advancements, exploring new technologies and methodologies to enhance solution effectiveness. Compliance & Security – Ensure AI implementations adhere to healthcare industry regulations, ethical AI principles, and data privacy standards. Automation & Workflow Enhancement – Identify opportunities to automate workflows and optimize business processes using AI-driven solutions. Front End Development – Streamlit is a must. Good to have React, Angular or Vue.js

Posted 1 month ago

Apply

2.0 - 5.0 years

12 - 22 Lacs

Mumbai, Mumbai Suburban, Mumbai (All Areas)

Work from Office

Russell Investments is actively hiring for Artificial Intelligence, Analyst for Mumbai, Goregaon (E) location. Interested applicants can share their updated resumes on rhule@russellinvestments.com Job Description This role is responsible for supporting and growing the AI strategy, platform and deliverables at Russell Investments. We are looking for a curious and analytical individual who will research, develop, implement, and maintain processes to meet the needs of our AI strategy and deliver on business objectives. This is an excellent opportunity to take advantage of emerging trends and technologies and make a real-world difference. Years of Experience Suitable candidates would have 2 - 5 years of programming/artificial intelligence experience along with some knowledge machine learning. Qualifications Bachelor's degree in Computer Science, Engineering, Finance, Economics, Statistics, or a related field. Advanced degree preferred. Proficient in Python and SQL (R or C# a plus) Exposure to TensorFlow, PyTorch and NLP Techniques. Proven experience in developing Generative AI applications Strong experience with Selenium, Beautiful Soup and/or other web crawling techniques Experience working with large-scale datasets for speech, video and text. Familiarity with Whisper models, speech-to-text, video intelligence, and chatbot frameworks. Experience with DevOps toolkit is a plus Strong analytical skill set with the ability to analyze complex data. Ability to read, analyze and interpret financial reports, tax documents, etc. Excellent problem-solving and debugging skills. Ability to work collaboratively in a fast-paced environment. Responsibilities Support and develop our Python code infrastructure Design and implement AI-powered speech and video processing solutions. Develop and optimize deep learning models for speech recognition, language modeling, and computer vision. Improve chatbot capabilities by integrating multimodal AI components. Create RAG workflows to ensure seamless AI tool integration. Stay updated with the latest AI research and bring innovative ideas to the team. Document workflows, models, and AI applications to ensure scalability and reproducibility. Work closely with business units to understand projects requirements and deliver solutions that meet business objectives Troubleshoot, debug and optimize code to ensure high performance and reliability of AI applications Stay abreast of the latest developments in AI, integrating new technologies into projects as appropriate Stay familiar with ethical AI and web scraping principles

Posted 1 month ago

Apply

5.0 - 8.0 years

25 - 37 Lacs

Hyderabad, Bengaluru, Delhi / NCR

Hybrid

Job Summary: We are seeking a highly skilled and innovative Senior AI/ML Engineer with strong experience in building Generative AI solutions from MVP to production. The ideal candidate should have hands-on expertise in multi-agent orchestration, prompt engineering, RAG architecture, and advanced Python frameworks such as LangChain, LangGraph, and AutoGen. Experience with Databricks and tool integrations (e.g., GitHub, UI components) is essential. Key Responsibilities: Design and implement Generative AI solutions, taking projects from MVP to scalable production systems. Build, orchestrate, and optimize multi-agent architectures using frameworks like LangChain, LangGraph, and AutoGen. Integrate external tools and APIs into AI agents (e.g., GitHub, internal/external UI components). Lead the development and deployment of RAG (Retrieval-Augmented Generation) pipelines. Collaborate with cross-functional teams to align AI solutions with business goals. Leverage Databricks for scalable data processing, ML model development, and deployment. Develop, refine, and evaluate prompts for LLM-based tasks (Prompt Engineering). Monitor and optimize the performance and accuracy of AI systems in production. Required Skills & Qualifications: 5+ years of experience in AI/ML, with at least 2 years focused on Generative AI solutions. Strong programming skills in Python and proficiency in GenAI frameworks (LangChain, LangGraph, AutoGen). Experience with building and orchestrating multi-agent systems. Proven track record in taking GenAI-based MVPs to full-scale production. Hands-on experience in integrating AI agents with tools and platforms (e.g., GitHub, custom UIs). Solid understanding and implementation experience with RAG architecture and prompt engineering. Proficiency with Databricks for data and ML workflows. Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment. Nice to Have: Experience with vector databases (e.g., FAISS, Pinecone, Weaviate). Exposure to cloud environments (Azure, AWS, or GCP). Familiarity with CI/CD for ML pipelines.

Posted 1 month ago

Apply

5.0 - 8.0 years

2 - 4 Lacs

Hyderabad, Bengaluru

Work from Office

Role & responsibilities Develop & Deploy a high-performance mobile app for Android & iOS/iPad using React Native, Swift, or Kotlin. Build a responsive web version using React.js, Node.js, Django, or Flask, Tailwind CSS, Framer Motion Design and implement RESTful APIs and integrate OAuth for secure authentication. Optimize database architecture using PostgreSQL, MongoDB, and Vector DB for AI/ML data handling. Create Figma wireframes/prototypes for UI/UX alignment, experience with design collaboration Deploy and manage cloud infrastructure on AWS (S3, EC2, Lambda) with cost-efficient scaling. Ensure cross-platform compatibility, security, and performance optimization. Integration with the AI Models and Data pipelines. Python for AI/ML integration Familiarity with AI model deployment (TensorFlow, PyTorch). Preferred candidate profile We are looking for an experienced and proven Full-Stack Developer to lead the development of our mobile and web platforms. The ideal candidate will have expertise in cross-platform mobile development, backend services, API integrations, database management, and cloud deployment with a strong focus on performance, security, and cost optimization. Send your resume, portfolio/GitHub links, and a brief note on your relevant experience

Posted 2 months ago

Apply

3.0 - 5.0 years

7 - 15 Lacs

Hyderabad, Bengaluru

Work from Office

Role & responsibilities Build Doctor Agentic AI, a real-time diagnostic and decision-support system. Patient-Doctor interaction- transcribe, understand, and summarize voice notes, answer patient inquiries, and recommend tests or treatments. Integrate text, images, lab reports, and structured EHR data into AI systems that see the full patient picture and medical insights. Live monitoring tools that detect anomalies in vitals, trigger adaptive questioning, and accelerate early interventions. Preferred candidate profile 3+ years of experience in AI/ML with a focus on multimodal data and agent-based systems. Deep understanding of transformer models and medical LLMs. Multimodal Health AI Stack like;: BioGPT, Med Palm, Meditron, ClinicalBERT, OCR (AWS Textract, Google Document AI, Google Vision) CoMT (Chain-of-Medical-Thought Reasoning) pipelines. RAG/MEDRAG pipelines multimodal analysis for structured and unstructured medical data. Proficient with LangChain, OpenAI Agents, Responses AI and tool calling APIs. Use vector databases for semantic search and memory retrieval. Experience generating or managing synthetic patient data for fine-tuning. Skilled in Python, Node.js, Connect models to backend infrastructure and ensure secure data pipelines. Familiar with AWS deployment and cost optimization.

Posted 2 months ago

Apply

6.0 - 9.0 years

0 - 3 Lacs

Pune, Chennai, Bengaluru

Hybrid

Job Description Primary: Postgres, PgVector, Vectorized database, SQL Working exp with DB: Postgres, hive, MS SQL Scripting : python, UNIX shell, spark Good communication and have advance analytical skill Working exp with AI project added advantage Experience: 5+ years advantage

Posted 2 months ago

Apply

3.0 - 6.0 years

17 - 32 Lacs

Gurugram

Work from Office

We are looking for a Senior AI/ML Engineer to join our team and help design, build, and deploy the product. About Us Zonka Feedback is a fast-growing, bootstrapped SaaS company building an AI-first platform that combines machine learning, GenAI, and large-scale analytics. We are working on cutting-edge applications of AI including LLMs , NLP , unsupervised clustering , vector databases , and retrieval-augmented generation (RAG) systems . Key Responsibilities Design and build scalable NLP pipelines , unsupervised clustering models , and semantic search solutions . Develop vectorization pipelines and implement RAG (Retrieval-Augmented Generation) architectures for efficient information retrieval. Fine-tune large language models (LLMs) and craft effective prompt engineering strategies for real-world performance. Evaluate and optimize AI/ML models using tools like LangChain , Evals , Hugging Face , and custom evaluation frameworks. Work closely with backend and product engineering teams to integrate AI capabilities into production systems. Continuously research and implement advancements in AI, ML, and LLMs to keep the product innovative and competitive. Requirements 3 to 6 years of hands-on experience in Machine Learning and NLP . Solid experience in unsupervised learning techniques (clustering, dimensionality reduction, topic modeling, etc.). Strong understanding and experience in vector databases (e.g., FAISS, Pinecone, ChromaDB) and RAG system design . Experience with LLM fine-tuning , prompt engineering , and deployment of AI systems in production. Proficiency in Python and relevant ML/AI libraries such as TensorFlow, PyTorch, Hugging Face, LangChain, and OpenAI APIs. Strong analytical and problem-solving skills with an ability to work in a fast-paced environment. Prior experience building end-to-end AI features that are used in real products is a strong plus. If this sounds like you, we are here to have the next talk. Share your resume on hr@zonkafeedback.com

Posted 2 months ago

Apply

2.0 - 4.0 years

3 - 5 Lacs

Bengaluru

Work from Office

Job Description: AI Engineer (Junior / Associate Level) About the Role We are looking for a passionate and hands-on AI Developer with around 3 years of experience in building and deploying machine learning models and working with the latest AI tools and frameworks. You will be working closely with our data science and engineering teams to develop smart, scalable, and production-ready AI solutions. Key Responsibilities Design, develop, test, and deploy machine learning models for classification, regression, recommendation, or NLP tasks. Work with state-of-the-art AI tools and libraries such as LangChain, Hugging Face, OpenAI, LlamaIndex, etc. Integrate Large Language Models (LLMs) into applications via APIs or custom fine- tuning. Build data pipelines for training and inference, ensuring model performance and robustness. Collaborate with software developers, product managers, and data engineers to turn AI models into usable products. Stay updated with the latest research and innovations in the AI/ML space and bring new ideas into development. Optimize model performance and scale AI solutions for production. Required Skills & Experience Bachelor's or Master's degree in Computer Science, AI, Data Science, or a related field. 2 to 4 years of hands-on experience in machine learning and deep learning. Proficiency in Python and libraries like TensorFlow, PyTorch, Scikit-learn, Pandas, NumPy. Good understanding of LLMs and experience working with platforms like OpenAI, Claude, Cohere, or Hugging Face Transformers. Experience in prompt engineering, fine-tuning, or using frameworks like LangChain, LlamaIndex, or Haystack. Exposure to cloud platforms (AWS, GCP, Azure) and tools like Docker, Git, CI/CD workflows. Strong analytical and problem-solving skills. Understanding of data preprocessing, feature engineering, and model evaluation techniques. Good to Have Experience building AI chatbots or virtual assistants. Knowledge of Vector databases (e.g., Chroma, Pinecone, Weaviate). Familiarity with MLOps tools like MLflow, Kubeflow, Vertex AI, etc. Experience with RESTful APIs or microservices architecture. Participation in Kaggle competitions or open-source AI projects. What We Offer Opportunity to work on real-world AI applications and products. A collaborative, learning-first environment. Exposure to enterprise-grade AI deployments and tools. Flexible working hours and remote work options. Competitive compensation and career growth path.

Posted 2 months ago

Apply

5.0 - 7.0 years

7 - 15 Lacs

Kolkata, New Delhi

Work from Office

DevOps, Cloud Infrastructure and CI/CD Strong hands-on experience with AWS services Exposure to AI pipelines, especially speech-to-text and vector databases (e.g., Pinecone) Knowledge of PostgreSQL performance tuning and replication 9220166817 tanya

Posted 2 months ago

Apply

2.0 - 5.0 years

6 - 16 Lacs

Pune

Work from Office

Were seeking a forward-thinking Gen AI Engineer to design, implement, and optimize cutting-edge agentic AI solutions using frameworks such as Crew.ai , LangChain , and LangGraph . You will work at the intersection of LLMs , Retrieval-Augmented Generation (RAG) systems, and NLP , enabling impactful AI applications across diverse domains. Key Responsibilities Design and deploy autonomous AI agents and multi-agent systems using LLMs such as GPT-4o, Claude, and LLaMA, leveraging Crew.ai, LangChain (and LangGraph). Own the AI solution lifecycle , including data acquisition, model experimentation, fine-tuning, deployment, and production monitoring. Develop scalable AI backends and pipelines using Python with frameworks like PyTorch or TensorFlow, deploying via REST APIs (FastAPI) on cloud platforms like AWS or Azure. Implement RAG-based systems by integrating open-source LLMs (via Hugging Face or Ollama) with vector databases (e.g., Pinecone, ChromaDB) and structured data stores (SQL/NoSQL). Collaborate with cross-functional teams to ensure reliable, maintainable, and impactful AI solutions. Required Skills & Experience 2 to 5 years of experience in AI/ML, NLP, Generative AI, with a focus on agentic AI systems . Strong proficiency in Python and hands-on experience building RAG systems and deploying open-source LLMs. Experience developing AI-driven backends and services with a strong understanding of scalability and performance. Familiarity with vector search technologies, database integration, and cloud-native architectures. Excellent communication and collaboration skills; ability to work effectively in a team environment. Education B.Tech/B.E. or M.Tech/M.E. in Computer Science, AI/ML, or a related field. Role: Gen AI Engineer Designation: AI Developer Experience: 2-5 Years (AI/ML, NLP, Generative AI): Location: Pune

Posted 2 months ago

Apply

9.0 - 14.0 years

35 - 50 Lacs

Hyderabad, Pune, Bengaluru

Work from Office

Role - Senior Data Scientist / Senior Gen AI Engineer Exp Range - 8 to 18 yrs Position - Permanent Fulltime Company - Data Analytics & AIML MNC Location - Hyderabad, Pune, Bangalore (Relocation accepted) About the Role: We are seeking a Software Engineer with expertise in Generative AI and Microsoft technologies to design, develop, and deploy AI-powered solutions using the Microsoft ecosystem. You will work with cross-functional teams to build scalable applications leveraging generative AI models and Azure services. Skills Required: Experience with Large Language Models (LLMs) like GPT, LLaMA, Claude, etc. Proficiency in Python for building and fine-tuning AI/ML models Familiarity with LangChain , LLMOps , or RAG (Retrieval-Augmented Generation) pipelines Experience with Vector Databases (e.g. FAISS, Pinecone, Weaviate) Knowledge of Prompt Engineering and model evaluation techniques Exposure to cloud platforms (Azure, AWS or GCP) for deploying GenAI solutions Preferred Skills: Experience with Azure OpenAI , Databricks or Microsoft Fabric Hands-on with Hugging Face Transformers , OpenAI APIs or custom model training

Posted 2 months ago

Apply

3.0 - 8.0 years

15 - 25 Lacs

Hyderabad

Work from Office

Company Name - Fission Labs Apply Here - https://app.fabrichq.ai/jobs/0e46cebe-8b91-4a96-9061-950b66dc4d54 About Us: Headquartered in Sunnyvale, with offices in Dallas & Hyderabad, Fission Labs is a leading software development company, specializing in crafting flexible, agile, and scalable solutions that propel businesses forward.With a comprehensive range of services, including product development, cloud engineering, big data analytics, QA, DevOps consulting, and AI/ML solutions, we empower clients to achieve sustainable digital transformation that aligns seamlessly with their business goals. Key Responsibilities Design and architect complex Generative AI solutions using AWS technologies Develop advanced AI architectures incorporating state-of-the-art GenAI technologies Create and implement Retrieval Augmented Generation (RAG) and GraphRAG solutions Architect scalable AI systems using AWS Bedrock and SageMaker Design and implement agentic AI systems with advanced reasoning capabilities Develop custom AI solutions leveraging vector databases and advanced machine learning techniques Evaluate and integrate emerging GenAI technologies and methodologies Technical Expertise Requirements Generative AI Technologies Expert-level understanding of: Retrieval Augmented Generation (RAG) Vector Database architectures Agentic AI design principles AWS AI Services Comprehensive expertise in: AWS Bedrock Amazon SageMaker AWS AI/ML services ecosystem Cloud-native AI solution design Technical Skills Advanced Python programming for AI/ML applications

Posted 2 months ago

Apply

4.0 - 8.0 years

20 - 30 Lacs

Hyderabad

Work from Office

Company Name - Fission Labs Apply Here - https://app.fabrichq.ai/jobs/0e46cebe-8b91-4a96-9061-950b66dc4d54 About Us: Headquartered in Sunnyvale, with offices in Dallas & Hyderabad, Fission Labs is a leading software development company, specializing in crafting flexible, agile, and scalable solutions that propel businesses forward.With a comprehensive range of services, including product development, cloud engineering, big data analytics, QA, DevOps consulting, and AI/ML solutions, we empower clients to achieve sustainable digital transformation that aligns seamlessly with their business goals. Key Responsibilities Design and architect complex Generative AI solutions using AWS technologies Develop advanced AI architectures incorporating state-of-the-art GenAI technologies Create and implement Retrieval Augmented Generation (RAG) and GraphRAG solutions Architect scalable AI systems using AWS Bedrock and SageMaker Design and implement agentic AI systems with advanced reasoning capabilities Develop custom AI solutions leveraging vector databases and advanced machine learning techniques Evaluate and integrate emerging GenAI technologies and methodologies Technical Expertise Requirements Generative AI Technologies Expert-level understanding of: Retrieval Augmented Generation (RAG) Vector Database architectures Agentic AI design principles AWS AI Services Comprehensive expertise in: AWS Bedrock Amazon SageMaker AWS AI/ML services ecosystem Cloud-native AI solution design Technical Skills Advanced Python programming for AI/ML applications

Posted 2 months ago

Apply

8.0 - 15.0 years

8 - 15 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

As a Manager, Machine Learning Engineering, you will collaborate with cross-functional teams to strategize, develop, and deliver machine learning models tailored to meet specific business objectives. You will be responsible for overseeing the entire lifecycle of these models, from data preprocessing and algorithm selection to performance evaluation and seamless integration into production systems. This role has a specific focus on Generative AI . Your Impact: What You'll Achieve As a Manager, Data Science specializing in Generative AI, you will: Lead AI-Driven Innovations: Drive the development of state-of-the-art AI and machine learning solutions that transform business strategies and deliver exceptional customer experiences. Strategic Collaboration: Work closely with cross-functional teams, including product managers, data engineers, and business stakeholders, to define and execute data-driven solutions aligned with organizational goals. Foster a High-Performance Team: Build, mentor, and lead a team of talented data scientists, cultivating a culture of innovation, collaboration, and continuous learning. Deliver Business Impact: Translate complex business problems into AI/ML solutions by leveraging advanced techniques such as generative AI, deep learning, and NLP , ensuring measurable outcomes. Optimize AI Pipelines: Oversee the development and deployment of scalable, efficient, and robust machine learning pipelines that address latency, responsiveness, and real-time data processing challenges. Customize AI Models: Direct the customization and fine-tuning of AI models, including large language models (LLMs) and other generative AI technologies, to meet domain-specific requirements. Promote Data-Driven Decision-Making: Advocate for data-centric approaches across teams, ensuring data quality, integrity, and readiness to maximize model performance and business impact. Develop Intelligent AI Agents: Architect and refine AI agents that solve complex business challenges, leveraging LLMs to deliver personalized, user-centric solutions. Advance Generative AI Applications: Innovate with cutting-edge generative AI models such as LLM, VLM, GANs, and VAEs to create tailored applications for dynamic content creation, predictive analytics, and enhanced automation. Scale AI with Cloud Technology: Deploy and scale LLM-based solutions on platforms like GCP, AWS, and Azure to address real-world business problems with precision and efficiency. Stay at the Cutting Edge: Keep up-to-date with emerging trends and innovations in AI and data science, identifying opportunities to incorporate the latest advancements into projects. Responsibilities Design AI Systems: Build AI agents for tasks such as content compliance, asset decomposition, and contextual personalization. Develop NLP Pipelines: Implement advanced NLP solutions for search relevance, intent detection, and dynamic content generation. Integrate Multi-Modal Systems: Combine data modalities such as text, images, and metadata for enriched user interactions and insights. Optimize AI Pipelines: Innovate in latency reduction, scalability, and real-time responsiveness for AI systems in production. Collaborate on AI Innovation: Work with business stakeholders to identify opportunities and deliver impactful AI-driven solutions. Qualifications Your Skills & Experience Overall Experience: 8 to 15 years of experience. Generative AI Experience: At least 2 years of Gen AI experience . LLM Fine-tuning: Fine-tuning experience with Large Language Models (LLMs, VLLMs, or Vision models) . Distributed Training/Inference: Experience with distributed training or inference frameworks like Ray, vllm, openllm, bentoML etc. Generative AI Frameworks: Experience with frameworks like LangChain, Llamaindex for building maintainable, scalable Generative AI applications. LLM Deployment/Optimization: Deployment experience or optimized hosting experience of Large Language Models (LLMs, VLLMs, or Vision models) . Vector Databases: Experience working with any Vector database like Milvus, FAISS, ChromaDB etc. Agent Development: Experience developing agents with frameworks like LangGraph, CrewAI, Autogen etc. Prompt Engineering: Experience with prompt engineering. Market Trends: Keeping up with latest market trends. Open Source LLMs: Experience working with open-source large language models from HuggingFace . Cloud Providers: Experience working with at least one public cloud provider such as Azure, AWS, or GCP . Container Technology: Experience working with container technology like Docker, ECS etc. DevOps & CI/CD: Experience with DevOps practices and CI/CD pipelines for data solutions. Production Deployment: Experience in deploying solutions to production with Kubernetes or OpenShift . ML Workflow Management: Experience with managing ML workflows with MLFlow or KubeFlow.

Posted 2 months ago

Apply

8.0 - 12.0 years

12 - 18 Lacs

Chennai

Work from Office

We are seeking a full-stack software developer and team lead to work with our front-end and back-end software development team and our computational linguistics team. This role requires architectural decisions, end-to-end planning and follow-up.

Posted 2 months ago

Apply

4.0 - 8.0 years

14 - 24 Lacs

Bengaluru

Hybrid

Data Scientist/ GenAI LLM Engineer. Working at the edge of AI/ML technologies we help our clients to leverage the value of unstructured data, to uncover the hidden power of accumulated enterprise information. As an LLM engineer, you will have the opportunity to apply your deep expertise in LLM/GenAI technologies. Your main responsibility will be collaborating closely with clients to prototype, build, test, and deploy products powered by GenAI/LLM technology on a large scale. Additionally, you will play a key role in fine-tuning the hyperparameters of LLM models, optimizing their configuration to ensure the overall model performance and enhance the overall model performance to drive positive outcomes for clients. Work with Python, LLM/GenAI frameworks and tools AI/ML end-to-end solutions developing CI/CD pipelines development, LLM model containerizing and deployment on cloud or premise. Models testing and follow-up maintenance. All stages of ML model life cycle ensuring and support. Design prototypes and POCs to demonstrate solution feasibility and value. Provide architecture solution. Research, design, build, and train innovative applications of LLMs to solve complex real-world problems. Provide technical guidance to clients adopting LLM technologies. Required Qualifications: Bachelors Degree (might be final course student) in Statistics, Applied Mathematics, Computer Science, or other related fields Experience in AI/ML technologies and software engineering, at least 3+ years of hands-on experience with Python, 2+ years of experience with Schell scripting, 1+ years of experience building and maintaining scalable API solution 2+ years of professional experience with NLP, 1+years of professional experience with Large Language Models (LLM)/GenAI technology like OpenAI API, ChatGPT, GPT-4, Bard, Synthesia, Langchain, HuggingFace Transformers, PyTorch and similar, 1+ years of experience with prompt engineering, 1+ years of experience with vector database 2+ years of experience with AWS, GCP or Microsoft Azure, 2+ years of experience with MLOps, CI/CD pipeline development, containerization, model deployment in test and production environments Be a team player, fluent in English and ability to clearly communicate complex LLM capabilities and limitations to non-technical stakeholders. Desired Qualifications: M.sc. or Phd. in corresponding fields Experience with Java, Java Script or Scala, 2+ years of experience with Snowflake or Databricks Deep knowledge of a specific domain or industry, with a focus on NLP/LLM In depth understanding of Responsible AI standards and protocols Applied research background leveraging frameworks to build LLM prototypes, knowledge of best practices for production LLM development

Posted 2 months ago

Apply

5.0 - 8.0 years

2 - 5 Lacs

Bengaluru

Work from Office

Design, develop, and optimize prompts to improve user interactions with AI models and tools. Collaborate with cross-functional teams to integrate prompt engineering capabilities into various applications. Implement and maintain high-quality prompt solutions within CoPilot Studio, ensuring alignment with user experience goals. Utilize the Azure Communication Framework to enhance AI model performance in real-world applications. Continuously test, evaluate, and refine prompt workflows to ensure effectiveness and efficiency. Document prompt design processes, testing protocols, and configuration guidelines to support operational consistency. Provide support in troubleshooting and resolving prompt-related issues. Design, develop, and optimize prompts to improve user interactions with AI models and tools. Collaborate with cross-functional teams to integrate prompt engineering capabilities into various applications. Implement and maintain high-quality prompt solutions within CoPilot Studio, ensuring alignment with user experience goals.

Posted 2 months ago

Apply

5.0 - 10.0 years

9 - 19 Lacs

Hyderabad, Chennai, Bengaluru

Work from Office

Role & responsibilities Role: Senior Technical Lead Skill : AI/ML/GENAI, Python ( NumPy,Pandas, Scikit-learn, Keras, Flask, SciPy , TensorFlow, NLTK),GPT/PaLM/BERT, BART, Azure Cloud Billing Rate: USD 28 #of positions: 1 Work Location; Chennai (preferred), Bangalore , Hyderabad, Noida, Pune Notice Period: Immediate Joiners Those who are interested may send their resume to aswathy.rajan@hcltech.com

Posted 2 months ago

Apply

10.0 - 20.0 years

37 - 45 Lacs

Chandigarh

Remote

Job Title: AI/ML and Chatbot Lead Experience Level: 10+ Years (Lead/Architect level) Location: Remote Employment Type: Full-time No. of Positions: 1 Job Overview: We are seeking a visionary and hands-on AI/ML and Chatbot Lead to spearhead the design, development, and deployment of enterprise-wide Conversational and Generative AI solutions. This role will establish and scale our AI Lab function, define chatbot and multimodal AI strategies, and deliver intelligent automation solutions that enhance user engagement and operational efficiency. Key Responsibilities Define and lead the enterprise-wide strategy for Conversational AI, Multimodal AI, and Large Language Models (LLMs). Build an AI/Chatbot Lab , creating a roadmap and driving innovations across in-app, generative, and conversational AI. Architect scalable AI/ML systems including presentation, orchestration, AI, and data layers. Collaborate with business stakeholders to assess needs, conduct ROI analyses, and deliver impactful AI use cases. Identify and implement agentic AI capabilities and SaaS optimization opportunities. Deliver POCs, pilots, and MVPs owning the design, development, and deployment lifecycle. Lead, mentor, and scale a high-performing team of AI/ML engineers and chatbot developers . Build multi-turn, memory-aware conversations using frameworks like LangChain or Semantic Kernel . Integrate bots with platforms like Salesforce, NetSuite, Slack , and custom applications via APIs/webhooks. Implement and monitor chatbot KPIs using tools like Kibana , Grafana , and custom dashboards. Champion ethical AI , governance, and data privacy/security best practices. Must-Have Skills 10+ years in AI/ML; demonstrable success in chatbot, conversational AI , and generative AI implementations. Experience building and operationalizing an AI/Chatbot architecture framework used enterprise-wide. Expertise in: Python , LangChain, ElasticSearch, NLP (spaCy, NLTK, Hugging Face) LLMs (e.g., GPT, BERT), RAG, prompt engineering Chatbot platforms (Azure OpenAI, MS Bot Framework), CLU, CQA AI solution deployment and monitoring at scale Familiarity with: Machine learning algorithms, deep learning, reinforcement learning NLP techniques for NLU/NLG Cloud platforms ( AWS, Azure, GCP ), Docker , Kubernetes Vector DBs (Pinecone, Weaviate, Qdrant) Semantic search, knowledge graphs, intelligent document processing Strong grasp of AI governance , documentation, and compliance standards Excellent team leadership, communication, and documentation skills Good-to-Have Skills Experience with Glean , Perplexity.ai , Rasa , XGBoost Familiarity with Salesforce , NetSuite , and business domains like Customer Success Knowledge of RPA tools like UiPath and its AI Center Role & responsibilities Interested candidate can call at 7087707007

Posted 3 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies