Posted:1 week ago|
Platform:
Remote
Full Time
We are seeking a highly experienced Senior data scientist with 8+ years of expertise in machine learning, focusing on NLP, Generative AI, and advanced LLM ecosystems. This role demands leadership in designing and deploying scalable AI systems leveraging the latest advancements such as Google ADK, Agent Engine, and Gemini LLM. You will spearhead building real-time inference pipelines and agentic AI solutions that power complex, multi-user applications with cutting-edge technology.
Lead the architecture, development, and deployment of scalable machine learning and AI systems centered on real-time LLM inference for concurrent users.
Design, implement, and manage agentic AI frameworks leveraging Google Adk, Langgraph or custom-built agents.
Integrate foundation models (GPT, LLaMA, Claude, Gemini) and fine-tune them for domain-specific intelligent applications.
Build robust MLOps pipelines for end-to-end lifecycle management of models-training, testing, deployment, and monitoring.
Collaborate with DevOps teams to deploy scalable serving infrastructures using containerization (Docker), orchestration (Kubernetes), and cloud platforms.
Drive innovation by adopting new AI capabilities and tools, such as Google Gemini, to enhance AI model performance and interaction quality.
Partner cross-functionally to understand traffic patterns and design AI systems that handle real-world scale and complexity.
Bachelor's or Master's degree in Computer Science, AI, Machine Learning, or related fields.
7+ years in ML engineering, applied AI, or senior data scientist roles.
Strong programming expertise in Python and frameworks including PyTorch, TensorFlow, Hugging Face Transformers.
Deep experience with NLP, Transformer models, and generative AI techniques.
Practical knowledge of LLM inference scaling with tools like vLLM, Groq, Triton Inference Server, and Google ADK.
Hands-on experience deploying AI models to concurrent users with high throughput and low latency.
Skilled in cloud environments (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes).
Familiarity with vector databases (FAISS, Pinecone, Weaviate) and retrieval-augmented generation (RAG).
Experience with agentic AI using Adk, LangChain, Langgraph and Agent Engine
Experience with Google Gemini and other advanced LLM innovations.
Contributions to open-source AI/ML projects or participation in applied AI research.
Knowledge of hardware acceleration and GPU/TPU-based inference optimization.
Exposure to event-driven architectures or streaming pipelines (Kafka, Redis).
EXL IT service management
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Bengaluru
20.0 - 25.0 Lacs P.A.
6.0 - 7.0 Lacs P.A.
Gurgaon
5.5 - 10.0 Lacs P.A.
0.00012 - 0.00013 Lacs P.A.
Noida
5.5 - 10.0 Lacs P.A.
Greater Delhi Area
Salary: Not disclosed
Noida, Uttar Pradesh, India
Salary: Not disclosed
Gurgaon, Haryana, India
Experience: Not specified
Salary: Not disclosed
Gurgaon, Haryana, India
Experience: Not specified
Salary: Not disclosed
Chennai
4.8 - 6.0 Lacs P.A.