Posted:1 week ago| Platform: Foundit logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Senior Manager - Senior Data Scientist (NLP & Generative AI)

Location:


About the Role

We are seeking a highly experienced Senior data scientist with 8+ years of expertise in machine learning, focusing on NLP, Generative AI, and advanced LLM ecosystems. This role demands leadership in designing and deploying scalable AI systems leveraging the latest advancements such as Google ADK, Agent Engine, and Gemini LLM. You will spearhead building real-time inference pipelines and agentic AI solutions that power complex, multi-user applications with cutting-edge technology.


Key Responsibilities

  • Lead the architecture, development, and deployment of scalable machine learning and AI systems centered on real-time LLM inference for concurrent users.

  • Design, implement, and manage agentic AI frameworks leveraging Google Adk, Langgraph or custom-built agents.

  • Integrate foundation models (GPT, LLaMA, Claude, Gemini) and fine-tune them for domain-specific intelligent applications.

  • Build robust MLOps pipelines for end-to-end lifecycle management of models-training, testing, deployment, and monitoring.

  • Collaborate with DevOps teams to deploy scalable serving infrastructures using containerization (Docker), orchestration (Kubernetes), and cloud platforms.

  • Drive innovation by adopting new AI capabilities and tools, such as Google Gemini, to enhance AI model performance and interaction quality.

  • Partner cross-functionally to understand traffic patterns and design AI systems that handle real-world scale and complexity.


Required Skills & Qualifications

  • Bachelor's or Master's degree in Computer Science, AI, Machine Learning, or related fields.

  • 7+ years in ML engineering, applied AI, or senior data scientist roles.

  • Strong programming expertise in Python and frameworks including PyTorch, TensorFlow, Hugging Face Transformers.

  • Deep experience with NLP, Transformer models, and generative AI techniques.

  • Practical knowledge of LLM inference scaling with tools like vLLM, Groq, Triton Inference Server, and Google ADK.

  • Hands-on experience deploying AI models to concurrent users with high throughput and low latency.

  • Skilled in cloud environments (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes).

  • Familiarity with vector databases (FAISS, Pinecone, Weaviate) and retrieval-augmented generation (RAG).

  • Experience with agentic AI using Adk, LangChain, Langgraph and Agent Engine


Preferred Qualifications

  • Experience with Google Gemini and other advanced LLM innovations.

  • Contributions to open-source AI/ML projects or participation in applied AI research.

  • Knowledge of hardware acceleration and GPU/TPU-based inference optimization.

  • Exposure to event-driven architectures or streaming pipelines (Kafka, Redis).

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

Noida, Uttar Pradesh, India

Gurgaon, Haryana, India

Gurgaon, Haryana, India