Jobs
Interviews

6 Voice Ai Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

1.0 - 5.0 years

0 Lacs

hyderabad, telangana

On-site

NTT DATA is looking for a GCP Python Gen AI LLM RAG Vertex AI to join their team in Hyderabad, Telangana, India. As a potential candidate, you should have at least 4 years of Software Engineering experience or equivalent demonstrated through various means such as work experience, training, military experience, or education. It is essential to have a minimum of 2 years of experience working with GCP (Google Cloud Platform) or alternate public/hybrid cloud, delivering products with cloud services and cloud architectures at scale. In addition, you should have 2+ years of experience with Python and 3+ years of experience with GenAI, LLMs, RAG, vector databases, and conversational bots. Furthermore, 1+ years of experience with Playbooks and Vertex AI is required for this role. Exposure to ADK (hands-on) and Voice AI is a must. While not mandatory, having experience with LangChain and/or LangGraph is considered a plus. Additionally, 4+ years of Contact Center industry experience would be advantageous, including design, development, testing, integration with vendors, CRMs, and business applications. Proven knowledge in contact center subdomains such as IVR/IVA, NLU/NLP, Real-Time Omni-channel Agent experience, customer journey, and CX/AX experience optimization using AI/ML is beneficial. Moreover, familiarity with Node JS, JAVA, Spring Boot, Kafka, Distributed Caches (GemFire, Redis), Elastic Search technologies, GraphQL, and NoSQL Databases (Cassandra or Mongo), Graph Databases, Public Cloud Marketplace services is a good-to-have skill set. Experience with Deep Domain Driven Design with cloud-native Microservices designed and developed for massive scale and seamless resiliency, deployed on PCF/VMWare Tanzu, K8s, or Serverless cloud technologies for at least 2 years is also an added advantage. NTT DATA is a trusted global innovator of business and technology services, with a commitment to helping clients innovate, optimize, and transform for long-term success. Being a part of NTT DATA means being part of a diverse team of experts in over 50 countries, with a robust partner ecosystem. Their services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation, and management of applications, infrastructure, and connectivity. NTT DATA is a leading provider of digital and AI infrastructure globally, and as part of the NTT Group, they invest significantly in R&D to help organizations and society move confidently and sustainably into the digital future. Visit their website for more information.,

Posted 3 days ago

Apply

5.0 - 9.0 years

0 Lacs

haryana

On-site

As a Senior AI Engineer specialized in Voice AI and Autonomous Agents at Spyne, you will be responsible for owning and building Spynes in-house voice bot stack. This pivotal role involves leveraging your expertise in LLMs, ASR/TTS, and voice UX to develop immersive, human-like conversations between auto dealerships and their customers. Located in Gurugram, you will work from the office five days a week to drive the development of cutting-edge AI solutions in the automotive retail sector. Your primary responsibilities will include: - Voice AI Stack Ownership: Developing and managing the complete voice bot pipeline encompassing ASR, NLU, dialog state management, tool calling, and TTS to deliver a seamless conversation experience. - LLM Orchestration & Tooling: Designing systems utilizing MCP to facilitate structured context exchange between real-time ASR, memory, APIs, and the LLM. - RAG Integration: Implementing retrieval-augmented generation to support responses based on dealership knowledge bases, inventory data, recall lookups, and FAQs. - Vector Store & Memory: Creating scalable vector-based search functionalities for dynamic FAQ handling, call recall, and user-specific memory embedding. - Latency Optimization: Engineering low-latency, streaming ASR + TTS pipelines and optimizing turn-taking models for natural conversations. - Model Tuning & Hallucination Control: Customizing tone, reducing hallucinations, and aligning responses with business objectives via fine-tuning, LoRA, or instruction tuning. - Instrumentation & QA Looping: Establishing robust observability, conducting real-time call QA processes, and analyzing interruptions, hallucinations, and fallbacks. - Cross-functional Collaboration: Collaborating closely with product, infra, and leadership teams to scale the voice bot solution to numerous US dealerships effectively. To excel in this role, you should possess: - Architectural Thinking: Ability to comprehend the integration of ASR, LLMs, memory, and tools and design modular, observable, and resilient systems. - LLM Tooling Mastery: Proficiency in implementing tool calling, retrieval pipelines, function calls, or prompt chaining across various workflows. - Fluency in Vector Search & RAG: Expertise in chunking, embedding, indexing, and retrieval processes while avoiding prompt bloat and token overflow. - Latency-First Mindset: Capability to identify and rectify token delays, optimize API round-trip time, and ensure human-like call interactions. - Grounding > Hallucination: Skill in tracing hallucinations to weak prompts, lack of guardrails, or tool access deficiencies and addressing them effectively. - Prototyping Skills: Comfort with building from scratch and rapid iteration using open-source or hosted tools as required. Requirements for this role include: - 5+ years of experience in AI/ML or voice/NLP systems with real-time exposure. - Profound knowledge of LLM orchestration, RAG, vector search, and prompt engineering. - Experience with MCP-style architectures and structured context pipelines between LLMs and APIs/tools. - Familiarity with integrating ASR (Whisper/Deepgram), TTS (ElevenLabs/Coqui), and OpenAI/GPT-style models. - Strong understanding of latency optimization, streaming inference, and real-time audio pipelines. - Proficiency in Python, FastAPI, vector DBs (Pinecone, Weaviate, FAISS), and cloud infrastructures (AWS/GCP). - Solid debugging, logging, and QA capabilities for hallucination, grounding, and UX analysis. Join Spyne for a real-world AI impact, a superior team balancing speed with technical depth, high autonomy, and visibility from day one, accelerated career growth, MacBook along with essential tools, and a flat structure focused on meaningful work without unnecessary bureaucracy.,

Posted 5 days ago

Apply

5.0 - 9.0 years

0 Lacs

haryana

On-site

As a Senior AI Engineer - Voice AI / Autonomous Agents at Spyne, you will have the opportunity to own and build Spynes in-house voice bot stack. In this high-impact individual contributor role, you will be at the intersection of LLMs, ASR/TTS, and voice UX, focusing on creating deeply human, latency-optimized conversations between auto dealerships and their customers. Your main responsibilities will include: Voice AI Stack Ownership: Building and owning the end-to-end voice bot pipeline, including ASR, NLU, dialog state management, tool calling, and TTS to deliver a natural, human-like conversation experience. LLM Orchestration & Tooling: Architecting systems using MCP (Model Context Protocol) to mediate structured context between real-time ASR, memory, APIs, and the LLM. RAG Integration: Implementing retrieval-augmented generation to ground responses using dealership knowledge bases, inventory data, recall lookups, and FAQs. Vector Store & Memory: Designing scalable vector-based search for dynamic FAQ handling, call recall, and user-specific memory embedding. Latency Optimization: Engineering low-latency, streaming ASR + TTS pipelines and fine-tuning turn-taking models for natural conversation. Model Tuning & Hallucination Control: Using fine-tuning, LoRA, or instruction tuning to customize tone, reduce hallucinations, and align responses to business goals. Instrumentation & QA Looping: Building robust observability, running real-time call QA pipelines, and analyzing interruptions, hallucinations, and fallbacks. Cross-functional Collaboration: Working closely with product, infra, and leadership to scale this bot to thousands of US dealerships. To be successful in this role, you should possess: Architect-level thinking: Understanding how ASR, LLMs, memory, and tools fit together and having the ability to design modular, observable, and resilient systems. LLM Tooling Mastery: Implementing tool calling, retrieval pipelines, function calls, or prompt chaining across multiple workflows. Fluency in Vector Search & RAG: Knowing how to chunk, embed, index, and retrieve, while avoiding prompt bloat and token overflow. Latency-First Mindset: Debugging token delays, understanding the cost of each API hop, and optimizing round-trip time to maintain human-like interactions. Grounding > Hallucination: Tracing hallucinations back to weak prompts, missing guardrails, or lack of tool access and effectively addressing them. Prototyper at heart: Being unafraid of building from scratch and iterating quickly, utilizing open-source or hosted tools as necessary. The ideal candidate will have: 5+ years of experience in AI/ML or voice/NLP systems with real-time experience. Deep knowledge of LLM orchestration, RAG, vector search, and prompt engineering. Experience with MCP-style architectures or structured context pipelines between LLMs and APIs/tools. Experience integrating ASR (Whisper/Deepgram), TTS (ElevenLabs/Coqui), and OpenAI/GPT-style models. Solid understanding of latency optimization, streaming inference, and real-time audio pipelines. Hands-on experience with Python, FastAPI, vector DBs (Pinecone, Weaviate, FAISS), and cloud infra (AWS/GCP). Strong debugging, logging, and QA instincts for hallucination, grounding, and UX behavior. Working at Spyne offers real-world AI impact at scale, a high-performing team that balances speed with technical depth, high autonomy and visibility from day one, rapid career acceleration, access to MacBook and all necessary tools and compute, a flat structure with real work focus, and no BS. Join us in redefining how cars are marketed and sold with cutting-edge Generative AI.,

Posted 1 week ago

Apply

2.0 - 5.0 years

6 - 9 Lacs

Kolkata

Work from Office

Role & Responsibilities: Design & implement AI workflows to automate CRM, trip quotations, lead follow-ups, and customer chat/voice queries Build smart agents using GPT (chat + voice) for internal & customer use (OpenAI, Twilio, CallHippo) Integrate travel APIs (flights, hotels, activities) with our platform for live quotation generation Automate repetitive tasks using Make.com, Zapier, and internal tools Work with the Product and Creative teams to bring AI to media creation (photo/video sorting, customer albums, etc.) Develop performance dashboards, auto-suggestion tools, and smart seller assistants Stay up to date with AI trends in TravelTech and test applicable models for use Preferred Candidate Profile: 2 to 5 years of real-world experience in applied AI, automation, or backend systems Strong Python skills (FastAPI, LangChain, Pandas, etc.) Hands-on with OpenAI/GPT APIs, Twilio, WhatsApp integrations Familiarity with automation tools like Make.com, Zapier Passionate about travel, tech, and creating impact Bonus: Experience in e-commerce, SaaS, or travel-tech products

Posted 2 weeks ago

Apply

5.0 - 8.0 years

15 - 30 Lacs

Noida

Work from Office

Position Summary: We are looking for a dynamic Senior Voice AI Developer / Voice AI Architect to spearhead our AI initiatives, guiding the integration of artificial intelligence into various aspects of our business.You will design AI systems from the ground up, collaborate with multidisciplinary teams to tailor AI solutions to specific business needs, and ensure these solutions are scalable and sustainable.Your expertise will help us harness the power of AI to drive innovation, improve decision-making, and maintain competitive advantage in our industry. Key Responsibilities: Design, develop, and oversee the implementation of end-to-end AI solutions Collaborate with business and IT stakeholders to understand and fulfill the AI needs of the organization Create architectural approaches for AI software and hardware integration Define AI solution objectives and ensure alignment with business outcomes Monitor AI industry trends and maintain state-of-the-art industry knowledge Implement best practices for AI testing, deployment, and maintenance Qualifications:- Candidates should be B.Tech/M.tech/MCA (CSE/IT) preferably from premium institutes. 5+ years of experience in Node.js / javascript / TypeScript 5+ years of relevant experience in AI Frameworks Mandatory Skills: Google Dialog flow, Open AI Intergration, Google STT/TTS & Nodejs Node.js: Competent in developing and maintaining applications. GCP Services: Proficient in GCP logs, services, and custom deployment configurations. Dialog flow Expertise: Strong understanding of conversational agent design and integration for Voice Applications. Speech-to-Text & Text-to-Speech: Functional knowledge of speech processing. Functional Knowledge of Audio Streaming/Processing Applications Function Knowledge of Conversational AI Generative AI: Foundational understanding of Gen AI concepts and applications. Proven understanding of scalable computing systems, microservices architectures, software architecture, data structures, and algorithms Working with defined processes and policy (e.g. Peer review, test driven development, coding standards and deployment process) Excellent proven analytical problem solving skills. Self-motivated high performer and able to perform with minimal supervision, who can lead by example. Excellent written and verbal communication skills. Good To Have: Exp. in Integration with Azure Open AI Working Knowledge of Any CCAI Framework/Applications Audio Intelligence Frameworks Experience with Back-end technologies like NoSQL, RDBMS, Cache Management Experience working with AWS Technologies - Lambda, API Gateway, S3 Agile Development Methodology Experience to work with geographically distributed teams. Knowledge of any CRM (WFM) will be a bonus, preferably ServiceNOW is a big plus. Benefits: - Flexible Working Hours. Hybrid Working Style. Personal Accidental Insurance. Health Insurance to Self, Spouse and two kids. 5 days working week.

Posted 1 month ago

Apply

2 - 7 years

7 - 15 Lacs

Bengaluru

Hybrid

As a Prompt Engineer , you will play a crucial role in designing and refining the conversational prompts that power our AI voice agents. You will work closely with AI researchers, product managers, and developers

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies