This role is for one of our clients
Industry: Technology, Information and MediaSeniority level: Associate levelMin Experience: 3 yearsJobType: full-time
About The Role
Are you ready to redefine how people interact with technology using natural language? We’re looking for a visionary Principal Engineer – Conversational AI & Voice Systems
to spearhead the development of intelligent, real-time voice-based agents. In this role, you'll leverage your deep expertise in machine learning, natural language processing, and speech technologies
to craft next-gen AI calling experiences that feel truly human.
You’ll join a high-performing team working at the intersection of AI, linguistics, and software engineering. Your innovations will empower millions of customer interactions, driving engagement, satisfaction, and business value through intelligent, lifelike conversations.What You’ll Do
🎯 Architect & Lead Voice AI Solutions
Design, prototype, and productionize end-to-end voice agent systems combining TTS, ASR, NLP, and ML models.
Drive decisions around system design, architecture, and performance optimization for low-latency, real-time interactions.🧠 Advance AI Conversation Models
Build and fine-tune neural TTS models to deliver expressive, human-like speech.
Develop context-aware dialogue systems using transformer-based NLP frameworks for robust understanding and dynamic response generation.Implement reinforcement learning or statistical techniques to optimize dialogue flow and personalization.📊 Data-Driven Improvement
Analyze structured and unstructured conversation data to identify model gaps and train higher-accuracy ML models.
Collaborate with linguistic, annotation, and product teams to continuously improve language coverage, tone, and content alignment.🛠️ Cross-Functional Collaboration
Partner with backend engineers, DevOps, linguists, and product teams to ensure smooth integration and deployment.
Work closely with voice UX designers to refine the end-user experience.🧪 Innovation & R&D
Stay ahead of the curve with advancements in speech synthesis (e.g., Tacotron, FastSpeech), NLP (e.g., BERT, GPT), and generative AI.
Lead proof-of-concept initiatives and experimental model development.What You Bring
✔️ Core Qualifications
4–7 years of hands-on experience in developing conversational AI, voice interfaces, or speech-driven systems.
Strong expertise in Python and ML/NLP libraries such as PyTorch, TensorFlow, HuggingFace Transformers, or spaCy.Deep understanding of speech synthesis, voice cloning, acoustic modeling, or signal processing.Track record of deploying production-grade NLP or speech AI pipelines.Solid grasp of dialogue management, context tracking, and natural language understanding.🌐 Bonus Experience
Experience with real-time voice agents, latency optimization, or telephony integrations.
Familiarity with cloud services (AWS/GCP/Azure) and APIs for voice platforms (Dialogflow, Amazon Lex, Twilio Voice, etc.).Background in computational linguistics or multi-lingual AI systems.🎓 Education
Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Computational Linguistics, or a related field. PhD is a plus.
Why Join Us
Help build the future of intelligent voice systems that feel intuitive, emotional, and human.
Collaborate with experts across AI, product, and design to shape a new category of human-tech interaction.Access cutting-edge resources, tools, and compute infrastructure for experimentation and scale.Be part of a mission-driven organization where your work directly transforms user experience at scale.