Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
You will be joining the engineering team at STAND 8 as a Senior AI Engineer / Data Engineer where you will play a key role in developing cutting-edge AI-powered business solutions. Your primary focus will be designing and optimizing AI systems that utilize advanced large language models, real-time AI interactions, and state-of-the-art retrieval architectures. Your contributions will directly impact products that are revolutionizing various business operations, especially in areas such as recruitment, data extraction, and decision-making processes. As a Senior AI Engineer / Data Engineer at STAND 8, your responsibilities will include designing, building, and enhancing AI-powered systems that incorporate multi-modal architectures encompassing text, voice, and visual elements. You will also be tasked with integrating and deploying large language model (LLM) APIs from providers like OpenAI, Anthropic, and AWS Bedrock, as well as constructing and managing Retrieval-Augmented Generation (RAG) systems with hybrid search capabilities, re-ranking functionalities, and knowledge graphs. Additionally, you will develop real-time AI features using streaming analytics and voice interaction tools, build APIs and pipelines to support AI workflows, process unstructured documents with layout and semantic understanding, implement predictive models, and deploy scalable solutions using various AWS services. Your role will also involve utilizing Docker for containerization, managing CI/CD workflows, version control through Git, debugging, monitoring, and optimizing performance for large-scale data pipelines, and collaborating with cross-functional teams consisting of product, data, and engineering professionals. To qualify for this role, you should possess at least 5 years of experience in AI/ML or data engineering with Python in production environments. Hands-on experience with LLM APIs and frameworks such as OpenAI, Anthropic, Bedrock, or LangChain is essential, along with expertise in vector databases like PGVector, Weaviate, FAISS, or Pinecone, a strong grasp of NLP, document extraction, and text processing, proficiency in AWS cloud services, experience with FastAPI or similar frameworks, familiarity with embedding models, prompt engineering, and RAG systems, knowledge of asynchronous programming, Docker, Git workflows, CI/CD pipelines, and testing best practices. Preferred qualifications include a background in HRTech or ATS integrations, knowledge of knowledge graphs for semantic relationships, experience with real-time AI systems and voice AI tools, advanced Python development skills, large-scale data processing experience, event-driven architecture knowledge using AWS services, and hands-on experience with fine-tuning, evaluating, and deploying foundation models.,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
STAND 8 provides end-to-end IT solutions to enterprise partners across the United States and with offices in Los Angeles, New York, New Jersey, Atlanta, and more including internationally in Mexico and India. We are looking for a Senior AI Engineer / Data Engineer to be a part of our engineering team and contribute to building the future of AI-powered business solutions. In this role, you will be involved in developing intelligent systems that make use of advanced large language models (LLMs), real-time AI interactions, and cutting-edge retrieval architectures. Your efforts will have a direct impact on products that are revolutionizing business operations, particularly in the areas of recruitment, data extraction, and intelligent decision-making. This position offers an exciting opportunity for individuals who excel in constructing production-grade AI systems and are adept at working across the full spectrum of modern AI technologies. Responsibilities - Design, construct, and enhance AI-powered systems utilizing multi-modal architectures encompassing text, voice, and visual elements. - Incorporate and deploy LLM APIs sourced from providers like OpenAI, Anthropic, and AWS Bedrock. - Develop and manage RAG (Retrieval-Augmented Generation) systems featuring hybrid search, re-ranking, and knowledge graphs. - Create real-time AI functionalities through the utilization of streaming analytics and voice interaction tools such as ElevenLabs. - Establish APIs and pipelines using FastAPI or similar frameworks to facilitate AI workflows. - Process and evaluate unstructured documents with an understanding of layout and semantics. - Implement predictive models that drive intelligent business recommendations. - Deploy and sustain scalable solutions leveraging AWS services like EC2, S3, RDS, Lambda, Bedrock, among others. - Utilize Docker for containerization and oversee CI/CD workflows and version control via Git. - Debug, monitor, and optimize performance for large-scale data pipelines. - Collaborate seamlessly with product, data, and engineering teams across different functions. Qualifications - Possess 5+ years of experience in AI/ML or data engineering utilizing Python in production environments. - Hands-on familiarity with LLM APIs and frameworks such as OpenAI, Anthropic, Bedrock, or LangChain. - Previous experience in deploying vector databases like PGVector, Weaviate, FAISS, or Pinecone in production settings. - Solid grasp of NLP, document extraction, and text processing. - Proficiency in AWS cloud services including Bedrock, EC2, S3, Lambda, and monitoring tools. - Experience with FastAPI or similar frameworks for constructing AI/ML APIs. - Knowledge of embedding models, prompt engineering, and RAG systems. - Proficiency in asynchronous programming for high-throughput pipelines. - Proficiency in Docker, Git workflows, CI/CD pipelines, and adherence to testing best practices. Preferred - Background in HRTech or ATS integrations (e.g., Greenhouse, Workday, Bullhorn). - Experience working with knowledge graphs (e.g., Neo4j) for semantic relationships. - Exposure to real-time AI systems (e.g., WebRTC, OpenAI Realtime API) and voice AI tools (e.g., ElevenLabs). - Advanced Python development skills employing design patterns and clean architecture. - Experience in large-scale data processing (1-2M+ records) with cost optimization techniques for LLMs. - Proficiency in event-driven architecture utilizing AWS SQS, SNS, or EventBridge. - Hands-on experience with fine-tuning, evaluating, and deploying foundation models.,
Posted 2 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough