Home
Jobs

1230 Inference Jobs - Page 4

Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
Filter
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Linkedin logo

Job Role: AI Engineer (WFO - Chennai) Experience: 3years Job Type: Full Time, Permanent Job Location: Chennai (5days - work from office) Notice Period: Immediate to 15days We are looking for a passionate AI Engineer with 1–3 years of hands-on experience in developing and deploying AI/ML models. You will contribute to designing scalable AI systems, training models, and supporting real-world deployment for autonomous vehicles and other AI-centric applications. If you're enthusiastic about applying your AI skills in impactful projects, join our innovative team. Roles & Responsibilities: Build and optimize AI/ML models for real-time autonomous driving systems. Work closely with senior architects and product teams to translate business problems into AI solutions. Implement and test deep learning models (e.g., CNNs, transformers) for tasks such as object detection, lane recognition, or natural language understanding. Integrate AI models into production environments and contribute to continuous model improvement. Collaborate with MLOps teams to ensure reliable deployment through CI/CD pipelines. Write clean, well-documented, and efficient code. Stay up-to-date with the latest research and best practices in AI and Machine Learning. Requirements: Required Technical and Professional Expertis e 1–3 years of experience in AI/ML development and deployment. Proficiency in Python and common ML frameworks (TensorFlow, PyTorch). Experience with data preprocessing, model training, evaluation, and optimization. Solid foundation in machine learning and deep learning algorithms. Experience working with APIs, model versioning, and deployment workflows. Familiarity with computer vision, NLP, or reinforcement learning. Hands-on experience with Docker, Git, and cloud services (AWS, GCP, or Azure). Strong problem-solving and analytical skills. Good communication skills and ability to work in a collaborative environment. Preferred Skills: Experience with real-time inference and edge deployment. Familiarity with autonomous systems or robotics applications. Exposure to LLMs (e.g., GPT, BERT) or multimodal AI models. Experience with RESTful APIs, microservices, and distributed systems. Educational Qualifications: Bachelor's or Master's degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field. Benefits Innovative Projects: Work on cutting-edge AI technologies shaping the future of mobility. Collaborative Culture: Join a passionate team pushing boundaries in AI and autonomy. Career Growth: Be part of a fast-growing startup with plenty of growth opportunities. Benefits: Competitive salary, health insurance (up to ₹20 lakhs), wellness programs, learning & development, mentorship, and performance-based increments.

Posted 3 days ago

Apply

6.0 - 10.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

InCommon is hiring on behalf of an early-stage digital commerce startup. Location: HSR Layout, Bangalore (In-office only) Experience: 6 - 10 years About the Role: We’re looking for a Senior Product Manager to lead our Catalog Infrastructure & Distribution Platform. This is a foundational layer that powers product data across all touchpoints - D2C sites, marketplaces, offline retail, and more. You’ll build systems to collect, enrich, template, and distribute catalog content at scale - turning fragmented, unstructured product data into consistent, API-ready, multi-channel feeds. This role sits at the intersection of data modeling, workflow automation, API integration, and UI tooling - with high visibility across engineering, design, and GTM. Responsibilities: Define the vision for a single source of truth for catalog data across the ShopOS stack Build tools to collect and enrich unstructured item data from various brand systems Create smart templates for product attributes (code, design, specs, etc.) Design and manage the catalog creation-to-distribution flow with click-to-publish functionality Integrate with external APIs (e.g. marketplaces, ERP systems, offline retail PoS) for automated sync Drive automation for SKU setup, versioning, updates, and distribution rules Own the productization of catalog intelligence (attribute inference, bulk edits, Gen AI enrichments) Requirements: 6–10 years of product management experience in eCommerce, supply chain, or SaaS platforms Strong understanding of catalog management, PIM/DAM systems, or ERP integration Experience working on data-heavy platforms or backend-heavy workflows Ability to work with engineering on data pipelines, versioning logic, and API schemas Sharp execution and MVP mindset - can break down complexity into simple workflows Comfort working with brands and ops teams to understand real-world catalog challenges Good to have: Background in Gen AI or automation tooling Experience with retailer onboarding, SKU mapping, and marketplace API integrations Prior work on catalog enrichment (images, specs, variants, sizing, etc.) Why This Role Matters: Your work will power the foundational layer of commerce at ShopOS - enabling our AI agents to act on consistent, clean product data. This system will touch every brand we support and directly impact how fast and intelligently they go to market. Our Values: Extreme ownership and bias for action Honest, high-velocity communication Respect for craft and obsession with users Fast, scrappy iteration over perfection Low ego, high empathy, radical candor What We Offer: Competitive salary Health & wellness benefits Work from our vibrant HSR Layout (Bangalore) or Chennai (Nungambakkam) office Direct mentorship from AI & commerce leaders Zero-bureaucracy, high-ownership environment Opportunity to shape the future of agentic commerce at internet scale

Posted 3 days ago

Apply

0.0 - 10.0 years

0 Lacs

Bengaluru, Karnataka

Remote

Indeed logo

Location: Bangalore - Karnataka, India - EOIZ Industrial Area Job Family: Engineering Worker Type Reference: Regular - Permanent Pay Rate Type: Salary Career Level: T3(B) Job ID: R-44637-2025 Description & Requirements Job Description Introduction: A Career at HARMAN Digital Transformation Solutions (DTS) We’re a global, multi-disciplinary team that’s putting the innovative power of technology to work and transforming tomorrow. At HARMAN DTS, you solve challenges by creating innovative solutions. Combine the physical and digital, making technology a more dynamic force to solve challenges and serve humanity’s needs Java Microservices Java Developer with experience in microservices deployment, automation, and system lifecycle management(security, and infrastructure management) Required Skills: Java, hibernate, SAML/OpenSAML REST APIs Docker PostgreSQL (PSQL) Familiar with git hub workflow. Good to Have: Go (for automation and bootstrapping) RAFT Consensus Algorithm HashiCorp Vault Key Responsibilities: Service Configuration & Automation: Configure and bootstrap services using the Go CLI. Develop and maintain Go workflow templates for automating Java-based microservices. Deployment & Upgrade Management: Manage service upgrade workflows and apply Docker-based patches. Implement and manage OS-level patches as part of the system lifecycle. Enable controlled deployments and rollbacks to minimize downtime. Network & Security Configuration: Configure and update FQDN, proxy settings, and SSL/TLS certificates. Set up and manage syslog servers for logging and monitoring. Manage appliance users, including root and SSH users, ensuring security compliance. Scalability & Performance Optimization: Implement scale-up and scale-down mechanisms for resource optimization. Ensure high availability and performance through efficient resource management. Lifecycle & Workflow Automation: Develop automated workflows to support service deployment, patching, and rollback. Ensure end-to-end lifecycle management of services and infrastructure. What You Will Do To perform in-depth analysis of data and machine learning models to identify insights and areas of improvement. Develop and implement models using both classical machine learning techniques and modern deep learning approaches. Deploy machine learning models into production, ensuring robust MLOps practices including CI/CD pipelines, model monitoring, and drift detection. Conduct fine-tuning and integrate Large Language Models (LLMs) to meet specific business or product requirements. Optimize models for performance and latency, including the implementation of caching strategies where appropriate. Collaborate cross-functionally with data scientists, engineers, and product teams to deliver end-to-end ML solutions. What You Need to Be Successful Utilized various statistical techniques to derive important insights and trends. Proven experience in machine learning model development and analysis using classical and neural networks based approaches. Strong understanding of LLM architecture, usage, and fine-tuning techniques. Solid understanding of statistics, data preprocessing, and feature engineering. Proficient in Python and popular ML libraries (scikit-learn, PyTorch, TensorFlow, etc.). Strong debugging and optimization skills for both training and inference pipelines. Familiarity with data formats and processing tools (Pandas, Spark, Dask). Experience working with transformer-based models (e.g., BERT, GPT) and Hugging Face ecosystem. Bonus Points if You Have Experience with MLOps tools (e.g., MLflow, Kubeflow, SageMaker, or similar). Experience with monitoring tools (Prometheus, Grafana, or custom solutions for ML metrics). Familiarity with cloud platforms (Sagemaker, AWS, GCP, Azure) and containerization (Docker, Kubernetes). Hands-on experience with MLOps practices and tools for deployment, monitoring, and drift detection. Exposure to distributed training and model parallelism techniques. Prior experience in AB testing ML models in production. What Makes You Eligible Bachelor’s or master’s degree in computer science, Artificial Intelligence, or a related field. 5-10 years relevant and Proven experience in developing and deploying generative AI models and agents in a professional setting. What We Offer Flexible work environment, allowing for full-time remote work globally for positions that can be performed outside a HARMAN or customer location Access to employee discounts on world-class Harman and Samsung products (JBL, HARMAN Kardon, AKG, etc.) Extensive training opportunities through our own HARMAN University Competitive wellness benefits Tuition reimbursement “Be Brilliant” employee recognition and rewards program An inclusive and diverse work environment that fosters and encourages professional and personal development You Belong Here HARMAN is committed to making every employee feel welcomed, valued, and empowered. No matter what role you play, we encourage you to share your ideas, voice your distinct perspective, and bring your whole self with you – all within a support-minded culture that celebrates what makes each of us unique. We also recognize that learning is a lifelong pursuit and want you to flourish. We proudly offer added opportunities for training, development, and continuing education, further empowering you to live the career you want. About HARMAN: Where Innovation Unleashes Next-Level Technology Ever since the 1920s, we’ve been amplifying the sense of sound. Today, that legacy endures, with integrated technology platforms that make the world smarter, safer, and more connected. Across automotive, lifestyle, and digital transformation solutions, we create innovative technologies that turn ordinary moments into extraordinary experiences. Our renowned automotive and lifestyle solutions can be found everywhere, from the music we play in our cars and homes to venues that feature today’s most sought-after performers, while our digital transformation solutions serve humanity by addressing the world’s ever-evolving needs and demands. Marketing our award-winning portfolio under 16 iconic brands, such as JBL, Mark Levinson, and Revel, we set ourselves apart by exceeding the highest engineering and design standards for our customers, our partners and each other. If you’re ready to innovate and do work that makes a lasting impact, join our talent community today ! HARMAN is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or Protected Veterans status. HARMAN offers a great work environment, challeng Important Notice: Recruitment Scams Please be aware that HARMAN recruiters will always communicate with you from an '@harman.com' email address. We will never ask for payments, banking, credit card, personal financial information or access to your LinkedIn/email account during the screening, interview, or recruitment process. If you are asked for such information or receive communication from an email address not ending in '@harman.com' about a job with HARMAN, please cease communication immediately and report the incident to us through: harmancareers@harman.com. HARMAN is proud to be an Equal Opportunity / Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Posted 3 days ago

Apply

5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours! The Opportunity The Identity and Administration team is looking for a motivated and versatile data analyst to drive Data & Insights for business decision making and growth testing. You will employ data to deeply understand customers through their administrative journey. The ideal candidate will have deep technical skills, a passion for delivering compelling customer analytics, curiosity, comfort with ambiguity, and able to influence partners to drive strategy. What you'll Do Lead strategic analytics projects, presenting findings that provide impactful insights for key collaborators. Apply data from various sources to understand customer behavior, including in-product engagement, purchase patterns, and retention. Build compelling narratives to drive action and find opportunities to improve the administration experiences. Develop agile Analysis frameworks to ensure quick and high-quality data insights that advise strategic decisions. Collaborate closely with Product teams to successfully implement the right strategies for Identity-based experiences in fast paced, data-driven environment. Design, implement, and analyze A/B experiments, delivering comprehensive reports and actionable insights to drive product improvements. Partner with enablement and development teams to ensure accurate analytics measurement across existing and new features, supporting data systems. Build and maintain dashboards to facilitate self-serve information access for teams. What you need to succeed A Bachelor's degree or equivalent experience in computer science, statistics, physics, or a related field. Technical experience of a similar level will also be taken into account. 5-7+ years of direct experience as a Data Analyst. Deep understanding of statistics and expertise in building, productizing, maintaining, and improving predictive models is preferred. Strong proficiency in querying, manipulating, validating, and analyzing large datasets using SQL or Hive. Experience in ML modeling and data inference is highly desirable. Proficiency with Python, R, or any other scripting language. Experience with data visualization tools such as Tableau or Power BI. Familiarity with Adobe Analytics and Amplitude is a plus. Strong self-starter with a proven track record of delivering business insights that have crafted product strategy. Outstanding communication and teamwork skills to work effectively with multidisciplinary teams. Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more about our vision here. Adobe aims to make Adobe.com accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email accommodations@adobe.com or call (408) 536-3015.

Posted 3 days ago

Apply

5.0 years

0 Lacs

Ghaziabad, Uttar Pradesh, India

Remote

Linkedin logo

We're not your average tech company! Rightcrowd is a global leader in keeping people safe and organizations secure. We build smart solutions that manage who's on-site, what access they have, and ensure everything is compliant. Think big names – Fortune 50 and ASX 10 companies rely on us to tackle their toughest security challenges. We're a passionate, global team with offices in places like Australia, the USA, Belgium, India, and the Philippines, and we're on a mission to make the world a safer place, one clever line of code at a time. What We Offer Competitive salary and benefits. A collaborative, respectful, and high-accountability team culture. Opportunities for growth within a global finance team. Flexibility in a remote-friendly and dynamic work environment. Key Responsibilities Required Qualifications & Skills: Agentic AI Development & Code Generation: Design, implement, and lead the development of an advanced agentic AI system that can interpret user stories and autonomously generate production-ready code Research and implement state-of-the-art techniques in LLM prompting, fine-tuning, and orchestration to achieve optimal code generation results Architect robust evaluation systems to test, validate, and improve AI-generated code Seamlessly integrate AI-generated code into existing CI/CD pipelines Collaborate with product and engineering teams to refine inputs and ensure alignment with development standards AI Recommendation Engine Architecture: Architect a high-performance, scalable recommendation engine for the RightCrowd SmartAccess platform Design and implement AI-driven feature extraction pipelines Develop hybrid recommendation algorithms leveraging both collaborative filtering and content- based approaches Engineer systems for real-time inference and continuous model improvement Ensure the architecture integrates seamlessly with existing product infrastructure Natural Language Reporting System Development: Lead the end-to-end development of an AI-powered reporting system that translates natural language queries to SQL Engineer robust NLP components to accurately understand and parse user requests Build a reliable translation layer between natural language and database queries Implement data retrieval, processing, and presentation mechanisms Design systems for generating insights and visualizations from retrieved data Ensure appropriate data security and access controls Technical Leadership & Innovation: Provide architectural guidance across AI initiatives within the organization Design scalable, maintainable AI systems with consideration for cloud infrastructure and MLOps best practices Stay at the forefront of advancements in LLMs, agentic AI, and code generation technologies Mentor junior engineers and foster a culture of AI innovation Collaborate effectively with cross-functional teams Master's or Ph.D. in Computer Science, AI, or related field (or equivalent practical experience) 5+ years of experience building production-level AI systems with significant focus on LLMs and generative AI Demonstrated expertise in Python and modern AI frameworks (PyTorch, TensorFlow, Hugging Face Transformers) Extensive experience with LLM orchestration frameworks (LangChain, LangGraph, or similar) Proven track record designing and implementing agentic AI systems Strong proficiency in prompt engineering, fine-tuning, and optimization of LLMs Expert-level SQL skills and experience with database systems Experience with cloud platforms (AWS, Azure, GCP) and their AI/ML services Solid understanding of software engineering principles and best practices Experience with MLOps and CI/CD pipelines for AI systems Preferred Qualifications: Experience developing systems that generate production-ready code Familiarity with retrieval-augmented generation (RAG) techniques Experience with vector databases (Pinecone, Weaviate, etc.) Knowledge of containerization technologies (Docker, Kubernetes) Experience with model optimization, quantization, and efficient inference Contributions to open-source AI projects or research publications Familiarity with physical security or identity access management domains Experience implementing AI systems with strong security and governance controls This position offers the opportunity to work at the forefront of AI engineering, developing systems that not only generate code but also function autonomously to deliver real business R&D Ghaziabad, India

Posted 3 days ago

Apply

3.0 years

0 Lacs

Chennai, Tamil Nadu, India

Remote

Linkedin logo

About Clarifai Clarifai is a leading, compute orchestration AI platform specializing in computer vision and generative AI. We empower organizations to transform unstructured image, video, text, and audio data into actionable insights, significantly faster and more accurately than manual processes. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been at the forefront of AI innovation since achieving the top five placements in the 2013 ImageNet Challenge. Our diverse, globally distributed team operates across the United States, Canada, Estonia, Argentina, and India. We have secured $100M in funding, including a $60M Series C round, backed by industry leaders such as Menlo Ventures, Union Square Ventures, Lux Capital, NEA, LDV Capital, Corazon Capital, Google Ventures, NVIDIA, Qualcomm, and Osage. Clarifai is proud to be an equal-opportunity workplace committed to building and maintaining a diverse and inclusive team. The Opportunity As a Senior Research Scientist at Clarifai, you'll contribute to applied research initiatives, converting the latest academic insights into production-ready solutions. You'll collaborate closely with our MLOps, Engineering, Business Development, and Product teams to rapidly prototype and deliver innovative capabilities, particularly within the national security domain. Your deep expertise in Computer Vision, GenAI, and multi-modal AI will drive strategic advancements and customer success. We seek individuals passionate about impactful AI applications, committed to collaboration, and skilled in managing multi-phase projects from initial proof-of-concept through deployment. Continuous learning and active participation in academic and industry forums are core elements of our research environment. Key Responsibilities Train, evaluate, and optimize machine learning models for high performance, scalability, and robustness. Contribute to R&D in object detection and multi-object tracking for remote sensing, including Synthetic Aperture Radar (SAR), and rapidly prototype proof-of-concept systems. Leverage and build AI data engines—scalable feedback systems that integrate model inference, human-guided labeling, and automated evaluation—to accelerate dataset growth and model refinement. Design and deliver production-grade, maintainable code while managing multi-phase development aligned to technical and customer objectives. Collaborate across teams and stakeholders—especially in national security and defense—to ensure effective knowledge transfer and mission-aligned innovation. Impact Your work as a Senior Research Scientist will significantly influence Clarifai's capability to deliver innovative AI solutions to the national security and intelligence communities. You will directly contribute to strategic projects that enhance Clarifai's reputation and position as a market leader in AI-driven geospatial analysis. Requirements 3+ years of hands-on experience developing neural networks, focusing particularly on Computer Vision and/or GenAI. Expertise in Python, with strong proficiency in libraries such as PyTorch, TensorFlow, or Jax. Advanced degree (Master's or PhD) in Computer Science, Mathematics, Engineering, or related fields. Great to Have Experience working with government, defense, or intelligence community R&D projects. Familiarity with remote sensing data sources, including commercial satellite imagery, UAS video, and NTM. Experience with LLMs, RAG, PEFT, and multi-modal applications (e.g., Captioning, VQA, cross-modal retrieval). Familiarity with the Model Context Protocol (MCP) and its use in structured agent communication, task orchestration, and context management across multi-agent systems. Published research in Computer Vision, NLP, or multi-modal AI. PhD in Machine Learning or related disciplines.

Posted 3 days ago

Apply

0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Linkedin logo

Job Description 💰 Compensation Note: The budget for this role is fixed at INR 50–55 lakhs per annum (non-negotiable). Please ensure this aligns with your expectations before applying. 📍 Work Setup: This is a hybrid role , requiring 3 days per week onsite at the office in Hyderabad, India . 📝 Interview Process: The process consists of 6 stages , including a technical assessment, code review, code discussion , and panel interviews . Company Description: Blend is a premier AI services provider, committed to co-creating meaningful impact for its clients through the power of data science, AI, technology, and people. With a mission to fuel bold visions, Blend tackles significant challenges by seamlessly aligning human expertise with artificial intelligence. The company is dedicated to unlocking value and fostering innovation for its clients by harnessing world-class people and data-driven strategy. We believe that the power of people and AI can have a meaningful impact on your world, creating more fulfilling work and projects for our people and clients. Job Description : We are looking for an AI Engineer with experience in Speech-to-text and Text Generation to solve a Conversational AI challenge for our client based in EMEA. The focus of this project is to transcribe conversations and leverage generative AI-powered text analytics to drive better engagement strategies and decision-making. The ideal candidate will have deep expertise in Speech-to-Text (STT), Natural Language Processing (NLP), Large Language Models (LLMs), and Conversational AI systems. This role involves working on real-time transcription, intent analysis, sentiment analysis, summarization, and decision-support tools. Key Responsibilities: Conversational AI & Call Transcription Development Develop and fine-tune automatic speech recognition (ASR) models Implement language model fine-tuning for industry-specific language. Develop speaker diarization techniques to distinguish speakers in multi-speaker conversations. NLP & Generative AI Applications Build summarization models to extract key insights from conversations. Implement Named Entity Recognition (NER) to identify key topics. Apply LLMs for conversation analytics and context-aware recommendations. Design custom RAG (Retrieval-Augmented Generation) pipelines to enrich call summaries with external knowledge. Sentiment Analysis & Decision Support Develop sentiment and intent classification models. Create predictive models that suggest next-best actions based on call content, engagement levels, and historical data. AI Deployment & Scalability Deploy AI models using tools like AWS, GCP, Azure AI, ensuring scalability and real-time processing. Optimize inference pipelines using ONNX, TensorRT, or Triton for cost-effective model serving. Implement MLOps workflows to continuously improve model performance with new call data. Qualifications: Technical Skills Strong experience in Speech-to-Text (ASR), NLP, and Conversational AI. Hands-on expertise with tools like Whisper, DeepSpeech, Kaldi, AWS Transcribe, Google Speech-to-Text. Proficiency in Python, PyTorch, TensorFlow, Hugging Face Transformers. Experience with LLM fine-tuning, RAG-based architectures, and LangChain. Hands-on experience with Vector Databases (FAISS, Pinecone, Weaviate, ChromaDB) for knowledge retrieval. Experience deploying AI models using Docker, Kubernetes, FastAPI, Flask. Soft Skills Ability to translate AI insights into business impact. Strong problem-solving skills and ability to work in a fast-paced AI-first environment. Excellent communication skills to collaborate with cross-functional teams, including data scientists, engineers, and client stakeholders. Preferred Qualifications Experience in healthcare, pharma, or life sciences NLP use cases. Background in knowledge graphs, prompt engineering, and multimodal AI. Experience with Reinforcement Learning (RLHF) for improving conversation models.

Posted 4 days ago

Apply

4.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Linkedin logo

Job Description Job Title: Senior Python Developer – AI Architecture Location: Onsite in Pune (NIBM (Kondhwa) Pune Experience: 4+ Years Job Type: Full-Time Shift: 11 AM - 8 PM IST) Skills: Flask, FastAPI, or Django, AI/ML frameworks (e.g., TensorFlow, PyTorch, scikit-learn) Joining: Immediate (This is a backfill role, so we need someone who can join in a week) About The Role We are seeking a seasoned Python Developer to join our AI Engineering team. You will play a key role in designing and implementing scalable, AI-driven systems and microservices that power intelligent applications across our platform. Key Responsibilities Design and develop robust, scalable backend systems using Python. Collaborate with ML/AI engineers to integrate AI/ML models into production. Architect and implement APIs and microservices for AI workflows. Optimize the performance of data pipelines and inference engines. Ensure code quality through unit testing, code reviews, and CI/CD practices. Work with cloud platforms (AWS/GCP/Azure) to deploy and monitor AI/MS services Requirements Required Skills: startup culture fit and mindset Strong problem-solving skills Strong proficiency in Python (Flask, FastAPI, or Django). Solid understanding of software architecture and design patterns. Experience with AI/ML frameworks (e.g., TensorFlow, PyTorch, scikit-learn). Familiarity with containerization (Docker, Kubernetes). Hands-on experience with RESTful APIs and asynchronous programming. Exposure to data engineering tools (Airflow, Spark, Kafka) is a plus. Preferred Qualifications Bachelor’s or Master’s in Computer Science, Engineering, or a related field. Experience working in agile teams and DevOps environments. Knowledge of MLOps practices and model lifecycle man.

Posted 4 days ago

Apply

0 years

0 Lacs

Gurugram, Haryana, India

On-site

Linkedin logo

Role : Applied AI Engineer (Gen-AI, Vision) Function : Applied AI Compensation : 40-60 LPA + ESOPs About the Company: A venture-backed, stealth-stage technology company building next-gen matchmaking and relationship platforms is hiring their founding AI/ML & Data Engineering Team. They are on a mission to reimagine how people connect, using AI, community, and content as the building blocks. They’re not building just another dating app — they’re creating an experience where users feel: “This app gets me.” At the core of the product are real-time, ML recommendation engines — similar to Spotify for song moods or TikTok for discovery. They are well funded and backed by marquee VCs in India and US. Company Philosophy: Core belief: Great data + Good models = Great recommendations Good data + Great models = Average recommendations That’s why they’re investing in data infrastructure from the inception and foundation. Position Overview: As an Applied AI Engineer, you'll lead the deployment of advanced vision-language models to power a personalised, photorealistic user experience. Your work will focus on fine-tuning and integrating models like Stable Diffusion, LoRA, and CLIP into real-time pipelines, optimising them for user delight, performance, accuracy, and eventually costs. You will collaborate closely with product and design teams to bring generative AI capabilities to deliver a novel dating experience. Role & Responsibilities: Build and deploy cutting-edge vision-language pipelines (e.g., Stable Diffusion, LoRA, CLIP, BLIP) for personalised and authentic user profile visuals Fine-tune LoRA models to reflect a user's best version while maintaining identity integrity Design scalable APIs for image generation, editing, and personalization Work with UX/design to integrate image-generation flows that feel intuitive and delightful Optimize models for mobile performance, fast inference, and cost efficiency Collaborate closely with product, backend, and ML researchers to productionize ideas Ideal Profile: They are looking for a hands-on applied AI/ML engineer with expertise in generative vision models like Stable Diffusion, and some track record of building real-time, personalised generative AI experiences. Experience with computer vision, generative models, or deep learning Experience working with Stable Diffusion, LoRA training, and prompt engineering Exposure to different vision-language architectures like Flux, CLIP, BLIP etc Proficient in PyTorch or equivalent ML frameworks Strong understanding of image pipelines, dataset curation, and captioning Experience deploying ML models on cloud-based services ( at minimum Replicate, Hugging Face Spaces and preferably on GPUs) Comfortable writing production-ready Python, working with versioned models and real-time inference stacks Passionate about building beautiful, human-centred AI experiences Nice to have: Prior experience with personalization, avatars, or face-preserving generative models Contributed to open-source vision models or fine-tuning libraries Experience in a startup or fast-paced product-focused team What the role offers: Join a founding team where your work is core to product experience Shape the future of how humans connect in the AI era Significant ESOPs and wealth creation + competitive cash compensation

Posted 4 days ago

Apply

4.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

About the Role: We are looking for a forward-thinking LLMOps Engineer to join our team and help build the next generation of secure, scalable, and responsible Generative AI (GenAI) platforms. This role will focus on establishing governance, security, and operational best practices while enabling development teams to build high-performing GenAI applications. You will also work closely with GenAI agents and integrate LLMs from multiple providers to support diverse use cases. Key Responsibilities: Design and implement governance frameworks for GenAI platforms, ensuring compliance with internal policies and external regulations (e.g., GDPR, AI Act). Define and enforce responsible AI practices including fairness, transparency, explainability, and auditability. Implement robust security protocols including IAM, data encryption, secure API access, and model sandboxing. Collaborate with security teams to conduct risk assessments and ensure secure deployment of LLMs. Build and maintain scalable LLMOps pipelines for model training, fine-tuning, evaluation, deployment, and monitoring. Automate model lifecycle management with CI/CD, versioning, rollback, and observability. Develop and manage GenAI agents capable of reasoning, planning, and tool use. Integrate and orchestrate LLMs from multiple providers (e.g., OpenAI, Anthropic, Cohere, Google, Azure OpenAI) to support hybrid and fallback strategies. Optimize prompt engineering, context management, and agent memory for production use. Ensure high availability, low latency, and cost-efficiency of GenAI workloads across cloud and hybrid environments. Implement monitoring and alerting for model drift, hallucinations, and performance degradation. Partner with GenAI developers to embed best practices and reusable components (SDKs, templates, APIs). Provide technical guidance and documentation to accelerate development and ensure platform consistency. Qualifications: Bachelor’s or Master’s degree in Computer Science, Engineering, or related field. 4+ years of experience in MLOps, DevOps, or platform engineering, with 1–2 years in LLM/GenAI environments. Deep understanding of LLMs, GenAI agents, prompt engineering, and inference optimization. Experience with LangChain, LlamaIndex, Langraph or similar agent frameworks. Hands-on with MLflow, or equivalent tools. Proficient in Python, containerization (Docker) and cloud platforms (AWS/GCP/Azure). Familiarity with AI governance frameworks and responsible AI principles. Experience with vector databases (e.g., FAISS, Pinecone), RAG pipelines, and model evaluation frameworks. Knowledge of Responsible AI, red-teaming, and OWASP security priciples.

Posted 4 days ago

Apply

0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Linkedin logo

Years of exp :10 - 15 yrs Location : Noida Join us as Cloud Engineer Lead at Dailoqa , where you will be responsible for operationalizing cutting-edge machine learning and generative AI solutions, ensuring scalable, secure, and efficient deployment across infrastructure. You will work closely with data scientists, ML engineers, and business stakeholders to build and maintain robust MLOps pipelines, enabling rapid experimentation and reliable production implementation of AI models, including LLMs and real-time analytics systems. To be successful as Cloud Engineer you should have experience with: Cloud sourcing, networks, VMs, performance, scaling, availability, storage, security, access management Deep expertise in one or more cloud platforms: AWS, Azure, GCP Strong experience in containerization and orchestration (Docker, Kubernetes, Helm) Familiarity with CI/CD tools: GitHub Actions, Jenkins, Azure DevOps, ArgoCD, etc. Proficiency in scripting languages (Python, Bash, PowerShell) Knowledge of MLOps tools such as MLflow, Kubeflow, SageMaker, Vertex AI, or Azure ML Strong understanding of DevOps principles applied to ML workflows. Key Responsibilities may include: Design and implement scalable, cost-optimized, and secure infrastructure for AI-driven platforms. Implement infrastructure as code using tools like Terraform, ARM, or Cloud Formation. Automate infrastructure provisioning, CI/CD pipelines, and model deployment workflows. Ensure version control, repeatability, and compliance across all infrastructure components. Set up monitoring, logging, and alerting frameworks using tools like Prometheus, Grafana, ELK, or Azure Monitor. Optimize performance and resource utilization of AI workloads including GPU-based training/inference Experience with Snowflake, Databricks for collaborative ML development and scalable data processing. Understanding model interpretability, responsible AI, and governance. Contributions to open-source MLOps tools or communities. Strong leadership, communication, and cross-functional collaboration skills. Knowledge of data privacy, model governance, and regulatory compliance in AI systems. Exposure to LangChain, Vector DBs (e. g. , FAISS, Pinecone), and retrieval-augmented generation (RAG) pipelines.

Posted 4 days ago

Apply

4.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Linkedin logo

Job Title: Senior Python Developer – AI Architecture Location: Onsite in Pune (NIBM (Kondhwa) Pune Experience: 4+ Years Job Type: Full-Time Shift: 11 AM - 8 PM IST) Skills: Flask, FastAPI, or Django, AI/ML frameworks (e.g., TensorFlow, PyTorch, scikit-learn) Joining: Immediate (This is a backfill role, so we need someone who can join in a week) About the Role: We are seeking a seasoned Python Developer to join our AI Engineering team. You will play a key role in designing and implementing scalable, AI-driven systems and microservices that power intelligent applications across our platform. Key Responsibilities: Design and develop robust, scalable backend systems using Python. Collaborate with ML/AI engineers to integrate AI/ML models into production. Architect and implement APIs and microservices for AI workflows. Optimize the performance of data pipelines and inference engines. Ensure code quality through unit testing, code reviews, and CI/CD practices. Work with cloud platforms (AWS/GCP/Azure) to deploy and monitor AI/MS services Required Skills: startup culture fit and mindset Strong problem-solving skills Strong proficiency in Python (Flask, FastAPI, or Django). Solid understanding of software architecture and design patterns. Experience with AI/ML frameworks (e.g., TensorFlow, PyTorch, scikit-learn). Familiarity with containerization (Docker, Kubernetes). Hands-on experience with RESTful APIs and asynchronous programming. Exposure to data engineering tools (Airflow, Spark, Kafka) is a plus. Preferred Qualifications: Bachelor’s or Master’s in Computer Science, Engineering, or a related field. Experience working in agile teams and DevOps environments. Knowledge of MLOps practices and model lifecycle man.

Posted 4 days ago

Apply

5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Thank you for considering the Backend Engineer position at Reveal Health Tech. We are an early-stage IT startup based in the US and India, focused on leveraging technology to deliver transformative healthcare solutions. About The Applied AI Lab The Applied AI Lab is an internal R&D team at Reveal dedicated to identifying high-impact problems in healthcare, life sciences, and adjacent verticals—and transforming those insights into repeatable AI solutions. We operate as a nimble product studio within the company: researching emerging technologies, rapidly prototyping AI and ML-powered tools, and building foundational infrastructure to support long-term product plays. Our output ranges from sandbox-ready MVPs to reusable components and SaaS-aligned platforms. Our team is multi-disciplinary—engineering, design, research, and business—and we work closely with client-facing and go-to-market teams to validate our ideas in the real world. About The Role We're looking for a Backend Engineer to join the Applied AI Lab and play a critical role in building robust, scalable, and flexible backend systems that power intelligent products and agentic workflows. You'll be responsible for developing APIs, integrating ML components, and helping stand up infrastructure that connects user interfaces with AI/ML logic and data services. You'll work closely with ML engineers, frontend developers, and designers to enable end-to-end functionality, rapid iteration, and long-term reliability. This role is ideal for someone who enjoys wearing multiple hats, thrives in a fast-moving environment, and wants to build backend systems that support real product innovation. Requirements Design and implement scalable APIs and service layers to support agentic AI workflows and prototypes Work closely with ML engineers to integrate models into backend infrastructure and orchestrate inference flows Build data access layers and connect to databases, vector stores, and external APIs Collaborate with frontend engineers to define data contracts and enable seamless UI integration Develop infrastructure for task orchestration, agent state tracking, and output management Contribute to DevOps efforts (e.g., CI/CD pipelines, deployment scripts, logging/monitoring) Optimize backend systems for performance, modularity, and reuse across Lab projects Support rapid prototyping and contribute to turning MVPs into stable, reusable assets Participate in roadmap planning, design sessions, and prioritization of Lab initiatives Desired Qualifications Please note that while you do not need to be an expert in every area, being familiar with most of the following is important. We are looking for someone who can effectively integrate everything, with team support to fill any gaps. 5+ years of experience as a backend engineer Programming & Backend Skills Terraform: Proficient in writing, testing, and managing Terraform modules Primary Language: Python (3.x), with solid understanding of object-oriented design Secondary (Nice to Have): Go, Node.js, or Java API Development: Experience with RESTful APIs and/or GraphQL; FastAPI or Flask preferred, microservices, and service-oriented architecture Testing: Pytest, unit/integration testing best practices CI/CD Pipelines: Experience with GitHub Actions, GitLab CI, or similar Experienced with cloud platforms (AWS, Azure, GCP) Compute: Lambda, ECS/Fargate, EC2 Storage: S3, EFS Networking: VPC, Route53, API Gateway Databases: RDS (PostgreSQL/MySQL), DynamoDB Monitoring: CloudWatch, X-Ray IAM: Policies, roles, permissions model Data & Event Processing (Optional but valuable) Experience with: Message Brokers: SQS, Kafka Data Pipelines / ETL: AWS Step Functions, Airflow (especially on MWAA), Glue File Parsing: JSON, XML, CSV, Parquet, etc. Tooling & Environment Version Control: Git (GitHub or GitLab workflows) Secrets Management: AWS Secrets Manager or SSM Parameter Store Dev Tools: Docker Compose, Make, VS Code How you will enrich us? Energetic and enthusiastic Autonomous and self-motivated Growth mindset Embraces challenges Building new things gets your blood pumping Curiosity and deep interest in the world Challenges the status quo constructively Benefits What do you get in return? Be part of a high-impact team shaping the future of our IP and product innovation strategy Work with cutting-edge technologies in a focused but flexible lab environment Help define how applied AI can solve real-world problems in complex, high-stakes domains Grow with a small, mission-aligned team with executive support and long-term vision Industry best compensation and benefits Next Steps Send us your updated CV - if you can mention how you have enriched your previous organisation in a cover letter, that would be great! If we find your profile suitable, we will have our Talent personnel to reach out to you to understand your profile/interests and how best we can align mutually. Finally, you would have a chat with our Leadership to understand more about us and see if this is the right next career move!

Posted 4 days ago

Apply

2.5 - 5.0 years

5 - 11 Lacs

India

On-site

GlassDoor logo

We are looking for an experienced AI Engineer to join our team. The ideal candidate will have a strong background in designing, deploying, and maintaining advanced AI/ML models with expertise in Natural Language Processing (NLP), Computer Vision, and architectures like Transformers and Diffusion Models. You will play a key role in developing AI-powered solutions, optimizing performance, and deploying and managing models in production environments. Key Responsibilities AI Model Development and Optimization: Design, train, and fine-tune AI models for NLP, Computer Vision, and other domains using frameworks like TensorFlow and PyTorch. Work on advanced architectures, including Transformer-based models (e.g., BERT, GPT, T5) for NLP tasks and CNN-based models (e.g., YOLO, VGG, ResNet) for Computer Vision applications. Utilize techniques like PEFT (Parameter-Efficient Fine-Tuning) and SFT (Supervised Fine-Tuning) to optimize models for specific tasks. Build and train RLHF (Reinforcement Learning with Human Feedback) and RL-based models to align AI behavior with real-world objectives., Explore multimodal AI solutions combining text, vision, and audio using generative deep learning architectures. Natural Language Processing (NLP): Develop and deploy NLP solutions, including language models, text generation, sentiment analysis, and text-to-speech systems. Leverage advanced Transformer architectures (e.g., BERT, GPT, T5) for NLP tasks. AI Model Deployment and Frameworks: Deploy AI models using frameworks like VLLM, Docker, and MLFlow in production-grade environments. Create robust data pipelines for training, testing, and inference workflows. Implement CI/CD pipelines for seamless integration and deployment of AI solutions. Production Environment Management: Deploy, monitor, and manage AI models in production, ensuring performance, reliability, and scalability. Set up monitoring systems using Prometheus to track metrics like latency, throughput, and model drift. Data Engineering and Pipelines: Design and implement efficient data pipelines for preprocessing, cleaning, and transformation of large datasets. Integrate with cloud-based data storage and retrieval systems for seamless AI workflows. Performance Monitoring and Optimization: Optimize AI model performance through hyperparameter tuning and algorithmic improvements. Monitor performance using tools like Prometheus, tracking key metrics (e.g., latency, accuracy, model drift, error rates etc.) Solution Design and Architecture: Collaborate with cross-functional teams to understand business requirements and translate them into scalable, efficient AI/ML solutions. Design end-to-end AI systems, including data pipelines, model training workflows, and deployment architectures, ensuring alignment with business objectives and technical constraints. Conduct feasibility studies and proof-of-concepts (PoCs) for emerging technologies to evaluate their applicability to specific use cases. Stakeholder Engagement: Act as the technical point of contact for AI/ML projects, managing expectations and aligning deliverables with timelines. Participate in workshops, demos, and client discussions to showcase AI capabilities and align solutions with client needs. Experience: 2.5 - 5 years of experience Salary : 5-11 LPA Job Types: Full-time, Permanent Pay: ₹500,000.00 - ₹1,100,000.00 per year Schedule: Day shift Work Location: In person

Posted 4 days ago

Apply

3.0 - 5.0 years

6 - 11 Lacs

Thiruvananthapuram

On-site

GlassDoor logo

Experience Required: 3-5 years of hands-on experience in full-stack development, system design, and supporting AI/ML data-driven solutions in a production environment. Key Responsibilities Implementing Technical Designs: Collaborate with architects and senior stakeholders to understand high-level designs and break them down into detailed engineering tasks. Implement system modules and ensure alignment with architectural direction. Cross-Functional Collaboration: Work closely with software developers, data scientists, and UI/UX teams to translate system requirements into working code. Clearly communicate technical concepts and implementation plans to internal teams. Stakeholder Support: Participate in discussions with product and client teams to gather requirements. Provide regular updates on development progress and raise flags early to manage expectations. System Development & Integration: Develop, integrate, and maintain components of AI/ML platforms and data-driven applications. Contribute to scalable, secure, and efficient system components based on guidance from architectural leads. Issue Resolution: Identify and debug system-level issues, including deployment and performance challenges. Proactively collaborate with DevOps and QA to ensure resolution. Quality Assurance & Security Compliance: Ensure that implementations meet coding standards, performance benchmarks, and security requirements. Perform unit and integration testing to uphold quality standards. Agile Execution: Break features into technical tasks, estimate efforts, and deliver components in sprints. Participate in sprint planning, reviews, and retrospectives with a focus on delivering value. Tool & Framework Proficiency: Use modern tools and frameworks in your daily workflow, including AI/ML libraries, backend APIs, front-end frameworks, databases, and cloud services, contributing to robust, maintainable, and scalable systems. Continuous Learning & Contribution: Keep up with evolving tech stacks and suggest optimizations or refactoring opportunities. Bring learnings from the industry into internal knowledge-sharing sessions. Proficiency in using AI-copilots for Coding: Adaptation to emerging tools and knowledge of prompt engineering to effectively use AI for day-to-day coding needs. Technical Skills Hands-on experience with Python-based AI/ML development using libraries such as TensorFlow , PyTorch , scikit-learn , or Keras . Hands-on exposure to self-hosted or managed LLMs , supporting integration and fine-tuning workflows as per system needs while following architectural blueprints. Practical implementation of NLP/CV modules using tools like SpaCy , NLTK , Hugging Face Transformers , and OpenCV , contributing to feature extraction, preprocessing, and inference pipelines. Strong backend experience using Django , Flask , or Node.js , and API development (REST or GraphQL). Front-end development experience with React , Angular , or Vue.js , with a working understanding of responsive design and state management. Development and optimization of data storage solutions , using SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, Cassandra), with hands-on experience configuring indexes, optimizing queries, and using caching tools like Redis and Memcached . Working knowledge of microservices and serverless patterns , participating in building modular services, integrating event-driven systems, and following best practices shared by architectural leads. Application of design patterns (e.g., Factory, Singleton, Observer) during implementation to ensure code reusability, scalability, and alignment with architectural standards. Exposure to big data tools like Apache Spark , and Kafka for processing datasets. Familiarity with ETL workflows and cloud data warehouse , using tools such as Airflow , dbt , BigQuery , or Snowflake . Understanding of CI/CD , containerization (Docker), IaC (Terraform), and cloud platforms (AWS, GCP, or Azure). Implementation of cloud security guidelines , including setting up IAM roles , configuring TLS/SSL , and working within secure VPC setups, with support from cloud architects. Exposure to MLOps practices , model versioning, and deployment pipelines using MLflow , FastAPI , or AWS SageMaker . Configuration and management of cloud services such as AWS EC2 , RDS , S3 , Load Balancers , and WAF , supporting scalable infrastructure deployment and reliability engineering efforts. Personal Attributes Proactive Execution and Communication: Able to take architectural direction and implement it independently with minimal rework with regular communication with stakeholders Collaboration: Comfortable working across disciplines with designers, data engineers, and QA teams. Responsibility: Owns code quality and reliability, especially in production systems. Problem Solver: Demonstrated ability to debug complex systems and contribute to solutioning. Key : Python, Django, Django ORM, HTML, CSS, Bootstrap, JavaScript, jQuery, Multi-threading, Multi-processing, Database Design, Database Administration, Cloud Infrastructure, Data Science, self-hosted LLMs Qualifications Bachelor’s or Master’s degree in Computer Science, Information Technology, Data Science, or a related field. Relevant certifications in cloud or machine learning are a plus. Package: 6-11 LPA Job Types: Full-time, Permanent Pay: ₹600,000.00 - ₹1,100,000.00 per year Schedule: Day shift Monday to Friday

Posted 4 days ago

Apply

2.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Linkedin logo

Overview Do you want to work in a fun and supportive environment? At erwin by Quest we know that companies with a strong positive culture perform so much better. That is why every day we strive to create a collaborative and inclusive working environment where our people can feel empowered to succeed. erwin by Quest is an award-winning Data Modelling software provider offering a broad selection of solutions that solve some of the most common and most challenging Data Governance problems. We are currently looking for Software Dev Engineer to join us. Responsibilities Write clean, reusable, and efficient code following best practices. Takes ownership of their work and consistently delivers results in a fast-paced environment. Work closely with the team in an agile and collaborative environment. Troubleshoot and resolve software defects and performance issues. Qualifications A minimum of 2-5 Years of Full Stack Java Development experience. Strong knowledge of Data Structures and Algorithms, System Design. Expertise in Java 8+ and its modern features (eg, Streams, Lambda Expressions, Optional, Functional Interfaces) Hands-on experience building enterprise-grade applications using Java, Spring Framework (Spring Boot, Spring JDBC, Spring Security) Proficiency in Spring Boot for building microservices and RESTful APIs is a plus. Experience with Spring Core, Spring MVC, Spring Data, and Spring Security. Understanding of dependency injection Strong knowledge of SQL databases like Postgres, SQL Server. Experience with JPA/Hibernate for ORM and understanding of database optimization techniques, query performance tuning, and designing efficient models. Proficiency in designing RESTful APIs and working with API specifications and documentation tools like Swagger/OpenAPI Experience with OAuth 2.0, JWT for authentication and authorization mechanisms. Strong knowledge of React, Redux Toolkit (Optional) Expertise in building and optimizing applications with React functional components and leveraging React Hooks for state and side effects management Provider in React and Context API Strong hands-on experience with TypeScript for building type-safe React applications. Deep understanding of TypeScript features like interfaces, generics, type inference, etc Strong understanding of semantic HTML and modern CSS for responsive design Familiarity with Material UI and Tailwind CSS for building modern, user-friendly, and scalable UI components. Proficiency with Git and working with branching strategies Experience with optimizing application performance, including JVM tuning, caching strategies, and improving query performance in databases Strong understanding of security best practices for both frontend and backend, including secure coding and protecting APIs. Familiarity with cloud services (Azure, AWS, GCP) is a plus. Company Description At Quest, we create and manage the software that makes the benefits of new technology real. Companies turn to us to manage, modernize and secure their business, from on-prem to in-cloud, from the heart of the network to the vulnerable endpoints. From complex challenges like Active Directory management and Office 365 migration, to database and systems management, to redefining security, and hundreds of needs in between, we help you conquer your next challenge now. We’re not the company that makes big promises. We’re the company that fulfills them. We’re Quest: Where Next Meets Now. Why work with us! Life at Quest means collaborating with dedicated professionals with a passion for technology. When we see something that could be improved, we get to work inventing the solution. Our people demonstrate our winning culture through positive and meaningful relationship. We invest in our people and offer a series of programs that enables them to pursue a career that fulfills their potential. Our team members’ health and wellness is our priority as well as rewarding them for their hard work. Quest is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. Come join us. For more information, visit us on the web at http://www.quest.com/careers . Connect With Us! Not ready to apply? Connect with us for general consideration.

Posted 4 days ago

Apply

8.0 - 12.0 years

5 Lacs

Bengaluru

On-site

GlassDoor logo

About Us InMobi is the leading provider of content, monetization, and marketing technologies that fuel growth for industries around the world. Our end-to-end advertising software platform, connected content, and commerce experiences activate audiences, drive real connections, and diversify revenue for businesses everywhere. InMobi Advertising is an end-to-end advertising platform that helps advertisers drive real connections with consumers. We drive customer growth by helping businesses understand, engage, and acquire consumers effectively through data-driven media solutions. Learn more at advertising.inmobi.com. Glance is a consumer technology company that operates disruptive digital platforms, including Glance, Roposo, and Nostra. Glance's smart lockscreen and TV experience inspires consumers to make the most of every moment by surfing relevant content without the need for searching and downloading apps. Glance is currently available on over 450 million smartphones and televisions worldwide. Learn more at glance.com. Born in India, InMobi maintains a large presence in Bangalore and San Mateo, CA, and has operations in New York, Singapore, Delhi, Mumbai, Beijing, Shanghai, Jakarta, Manila, Kuala Lumpur, Sydney, Melbourne, Seoul, Tokyo, London, and Dubai. To learn more, visit inmobi.com. About InMobi Demand Side Platform (DSP) InMobi DSP is growing rapidly as the preferred platform for mobile app marketers globally, with investments in proprietary AI models, privacy-first architecture, and deep performance capabilities. As the Director of Product Management for AI/ML, you'll have the opportunity to shape the brain of the DSP — working on cutting-edge problems in advertising, optimization, and personalization. Overview of role We are looking for a visionary and execution-focused Director of Product Management to lead the evolution of the AI/ML stack that powers InMobi's Demand Side Platform (DSP) . This person will play a pivotal role in shaping the intelligence that drives campaign performance, optimization, and decisioning at scale across billions of ad requests per day. This is a high-impact leadership role that sits at the heart of our DSP growth strategy. You will lead the product charter for areas such as bid prediction models, budget pacing, ad engagement models, targeting engines, A/B experimentation platforms, real-time inference infrastructure and underlying data and intelligence stacks. You will directly influence revenue by shaping how DSP Platform delivers on advertisers and agency performance marketing objectives and grow our share of wallet in their growth budgets. You will define the InMobi's competitive edge in mobile performance advertising through a differentiated AI/ML platform. This role reports to the VP of Product for InMobi DSP and works cross-functionally with Data Science, engineering, Trading, Product marketing, and our GTM teams. You'll be a key driver of strategy and execution as we build the platform for next orbit of growth across mobile app and web. This role is on-site based in our Bangalore office and may include quarterly travel to our Bay Area office. Key Responsibilities Own and drive the product roadmap for all AI/ML engines powering InMobi DSP's performance and efficiency. Partner closely with Data Science, Engineering, Trading, Sales, and GTM teams to define, prioritize, and deliver features that improve advertiser outcomes (CPI, ROAS, LTV). Define clear product requirements and success metrics for machine learning models (e.g., CTR prediction, user scoring, conversion models). Guide the evolution of our model training pipelines, feature stores, user identities, real-time inference, and feedback loops for continual learning. Evangelize the data and signals collection, standardization, retention across the InMobi group in a Privacy-first manner Translate business strategy into ML product strategy that balances innovation with operational rigor Drive adoption of AI/ML models across the business stakeholders Build a strong culture of experimentation and data-driven product development What We're Looking For Experience 8-12 years of experience in Product Management, with at least 5 years building AI/ML-based products (preferably in the AdTech or performance marketing domain) Proven experience working with large-scale distributed systems, ML pipelines, or real-time decision engines Good understanding of programmatic advertising ecosystem Prior experience with ML infrastructure products is a plus. Technical Fluency Solid grasp of machine learning concepts, including supervised learning, reinforcement learning, data and model drifts, experimentation frameworks, etc. Experience working with data science teams to convert models into production systems. Familiarity with MLOps, real-time inference, A/B testing platforms, and feature engineering pipelines. Functional & Leadership Skills Strategic thinking and ability to align AI/ML roadmap with business goals. Strong analytical mindset and data fluency to define and evaluate product success. Excellent written and verbal communication skills; ability to influence across functions and seniority levels. Passion for mentoring and building high-performance product teams Bias for action, and a passion for building in fast-paced, cross-functional environments The InMobi Culture At InMobi, culture isn't a buzzword; it's an ethos woven by every InMobian, reflecting our diverse backgrounds and experiences. We thrive on challenges and seize every opportunity for growth. Our core values of thinking big, being passionate, showing accountability, and taking ownership with freedom – guide us in every decision we make. We believe in nurturing and investing in your development through continuous learning and career progression with our InMobi Live Your Potential program. InMobi is proud to be an Equal Employment Opportunity and we make reasonable accommodations for qualified individuals with disabilities. Visit https://www.inmobi.com/company/careers to better understand our benefits, values, and more!

Posted 4 days ago

Apply

4.0 years

10 - 16 Lacs

Bengaluru

On-site

GlassDoor logo

Job Title: Python Developer – Generative AI Location: Bangalore (Night Shift) Experience: 4+ Years Shift: Night Shift Employment Type: Full-Time About the Role We are seeking an experienced and innovative Python Developer with expertise in Generative AI to work in a night shift capacity. You will design, develop, and deploy intelligent AI-powered systems using cutting-edge LLMs and generative models. The ideal candidate thrives in fast-paced environments and is passionate about leveraging AI to solve real-world problems. Key Responsibilities Build and maintain Python-based APIs and backends integrated with Generative AI models. Work with large language models (e.g., GPT, Claude, LLaMA) and image/audio generation tools (e.g., DALL·E, Stable Diffusion). Implement prompt engineering, fine-tuning, and model deployment pipelines. Collaborate with global teams during night shift hours to develop scalable AI features. Deploy models using FastAPI, Flask, Docker, or cloud platforms. Optimize model performance for latency, accuracy, and scalability. Ensure testing, monitoring, and documentation of AI integrations. Required Skills 4+ years of Python development experience. 1+ years of hands-on experience with Generative AI tools and models. Strong knowledge of PyTorch, TensorFlow, Hugging Face, LangChain, or OpenAI API. Experience with deployment (Docker, FastAPI), and model inference in production. Familiarity with vector databases (FAISS, Pinecone, Weaviate). Preferred Skills Experience with GPU-based training or inference. Exposure to MLOps tools like MLflow, Airflow, or Kubeflow. Understanding of AI ethics, model safety, and bias mitigation. Contributions to open-source GenAI or ML projects. Job Types: Full-time, Permanent Pay: ₹1,089,300.86 - ₹1,660,283.26 per year Benefits: Health insurance Provident Fund Schedule: Night shift Work Location: In person

Posted 4 days ago

Apply

0 years

1 - 2 Lacs

Noida

On-site

GlassDoor logo

Job Information Date Opened 06/23/2025 Job Type Full time Industry Consulting City Noida State/Province Uttar Pradesh Country India Zip/Postal Code 201301 Job Description What impact will you make? As the GenAI Solution Architect, you will lead the organization’s strategic vision for GenAI and Agentic AI solutions, driving innovation through advanced AI technologies. Your leadership will shape the development of AI-driven solutions that deliver high value to the business and its customers, positioning the organization as a leader in leveraging Gen AI Solutions. Work you’ll do Primary Responsibilities and Daily Tasks Design and Implement Agentic AI Architectures: Develop modular, event-driven systems for autonomous agents using modern orchestration tools. Engineer Agent Workflows and Prompt Strategies: Build multi-step agent workflows using contextual memory, prompt chaining, and tool-based reasoning. Integrate AI Agents with Enterprise Platforms: Develop and maintain secure integrations with IAM/ITSM tools like ServiceNow, SailPoint, Saviynt, Okta, and Ping. Optimize Production AI Systems: Tune latency, caching, and inference efficiency in real-world deployments, using tools like LangSmith. Lead Technical Initiatives and Evaluate Tools: Assess new GenAI tooling, mentor developers, enforce standards, and guide system scalability and fault tolerance. Short-term and Long-term Goals Short-term: Rollout and enhance tailored AI Solution implementations Long-term: Develop a scalable AI architecture for continuous improvement in AI methodologies, and position the organization as an innovator in domain-specific AI solutions. Enough about us, let’s talk about you We are looking for a visionary leader with an advanced understanding of Gen AI, Agent Architectures and cloud technologies, capable of defining AI solutions at an enterprise level. Essential Skills and Qualifications Technical Expertise: GenAI Tools & Infrastructure: Hands-on with LangGraph, Semantic Kernel, embedding models, fallback handling, and human-in-loop validation. Agentic AI Frameworks: Expertise in LangGraph, AutoGen, CrewAI, and Python-based orchestration. Production Deployment: Experience in deploying AI agents with real-time inference, within containerized deployments. Security & Access Management: Strong understanding of JWT, OAuth, RBAC, and secure API communication. Performance Monitoring: Skills in using observability tools like LangSmith and implementing tuning mechanisms. LLMs, Vector Stores & Retrieval Frameworks: Proficiency with Chroma, Weaviate, RAG pipelines, and memory modules. Leadership and Communication Skills: Demonstrated ability to lead, mentor, and inspire an engineering team. Excellent communication skills, with the ability to align technical initiatives with organizational objectives.

Posted 4 days ago

Apply

3.0 - 5.0 years

0 Lacs

Gurugram, Haryana, India

On-site

Linkedin logo

Job Description We aim to bring about a new paradigm in medical image diagnostics; providing intelligent, holistic, ethical, explainable and patient centric care. We are looking for innovative problem solvers who love solving problems. We want people who can empathize with the consumer, understand business problems, and design and deliver intelligent products. People who are looking to extend artificial intelligence into unexplored areas. Your primary focus will be in applying deep learning and artificial intelligence techniques to the domain of medical image analysis. Responsibilities Selecting features, building and optimizing classifier engines using deep learning techniques. Understanding the problem and applying the suitable image processing techniques Use techniques from artificial intelligence/deep learning to solve supervised and unsupervised learning problems. Understanding and designing solutions for complex problems related to medical image analysis by using Deep Learning/Object Detection/Image Segmentation. Recommend and implement best practices around the application of statistical modeling. Create, train, test, and deploy various neural networks to solve complex problems. Develop and implement solutions to fit business problems which may include applying algorithms from a standard statistical tool, deep learning or custom algorithm development. Understanding the requirements and designing solutions and architecture in accordance with them. Participate in code reviews, sprint planning, and Agile ceremonies to drive high-quality deliverables. Design and implement scalable data science architectures for training, inference, and deployment pipelines. Ensure code quality, readability, and maintainability by enforcing software engineering best practices within the data science team. Optimize models for production, including quantization, pruning, and latency reduction for real-time inference. Drive the adoption of versioing strategies for models, datasets, and experiments (e.g., using MLFlow, DVC). Contribute to the architectural design of data platforms to support large-scale experimentation and production workloads. Skills and Qualifications Strong software engineering skills in Python (or other languages used in data science) with emphasis on clean code, modularity, and testability. Excellent understanding and hands-on of Deep Learning techniques such as ANN, CNN, RNN, LSTM, Transformers, VAEs etc. Must have experience with Tensorflow or PyTorch framework in building, training, testing, and deploying neural networks. Experience in solving problems in the domain of Computer Vision. Knowledge of data, data augmentation, data curation, and synthetic data generation. Ability to understand the complete problem and design the solutions that best fit all the constraints. Knowledge of the common data science and deep learning libraries and toolkits such as Keras, Pandas, Scikit-learn, Numpy, Scipy, OpenCV etc. Good applied statistical skills, such as distributions, statistical testing, regression, etc. Exposure to Agile/Scrum methodologies and collaborative development practices. Experience with the development of RESTful APIs. The knowledge of libraries like FastAPI and the ability to apply it to deep learning architectures is essential. Excellent analytical and problem-solving skills with a good attitude and keen to adapt to evolving technologies. Experience with medical image analysis will be an advantage. Experience designing and building ML architecture components (e.g., feature stores, model registries, inference servers). Solid understanding of software design patterns, microservices, and cloud-native architectures. Expertise in model optimization techniques (e.g., ONNX conversion, TensorRT, model distillation) Education: BE/B Tech MS/M Tech (will be a bonus) Experience: 3-5 Years

Posted 4 days ago

Apply

1.0 years

0 Lacs

India

Remote

Linkedin logo

🚀 Job Title: AI Engineer (Full Stack / Model Deployment Specialist) Location: Remote (India preferred) Type: Full-Time (6-Month Fixed Contract) Experience Level: 1+ Years Salary: Competitive (based on experience) Potential: High-performing candidates may be offered a permanent role after the contract 🧩 About Us We are a dynamic collaboration between Funding Bay , Effer Ventures , and FBX Capital Partners, three industry leaders combining forces to deliver financial, compliance, and strategic growth solutions to businesses across the UK. We’re looking for an AI Engineer who can bridge the gap between machine learning and production-ready applications. If you love optimizing models, deploying them in real-world environments, and know your way around modern web stacks, this role is for you. 🔧 What You’ll Do End-to-End Ownership of ML Models: From training and evaluation to optimization and deployment. Deploy ML Models using AWS services (EC2, Lambda, S3, SageMaker, or custom Docker setups). Optimize Model Performance: Ensure fast inference, low memory usage, and high-quality results. Integrate AI into MERN Stack Applications: Build APIs and interfaces to expose your models to the frontend. Collaborate Cross-Functionally with frontend, product, and design teams. Build scalable and secure pipelines for data ingestion, model serving, and monitoring. Optimize for Speed & Usability : Ensure both backend inference and frontend UI are responsive and seamless. ✅ What We’re Looking For Proficient in MERN Stack (MongoDB, Express.js, React, Node.js) Strong Python skills , especially for AI/ML (NumPy, Pandas, scikit-learn, TensorFlow or PyTorch etc) Hands-on with Model Optimization : Quantization, pruning, distillation, or ONNX deployment is a plus Solid AWS Experience: EC2, S3, IAM, Lambda, API Gateway, CloudWatch, etc. Experience with Docker & CI/CD pipelines (e.g., GitHub Actions, Jenkins) Comfortable building and consuming REST/GraphQL APIs Familiar with ML deployment tools like FastAPI, Flask, TorchServe, or SageMaker endpoints Good understanding of performance profiling, logging, and model monitoring ⭐ Nice to Have Experience with LangChain , LLMs , or NLP pipelines Startup or fast-paced team background Open-source contributions or live-deployed AI projects 🌱 Why Join Us? Build & deploy real AI products that go live Work in a growth-focused, high-ownership environment 6-month contract with the potential for a Permanent full-time Flexible work culture & flat hierarchy Learn fast and build faster with founders and builders Take ownership of core parts of the AI stack Competitive compensation based on experience 📬 To Apply Send us: Your resume A link to your GitHub or portfolio A short paragraph about a project where you deployed an optimized AI model 📧 Email: developer@fundingbay.co.uk or directly apply

Posted 4 days ago

Apply

2.0 years

0 Lacs

Bhilai, Chhattisgarh, India

On-site

Linkedin logo

BIMCAP Private Limited BIMCAP is an international BIM outsourcing company with an expanding portfolio of cultural, infrastructure, and commercial projects around the world. We are leaders in BIM innovation, growth of BIM talents, and unique for our supportive family-like culture that expanded from Hong Kong to Netherlands, England, Germany, Hungary, Spain, and Philippines. This role is based out of Bhilai, Chhattisgarh, for BIMCAP’s subsidiary company registered as Ecliptiko Private Limited . Job Summary We are looking for a driven and curious AI Engineer with hands-on experience in Large Language Models (LLMs). In this role, you’ll work on end-to-end development and deployment of LLM-based solutions, including prompt engineering, model fine-tuning, and cloud-based deployment. You’ll collaborate closely with product and engineering teams to build intelligent, scalable, and secure AI-powered systems. Key Responsibilities Design, develop, and optimize prompt strategies for LLM-based applications. Fine-tune pre-trained models (e.g., OpenAI, Hugging Face, etc.) using custom datasets. Build and deploy LLM-powered APIs and services in cloud environments (AWS, GCP, or Azure). Integrate LLMs into applications with efficient inference and cost-aware strategies. Conduct evaluations, benchmarking, and A/B testing for LLM outputs. Collaborate on data collection, preprocessing, and feature engineering tasks. Stay up to date with the latest in GenAI research and toolchains. Requirements 2+ years of industry experience in AI/ML or related fields. Strong grasp of NLP concepts, transformers, and recent LLM developments. Proficiency in Python and ML frameworks (PyTorch, TensorFlow, or similar). Experience with prompt engineering and prompt evaluation. Hands-on experience with cloud platforms (AWS/GCP/Azure), Docker, and CI/CD. Familiarity with APIs and SDKs of major LLM providers (e.g., OpenAI, Cohere, Anthropic). Understanding of data privacy, security, and ethical considerations in AI. Preferred Qualifications Experience with tools like LangChain, LlamaIndex, or Vector DBs (e.g., FAISS, Pinecone). Exposure to Retrieval-Augmented Generation (RAG) systems. Knowledge of MLOps best practices and ML model lifecycle management

Posted 4 days ago

Apply

6.0 - 7.0 years

0 Lacs

Bengaluru, Karnataka, India

Remote

Linkedin logo

JD – AI/ML Engineer This is a full-time position with D Square Consulting Services Pvt Ltd Required Experience - 6-7 years Location – Bangalore Work mode - Onsite The candidate who can join within 30 days Job Summary The AI/ML Engineer will lead end-to-end design, development, and deployment of advanced, value-driven AI/ML solutions for digital marketing and analytics. Drive innovation, leverage cutting-edge techniques, and standardize multi-cloud model deployment through collaboration, delivering profound data insights. Required Qualifications- Bachelor’s degree (or higher preferred) in Computer Science, Data Science, ML, Mathematics, Statistics, Economics, or related fields with emphasis on quantitative methods. 6-7 years’ experience in software engineering with deep, hands-on expertise in the full lifecycle of ML model development, deployment, and operationalization. Demonstrated ability to write highly robust, efficient, and scalable Python, Java, Spark, and SQL code, adhering to industry best practices. Extensive experience with major ML frameworks (TensorFlow, PyTorch, Scikit-learn) and advanced deep learning libraries, including optimization. Strong, in-depth understanding of diverse ML algorithms (e.g., advanced regression, classification, clustering, RNNs, CNNs, transformers, time series, reinforcement learning), sophisticated data structures, and enterprise software design. Significant experience deploying and managing AI/ML models on major cloud platforms (Azure, AWS, GCP). Proven experience with LLMs and generative AI (fine-tuning, prompt engineering, deployment) is highly desirable. Exceptional problem-solving skills in a fast-paced, collaborative remote environment. Excellent communication and interpersonal skills, with the ability to effectively collaborate and influence diverse global teams remotely. Experience with classification, time series forecasting, customer lifetime value models, LLMs, and generative AI, preferably from the Retail, e-commerce, or CPG industry. Responsibilities- Lead cross-functional teams to deliver and scale complex AI/ML solutions (DL, NLP, optimization). Architect, design, develop, train, and evaluate high-performance, production-ready AI/ML models, ensuring scalability and robustness. Drive implementation, deployment, and maintenance of AI/ML solutions, optimizing inference, and documenting processes. Oversee data exploration, advanced preprocessing, complex feature engineering, and robust data pipeline development. Establish strategies for continuous testing, validation, and monitoring of deployed AI/ML models to ensure accuracy and reliability. Partner with senior stakeholders to translate business requirements into scalable AI solutions that deliver measurable value. Act as a primary SME on AI/ML model development, MLOps, and deployment, influencing global data science platforms. Continuously research and champion the adoption of the latest AI/ML technologies, algorithms, and best practices to maximize business value. Foster innovative thinking and continuous improvement, seeking superior ways of working for teams and partners.

Posted 4 days ago

Apply

9.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Linkedin logo

Company Qualcomm India Private Limited Job Area Engineering Group, Engineering Group > Software Engineering General Summary As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting-edge, world class products that meet and exceed customer needs. Qualcomm Software Engineers collaborate with systems, hardware, architecture, test engineers, and other teams to design system-level software solutions and obtain information on performance requirements and interfaces. Minimum Qualifications Bachelor's degree in Engineering, Information Systems, Computer Science, or related field. Job Location: Hyderabad Staff Engineer (AISW) Job Location: Hyderabad More Details Below About the team: Join the growing team at Qualcomm focused on advancing state-of-the-art in Machine Learning. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities to allow inference of trained neural networks on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. See your work directly impact billions of devices around the world. Responsibilities In this position, you will be responsible for the development and commercialization of ML solutions like Snapdragon Neural Processing Engine (SNPE) SDK on Qualcomm SoCs. You will be developing various SW features in our ML stack. You would be porting AI/ML solutions to various platforms and optimize the performance on multiple hardware accelerators (like CPU/GPU/NPU). You will have expert knowledge in deployment aspects of large software C/C++ dependency stacks using best practices. You will also have to keep up with the fast-paced development happening in the industry and academia to continuously enhance our solution from software engineering as well as machine learning standpoint. Work Experience 9+ years of relevant work experience in software development. Live and breathe quality software development with excellent analytical and debugging skills. Strong understanding about Processor architecture, system design fundamentals. Strong development & programming skills in C and C++. Experience with embedded systems development or equivalent. Excellent communication skills (verbal, presentation, written). Ability to collaborate across a globally diverse team and multiple interests. Preferred Qualifications Experience in embedded system development. Experience in C, C++, OOPS and Design patterns. Experience in Linux kernel or driver development is a plus. Strong OS concepts. Applicants : Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers. 3072736

Posted 4 days ago

Apply

5.0 years

0 Lacs

Gurugram, Haryana, India

On-site

Linkedin logo

We are seeking a passionate AI/ML Engineer to join our team in building the core AI-driven functionality an intelligent visual data encryption system. The role involves designing, training, and deploying AI models (e.g., CLIP, DCGANs, Decision Trees), integrating them into a secure backend, and operationalizing the solution via AWS cloud services and Python-based APIs. Key Responsibilities: AI/ML Development Design and train deep learning models for image classification and sensitivity tagging using CLIP, DCGANs, and Decision Trees. Build synthetic datasets using DCGANs for balancing. Fine-tune pre-trained models for customized encryption logic. Implement explainable classification logic for model outputs. Validate model performance using custom metrics and datasets. API Development Design and develop Python RESTful APIs using FastAPI or Flask for: Image upload and classification Model inference endpoints Encryption trigger calls Integrate APIs with AWS Lambda and Amazon API Gateway. AWS Integration Deploy and manage AI models on Amazon SageMaker for training and real-time inference. Use AWS Lambda for serverless backend compute. Store encrypted image data on Amazon S3 and metadata on Amazon RDS (PostgreSQL). Use AWS Cognito for secure user authentication and KMS for key management. Monitor job status via CloudWatch and enable secure, scalable API access. Required Skills & Experience: Must-Have 3–5 years of experience in AI/ML (especially vision-based systems). Strong experience with PyTorch or TensorFlow for model development. Proficient in Python with experience building RESTful APIs. Hands-on experience with Amazon SageMaker, Lambda, API Gateway, and S3. Knowledge of OpenSSL/PyCryptodome or basic cryptographic concepts. Understanding of model deployment, serialization, and performance tuning. Nice-to-Have Experience with CLIP model fine-tuning. Familiarity with Docker, GitHub Actions, or CI/CD pipelines. Experience in data classification under compliance regimes (e.g., GDPR, HIPAA). Familiarity with multi-tenant SaaS design patterns. Tools & Technologies: Python, PyTorch, TensorFlow FastAPI, Flask AWS: SageMaker, Lambda, S3, RDS, Cognito, API Gateway, KMS Git, Docker, Postgres, OpenCV, OpenSSL

Posted 4 days ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies