AI/ML Architect

6 years

0 Lacs

Posted:4 days ago| Platform: SimplyHired logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Job Summary


We are seeking an experienced AI/ML Architect to lead the design and development of scalable, real-time AI systems. You will work closely with product, data, and engineering teams to architect end-to-end solutions — from model development and deployment to system integration and production monitoring.


Key Responsibilities


  • Design and architect AI/ML systems that are scalable, low-latency, and production-ready

  • Lead development of real-time inference pipelines for use cases like voice, vision, or NLP

  • Select and integrate appropriate tools, frameworks, and infrastructure (e.g., Kubernetes, Kafka, TensorFlow, PyTorch, ONNX, Triton, VLLM etc.)

  • Collaborate with data scientists and ML engineers to productionize models

  • Ensure reliability, observability, and performance of deployed systems

  • Conduct architecture reviews, POCs, and system optimizations

  • Mentor engineers and help set best practices for ML lifecycle (MLOps)


Requirements


  • 6+ years of experience building and deploying ML systems in production

  • Proven expertise in real-time, low-latency system design (e.g., streaming inference, event-driven pipelines)

  • Strong understanding of scalable architectures — microservices, message queues, distributed training/inference

  • Proficient in Python and popular ML/DL frameworks (scikit-learn, TensorFlow, PyTorch)

  • Hands-on experience with LLM inference optimization using frameworks like vLLM, TensorRT-LLM, and SGLang

  • Familiarity with vector databases, embedding-based retrieval, and RAG pipelines

  • Experience with containerized environments (Docker, Kubernetes) and managing multi-container applications

  • Working knowledge of cloud platforms (AWS, GCP, or Azure) and CI/CD practices for ML workflows

  • Exposure to edge deployments and model compression/optimization techniques

  • Strong foundation in software engineering principles and system design


Nice to Haves


  • Experience in Linux (Ubuntu)

  • Terminal/Bash Scripting

    Date Opened

    07/08/2025

    Job Type

    Full time

    Years of Experience

    10 - 12 Years

    Domain

    Chemicals

    City

    Chennai

    State/Province

    Tamil Nadu

    Country

    India

    Zip/Postal Code

    600001

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Recode Solutions logo
Recode Solutions

Information Technology

Techville

RecommendedJobs for You

chennai, tamil nadu

Bengaluru, Delhi / NCR, Mumbai (All Areas)

Ahmedabad, Gujarat, India

ahmedabad, gujarat

Indore, Madhya Pradesh, India