AI Model Architect

10 - 15 years

25 - 30 Lacs

Posted:1 week ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Meet the Team

We are an innovation team on a mission to transform how enterprises harness AI. Operating with the agility of a startup and the focus of an incubator, were building a tight-knit group of AI and infrastructure experts driven by >

We thrive in a fast-paced, experimentation-rich environment where new technologies arent just welcome theyre expected. Here, you'll work side-by-side with seasoned engineers, architects, and thinkers to craft the kind of iconic products that can reshape industries and unlock entirely new models of operation for the enterprise.

If you're energized by the challenge of solving hard problems, love working at the edge of what's possible, and want to help shape the future of AI infrastructure we'd love to meet you.

Your Impact

AI Model Architect

As an Architect, you will be responsible for both mentoring a high-caliber team as well as hands-on design to deliver robust AI models and workflows that learn from, recommend, and optimize the uptime, quality, and performance of customer infrastructure. Youll also guide strategic direction on resource utilization in generative AI systems, working cross-functionally to align product direction with infrastructure capabilities.

Key Responsibilities:

  • Architect and design datasets from infrastructure and operational telemetry.
  • Architect, select, and fine-tune/train AI and Generative AI model that can detect patterns on time series datasets.
  • Fine-tune and train AI and Generative AI models that can interreact with tools and take actions.
  • Understanding k8s and other infrastructure components and their usages.
  • Build Model Context Protocol (MCP) tools to support Agentic Workflows
  • Demonstrate a deep understanding of AI frameworks that support Nvidia and AMD GPUs.
  • Plan and coordinate software engineering work, map tasks to releases, conduct code reviews, and resolve technical challenges to unblock releases.
  • Generate architecture specifications and build proof-of-concept (POC) solutions for clarity when needed.
  • Collaborate with product management to understand customer requirements and build architecturally sound solutions. Work closely with engineers on implementation and track progress to ensure alignment with architectural requirements.

Minimum Qualifications:

  • Demonstrable experience in following
    • AI and Generative AI model training and fine-tuning.
    • KV Cache management and context length impact in LLM inferencing.
    • Python, PyTorch, TensorRT and other AI frameworks
    • CUDA, Nsight and other nvidia tools.
    • vLLM, LLM-D and other runtime for LLMs.
  • Comprehensive understanding of software release processes
  • Proficiency in using agents building pipelines.
  • Bachelors degree or equivalent with 10+ years of engineering experience.

Preferred Qualifications:

  • Demonstrable technical leadership through publish papers in industry conferences and publications, and issued patents
  • Proven leadership experience in architecture and design of Retrieval Augmented Generation workflows.
  • Demonstrable experience collecting and using system metrics in AI training/fine-tuning and inference
  • Masters degree or equivalent.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Cisco logo
Cisco

Software Development

San Jose CA

RecommendedJobs for You