Home
Jobs

5 Tensorrt Jobs

Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
Filter
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

8.0 - 13.0 years

10 - 14 Lacs

Bengaluru

Work from Office

Naukri logo

General Summary: As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Systems Engineer, you will research, design, develop, simulate, and/or validate systems-level software, hardware, architecture, algorithms, and solutions that enables the development of cutting-edge technology. Qualcomm Systems Engineers collaborate across functional teams to meet and exceed system-level requirements and standards. Minimum Qualifications: Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 8+ years of Systems Engineering or related work experience. OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 7+ years of Systems Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer Science, or related field and 6+ years of Systems Engineering or related work experience. Principal Engineer Machine Learning We are looking for a Principal AI/ML Engineer with expertise in model inference , optimization , debugging , and hardware acceleration . This role will focus on building efficient AI inference systems, debugging deep learning models, optimizing AI workloads for low latency, and accelerating deployment across diverse hardware platforms. In addition to hands-on engineering, this role involves cutting-edge research in efficient deep learning, model compression, quantization, and AI hardware-aware optimization techniques . You will explore and implement state-of-the-art AI acceleration methods while collaborating with researchers, industry experts, and open-source communities to push the boundaries of AI performance. This is an exciting opportunity for someone passionate about both applied AI development and AI research , with a strong focus on real-world deployment, model interpretability, and high-performance inference . Education & Experience: 20+ years of experience in AI/ML development, with at least 5 years in model inference, optimization, debugging, and Python-based AI deployment. Masters or Ph.D. in Computer Science, Machine Learning, AI Leadership & Collaboration Lead a team of AI engineers in Python-based AI inference development . Collaborate with ML researchers, software engineers, and DevOps teams to deploy optimized AI solutions. Define and enforce best practices for debugging and optimizing AI models Key Responsibilities Model Optimization & Quantization Optimize deep learning models using quantization (INT8, INT4, mixed precision etc), pruning, and knowledge distillation . Implement Post-Training Quantization (PTQ) and Quantization-Aware Training (QAT) for deployment. Familiarity with TensorRT, ONNX Runtime, OpenVINO, TVM AI Hardware Acceleration & Deployment Optimize AI workloads for Qualcomm Hexagon DSP, GPUs (CUDA, Tensor Cores), TPUs, NPUs, FPGAs, Habana Gaudi, Apple Neural Engine . Leverage Python APIs for hardware-specific acceleration , including cuDNN, XLA, MLIR . Benchmark models on AI hardware architectures and debug performance issues AI Research & Innovation Conduct state-of-the-art research on AI inference efficiency, model compression, low-bit precision, sparse computing, and algorithmic acceleration . Explore new deep learning architectures (Sparse Transformers, Mixture of Experts, Flash Attention) for better inference performance . Contribute to open-source AI projects and publish findings in top-tier ML conferences (NeurIPS, ICML, CVPR). Collaborate with hardware vendors and AI research teams to optimize deep learning models for next-gen AI accelerators. Details of Expertise: Experience optimizing LLMs, LVMs, LMMs for inference Experience with deep learning frameworks : TensorFlow, PyTorch, JAX, ONNX. Advanced skills in model quantization, pruning, and compression . Proficiency in CUDA programming and Python GPU acceleration using cuPy, Numba, and TensorRT . Hands-on experience with ML inference runtimes (TensorRT, TVM, ONNX Runtime, OpenVINO) Experience working with RunTimes Delegates (TFLite, ONNX, Qualcomm) Strong expertise in Python programming , writing optimized and scalable AI code. Experience with debugging AI models , including examining computation graphs using Netron Viewer, TensorBoard, and ONNX Runtime Debugger . Strong debugging skills using profiling tools (PyTorch Profiler, TensorFlow Profiler, cProfile, Nsight Systems, perf, Py-Spy) . Expertise in cloud-based AI inference (AWS Inferentia, Azure ML, GCP AI Platform, Habana Gaudi). Knowledge of hardware-aware optimizations (oneDNN, XLA, cuDNN, ROCm, MLIR, SparseML). Contributions to open-source community Publications in International forums conferences journals

Posted 2 weeks ago

Apply

15.0 - 24.0 years

60 - 65 Lacs

Noida, Chennai, Bengaluru

Work from Office

Naukri logo

We are seeking a highly skilled Generative AI Consulting Director to join our dynamic team, where they will lead our consulting team, manage the delivery of consulting services, guide clients through the implementation of our Gen AI platform, and ensure the successful adoption of the platform across industries. Key Responsibilities: Lead or mentor a global team of AI consultants, solution architects, and professional services teams. Develop and execute the strategy for consulting and professional services for the Gen AI platform. Manage the end-to-end Implementation our platform in client environments, ensuring high quality implementations, on time delivery, and alignment with customer expectations. Work closely with clients to understand their business challenges and design tailored solutions using the Gen AI platform. Lead the development of solution architectures, ensuring that proposed solutions are scalable, innovative, and aligned with the client's objectives. Collaborate with product development, GTM, and engineering teams to ensure successful implementation and integrations. Provide feedback to product teams based on client needs and market trends to continuously improve the platforms offerings. Drive client success by ensuring that Gen AI platform implementation deliver measurable value and return on investment (ROI). Work closely with clients to define successful metrics, track project outcomes, and guide the optimization of AI models and systems post-implementation. Manage P&L for the Consulting and Professional Services division, ensuring profitability through effective project management, cost control, and client retention. Develop and implement strategies to drive revenue growth within the professional services arm. Ethical and Responsible AI: Adhere to ethical AI practices, such as fairness, transparency, and accountability. Address biases and potential risks associated with AI systems to ensure responsible deployment and usage. Research and Innovation: Stay updated with the latest advancements in AI technologies, frameworks, and algorithms. Conduct research and experimentation to explore innovative approaches and techniques that can enhance AI capabilities. Mandatory Qualifications/Skills: A bachelors or masters degree, or equivalent, in computer science, Artificial Intelligence, or a related field. 15+ years of experience in consulting or professional services, with at least 5 years in a leadership role overseeing a team of AI consultants or solution architects Extensive experience in delivering Generative AI solutions and familiarity with AI platforms, including knowledge of NLP, deep learning, and reinforcement learning. Experience with large language models (LLMs) and prompt engineering. Solid understanding of various fine-tuning techniques like full fine tuning, PEFT techniques like LoRA, QLoRA and the strategy to adopt for various use cases Proficiency in languages such as Python, Scala, or Java In depth knowledge of both relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., Vector databases, MongoDB, Cassandra etc). Expertise in Gen AI/AI libraries / frameworks, including but not limited to LangChain, LangGraph, LangSmith, TensorFlow, PyTorch, scikit and Keras Proven understanding of cloud computing platforms (e.g., AWS, Azure, Google Cloud) and experience deploying AI models on these platforms. Proven experience in managing client relationships and understanding their business needs to deliver successful AI solutions. Strong understanding of AI systems architecture and the ability to design and implement complex AI solutions for clients across various industries. Experience with project management methodologies, and a proven ability to manage large, complex projects to successful completion. Excellent leadership, mentoring, and team building skills, with a track record of developing high performing teams. Strong business acumen, with the ability to balance technical expertise with client centric decision making. Outstanding communication and presentation skills, capable of engaging with senior executives and non-technical stakeholders. Strong problem solving and analytical skills, with the ability to think creatively and provide innovative solutions Preferred Skills: Knowledge of NVIDIA CUDA, cuDNN, TensorRT, and experience with NVIDIA GPU hardware and the software stack. Familiarity with High Performance Computing (HPC) and their integration of AI workloads. Familiarity with Big Data platforms and technologies, such as Hadoop or Apache Spark and their integration with AI solutions.

Posted 3 weeks ago

Apply

5.0 - 7.0 years

45 - 50 Lacs

Mumbai, New Delhi, Bengaluru

Work from Office

Naukri logo

Job Overview We are looking for a Senior Computer Vision Machine Learning Engineer to lead the development of real-time CV/ML systems, with an emphasis on deploying models on edge platforms like the NVIDIA IGX Orin. The ideal candidate will have experience in designing robust vision pipelines, training and optimizing deep learning models, and working closely with hardware platforms for deployment. Responsibilities Lead the design, development, and deployment of end-to-end computer vision and deep learning models Optimize and deploy CV/ML pipelines on edge platforms, particularly NVIDIA IGX (Orin preferred) Work with cross-functional teams to integrate models into real-time applications (e.g., robotics, safety systems, industrial inspection) Develop and maintain datasets, perform data augmentation, and ensure quality training inputs Leverage NVIDIA SDKs (e.g., DeepStream, TensorRT, TAO Toolkit, CUDA) for performance and acceleration Collaborate with hardware engineers to fine-tune models for power, latency, and throughput constraints Stay up to date with the latest research and techniques in computer vision, edge AI, and embedded ML Requirements Bachelors or Masters degree in Computer Science, Electrical Engineering, or related field 5+ years of experience in Computer Vision and Machine Learning (deep learning emphasis) Proficiency in Python, C++, TensorFlow, PyTorch Strong understanding of model optimization techniques for edge deployment Hands-on experience with NVIDIA platforms IGX, Jetson, or Xavier (IGX Orin highly preferred) Experience with NVIDIA SDKs (e.g., DeepStream, TensorRT, CUDA, TAO Toolkit) Solid knowledge of vision tasks: object detection, tracking, classification, segmentation Familiarity with containerization (Docker), CI/CD pipelines, and version control (Git) Preferred Qualifications Experience in industrial AI, medical imaging, or robotics Exposure to RTOS, safety-critical systems, or IEC 61508/ISO 26262 environments Familiarity with ONNX, OpenCV, ROS, or GStreamer What We Offer Opportunity to work on cutting-edge AI/edge technology with real-world impact Collaborative and fast-paced engineering culture Flexible working hours and remote work options Competitive salary and benefits package Location-Remote,Delhi NCR,Bangalore,Chennai,Pune,Kolkata,Ahmedabad,Mumbai,Hyderabad

Posted 4 weeks ago

Apply

3.0 - 8.0 years

5 - 10 Lacs

Noida

Work from Office

Naukri logo

About The Role Were building an agentic AI platform that turns one line of text and a video feed into end-to-end, real-time computer-vision solutionsthink semantic video search, object / action recognition, and task-oriented visual agents deployable with a single click As a Gen AI ML Engineer, youll architect the core vision & multimodal-reasoning stack and pave the road from prototype to production. Roles And Responsibilities Semantic video search Ship a pipeline that allows users to type show every forklift near aisle 5 in the last 30 minutes and get keyed-off clips in Wire embeddings to a hybrid FAISS/HNSW index; surface results through a simple REST & React playground. Create agentic pipelines Chain vision language models and zero/few-shot vision models with LLM planners (Gemini, GPT-4o, AutoGen, etc.) so a single prompt becomes a multi-step perception workflow. Profile and accelerate inference (TensorRT, ONNX, quantization, batching) to meet latency / throughput targets on GPU and CPU fleets. Rapid prototyping loops Run weekly paper-to-prototype spikes: reproduce a fresh arXiv idea, benchmark, and decide go/no-go in Hand successful python scripts & checkpoints to MLOps for productionizationno plumbing marathons. Data & Evaluation Spin up scalable pipelines for video ingestion, labeling (active learning, weak supervision), experiment tracking, and continuous evaluation. Collaborate & Lead Partner with product and ML Ops engineers; set research direction, mentor future hires, and establish best practices. Must-have Skill Set 13 years deep-learning research experience (internships & grad work count). Fluency in Python + PyTorch; comfortable hacking large vision/LLM repos. Proof you ship ideasfirst-author paper, OSS repo, Kaggle medal, or faithful reproduction of a cutting-edge model. Hands-on with LLM prompting/fine-tuning and at least one agent framework. Able to turn fuzzy product asks into measurable experiments and explain results clearly. Bonus Cred Large-scale video retrieval or temporal grounding experience. Prior work building agentic-AI pipelines that combine perception models with LLM reasoning. Open-source contributions to GenAI/vision libs (OpenCLIP, Vid2Seq, ViperGPT, etc.). What can you expect? Ability to shape the future of manufacturing by leveraging best-in-class AI and software; we are a unique organization with niche skill set that you would also develop while working with us World class work culture, coaching and development Mentoring from highly experienced leadership from world class companies (refer to Ripik.AI website for details) International exposure Work Location NOIDA (Work from Office)

Posted 1 month ago

Apply

17 - 27 years

100 - 200 Lacs

Bengaluru

Work from Office

Naukri logo

Senior Software Technical Director / Software Technical Director Bangalore Founded in 2023,by Industry veterans HQ in California,US We are revolutionizing sustainable AI compute through intuitive software with composable silicon We are looking for a Software Technical Director with a strong technical foundation in systems software, Linux platforms, or machine learning compiler stacks to lead and grow a high-impact engineering team in Bangalore. You will be responsible for shaping the architecture, contributing to codebases, and managing execution across projects that sit at the intersection of systems programming, AI runtimes, and performance-critical software. Key Responsibilities: Technical Leadership: Lead the design and development of Linux platform software, firmware, or ML compilers and runtimes. Drive architecture decisions across compiler, runtime, or low-level platform components. Write production-grade C++ code and perform detailed code reviews. Guide performance analysis and debugging across the full stackfrom firmware and drivers to user-level runtime libraries. Collaborate with architects, silicon teams, and ML researchers to build future-proof software stacks. Team & Project Management: Mentor and coach junior and senior engineers to grow technical depth and autonomy. Own end-to-end project planning, execution, and delivery, ensuring high-quality output across sprints/releases. Facilitate strong cross-functional communication with hardware, product, and other software teams globally. Recruit and grow a top-tier engineering team in Bangalore, contributing to the hiring strategy and team culture. Required Qualifications: Bachelors or Master’s degree in Computer Science, Electrical Engineering, or related field. 18+ years of experience in systems software development with significant time spent in C++, including architectural and hands-on roles. Proven experience in either: Linux kernel, bootloaders, firmware, or low-level platform software, or Machine Learning compilers (e.g., MLIR, TVM, Glow) or runtimes (e.g., ONNX Runtime, TensorRT, vLLM). Excellent communication skills—written and verbal. Prior experience in project leadership or engineering management with direct reports. Highly Desirable: Understanding of AI/ML compute workloads, particularly Large Language Models (LLMs). Familiarity with performance profiling, bottleneck analysis, and compiler-level optimizations. Exposure to AI accelerators, systolic arrays, or vector SIMD programming. Why Join Us? Work at the forefront of AI systems software, shaping the future of ML compilers and runtimes. Collaborate with globally distributed teams in a fast-paced, innovation-driven environment. Build and lead a technically elite team from the ground up in a growth-stage organization. Contact: Uday Mulya Technologies muday_bhaskar@yahoo.com "Mining The Knowledge Community"

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies