Jobs

Interviews
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home

Jobs

Home
>
Jobs in Hyderabad
>
Advanced Micro Devices, Inc
>
HPC Software Optimization Engineer - C++

HPC Software Optimization Engineer - C++

Advanced Micro Devices, Inc

1 - 6 years

11 - 12 Lacs

Hyderabad

Posted:6 hours ago| Platform:

Apply

Skills Required

Performance tuning C++ Automation Software design Debugging Open source Gaming Testing Python Recruitment

Work Mode

Work from Office

Job Type

Full Time

Job Description

Join AMD s high-impact team at the heart of innovation in AI, ML, and high-performance computing (HPC). We re a collaborative group of software architects and GPU engineers focused on pushing the boundaries of AI model performance across distributed, GPU-accelerated platforms. Our work drives the next generation of AMD s AI software stack, enabling large-scale machine learning training and inference workloads in data centers and enterprise environments.

THE ROLE:

As a Senior Software Developer, you will develop both GPU kernel-level optimization and distributed software efforts for large-scale AI workloads. This is a technical leadership role with direct influence over critical software components in AMD s AI stack. You ll architect and implement optimized compute kernels, guide software teams through the full product lifecycle, and work closely with internal and external partners to deploy scalable, high-performance solutions.

THE PERSON:

We re looking for a highly skilled, deep systems thinker who thrives in complex problem domains involving parallel computing, GPU architecture, and AI model execution. You are confident leading software architecture decisions and know how to translate business goals into robust, optimized software solutions. You re just as comfortable writing performance-critical code as you are guiding agile development teams across product lifecycles. Ideal candidates have a strong balance of low-level programming, distributed systems knowledge, and leadership experience paired with a passion for AI performance at scale.

KEY RESPONSIBILITIES:

GPU Kernel Optimization
:
Develop and optimize GPU kernels to accelerate inference and training of large machine learning models while ensuring numerical accuracy and runtime efficiency.
Multi-GPU and Multi-Node Scaling: Architect and implement strategies for distributed training/inference across multi-GPU/multi-node environments using model/data parallelism techniques.
Performance Profiling: Identify bottlenecks and performance limitations using profiling tools; propose and implement optimizations to improve hardware utilization.
Parallel Computing
:
Design and implement multi-threaded and synchronized compute techniques for scalable execution on modern GPU architectures.
Benchmarking & Testing: Build robust benchmarking and validation infrastructure to assess performance, reliability, and scalability of deployed software.
Documentation & Best Practices: Produce technical documentation and share architectural patterns, code optimization tips, and reusable components.

PREFERRED EXPERIENCE:

Software Team Leadership

Collaboration with customers and business units to define deliverables and roadmaps.
Interfacing with executive leadership on program progress and strategic planning.
Experience in production-level software deployment (e.g., upstreaming to open source, commercial rollouts).

Software Architecture

Deep experience with GPU kernel optimization in
C++12/17/20
.
Working knowledge of frameworks such as
PyTorch, vLLM, Cutlass, Kokkos
.
Practical expertise in CPU/GPU architecture and system-level performance tuning.
Proficiency in
Python
scripting and infrastructure automation.
Application of software design patterns and industry-standard engineering practices.

GPU & Low-Level Optimization

Hands-on experience with
CUDA
and low-level GPU programming.
Kernel optimization in
assembly
and tight loops for latency-sensitive code.
Proficiency with performance profiling tools (Nsight, VTune, Perf, etc.).
Experience with distributed computing strategies in AI environments (multi-GPU, NCCL, MPI).
Strong debugging, problem-solving, and performance tuning skills in complex systems.

ACADEMIC CREDENTIALS:

Bachelor s or Master s degree
in Computer Engineering, Electrical Engineering, Computer Science, or a related technical field.
Advanced degrees or published work in HPC, GPU computing, or AI systems is a plus.

#LI-NR1

Benefits offered are described:

AMD benefits at a glance .

More Jobs at Advanced Micro Devices, Inc

PMTS Silicon Design Engineer

Bengaluru

Experience: Not specified

INR 15 - 20 Lacs

Sr. Commercial Sales Rep

Bengaluru

10 - 12 yrs

INR 7 - 11 Lacs

MTS Software Development Engineer

Bengaluru

10 - 15 yrs

INR 9 - 13 Lacs

SMTS Silicon Design Engineer

Bengaluru

12 - 15 yrs

INR 15 - 20 Lacs

Hardware Development Lead (High-Speed Digital Design / Board Design)

Hyderabad

12 - 15 yrs

INR 15 - 20 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Performance Tuning Interview Now

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Advanced Micro Devices, Inc

Semiconductors

Sunnyvale

15,500 Employees

206 Jobs

Key People

Dr. Lisa Su

President and CEO
Devinder Kumar

Executive Vice President, Finance and CFO
Mark Papermaster

Executive Vice President and Chief Technology Officer
Ruth Cotter

Senior Vice President, Human Resources and Global Communications

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

Personal Settings

HPC Software Optimization Engineer - C++

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

THE ROLE:

THE PERSON:

KEY RESPONSIBILITIES:

:

:

PREFERRED EXPERIENCE:

C++12/17/20

PyTorch, vLLM, Cutlass, Kokkos

Python

CUDA

assembly

ACADEMIC CREDENTIALS:

Bachelor s or Master s degree

More Jobs at Advanced Micro Devices, Inc