Jobs.ca
Jobs.ca
Language
AMD logo

Software Development Engineer - GPU Debugger

AMDabout 7 hours ago
Hybrid
Markham, Ontario
$126,160 - $189,240/yearly
Mid Level
full_time

About the role

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Role AMD is seeking a Software Development Engineer to join the AI Developer Tools team. In this role, you will design and develop advanced GPU Debugging tools that enable developers to debug HPC, ML, and AI workloads. You will contribute to the ROCm ecosystem by building robust, scalable debugging solutions that empower developers to maximize AMD GPU capabilities.

THE PERSON

You are passionate about software engineering and performance optimization. You have a strong foundation in C++ and computer architecture, and you thrive in collaborative environments. You are detail-oriented, proactive in solving complex technical challenges, and able to communicate effectively across teams.

Key Responsibilities

  • Design, develop, and maintain ROCm Debugger components for GPU debugging
  • Collaborate with architecture, driver, and runtime teams to enable profiling for next-generation AMD GPUs involved in Pre Silicon and Post Silicon activities.
  • Implement new features and APIs to enhance debugging capabilities for AI and HPC workloads
  • Optimize profiling tools for accuracy, scalability, and minimal overhead
  • Debug and resolve issues in profiling workflows and improve tool reliability
  • Participate in hardware bring-up and ensure profiling support for new ASICs
  • Stay current with GPU architecture advancements and integrate them into ROCm profiling tools
  • Contribute to documentation and developer resources for ROCm Debugger

Preferred Experience

  • Strong proficiency in C++ and object-oriented programming
  • Experience with GDB, LLDB and other debug tools
  • Familiarity with GPU programming models (HIP, OpenCL, or CUDA)
  • Understanding of GPU architecture and system-level performance concepts
  • Experience with multithreading and concurrency in modern C++
  • Knowledge of Linux development environments; Windows experience is a plus
  • Familiarity with ROCm ecosystem and tools is highly desirable
  • Experience with Git-based workflows and debugging tools
  • Strong problem-solving skills and ability to work independently and in a team

Academic Credentials

  • Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

LOCATION: Markham, Canada

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

About AMD

Semiconductor Manufacturing
10,000+

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance_