CUDA Kernel Performance Engineer, Amazon AI

Amazon Development Centre Canada ULC • vancouver, metro vancouver regional district, Canada • Posted June 27, 2026

Location vancouver, metro vancouver regional district
Job Type Full-time
Category IT & Technology
Posted June 27, 2026
Join Amazon Devices as a CUDA Kernel Performance Engineer focused on high-performance GPU optimization for edge AI technologies. Deliver peak efficiency in model training and inference workflows.
In the AI Platform team, your role will be crucial in designing and implementing cutting-edge CUDA and Triton kernels. You will work alongside top-tier scientists and engineers to enhance compression algorithms and resolve performance bottlenecks. This position emphasizes your impact on the productivity of the entire team in deploying efficient AI models.
Key Responsibilities:
• Craft efficient CUDA and Triton kernels for edge AI tasks
• Conduct performance optimizations to expedite training processes
• Design profiling and testing infrastructure for kernel efficiency
• Maintain and enhance the training kernels library for ease of use
• Collaborate to unify software and hardware for deployment
Requirements:
• 3+ years of software development experience
• 2+ years ...

Interested in this role?

Click the button below to start your application.

Apply Now