Senior LLM Inference Performance Engineer

AMD • helsinki, uusimaa, Finland • Posted June 06, 2026

Location helsinki, uusimaa

Job Type Full-time

Category Quality Engineering

Posted June 06, 2026

Role Overview Position focused on performance analysis and optimization of production‑grade AI services within the AMD Inference Microservice (AIM) ecosystem. Part of a diverse team responsible for ensuring reliable performance of AI micro‑services on varied hardware configurations. Requires deep understanding of large language models (LLMs) and hands‑on knowledge of AI tooling such as inference servers. 
Key Responsibilities Measure, analyze, and optimize LLM and AI service performance across metrics like latency and throughput for various training and inference use cases. 
Design and implement methodologies for measuring model performance and automate optimization strategies to identify optimal configurations. 
Stay on top of current advances in AI, models, APIs, and open‑source ecosystems, and translate them into scalable solutions. 
LLM and AI Tooling Design and develop tooling to measure and an...
            

Interested in this role?

Click the button below to start your application.

Apply Now