Senior LLM Inference Performance Engineer
Location
helsinki, uusimaa
Job Type
Full-time
Category
Quality Engineering
Posted
June 06, 2026
Role Overview
Position focused on performance analysis and optimization of production‑grade AI services within the AMD Inference Microservice (AIM) ecosystem. Part of a diverse team responsible for ensuring reliable performance of AI micro‑services on varied hardware configurations. Requires deep understanding of large language models (LLMs) and hands‑on knowledge of AI tooling such as inference servers.
Key Responsibilities
- Measure, analyze, and optimize LLM and AI service performance across metrics like latency and throughput for various training and inference use cases.
- Design and implement methodologies for measuring model performance and automate optimization strategies to identify optimal configurations.
- Stay on top of current advances in AI, models, APIs, and open‑source ecosystems, and translate them into scalable solutions.
LLM and AI Tooling
- Design and develop tooling to measure and an...