Senior Ai Research Engineer Model Inference Remote
Location
gasteiz, kingdom of spain
Job Type
Full-time
Category
Engineering
Posted
June 02, 2026
About the Job
Aumente sus posibilidades de conseguir una entrevista leyendo la siguiente descripción general de este puesto antes de presentar su candidatura.
We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine‑tuning for language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).
Responsibilities
Implement and optimize custom inference and fine‑tuning kernels for small and large language models across multiple hardware backends.
Implement and optimize full and LoRA fine‑tuning for small and large language models across multiple hardware backends.
Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
Design, customize, and optimize Vulkan compute shaders for quantized operators and fine‑tuning workflows...
Aumente sus posibilidades de conseguir una entrevista leyendo la siguiente descripción general de este puesto antes de presentar su candidatura.
We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine‑tuning for language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).
Responsibilities
Implement and optimize custom inference and fine‑tuning kernels for small and large language models across multiple hardware backends.
Implement and optimize full and LoRA fine‑tuning for small and large language models across multiple hardware backends.
Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
Design, customize, and optimize Vulkan compute shaders for quantized operators and fine‑tuning workflows...