Machine Learning Engineer

Insight Global • San Jose, CA, United States • Posted June 03, 2026

Location San Jose, CA
Job Type Full-time
Category other-general
Posted June 03, 2026
Job Description
Insight Global is seeking a team of experienced, driven Machine Learning Engineer to join an established health technology company sitting in San Jose, CA. This is a full-time, permanent role with competitive salary, bonus, and comprehensive benefits.

In this role you'll need:
Deep Learning Frameworks: Hands-on experience with PyTorch (main focus) and familiarity with TensorFlow.

Large-Scale Model Training: Exposure to advanced training techniques like Distributed Data Parallel (DDP), Fully Sharded Data Parallel (FSDP), ZeRO, and model parallelism (pipeline/tensor). Experience with distributed training is a strong plus.

Model Optimization: Skilled in improving model performance through techniques like quantization (PTQ, QAT, AWQ, GPTQ), pruning, knowledge distillation, KV-cache tuning, and using efficient attention mechanisms like Flash Attention.

Scalable Model Serving: Understanding of how to deploy models at scale, including ...

Interested in this role?

Click the button below to start your application.

Apply Now