Skip to main content

Saviynt

AI Platform Engineer, Training and Inference

Milpitas, California$240k–$260kfull-timemidAdded 2 days ago

About this role

Saviynt is seeking an AI Platform Engineer to manage and optimize distributed training and inference systems for their AI-powered identity platform. This role focuses on ensuring the efficient deployment and scaling of AI models, enhancing operational effectiveness while maintaining security and compliance.

What you'll do

  • Manage the Ray ecosystem, including KubeRay on GKE
  • Operate distributed training with Ray Train and H100 clusters
  • Build and manage the LLM inference mesh using Ray Serve
  • Optimize inference performance and autoscaling features
  • Design model routing layers for efficient deployment
  • Oversee the full model promotion lifecycle and retrain pipelines

What they're looking for

  • Experience in ML engineering and MLOps
  • Proficiency in Ray Train, Serve, Core, and Data
  • Hands-on knowledge with LLM serving engines
  • Familiarity with distributed training and RL concepts
  • Experience with model lifecycle operations
  • Strong programming skills in Python and PyTorch
  • Understanding of vector databases and ANN indexing
  • Knowledge of quantization techniques (nice to have)

Benefits

  • Competitive total rewards package
  • Opportunities for learning and career advancement
  • Potential for discretionary bonus participation
  • [unknown]
  • [unknown]
  • [unknown]
Apply on the employer's site

Opens the official application on the employer’s site. No login required.