Skip to main content

Harmonic

Research Engineer, Training & Inference

Palo Alto$200k–$450kfulltimemidAdded today

About this role

Harmonic seeks a Research Engineer to optimize and maintain their custom reinforcement learning and inference stack, from Python APIs to CUDA kernels. You'll focus on maximizing throughput for training and serving high-performance AI systems that solve complex mathematical reasoning problems.

What you'll do

  • Maintain and optimize proprietary RL training and inference infrastructure across all layers
  • Maximize reinforcement learning throughput through sharded multi-node training and inference optimization
  • Optimize inference serving stack for high-throughput RL and low-latency LLM production traffic
  • Identify and resolve performance bottlenecks in distributed clusters for multi-billion parameter models
  • Write or improve custom kernels to resolve low-level compute bottlenecks
  • Design systems balancing memory efficiency with compute-intensive training cycles

What they're looking for

  • Python proficiency
  • ML framework experience (PyTorch, JAX, or TensorFlow)
  • Distributed training concepts and collective communication (NCCL)
  • GPU-accelerated infrastructure deployment and profiling
  • C++ (preferred)
  • Custom kernel development (Triton, CUDA, CUTLASS)
  • Multi-node GPU cluster scaling (FSDP, Tensor Parallelism)
  • Reinforcement learning system design

Benefits

  • Unlimited PTO
  • 401(k) matching
  • 100% employer-paid health, vision, and dental for employees; 50% for dependents
  • Health Savings Account (HSA) available
  • Work on cutting-edge mathematical reasoning AI
  • Elite technical team environment
Apply on the employer's site

Opens the official application on the employer’s site. No login required.