Skip to main content

CTGT

Machine Learning Engineer: LLM Interpretability & Systems

San Francisco$175k–$250kfulltimemidAdded 2 days ago

About this role

Join CTGT as a Senior Machine Learning Engineer to build production systems that make large language models reliable and controllable for enterprise use. You'll work directly with model internals using mechanistic interpretability techniques to operationalize research and enable deterministic AI governance.

What you'll do

  • Translate mechanistic interpretability research into production-grade code that improves model behavior
  • Develop and implement activation patching, control vectors, and feature extraction techniques for targeted model improvements
  • Build evaluation and deployment systems to reliably ship model modifications to enterprise environments
  • Design feature-level intervention systems for real-time policy enforcement at inference time
  • Work with commercial and open-source model internals across weights, activations, and architectures
  • Probe model mechanics to identify fundamental improvements beyond prompt engineering

What they're looking for

  • Transformer architectures and PyTorch internals
  • Deep learning mathematical foundations
  • Model training, fine-tuning, and optimization experience
  • Python, Rust, Node/TypeScript programming
  • Research paper implementation and evaluation
  • Machine learning systems design and deployment
  • Problem ownership and debugging complex systems
  • Mechanistic interpretability techniques

Benefits

  • Competitive base compensation with significant equity in venture-backed company
  • Work on core systems with direct real-world impact in high-stakes environments
  • High autonomy and trust to form and execute technical opinions
  • Access to frontier ML infrastructure and models
  • Backing from prestigious investors including Google Gradient Ventures and Y Combinator
Apply on the employer's site

Opens the official application on the employer’s site. No login required.