Skip to main content

Meshy

AI Infrastructure Engineer

Bay Area OfficefulltimemidAdded today

About this role

Meshy, a leading 3D generative AI company, seeks an AI Infrastructure Engineer to design and optimize the inference platform that powers their model serving stack. You'll work on GPU resource management, service orchestration, and production reliability while supporting rapid growth in a well-funded startup.

What you'll do

  • Design and optimize core inference platform capabilities including services, task scheduling, orchestration, and elastic scaling
  • Develop CPU/GPU resource management systems to balance stability, utilization, and cost efficiency across inference and training workloads
  • Implement unified GPU resource scheduling and explore technologies like MIG, MPS, and virtualization in production
  • Optimize throughput, latency, and availability across complex inference pipelines and high-concurrency scenarios
  • Drive R&D efficiency, cost management, and disaster recovery architecture to support company scaling
  • Research AI-native infrastructure and automated operations to improve system reliability and usability

What they're looking for

  • Go or Python programming with strong software engineering practices
  • Kubernetes, Docker, and container orchestration
  • Distributed systems and microservices architecture
  • Linux, operating systems, computer networks fundamentals
  • GPU inference platforms and resource scheduling
  • CI/CD, build systems, and deployment infrastructure
  • Model serving and task orchestration frameworks
  • Problem-solving and debugging complex production systems
Apply on the employer's site

Opens the official application on the employer’s site. No login required.