Meshy

AI Infrastructure Engineer

Bay Area OfficefulltimemidAdded today

About this role

Meshy, a leading 3D generative AI company, seeks an AI Infrastructure Engineer to design and optimize the inference platform that powers their model serving stack. You'll work on GPU resource management, service orchestration, and production reliability while supporting rapid growth in a well-funded startup.

What you'll do

Design and optimize core inference platform capabilities including services, task scheduling, orchestration, and elastic scaling
Develop CPU/GPU resource management systems to balance stability, utilization, and cost efficiency across inference and training workloads
Implement unified GPU resource scheduling and explore technologies like MIG, MPS, and virtualization in production
Optimize throughput, latency, and availability across complex inference pipelines and high-concurrency scenarios
Drive R&D efficiency, cost management, and disaster recovery architecture to support company scaling
Research AI-native infrastructure and automated operations to improve system reliability and usability

What they're looking for

Go or Python programming with strong software engineering practices
Kubernetes, Docker, and container orchestration
Distributed systems and microservices architecture
Linux, operating systems, computer networks fundamentals
GPU inference platforms and resource scheduling
CI/CD, build systems, and deployment infrastructure
Model serving and task orchestration frameworks
Problem-solving and debugging complex production systems

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.