Meshy
AI Infrastructure Engineer
Bay Area OfficefulltimemidAdded today
About this role
Meshy, a leading 3D generative AI company, seeks an AI Infrastructure Engineer to design and optimize the inference platform that powers their model serving stack. You'll work on GPU resource management, service orchestration, and production reliability while supporting rapid growth in a well-funded startup.
What you'll do
- Design and optimize core inference platform capabilities including services, task scheduling, orchestration, and elastic scaling
- Develop CPU/GPU resource management systems to balance stability, utilization, and cost efficiency across inference and training workloads
- Implement unified GPU resource scheduling and explore technologies like MIG, MPS, and virtualization in production
- Optimize throughput, latency, and availability across complex inference pipelines and high-concurrency scenarios
- Drive R&D efficiency, cost management, and disaster recovery architecture to support company scaling
- Research AI-native infrastructure and automated operations to improve system reliability and usability
What they're looking for
- Go or Python programming with strong software engineering practices
- Kubernetes, Docker, and container orchestration
- Distributed systems and microservices architecture
- Linux, operating systems, computer networks fundamentals
- GPU inference platforms and resource scheduling
- CI/CD, build systems, and deployment infrastructure
- Model serving and task orchestration frameworks
- Problem-solving and debugging complex production systems
Opens the official application on the employer’s site. No login required.