xAI
Software Engineer: Network (C++)
Palo Alto, CA; Seattle, WAmidAdded today
About this role
Join xAI as a Software Engineer focusing on networking for Colossus, a high-performance datacenter network. You'll be responsible for developing software that enhances the efficiency and reliability of AI training across a massive GPU infrastructure.
What you'll do
- Create routing and traffic-engineering algorithms for the network.
- Develop reliable real-time software for network switches.
- Participate in architecture, design, and code reviews.
- Conduct experiments to validate design decisions.
- Build development and testing tools across various environments.
- Deploy software updates with rigorous monitoring.
What they're looking for
- Proficient in C or C++ programming.
- Experience in high-performance software development.
- Knowledge of networking protocols (UDP, TCP/IP, RDMA).
- Familiarity with distributed systems and datacenter fabrics.
- Background in real-time and high-performance computing.
- Strong analytical and problem-solving skills.
- Excellent communication abilities.
- Ability to adapt in a dynamic environment.
Opens the official application on the employer’s site. No login required.