openai
Applied AI Engineer, Codex Core Agent
San Franciscofulltimemid
About this role
Join OpenAI's Codex Core Agent team to transform AI research into production-ready agent systems that solve real software engineering tasks. You'll bridge research and product by improving agent performance, reliability, and user value through evaluation, experimentation, and systematic failure analysis.
What you'll do
- Design and iterate on agent behaviors for real-world coding tasks and long-horizon workflows
- Develop evaluations to measure agent performance, identify failures, and detect regressions
- Optimize agent performance through prompting, tool-use strategies, and context construction
- Analyze production failures and improve robustness and reliability systematically
- Build feedback loops and data systems for better real-task data in evaluation
- Collaborate with product teams on user-facing agent experiences and interfaces
What they're looking for
- Python programming
- Machine learning and LLM product development
- Model evaluation and prompt design
- Agent frameworks and tool-using LLM systems
- Systems thinking and debugging complex failures
- Data analysis and handling large datasets
- Fine-tuning and model experimentation
- Code generation or developer tooling experience
Opens the official application on the employer’s site. No login required.