42dot
LLM Engineer (Data Generation)
Pangyo (Software Dream Center), South Korea (Remote)fulltimemidAdded today
About this role
The LLM Engineer (Data Generation) focuses on enhancing model performance through the design, generation, and evaluation of training data from a Data-Centric AI perspective. The role involves addressing performance bottlenecks, building data generation pipelines, and establishing metrics for data quality assessment.
What you'll do
- Design and generate data to improve model performance
- Define data requirements and analyze model performance issues
- Build and maintain data generation pipelines
- Experimentally analyze the impact of generated data on model performance
- Establish data quality evaluation metrics
- Develop iterative strategies for data generation and filtering
What they're looking for
- 3+ years of experience in LLM, Machine Learning, or Data Generation
- Understanding of deep learning and natural language processing
- Proficiency in Python for data processing and automation
- Experience with large training dataset management
- Knowledge of data evaluation frameworks
- Strong problem-solving abilities
- Collaboration and communication skills
- Familiarity with LLM APIs and generation strategies
Opens the official application on the employer’s site. No login required.