Skip to main content

42dot

LLM Engineer (Data Generation)

Pangyo (Software Dream Center), South Korea (Remote)fulltimemidAdded today

About this role

The LLM Engineer (Data Generation) focuses on enhancing model performance through the design, generation, and evaluation of training data from a Data-Centric AI perspective. The role involves addressing performance bottlenecks, building data generation pipelines, and establishing metrics for data quality assessment.

What you'll do

  • Design and generate data to improve model performance
  • Define data requirements and analyze model performance issues
  • Build and maintain data generation pipelines
  • Experimentally analyze the impact of generated data on model performance
  • Establish data quality evaluation metrics
  • Develop iterative strategies for data generation and filtering

What they're looking for

  • 3+ years of experience in LLM, Machine Learning, or Data Generation
  • Understanding of deep learning and natural language processing
  • Proficiency in Python for data processing and automation
  • Experience with large training dataset management
  • Knowledge of data evaluation frameworks
  • Strong problem-solving abilities
  • Collaboration and communication skills
  • Familiarity with LLM APIs and generation strategies
Apply on the employer's site

Opens the official application on the employer’s site. No login required.