AI DataSkolkovo / remoteFull-time

AI Data & Evaluation Engineer

About the role

YappiX is hiring an AI Data & Evaluation Engineer to work on datasets, labeling, benchmark design, LLM evaluation, and quality control for AI-first products and new AI architectures.

We need someone who understands that models do not improve because of slogans — they improve because of better data, honest tests, and rigorous evaluation.

Responsibilities

collect, clean, normalize, and structure datasets
build evaluation pipelines and benchmark suites
design private test sets, adversarial tests, and quality metrics
analyze model failures and identify architectural weak points
work with synthetic data, filtering, deduplication, and quality control
support data workflows for research and product experiments
collaborate with research and engineering teams on model quality and measurable outcomes

Requirements

Python
experience with data workflows, ML datasets, and data pipelines
understanding of LLM evaluation, quality metrics, and benchmarking
attention to detail and strong data discipline
ability to detect system-level errors rather than only local issues
ability to propose metrics and validation schemes independently
understanding of reproducibility and data quality

Nice to have

experience with NLP, LLMs, prompt evaluation, or red teaming
experience with synthetic data generation and dataset curation
experience in labeling, QA, and research analytics
experience with SQL, DuckDB, Pandas, Arrow, or Hugging Face Datasets

You may not be a fit if

you treat data work as a secondary support task
you cannot design honest and reproducible tests
you do not distinguish between “the model sounds good” and “the model is correct.”

What we offer

work on AI-first systems and new AI architectures
a strong role in model quality and measurable results
the opportunity to build benchmark and evaluation systems from scratch
a compact team and fast experimental cycles
remote / Skolkovo / remote

How to apply

Send your CV, examples of data or evaluation work, and a short note on how you designed test sets or quality metrics to hr@yappix.ru or via https://yappix.ru/en/contact