ML InfrastructureSkolkovo / remoteFull-time

ML Systems / Infrastructure Engineer (AI / LLM)

About the role

YappiX is hiring an ML Systems / Infrastructure Engineer to build and operate infrastructure for AI systems, LLMs, new AI architectures, and high-performance training and inference workflows.

We need someone who understands not only code, but also GPU behavior, memory limits, latency, batching, reproducibility, and distributed systems.

This role is for an engineer who can turn research ideas into reliable engineering systems.

Responsibilities

build infrastructure for model training, inference, and evaluation
work on GPU performance, memory efficiency, latency, and throughput
create reproducible environments for research and production
maintain containers, CI/CD, deployment workflows, and internal pipelines
support distributed training, distributed inference, and systems-level optimization
help the research team move fast without creating infrastructure chaos
improve observability, reliability, and engineering reproducibility

Requirements

strong Python
solid Linux
Docker, Git, CI/CD
understanding of GPU memory, inference optimization, and distributed systems
experience with ML infrastructure, AI/LLM pipelines, or systems engineering
strong engineering discipline and attention to detail
ability to find bottlenecks independently and solve them

Nice to have

CUDA
Triton
DeepSpeed
Ray
vLLM
Kubernetes
Prometheus / Grafana
experience with GPU clusters and orchestration

You may not be a fit if

you only know standard backend deployment patterns
you do not understand the difference between a research prototype and a production system
you wait for perfect specs instead of solving the real problem

What we offer

a chance to build the AI-first infrastructure layer of a serious technical team
work at the intersection of research, infrastructure, and new AI systems
meaningful ownership and technical influence
a compact team and fast iteration cycles
remote / Skolkovo / remote

How to apply

Send your CV, GitHub, and a short note about infrastructure problems you solved to hr@yappix.ru or via https://yappix.ru/en/contact