We invite you to join our team Lead Data Scientist/Machine Learning EngineerResponsibilities:lead the ML direction and roadmap; ensure the transition from ideas to stable production and scaling; demonstrate measurable impact on P&L.Areas of responsibility: ML initiative priorities, architectural principles, metrics/validation standard; Operational outline: dataset/feature standards, SRM control, roll-out/rollback policies; Reliability/observability of services (SLO/SLI, alerting, cost-aware infe
We invite you to join our team Lead Data Scientist/Machine Learning Engineer
Responsibilities:lead the ML direction and roadmap; ensure the transition from ideas to stable production and scaling; demonstrate measurable impact on P&L.
Areas of responsibility:
- ML initiative priorities, architectural principles, metrics/validation standard;
- Operational outline: dataset/feature standards, SRM control, roll-out/rollback policies;
- Reliability/observability of services (SLO/SLI, alerting, cost-aware inference), including edge scenarios;
- Technical leadership: hiring/mentoring, review, research and knowledge culture;
- Data governance: PII, access, lineage, model maps/documentation;
- Experimental platform: events, stratification, incrementality;
- Pricing/promo models with elasticities, cannibalization, shelf/stock limits; mission personalization;
- Synchronization of ML goals with budget/plan, transparent impact reporting;
Expected results (OKR examples):
- NDCG@K > 2 v.p. in personalization (A/B, statistical significance);
- Reduction of wMAPE in daily TS-forecasts at SKU store level by 10-15%;
- Reduction of OOS/OSA losses by 5% and reduction of write-offs/losses (waste/shrink) by 5%;
- Maintenance growth due to a relevant promo mix within control corridors.
Requirements:
- 5+ years in ML/DS, 2+ years as a lead/tech lead;
- Proven experience in building and bringing to production on-prem ML services with store/region replication;
- Cases in CV/Recsys/TS with a proven business effect; production-Python/SQL; MLOps practices, testing, monitoring;
- Events (streaming) and batch, model monitoring (drifts/stability/degradations), design of experiments, business communications.
Will be a plus:
- Multimodal features/LLM-signals for cold-start; inventory-aware recsys; promotion optimization;
- Feature contracts/lineage/metadata management; inference cost optimization (ONNX/TensorRT/quantization);
- Edge inference in hall/SCO, anti-fraud.
Technical stack (on-prem):
Roles/models
- CV: Python, PyTorch, OpenCV, Albumentations, YOLOv8–v10 or Detectron2, TrOCR or Tesseract.
- Recsys: NVIDIA Merlin/Transformers4Rec, implicit (ALS), LightFM, TS reordering/Forecast: LightGBM, CatBoost, XGBoost, N-BEATS, N-HiTS, TFT
- Vectors: FAISS, pgvector, Milvus | Qdrant. Pandas/Polars for local processing.
NLP/LLM platform
- NLP core: Hugging Face Transformers, Datasets, Tokenizers, SentencePiece, spaCy|Stanza (uk), Sacremoses.
- LLM serving: vLLM|Hugging Face TGI; TensorRT-LLM|llama.cpp/gguf (by resource profile).
- RAG: OpenSearch (BM25) plus re-ranker, chunking and ingest, hybrid search with FAISS|pgvector or Milvus|Qdrant.
- Evaluation: ROUGE, BLEU, METEOR, BERTScore, MTEB, Recall@K, MRR, NDCG. Security/PII: Microsoft Presidio.
MLOps/Serving/experiments
- MLflow (Tracking/Registry/Serving)
- Serving: NVIDIA Triton | KServe | Seldon Core | Ray Serve
- Feature Store: Feast (self-host)
Data and Processing Platform
- Streaming: Kafka | Redpanda
- Compute: Spark| Flink
- Orchestration: Airflow | Dagster
- Transformations: dbt Core
- SQL/Storefronts: PostgreSQL, ClickHouse
Data Storage and Architecture
- Lakehouse: Apache Iceberg | Delta Lake
- Formats: Parquet, ORC
- Object storage: MinIO | CEPH
Observability/quality
- Service observability: Prometheus, Grafana, Loki
- ML observability: Evidently, whylogs
- Lineage/directory: OpenLineage, OpenMetadata or DataHub
Infrastructure and security
- Containerization/cluster: Docker, Kubernetes | OpenShift
- Security: policy-as-code, Vault Secret Management | Sealed Secrets
gig contract or in the state (reservation is possible);
paid annual leave of 24 calendar days, paid sick leave;regular payment of wages without delays and in the stipulated volumes, regular salary review;possibility professional and career development;training courses.
Contact person: Kateryna, tel.0984567857 (t.me/KaterynaB_HR)