Next job

Data Engineer / Data Platform Engineer in KOHANSTUDIO

Posted more than 30 days ago

3 views

KOHANSTUDIO

KOHANSTUDIO

0
0 reviews
Without experience
Kyiv
Full-time work
We are a team working on anti-corruption analytics and digitalization of government services in Ukraine, private developments and more — we are looking for an experienced Data Engineer who will become a key technical figure in the development of a data processing platform.We build a scalable data platform from open-source components, which allows you to create pipelines of any complexity, visualize them and integrate them with other systems.About the roleThis is an engineering role with an incre

We are a team working on anti-corruption analytics and digitalization of government services in Ukraine, private developments and more — we are looking for an experienced Data Engineer who will become a key technical figure in the development of a data processing platform.

We build a scalable data platform from open-source components, which allows you to create pipelines of any complexity, visualize them and integrate them with other systems.

About the role

This is an engineering role with an increased level of responsibility.

You don't just implement tasks, but influence architecture, approaches and technical solutions, help the team move faster and better.

Minimum of formal management, but:

  • participation in technical solutions,
  • architecture and code reviews

    expected

Main responsibilities

  • Design and development of a scalable and secure data-platform
  • Implementation and optimization of data-pipelines (ETL/ELT). style="font-weight: 700">cyber security and access control.
  • Evaluation of technical solutions in terms of complexity, deadlines and risks.
  • Technical support and mentoring of other developers.

Technology stack (mandatory)

  • Python - confident level, experience from 3 years.
  • SQL (PostgreSQL) - complex queries, optimization.
  • Redis.
  • Elasticsearch and Elasticsearch ELK stack.
  • REST API.
  • Git, Bash, CI/CD.
  • Docker, Kubernetes, Nginx.
  • Basic understanding ML / data pipelines.

Data Engineering & Warehousing

  • Building DWH:
    • star schema, fact & dimension tables
    • slowly changing dimensions (SCD)
  • Advanced SQL: CTE, window functions, procedures.
  • Working with Presto / Trino.
  • Understanding indexes, incl. geoindexes (H3).
  • Working with spatial data (GeoJSON, Point, Polygon).

The platform we work on

Main script:

  1. Ingestion of data (Dagster).
  2. Saving raw data in S3 compatible (bronze layer).
  3. Parsing into structured data(silver layer).
  4. Transformations for analytics (gold layer) 700">S3-compatible storage.
  5. Apache Iceberg — tabular format + time travel.
  6. Project Nessie — data catalog.
  7. Trino — SQL engine.
  8. Apache Superset - BI and dashboards.
  9. Keycloak / Authentik - authorization.
  10. Kubernetes (k3s) + Terraform + Ansible.

  11. Would be an advantage

    • Experience with Kafka, Spark / PySpark.
    • Iceberg / Delta tables in production.
    • Prometheus, Grafana, Jenkins.
    • Experience with KEP, Trembita.
    • Experience in projects for the public sector.
    • Experience of informal leadership or mentoring of a team (5+ people).

    Soft skills that are important to us

    • Ability to work with unclear requirements.
    • Independence and responsibility for result.
    • Ability to explain complex technical things simply.
    • Understanding when to complicate and when not.
    • Proactivity and healthy engineering skepticism.

    We offer

    • Work on socially significant projects.
    • Real influence on architecture and technical solutions.
    • Flexible work format.
    • The possibility of booking for key employees.
    • Adequate team without "corporate theater".
Without experience
Kyiv
Full-time work
Want to get related jobs?
New job openings in your Telegram
Subscribe
We use cookies
accept