Main duties:Development, testing and optimization of ETL processes for new and existing custom projects.Interaction with analysts and business customers to accurately gather technical requirements. li>Monitoring data processing processes, identifying and eliminating problems with data speed and accuracy.Development and implementation of standards and best practices to ensure data quality and reliability.< p style="font-style: normal; font-weight: 400">Requirements for the candidate:In-depth know
Main duties:
- Development, testing and optimization of ETL processes for new and existing custom projects.
- Interaction with analysts and business customers to accurately gather technical requirements.
li>- Monitoring data processing processes, identifying and eliminating problems with data speed and accuracy.
- Development and implementation of standards and best practices to ensure data quality and reliability.
< p style="font-style: normal; font-weight: 400">
Requirements for the candidate:- In-depth knowledge of statistics and mathematical analysis.
- Minimum 3 years of experience in the development and optimization of ETL processes.
- In-depth knowledge of SQL and Python, experience with relational and non-relational databases.
- Experience with Python libraries for data manipulation such as Pandas, NumPy, SciPy and data visualization using Looker Studio.
- Knowledge of frameworks for working with big data in Python such as PySpark or Dask.
- Experience with ETL tools such as Informatica, Talend, DataStage, or SSIS.
- Ability to communicate effectively both with both technical and non-technical aspects.
It will be useful:
- Experience with data warehouses and big data processing technologies (e.g. Hadoop, Spark, Kafka).
- Setting up and maintaining version control infrastructure (e.g. Git).
- Configuring and supporting containerization (Docker).