Experience : Medior with 3+ years of relevant experience
Responsibilities: Pipeline Development: Design, build, and optimize data pipelines for model training and inference systems. Collaborate with data scientists and machine learning engineers to ensure efficient data preparation and feature engineering. Scaled System Development: Architect and implement scalable systems for model inference, data retrieval, and acquisition to support high-performance AI applications.. Optimize performa
Experience : Medior with 3+ years of relevant experience
Responsibilities:
Pipeline Development:
- Design, build, and optimize data pipelines for model training and inference systems.
- Collaborate with data scientists and machine learning engineers to ensure efficient data preparation and feature engineering.
Scaled System Development:
- Architect and implement scalable systems for model inference, data retrieval, and acquisition to support high-performance AI applications..
- Optimize performance and reliability to handle large-scale data processing.
Observability:
- Develop robust logging and monitoring solutions for AI systems.
- Ensure traceability, debugging, and performance monitoring across the AI platform.
Data Management:
- Work seamlessly with both structured and unstructured data sources to support diverse AI initiatives.
- Ensure all data engineering practices comply with company policies and industry regulations for data security and privacy.
Innovation :
- Stay abreast of the latest advancements in data engineering and AI technologies to continuously improve our systems and processes.
Requirements:
- Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field.
- Advanced Python knowledge for data processing and scripting.
- Hands-on experience with one or more cloud computing platforms (Azure, AWS, GCP).
- Hands-on experience with big data technologies and distributed computing frameworks.
- Proficiency in RDBMS/NoSQL data stores and appropriate use cases.
- Experience with Data as Code ; version control, small and regular commits, unit tests, CI/CD, packaging, familiarity with containerization tools such as Docker and Kubernetes is a plus.
- Understanding of AI/ML principles and practices, including model training, inference, and deployment.
- Experience with Infrastructure as Code is a plus.
- Strong problem-solving skills and attention to detail.
- Good communication skills, fluent English.
Expected start date: September 1st, 2024
Remote vs Onsite: Remote, with possible occasional in person team sessions / workshops / gatherings (i.e. 1x quarter) likely to take place in Prague
HackerRank challenge: Yes
Working overlap needed: 9-6/10-7 CET possibility of a wider overlap (flexibility) appreciated
Salary : 100-150/h B2B
- Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field.
- Advanced Python knowledge for data processing and scripting.
- Hands-on experience with one or more cloud computing platforms (Azure, AWS, GCP).
- Hands-on experience with big data technologies and distributed computing frameworks.
- Proficiency in RDBMS/NoSQL data stores and appropriate use cases.
- Experience with Data as Code ; version control, small and regular commits, unit tests, CI/CD, packaging, familiarity with containerization tools such as Docker and Kubernetes is a plus.
- Understanding of AI/ML principles and practices, including model training, inference, and deployment.
- Experience with Infrastructure as Code is a plus.
- Strong problem-solving skills and attention to detail.
- Good communication skills, fluent English.
Experience : Medior with 3+ years of relevant experience
Responsibilities:
Pipeline Development:
- Design, build, and optimize data pipelines for model training and inference systems.
- Collaborate with data scientists and machine learning engineers to ensure efficient data preparation and feature engineering.
Scaled System Development:
- Architect and implement scalable systems for model inference, data retrieval, and acquisition to support high-performance AI applications..
- Optimize performance and reliability to handle large-scale data processing.
Observability:
- Develop robust logging and monitoring solutions for AI systems.
- Ensure traceability, debugging, and performance monitoring across the AI platform.
Data Management:
- Work seamlessly with both structured and unstructured data sources to support diverse AI initiatives.
- Ensure all data engineering practices comply with company policies and industry regulations for data security and privacy.
Innovation :
- Stay abreast of the latest advancements in data engineering and AI technologies to continuously improve our systems and processes.
Requirements:
- Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field.
- Advanced Python knowledge for data processing and scripting.
- Hands-on experience with one or more cloud computing platforms (Azure, AWS, GCP).
- Hands-on experience with big data technologies and distributed computing frameworks.
- Proficiency in RDBMS/NoSQL data stores and appropriate use cases.
- Experience with Data as Code ; version control, small and regular commits, unit tests, CI/CD, packaging, familiarity with containerization tools such as Docker and Kubernetes is a plus.
- Understanding of AI/ML principles and practices, including model training, inference, and deployment.
- Experience with Infrastructure as Code is a plus.
- Strong problem-solving skills and attention to detail.
- Good communication skills, fluent English.
Expected start date: September 1st, 2024
Remote vs Onsite: Remote, with possible occasional in person team sessions / workshops / gatherings (i.e. 1x quarter) likely to take place in Prague
HackerRank challenge: Yes
Working overlap needed: 9-6/10-7 CET possibilityof a wider overlap (flexibility) appreciated
Salary : 100-150/h B2B
,[Pipeline Development: Design, build, and optimize data pipelines for model training and inference systems. Collaborate with data scientists and machine learning engineers to ensure efficient data preparation and feature engineering. , Scaled System Development: Architect and implement scalable systems for model inference, data retrieval, and acquisition to support high-performance AI applications. Optimize performance and reliability to handle large-scale data processing. , Observability: Develop robust logging and monitoring solutions for AI systems. Ensure traceability, debugging, and performance monitoring across the AI platform. , Data Management: Work seamlessly with both structured and unstructured data sources to support diverse AI initiatives. Ensure all data engineering practices comply with company policies and industry regulations for data security and privacy. , Innovation: Stay abreast of the latest advancements in data engineering and AI technologies to continuously improve our systems and processes.] Вимоги: Python, Degree, Cloud computing, Azure, AWS, GCP, Big Data, Distributed computing, RDBMS, NoSQL, Use cases, Docker, Kubernetes, AI, Infrastructure as Code, Communication skills Бонуси та переваги: Sport subscription, Small teams, International projects, Free coffee, Shower, Modern office.