Match score not available

Data Engineer in ML

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Strong skills in Python (PySpark, pandas, Ray Data) and SQL., Experience in data warehousing (Snowflake, BigQuery, Redshift) and ETL tools (Spark, DBT, Google Dataflow, AWS Glue)., Familiarity with pipeline management (Apache Airflow, Dagster) and Infrastructure as Code (Terraform, AWS CDK)., Fluent in Polish and English, with proven experience in Machine Learning projects..

Key responsabilities:

  • Deliver comprehensive and verified answers to clients’ questions and document findings clearly.
  • Build a reliable data processing infrastructure and follow good engineering practices.
  • Communicate with clients to gather requirements and solve business problems as they arise.
  • Empower developers and business teams to effectively use data in their workflows.

Tooploox logo
Tooploox SME http://tooploox.com/
51 - 200 Employees
See all jobs

Job description

Hi there!

We are Tooploox, an AI software development company offering custom AI solutions and services. We help innovative companies and startups design and build digital products with generative AI, mobile, and web technologies.

Our team, consisting of nearly 200 experts including our R&D team of over 40 engineers, many with PhDs, has pioneered AI solutions across industries like healthcare, fashion, and e-commerce. We’ve published over 15 research papers in top conferences like NeurIPS and ICML.

We're on the lookout for a Data Engineer in ML to take on a pivotal role in our team. You'll have a big impact on a new product that builds on data gathered in 46 countries and you'll scale data operations from thousands to millions of users.
If you want to create insights about all aspects of the product - financial, behavioral, and domain-specific, this role is tailor-made for you.

Feel invited!

What you will do:
  • Deliver comprehensive and thoroughly verified answers to clients’ questions, document the thought process and share findings in a clear way.
  • Focus on building a reliable data processing infrastructure.
  • Follow good engineering practices such as testing, documentation, infrastructure as code, and automation
  • Solve business problems as they come and communicate with the client to gather requirements and explain data.
  • Empower developers and business teams to use data in their workflows.
Experience and skills you need to join us:
  • Have strong Python (PySpark, pandas, Ray Data) and SQL skills.
  • Experience in data warehousing (Snowflake, BigQuery, Redshift).
  • Experience in ETL tools (Spark, DBT, Google Dataflow, AWS Glue).
  • Experience with pipeline management (Apache Airflow, Dagster).
  • Proven experience working on projects utilizing Machine Learning, including data preprocessing, model development, evaluation, and deployment to solve real-world problems.
  • Familiarity with IaC (Terraform, AWS CDK).
  • Familiarity with Docker.
  • You are fluent in Polish and English (you will attend meetings with English-speaking clients).
It would be great if you also have:
  • Familiarity with CI/CD tools (Jenkins, Tekton, GitHub Actions).
  • Familiarity with LLM/LMM architectures, training processes, and data requirements is preferred.
  • Ability to design and deploy data-processing cloud infrastructure.
How we work:

At Tooploox, you have the flexibility to choose your working hours and location. While we value remote work, we also believe in building relationships and invite you to join us in our Warsaw and Wrocław offices. Enjoy a relaxed atmosphere and try some “home-made” pizza from our office pizza oven. We love having pets in the office, so feel free to bring yours along.

Join us and shape the future of AI while working the way you like!

Required profile

Experience

Spoken language(s):
PolishEnglish
Check out the description to know which languages are mandatory.

Other Skills

  • Communication

Machine Learning Engineer Related jobs