Rethinking the existing data loading and processing process as an ETL example with pandas [ukr]

Talk presentation

ETL stands for extract, transform, load. It's a process that combines data from different sources into a single repository for further processing, analysis, and utilization.

This talk provides an example of how pandas can be used to solve ETL tasks as a stage in the evolution of the data intake component. This involves preliminary validation, filtering, and conversion of data according to a set of business rules and internal representation, with intermediate combination with other sources.

Yehor Nazarkin
Healthjoy Inc., Engineering Manager
  • He supports the experience gained as a developer and architect in working with engineering teams and product development.
  • Works with people, business, and technology to solve problems when they come together.
  • GitHub.
Sign in
Or by mail
Sign in
Or by mail
Register with email
Register with email
Forgot password?