Rethinking the existing data loading and processing process as an ETL example with pandas [ukr]
Talk presentation
ETL stands for extract, transform, load. It's a process that combines data from different sources into a single repository for further processing, analysis, and utilization.
This talk provides an example of how pandas can be used to solve ETL tasks as a stage in the evolution of the data intake component. This involves preliminary validation, filtering, and conversion of data according to a set of business rules and internal representation, with intermediate combination with other sources.
Yehor Nazarkin
Healthjoy Inc., Engineering Manager
- He supports the experience gained as a developer and architect in working with engineering teams and product development.
- Works with people, business, and technology to solve problems when they come together.
- GitHub.