The art of data engineering

Talk video

Talk presentation

As the data space has increased, data engineering has emerged as a separate and related role that works together with data scientists. Usually, data scientists focus on finding new insights from a data set, while data engineers are concerned with the production readiness of that data.

In this talk, I’ll show you how to gather and collect the huge amount of data, store it, do batch processing or real-time processing on it, and how to build a data pipeline using Airflow for processing billions of records per table.

Also, we will discuss what is big data, and why it’s important to be able to process it so quick.

Andrii Soldatenko
Dynatrace, Senior Software Engineer
  • Python developer in the day, Go developer (gopher) under the hood. Big fan of full-text search and graph databases
  • Speaker at KCD Austria 2023, FOSDEM 2020 and 2023 (Go and Rust dev rooms), GoDays 2020, PyCaribbean, PyCon Israel, PyCon Italia 2017 and 2022, EuroPython 2016 and 2022, PyCon Ukraine 2014, OdessaPy and lot’s of local meetups.
  • Contributed in different python/go open source projects: pyhelm, aiohttp-swagger, mezzanine; chalice, requests, aiohttp tutorial; sendgrid-python and sendgrid-django; OpenAPI v3 specification, fix Go docs
  • Blogger, Twitter, LinkedIn, GitHub
Sign in
Or by mail
Sign in
Or by mail
Register with email
Register with email
Forgot password?