BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//programme.europython.eu//europython-2026//talk//H7KGU3
BEGIN:VTIMEZONE
TZID:CET
BEGIN:STANDARD
DTSTART:20001029T040000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:CET
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000326T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:CEST
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-europython-2026-H7KGU3@programme.europython.eu
DTSTART;TZID=CET:20260715T152500
DTEND;TZID=CET:20260715T155500
DESCRIPTION:*Pitch*\n\nWith DuckDB and DuckLake\, managing and analyzing hu
 ge data sets is no longer limited to complex cloud infrastructure setups. 
 You can literally run these tasks on your notebook now and at comparable s
 peeds. This talk will show you how.\n\n*Description*\n\n**DuckDB** is an e
 mbedded relational analytics database (OLAP) which can be added to a Pytho
 n project with a simple `uv add duckdb` or `pip install duckdb`. It is bot
 h fast and powerful for processing analytical data warehouse workloads\, u
 sing the well-known PostgreSQL SQL dialect. Data can be stored in memory a
 nd persisted on disk. DuckDB is well integrated with Polars via zero copy 
 Apache Arrow data structures\, making it a great choice for complex data s
 cience and engineering tasks.\n\n**DuckLake** is a extension which comes w
 ith DuckDB to add data lake features\, meaning that huge data sets can be 
 managed using Parquet files stored on disk or in an object store such as S
 3. It uses a novel approach to data lakes in that the management structure
 s are stored in a database (DuckDB)\, instead of complex file and director
 y structures\, as many other data lake systems do. This provides great adv
 antages for implementing smart features such as snapshots\, schema evoluti
 on or time travel.\n\nAgain\, installation of the extension is just a simp
 le `INSTALL ducklake` command away\, making this a really easy way to conf
 igure your own personal "lake house" - the ideal combination of a data war
 ehouse with a data lake.\n\nThe talk will give a short introduction to the
  database terminology\, explain what is novel about the DuckLake approach 
 and then showcase a typical use case for lake houses: storing historical w
 eather data and making this available for analytics to Python applications
 .\n\nBoth DuckDB and DuckLake are MIT licensed.\n\n*Resources:*\n- [Python
 .org](https://www.python.org/)\n- [DuckDB – An in-process SQL OLAP datab
 ase management system](https://duckdb.org/)\n- [DuckLake is an integrated 
 data lake and catalog format – DuckLake](https://ducklake.select/)
DTSTAMP:20260524T130809Z
LOCATION:Conference Hall Complex (S4)
SUMMARY:DuckLake - Take Python and DuckDB for a swim in your data lake - Ma
 rc-André Lemburg
URL:https://programme.europython.eu/europython-2026/talk/H7KGU3/
END:VEVENT
END:VCALENDAR
