Search Shortcut cmd + k | ctrl + k

DuckDB in Science

SQL Engines Excel at the Execution of Imperative Programs
Paper

SQL Engines Excel at the Execution of Imperative Programs

2025-09-01
Tim Fischer, Denis Hirn, Torsten Grust
Saving Private Hash Join
Paper

Saving Private Hash Join

2025-09-01
Laurens Kuiper, Paul Groß, Peter Boncz, Hannes Mühleisen
Parachute: Single-Pass Bi-Directional Information Passing
Paper

Parachute: Single-Pass Bi-Directional Information Passing

2025-09-01
Mihail Stoian, Andreas Zimmerer, Skander Krid, Amadou Latyr Ngom, Jialin Ding, Tim Kraska, Andreas Kipf
The FastLanes File Format
Paper

The FastLanes File Format

2025-09-01
Azim Afroozeh, Peter Boncz
Environmental Footprints of Query Processing: A Vision for Sustainable Database Architectures
Paper

Environmental Footprints of Query Processing: A Vision for Sustainable Database Architectures

2025-09-01
Michail Bachras, Hans-Arno Jacobsen
Democratize MATCH_RECOGNIZE!
Paper

Democratize MATCH_RECOGNIZE!

2025-09-01
Louisa Lambrecht, Tim Findling, Samuel Heid, Marcel Knüdeler, Torsten Grust
CUBIT: Concurrent Updatable Bitmap Indexing
Paper

CUBIT: Concurrent Updatable Bitmap Indexing

2025-09-01
Junchang Wang, Manos Athanassoulis
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB
Paper

Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB

2025-09-01
Anas Dorbani, Sunny Yasser, Jimmy Lin, Amine Mhedhbi
Anarchy in the Database: A Survey and Evaluation of Database Management System Extensibility
Paper

Anarchy in the Database: A Survey and Evaluation of Database Management System Extensibility

2025-09-01
Abigale Kim, Marco Slot, David Andersen, Andrew Pavlo
BIRNE: Mixed-paradigm Workload Execution in SQL Engines
Paper

BIRNE: Mixed-paradigm Workload Execution in SQL Engines

2025-06-27
Tim Fischer, Denis Hirn
Yannakakis+: Practical Acyclic Query Evaluation with Theoretical Guarantees
Paper

Yannakakis+: Practical Acyclic Query Evaluation with Theoretical Guarantees

2025-06-22
Qichen Wang, Bingnan Chen, Binyang Dai, Ke Yi, Feifei Li, Liang Lin
Debunking the Myth of Join Ordering: Toward Robust SQL Analytics
Paper

Debunking the Myth of Join Ordering: Toward Robust SQL Analytics

2025-06-22
Junyi Zhao, Kai Su, Yifei Yang, Xiangyao Yu, Paraschos Koutris, Huanchen Zhang
Towards a Converged Relational-Graph Optimization Framework
Paper

Towards a Converged Relational-Graph Optimization Framework

2025-06-22
Yunkai Lou, Longbin Lai, Bingqing Lyu, Yufan Yang, Xiaoli Zhou, Wenyuan Yu, Ying Zhang, Jingren Zhou
Data Chunk Compaction in Vectorized Execution
Paper

Data Chunk Compaction in Vectorized Execution

2025-06-22
Yiming Qiao, Huanchen Zhang
How DuckDB is USING KEY to Unlock Recursive Query Performance
Paper

How DuckDB is USING KEY to Unlock Recursive Query Performance

2025-06-22
Björn Bamberg, Denis Hirn, Torsten Grust
DuckDB in Research S01E05: Haralampos Gavriilidis
Podcast 40 min

DuckDB in Research S01E05: Haralampos Gavriilidis

2025-04-17
Disseminate Podcast Series by Jack Waudby
DuckDB in Research S01E04: Arjen P. de Vries
Podcast 45 min

DuckDB in Research S01E04: Arjen P. de Vries

2025-04-10
Disseminate Podcast Series by Jack Waudby
DuckDB in Research S01E03: David Justen
Podcast 50 min

DuckDB in Research S01E03: David Justen

2025-04-03
Disseminate Podcast Series by Jack Waudby
DuckDB in Research S01E02: Daniël ten Wolde
Podcast 50 min

DuckDB in Research S01E02: Daniël ten Wolde

2025-03-21
Disseminate Podcast Series by Jack Waudby
DuckDB in Research S01E01: Till Döhmen
Podcast 60 min

DuckDB in Research S01E01: Till Döhmen

2025-03-13
Disseminate Podcast Series by Jack Waudby
Runtime-Extensible Parsers
Paper

Runtime-Extensible Parsers

2025-01-19
Hannes Mühleisen, Mark Raasveldt"
Adaptive Factorization Using Linear-Chained Hash Tables
Paper

Adaptive Factorization Using Linear-Chained Hash Tables

2025-01-19
Paul Groß, Daniel ten Wolde, Peter Boncz
POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance
Paper

POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance

2024-08-26
David Justen, Daniel Ritter, Campbell Fraser, Andrew Lamb, Allison Lee, Thomas Bodner, Mhd Yamen Haddad, Steffen Zeuch, Volker Markl, Matthias Boehm
ALP: Adaptive Lossless Floating-Point Compression
Paper

ALP: Adaptive Lossless Floating-Point Compression

2024-06-09
Azim Afroozeh, Leonardo X. Kuffo, Peter A. Boncz
Robust External Hash Aggregation in the Solid State Age
Paper

Robust External Hash Aggregation in the Solid State Age

2024-04-13
Laurens Kuiper, Peter A. Boncz, Hannes Mühleisen
How to Make your Duck Fly: Advanced Floating Point Compression to the Rescue
Paper

How to Make your Duck Fly: Advanced Floating Point Compression to the Rescue

2024-03-25
Panagiotis Liakos, Katia Papakonstantinopoulou, Thijs Bruineman, Mark Raasveldt, Yannis Kotidis
MotherDuck: DuckDB in the Cloud and in the Client
Paper

MotherDuck: DuckDB in the Cloud and in the Client

2024-01-17
RJ Atwal, Peter Boncz, Ryan Boyd, Antony Courtney, Till Döhmen, Florian Gerlinghoff, Jeff Huang, Joseph Hwang, Raphael Hyde, Elena Felder, Jacob Lacouture, Yves LeMaout, Boaz Leskes, Yao Liu, Alex Monahan, Dan Perkins, Tino Tereshko, Jordan Tigani, Nick Ursa, Stephanie Wang, Yannick Welsch
Dear User-Defined Functions, Inlining isn't working out so great for us. Let's try batching to make our relationship work. Sincerely, SQL
Paper

Dear User-Defined Functions, Inlining isn't working out so great for us. Let's try batching to make our relationship work. Sincerely, SQL

2024-01-14
Kai Franz, Sam Arch, Denis Hirn, Torsten Grust, Todd C. Mowry, Andy Pavlo
DuckPGQ: Bringing SQL/PGQ to DuckDB
Paper

DuckPGQ: Bringing SQL/PGQ to DuckDB

2023-08-28
Daniel ten Wolde, Gábor Szárnyas, Peter Boncz
These Rows Are Made for Sorting and That's Just What We'll Do
Paper

These Rows Are Made for Sorting and That's Just What We'll Do

2023-04-03
Laurens Kuiper, Hannes Muhleisen ¨
DuckPGQ: Efficient Property Graph Queries in an analytical RDBMS
Paper

DuckPGQ: Efficient Property Graph Queries in an analytical RDBMS

2023-01-08
Daniel ten Wolde, Tavneet Singh, Gabor Szarnyas, Peter Boncz
DuckDB-Wasm: Fast Analytical Processing for the Web
Paper

DuckDB-Wasm: Fast Analytical Processing for the Web

2022-09-05
André Kohn, Dominik Moritz, Mark Raasveldt, Hannes Mühleisen, Thomas Neumann
Data Management for Data Science Towards Embedded Analytics
Paper

Data Management for Data Science Towards Embedded Analytics

2020-01-12
Hannes Mühleisen, Mark Raasveldt
DuckDB: An Embeddable Analytical Database
Paper

DuckDB: An Embeddable Analytical Database

2019-06-30
Hannes Mühleisen, Mark Raasveldt

Submission Guidelines

Submissions are welcome in the form of pull requests in the duckdb-web repository.
You are welcome to submit both your own work and also the work of others that fits the guidelines. When submitting research papers, please follow these guidelines:

  • The paper should be about an algorithm, data structure, etc. used in DuckDB, built on top of DuckDB. Papers that study DuckDB in depth also qualify. However, papers where the use of DuckDB is limited to e.g., being the baseline for comparison or as part of their data processing toolchain, are out of scope.
  • The filename should start with the conference's first day (e.g., 2025-09-01) or the journal issue's publication date. If these are not easily obtainable, just use an estimated date.
  • Include a link to an open-access version of the paper.
  • Include the abstract. Make sure it doesn't have extra newlines, hyphens, or special characters.
  • If applicable, please add an “Implementation” section to describe the availability of the implementation (core DuckDB, community extension, fork, etc.)
  • If available, please link the presentation slide deck.