dan1elt0m

Daniël Tom dan1elt0m

9 followers · 8 following

Xebia Data
Amsterdam
https://www.linkedin.com/in/daniel-tom-data-engineer/

Sponsoring

Achievements

Stars

NVIDIA / spark-rapids-examples

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.

Jupyter Notebook 163 61 Updated Sep 23, 2025

dan1elt0m / binarycookies

Binary Cookies CLI and Python library

Python 21 3 Updated Oct 6, 2025

duckdb / duckdb-delta

DuckDB extension for Delta Lake

C++ 203 26 Updated Oct 8, 2025

NVIDIA / spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 935 259 Updated Oct 12, 2025

apache / datafusion

Apache DataFusion SQL Query Engine

Rust 7,863 1,678 Updated Oct 11, 2025

marimo-team / marimo

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.

Python 16,376 718 Updated Oct 12, 2025

mwouts / itables

Pandas DataFrames as Interactive DataTables

Python 915 61 Updated Sep 28, 2025

delta-io / delta-rs

A native Rust library for Delta Lake, with bindings into Python

Rust 2,983 530 Updated Oct 12, 2025

geekwhocodes / pyspark-custom-datasource-template

The PySpark Custom Data Source Template makes it easy to build and test custom data sources for Apache PySpark. It simplifies environment setup, debugging, and test data management while providing …

Python 2 Updated Feb 21, 2025

allisonwang-db / pyspark-data-sources

Custom PySpark Data Sources

Python 66 17 Updated Oct 3, 2025

apache / spark

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,072 28,881 Updated Oct 12, 2025

kaiko-ai / typedspark

Column-wise type annotations for pyspark DataFrames

Python 87 13 Updated Oct 12, 2025

duckdb / dbt-duckdb

dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

Python 1,155 122 Updated Oct 8, 2025

delta-io / delta-kernel-rs

A native Delta implementation for integration with any query engine

Rust 271 113 Updated Oct 10, 2025

traefik / traefik

The Cloud Native Application Proxy

Go 57,097 5,515 Updated Oct 10, 2025

hugovk / pypistats

Command-line interface to PyPI Stats API to get download stats for Python packages

Python 223 32 Updated Oct 8, 2025

google / pseudo-identity-provider

Go 9 5 Updated Jun 6, 2025

godatadriven / ducklake-blog-1

Example files used in the DuckDB - Unity Catalog blog

Jupyter Notebook 10 1 Updated Dec 6, 2024

godatadriven-dockerhub / unity-catalog

Dockerfile for Unity Catalog image

Dockerfile 10 Updated Dec 23, 2024

awesome-selfhosted / awesome-selfhosted

A list of Free Software network services and web applications which can be hosted on your own servers

252,265 11,696 Updated Oct 11, 2025

testcontainers / testcontainers-python

Testcontainers is a Python library that providing a friendly API to run Docker container. It is designed to create runtime environment to use during your automatic tests.

Python 1,986 339 Updated Oct 7, 2025

JetBrains / ideavim

IdeaVim – A Vim engine for JetBrains IDEs

Kotlin 9,979 799 Updated Oct 12, 2025

jupyterlab / jupyterlab

JupyterLab computational environment.

TypeScript 14,827 3,752 Updated Oct 10, 2025

duckdb / pg_duckdb

DuckDB-powered Postgres for high performance apps & analytics.

C++ 2,610 133 Updated Oct 3, 2025

exelban / stats

macOS system monitor in your menu bar

Swift 34,190 1,085 Updated Oct 12, 2025

ramonvermeulen / dbt-toolkit

The dbt-toolkit is an early-stage plugin designed to enhance your experience working with dbt-core projects in JetBrains IDEs.

Kotlin 32 Updated Oct 2, 2025

apache / polaris

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java 1,690 314 Updated Oct 12, 2025

simw / pydantic-to-pyarrow

A library to convert a pydantic model to a pyarrow schema

Python 44 6 Updated May 10, 2025

unitycatalog / unitycatalog-python

Python 18 6 Updated Jul 8, 2024

EvalBench / cdc

Evaluation Matrix for Change Data Capture

HTML 26 1 Updated Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daniël Tom dan1elt0m

Sponsoring

Achievements

Achievements

Block or report dan1elt0m

Stars

NVIDIA / spark-rapids-examples

dan1elt0m / binarycookies

duckdb / duckdb-delta

NVIDIA / spark-rapids

apache / datafusion

marimo-team / marimo

mwouts / itables

delta-io / delta-rs

geekwhocodes / pyspark-custom-datasource-template

allisonwang-db / pyspark-data-sources

apache / spark

kaiko-ai / typedspark

duckdb / dbt-duckdb

delta-io / delta-kernel-rs

traefik / traefik

hugovk / pypistats

google / pseudo-identity-provider

godatadriven / ducklake-blog-1

godatadriven-dockerhub / unity-catalog

awesome-selfhosted / awesome-selfhosted

testcontainers / testcontainers-python

JetBrains / ideavim

jupyterlab / jupyterlab

duckdb / pg_duckdb

exelban / stats

ramonvermeulen / dbt-toolkit

apache / polaris

simw / pydantic-to-pyarrow

unitycatalog / unitycatalog-python

EvalBench / cdc