Skip to content
View kevinjqliu's full-sized avatar

Block or report kevinjqliu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An Apache Iceberg REST Catalog explorer - view namespaces, tables, stats, metadata, schema evolution, and more.

TypeScript 2 Updated Nov 5, 2025

pg_lake: Postgres with Iceberg and data lake access

C 1,259 52 Updated Nov 25, 2025

claude-code generated parquet metadata vizualizer that runs in your browser

HTML 14 2 Updated Nov 11, 2025

Apache Parquet Testing

Python 75 69 Updated Aug 21, 2025

dbc is a command-line tool for installing and managing ADBC drivers

Go 62 6 Updated Nov 25, 2025

Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

Rust 14,049 823 Updated Nov 25, 2025

IndexTables is an experimental open-table format for Apache Spark that enables fast retrieval and full-text search across large-scale data. It integrates seamlessly with Spark SQL, allowing you to …

Scala 22 2 Updated Nov 25, 2025

WIP (out of tree) Rust implementation of TPC-DS generators.

Rust 10 Updated Nov 15, 2025

[SIGMOD 2026] F3: The Open-Source Data File Format for the Future

Rust 283 15 Updated Nov 3, 2025

Chronon is a data platform for serving for AI/ML applications.

Scala 942 86 Updated Nov 25, 2025

Apache Iceberg C++

C++ 158 69 Updated Nov 25, 2025

[VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"

67 9 Updated Oct 15, 2025

An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.

Rust 2,268 95 Updated Nov 25, 2025

Azure extension for DuckDB

C++ 67 29 Updated Nov 17, 2025

Build reliable AI and agentic applications with DataFrames

Python 404 27 Updated Nov 25, 2025

The Feldera Incremental Computation Engine

Rust 1,690 83 Updated Nov 25, 2025

Protocol and libraries for sending and receiving OpenTelemetry data using Apache Arrow

Rust 259 59 Updated Nov 25, 2025

DuckLake is an integrated data lake and catalog format

C++ 2,268 112 Updated Nov 25, 2025

Native Rust TPCH support for Datafusion using tpchgen

Rust 3 3 Updated Jun 8, 2025
Python 165 6 Updated May 21, 2025

Icebird: JavaScript Iceberg Client

JavaScript 110 3 Updated Oct 6, 2025

Spark integrations for working with Lance datasets

Java 31 25 Updated Nov 19, 2025

Lance Namespace is an open specification on top of the storage-based Lance table and file format to standardize access to a collection of Lance tables

Java 37 21 Updated Nov 25, 2025

TPC-H benchmark data generation in pure Rust

Rust 210 47 Updated Nov 24, 2025

Olympia is a storage-only open catalog format for big data analytics, ML & AI.

Java 15 3 Updated May 5, 2025

Code used to create text embeddings of all Magic: The Gathering cards.

Jupyter Notebook 57 3 Updated Feb 24, 2025

DataFusion TableProviders for reading data from other systems

Rust 160 57 Updated Nov 25, 2025

Apache Iceberg

Rust 1,144 356 Updated Nov 25, 2025
Next