-
calcite Public
Forked from apache/calciteApache Calcite
-
incubator-seata-go Public
Forked from apache/incubator-seata-goGo Implementation For Seata
Go Apache License 2.0 UpdatedNov 17, 2025 -
amoro Public
Forked from apache/amoroApache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
Java Apache License 2.0 UpdatedNov 11, 2025 -
kvrocks Public
Forked from apache/kvrocksApache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
C++ Apache License 2.0 UpdatedOct 10, 2025 -
fluss Public
Forked from apache/flussFluss is a streaming storage built for real-time analytics.
Java Apache License 2.0 UpdatedOct 10, 2025 -
incubator-paimon Public
Forked from apache/paimonApache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
Java Apache License 2.0 UpdatedSep 17, 2025 -
auron Public
Forked from apache/auronBlazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Rust Apache License 2.0 UpdatedSep 11, 2025 -
lance Public
Forked from lance-format/lanceModern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Rust Apache License 2.0 UpdatedSep 11, 2025 -
seatunnel Public
Forked from apache/seatunnelSeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
Java Apache License 2.0 UpdatedAug 21, 2025 -
kvrocks-website Public
Forked from apache/kvrocks-websiteApache Kvrocks Website
TypeScript Apache License 2.0 UpdatedAug 19, 2025 -
duckdb Public
Forked from duckdb/duckdbDuckDB is an analytical in-process SQL database management system
C++ MIT License UpdatedAug 18, 2025 -
iceberg Public
Forked from apache/icebergApache Iceberg
Java Apache License 2.0 UpdatedAug 15, 2025 -
-
dubbo-go Public
Forked from apache/dubbo-goGo Implementation For Apache Dubbo .
Go Apache License 2.0 UpdatedJul 30, 2025 -
lancedb Public
Forked from lancedb/lancedbDeveloper-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
Python Apache License 2.0 UpdatedJul 1, 2025 -
incubator-gluten Public
Forked from apache/incubator-glutenGluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Scala Apache License 2.0 UpdatedMay 15, 2025 -
presto Public
Forked from prestodb/prestoThe official home of the Presto distributed SQL query engine for big data
Java Apache License 2.0 UpdatedFeb 19, 2025 -
spark-1 Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedDec 29, 2024 -
delta Public
Forked from delta-io/deltaAn open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Scala Apache License 2.0 UpdatedDec 13, 2024 -
flink-cdc-connectors Public
Forked from apache/flink-cdcCDC Connectors for Apache Flink®
Java Apache License 2.0 UpdatedOct 23, 2024 -
-
parquet-format Public
Forked from apache/parquet-formatApache Parquet Format
Thrift Apache License 2.0 UpdatedAug 16, 2024 -
parquet-java Public
Forked from apache/parquet-javaApache Parquet Java
Java Apache License 2.0 UpdatedAug 16, 2024 -
celeborn Public
Forked from apache/celebornApache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Java Apache License 2.0 UpdatedJul 30, 2024 -
paimon-rust Public
Forked from apache/paimon-rustApache Paimon Rust The rust implementation of Apache Paimon.
Rust Apache License 2.0 UpdatedJul 8, 2024 -
dolphinscheduler Public
Forked from apache/dolphinschedulerApache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Java Apache License 2.0 UpdatedJul 2, 2024 -
hudi Public
Forked from apache/hudiUpserts, Deletes And Incremental Processing on Big Data.
-
starrocks Public
Forked from StarRocks/starrocksStarRocks is a next-gen sub-second MPP database for full analysis scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
Java Apache License 2.0 UpdatedMay 20, 2024 -
-
beam Public
Forked from apache/beamApache Beam is a unified programming model for Batch and Streaming data processing.
Java Apache License 2.0 UpdatedMar 1, 2024