-
-
amoro Public
Forked from apache/amoroApache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
Java Apache License 2.0 UpdatedOct 16, 2025 -
incubator-uniffle Public
Forked from apache/uniffleUniffle is a high performance, general purpose Remote Shuffle Service.
Java Apache License 2.0 UpdatedSep 25, 2025 -
kyuubi Public
Forked from apache/kyuubiApache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Scala Apache License 2.0 UpdatedMay 29, 2025 -
presto Public
Forked from prestodb/prestoThe official home of the Presto distributed SQL query engine for big data
Java Apache License 2.0 UpdatedFeb 13, 2025 -
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedDec 9, 2024 -
iceberg Public
Forked from apache/icebergApache Iceberg
Java Apache License 2.0 UpdatedOct 28, 2024 -
zstd Public
Forked from facebook/zstdZstandard - Fast real-time compression algorithm
C Other UpdatedOct 12, 2024 -
incubator-gluten Public
Forked from apache/incubator-glutenGluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Scala Apache License 2.0 UpdatedJun 19, 2024 -
incubator-celeborn Public
Forked from apache/celebornApache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Java Apache License 2.0 UpdatedJun 14, 2024 -
ray Public
Forked from ray-project/rayRay is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python Apache License 2.0 UpdatedApr 20, 2024 -
-
IcebergMetadataRewrite Public
Forked from ksmatharoo/IcebergMetadataRewriteproject to rewrite Apache iceberg metadata when tables from one location to another
Java Apache License 2.0 UpdatedApr 4, 2024 -
kuberay Public
Forked from ray-project/kuberayA toolkit to run Ray applications on Kubernetes
Go Apache License 2.0 UpdatedMar 3, 2024 -
arrow Public
Forked from apache/arrowApache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
C++ Apache License 2.0 UpdatedMar 1, 2024 -
volcano Public
Forked from volcano-sh/volcanoA Cloud Native Batch System (Project under CNCF)
Go Apache License 2.0 UpdatedNov 21, 2023 -
-
-
-
parquet-mr Public
Forked from apache/parquet-javaApache Parquet
Java Apache License 2.0 UpdatedJun 18, 2022 -
trino Public
Forked from trinodb/trinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Java Apache License 2.0 UpdatedDec 4, 2021 -
Kubernetes_Doc Public
Forked from Jack-lizhiXin/Kubernetes_Doc这是一个通过源码来部署kubernetes开发环境的文档,因为自己也还在探索,所以这个文档会同步跟新。
UpdatedOct 8, 2019 -