Skip to content
View jiaoew1991's full-sized avatar
🤞
🤞

Block or report jiaoew1991

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-performance distributed multi-level cache system. Built by Rust.

Rust 431 59 Updated Nov 28, 2025

A cloud native embedded storage engine built on object storage.

Rust 2,493 161 Updated Nov 28, 2025

LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.

Rust 1,079 68 Updated Nov 28, 2025

Spark integrations for working with Lance datasets

Java 32 25 Updated Nov 28, 2025

Integration between Lance and Ray for distributed data processing

Python 15 16 Updated Nov 20, 2025

Build reliable AI and agentic applications with DataFrames

Python 407 27 Updated Nov 25, 2025

The observability platform for Iceberg lakehouses.

TypeScript 392 22 Updated Nov 28, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,096 238 Updated Nov 28, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,489 967 Updated Oct 24, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,848 431 Updated Mar 5, 2025

Perforator is a cluster-wide continuous profiling tool designed for large data centers

C++ 3,357 147 Updated Nov 28, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,251 313 Updated Jul 7, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. AntRay is forked from ray, offering incremental new features on top …

Python 158 25 Updated Nov 28, 2025

Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.

Rust 1,059 102 Updated Nov 28, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,570 291 Updated Nov 28, 2025

Apache DataFusion Ray

Python 223 26 Updated Oct 5, 2025

A collection of RBIR projects and posts for anyone interested in joining this journey.

Rust 296 11 Updated Nov 28, 2025

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 5,766 490 Updated Nov 28, 2025

Use your Neovim like using Cursor AI IDE!

Lua 16,555 750 Updated Nov 28, 2025

Eclipse Theia is a cloud & desktop IDE framework implemented in TypeScript.

TypeScript 21,184 2,741 Updated Nov 28, 2025

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 2,360 672 Updated Nov 28, 2025

New file format for storage of large columnar datasets.

C++ 647 54 Updated Nov 27, 2025

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 4,864 353 Updated Nov 27, 2025

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,474 545 Updated Nov 28, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,200 3,917 Updated Nov 28, 2025

Apache OpenDAL: One Layer, All Storage.

Rust 4,620 664 Updated Nov 27, 2025

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,114 2,958 Updated Apr 29, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,056 6,955 Updated Nov 28, 2025

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java 14,653 3,619 Updated Nov 28, 2025
Next