Stars
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Duβ¦
Extremely fast Query Engine for DataFrames, written in Rust
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
Lightweight, efficient, binary serialization and deserialization codec
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
A high-throughput and memory-efficient inference and serving engine for LLMs
A package containing all IBC Token Data, across the whole Cosmos Ecosystem.
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
Opensource,Database,AI,Business,Minds. git clone --depth 1 https://github.com/digoal/blog
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
This is proof of solvency tool for Centralized exchanges built by Binance. Please raise bugs and security issues to https://bugcrowd.com/binance
a high-performance, POSIX-ish Amazon S3 file system written in Go
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.
A modern and intuitive command line client for Kafka Connect
Maven plugin for running and creating Docker images
A process for collecting metrics using JMX MBeans for Prometheus consumption
πππA faster, better and more stable Redis desktop manager [GUI client], compatible with Linux, Windows, Mac.
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance β¦