Change the repository type filter
All
Repositories list
134 repositories
- Multi-hop declarative data pipelines
cruise-control
PublicCruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.- Efficient Triton Kernels for LLM Training
helix
Public- Open Control Plane for Tables in Data Lakehouse
rest.li
Publicambry
Publiciceberg
Public- An extensible distributed system for reliable nearline data streaming at scale
ghc25-ds-workshop
Publiclinkedin.github.com
Publictransport
Publicavro-util
Publicfmchisel
Publicgoavro
Publiccoral
PublicBurrow
Publicshaky-android
Publicforthic
Publicisolation-forest
PublicA distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalable training and ONNX export for easy cross-platform inference.robustInfer
Publicluminol
PublicAnomaly Detection and Correlation library