Highlights
Stars
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
Open Source, Google Zanzibar-inspired database for scalably storing and querying fine-grained authorization data
Open Policy Agent (OPA) is an open source, general-purpose policy engine.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
Data validation using Python type hints
Open, Multi-modal Catalog for Data & AI
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
Kubernetes/OpenShift operator for Debezium Server. Please log issues at https://github.com/debezium/dbz/issues.
A platform for building proxies to bypass network restrictions.
A platform for building proxies to bypass network restrictions.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The Metadata Platform for your Data and AI Stack
Python library providing function decorators for configurable backoff and retry
Brave browser for Android, iOS, Linux, macOS, Windows.
An open source framework for building data analytic applications.
Convert Machine Learning Code Between Frameworks
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors