Stars
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
Breeze is/was a numerical processing library for Scala.
Rapid development of self-documenting APIs
State of the Art Natural Language Processing
Uplift modeling and causal inference with machine learning algorithms
Reactive data-binding for Scala
A fault tolerant, protocol-agnostic RPC system
A better build tool for Java, Scala and Kotlin: 3-6x faster than Maven or Gradle, less fiddling with plugins, and more easily explorable in your IDE
Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions
The Hypothesis web-based annotation client.
Scala types for your library to represent HTML tags, attributes, properties and CSS styles
An open-source AI agent that brings the power of Gemini directly into your terminal.
Java binary serialization and cloning: fast, efficient, automatic
A game theoretic approach to explain the output of any machine learning model.
Vim-fork focused on extensibility and usability
π LunarVim is an IDE layer for Neovim. Completely free and community driven.
Apache Spark - A unified analytics engine for large-scale data processing
JDK main-line development https://openjdk.org/projects/jdk
Bring data to life with SVG, Canvas and HTML. πππ