- New York, NY
Stars
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
Code to accompany our paper Chen and Zimmermann (2020), "Open source cross-sectional asset pricing"
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Library for building stateful property tests using the proptest crate
The financial transactions database designed for mission critical safety and performance.
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches t…
Scriptable database and system performance benchmark
Turn YouTube or Vimeo channels, users, or playlists into podcast feeds
Example multi-region AWS Terraform application
A Virtual Private Cloud networking solution based on P4 language
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
Fast web applications through dynamic, partially-stateful dataflow
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
A safe and fast multi-producer, multi-consumer channel.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Collaborative Machine-Learning-Centric Data Analytics Using Workflows
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
A system for quickly generating training data with weak supervision