Skip to content
View pbailis's full-sized avatar

Organizations

@stanford-futuredata @sisudata

Block or report pbailis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)

Python 771 22 Updated Jul 12, 2023

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

TypeScript 4,720 449 Updated Nov 12, 2025

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.

Go 521 20 Updated Jun 7, 2023

This project contains the code for running experiments with the DIRECT and SKETCHREFINE algorithms presented in the paper: Matteo Brucato, Juan Felipe Beltran, Azza Abouzied, Alexandra Meliou: Scal…

Python 8 3 Updated Dec 18, 2020

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go 3,195 369 Updated Mar 20, 2025

Graph Coloring to Accelerate Analysis

C 7 Updated Jan 25, 2020

Accelerating network inference over video

Python 435 120 Updated Mar 6, 2020

presentations for busy messy hackers

JavaScript 3,304 161 Updated Nov 7, 2025

Sparser: Raw Filtering for Faster Analytics over Raw Data

C 433 53 Updated Sep 18, 2018

MacroBase: A Search Engine for Fast Data

Java 670 126 Updated Dec 14, 2022

A framework for formally verifying distributed systems implementations in Coq

Rocq Prover 609 56 Updated Jun 27, 2025

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,107 2,953 Updated Apr 29, 2025

Twemcache is the Twitter Memcached

C 936 153 Updated Nov 1, 2021

Benchmark for measuring PBS overhead in Cassandra

Python 4 Updated Jul 10, 2012

Adding logging information to Voldemort for analysis ("profiling" branch)

Java 2 Updated Aug 9, 2012

Changing DynamoDB put interface to control timestamps

Java 4 3 Updated Apr 14, 2012

Development in Shark has been ended.

Scala 994 325 Updated Aug 11, 2015

Tiny Transactions on Computer Systems (TinyToCS) Site

TeX 32 5 Updated Mar 8, 2016

RoboBees Colony Swarm Simulator

Java 19 10 Updated Oct 4, 2022

Peter Bailis's Blog

Python 1 Updated Apr 6, 2014

Sparrow scheduling platform (U.C. Berkeley).

Python 329 91 Updated Jul 25, 2020

Old Probabilistically Bounded Staleness (PBS) analysis for Cassandra (see http://www.bailis.org/blog/using-pbs-in-cassandra-1.2.0/)

Java 29 2 Updated Jul 10, 2012

Work-in-progress sample code related to Bud

Ruby 68 9 Updated Nov 22, 2016

Prototype Bud runtime (Bloom Under Development)

Ruby 866 60 Updated Sep 1, 2020

The official AWS SDK for Java 1.x (In Maintenance Mode, End-of-Life on 12/31/2025). The AWS SDK for Java 2.x is available here: https://github.com/aws/aws-sdk-java-v2/

Java 4,186 2,820 Updated Oct 28, 2025

An open source clone of Amazon's Dynamo.

Java 2,676 586 Updated Jul 24, 2023

Apache Cassandra®

Java 9,472 3,793 Updated Nov 12, 2025