Skip to content
View pbailis's full-sized avatar

Organizations

@stanford-futuredata @sisudata

Block or report pbailis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)

Python 771 22 Updated Jul 12, 2023

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

TypeScript 4,906 465 Updated Jan 7, 2026

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.

Go 521 20 Updated Jun 7, 2023

This project contains the code for running experiments with the DIRECT and SKETCHREFINE algorithms presented in the paper: Matteo Brucato, Juan Felipe Beltran, Azza Abouzied, Alexandra Meliou: Scal…

Python 8 3 Updated Dec 18, 2020

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go 3,208 369 Updated Mar 20, 2025

Graph Coloring to Accelerate Analysis

C 7 Updated Jan 25, 2020

Accelerating network inference over video

Python 436 121 Updated Mar 6, 2020

presentations for busy messy hackers

JavaScript 3,310 162 Updated Nov 7, 2025

Sparser: Raw Filtering for Faster Analytics over Raw Data

C 434 54 Updated Sep 18, 2018

MacroBase: A Search Engine for Fast Data

Java 671 126 Updated Dec 14, 2022

A framework for formally verifying distributed systems implementations in Coq

Rocq Prover 613 57 Updated Jun 27, 2025

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,139 2,955 Updated Apr 29, 2025

Twemcache is the Twitter Memcached

C 935 152 Updated Nov 1, 2021

Benchmark for measuring PBS overhead in Cassandra

Python 4 Updated Jul 10, 2012

Adding logging information to Voldemort for analysis ("profiling" branch)

Java 2 Updated Aug 9, 2012

Changing DynamoDB put interface to control timestamps

Java 4 3 Updated Apr 14, 2012

Development in Shark has been ended.

Scala 994 324 Updated Aug 11, 2015

Tiny Transactions on Computer Systems (TinyToCS) Site

TeX 32 5 Updated Mar 8, 2016

RoboBees Colony Swarm Simulator

Java 18 10 Updated Oct 4, 2022

Peter Bailis's Blog

Python 1 Updated Apr 6, 2014

Sparrow scheduling platform (U.C. Berkeley).

Python 328 90 Updated Jul 25, 2020

Old Probabilistically Bounded Staleness (PBS) analysis for Cassandra (see http://www.bailis.org/blog/using-pbs-in-cassandra-1.2.0/)

Java 29 2 Updated Jul 10, 2012

Work-in-progress sample code related to Bud

Ruby 69 9 Updated Nov 22, 2016

Prototype Bud runtime (Bloom Under Development)

Ruby 869 60 Updated Sep 1, 2020

The official AWS SDK for Java 1.x (In Maintenance Mode, End-of-Life on 12/31/2025). The AWS SDK for Java 2.x is available here: https://github.com/aws/aws-sdk-java-v2/

Java 4,194 2,812 Updated Jan 5, 2026

An open source clone of Amazon's Dynamo.

Java 2,682 584 Updated Jul 24, 2023

Apache Cassandra®

Java 9,567 3,818 Updated Dec 31, 2025