- San Francisco, CA
Stars
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
Streamlit component that allows Plotly events to bubble back up to streamlit. Makes Plotly charts interactive!
Official Code for DragGAN (SIGGRAPH 2023)
An open-source screen recorder built with web technology
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
A game theoretic approach to explain the output of any machine learning model.
⚡ A Fast, Extensible Progress Bar for Python and CLI
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Fast ISO8601 date time parser for Python written in C
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Docker image for Airbnb's Superset
Cubism.js: A JavaScript library for time series visualization.
A library of extension and helper modules for Python's data analysis and machine learning libraries.
A Python implementation of global optimization with gaussian processes.
Jsmn is a world fastest JSON parser/tokenizer. This is the official repo replacing the old one at Bitbucket
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
A curated list of awesome big data frameworks, ressources and other awesomeness.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
A fast PostgreSQL Database Client Library for Python/asyncio.