Stars
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Search Request Processor: pipeline for transformation of queries and results inline with a search request.
A Python Library for Graph Outlier Detection (Anomaly Detection)
RedBeat is a Celery Beat Scheduler that stores the scheduled tasks and runtime metadata in Redis.
Fast web applications through dynamic, partially-stateful dataflow
UpliftML: A Python Package for Scalable Uplift Modeling
GeoLift is an end-to-end geo-experimental methodology based on Synthetic Control Methods used to measure the true incremental effect (Lift) of ad campaign.
aws-es-proxy is a small web server application sitting between your HTTP client (browser, curl, etc...) and Amazon Elasticsearch service.
🔎 Open source distributed and RESTful search engine.
Singer.io Target for Snowflake - PipelineWise compatible
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)
State-of-the-Art Text Embeddings
Robyn is an experimental, AI/ML-powered and open sourced Marketing Mix Modeling (MMM) package from Meta Marketing Science. Our mission is to democratise modeling knowledge, inspire the industry thr…
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Python Implementation of Apriori Algorithm for finding Frequent sets and Association Rules
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Unsupervised text tokenizer for Neural Network-based text generation.
Convert scikit-learn models and pipelines to ONNX
Production infrastructure for machine learning at scale
An open-source implementation of the geo experiment analysis methodology developed at Google. Disclaimer: This is not an official Google product.
Streamlit — A faster way to build and share data apps.
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its go…
Uplift modeling and causal inference with machine learning algorithms
Implementation of statistical models to analyze time lagged conversions
An index of algorithms for learning causality with data