Starred repositories
PostgreSQL extension for BM25 relevance-ranked full-text search. Postgres OSS licensed.
OCR model that handles complex tables, forms, handwriting with full layout.
Conformal Prediction for Time-series Forecasting with Change Points
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
⚡ TabPFN: Foundation Model for Tabular Data ⚡
Grist is the evolution of spreadsheets.
Visualize large time series data with plotly.py
A terminal spreadsheet multitool for discovering and arranging data
Creating beautiful plots of data maps
Command line artificial intelligence - Your local LLM context-feeder
Fast, stateless LLM for your shell: qq answers; qa runs commands
Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 tests, cross-version persistence. Drop-in replacement for MLl…
Curate, Annotate, and Manage Your Data in LightlyStudio.
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Simplifying reinforcement learning for complex game environments
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
🪄 Create rich visualizations with AI
DeepDiff: Deep Difference and search of any Python object/data. DeepHash: Hash of any object based on its contents. Delta: Use deltas to reconstruct objects by adding deltas together.
Get your documents ready for gen AI
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
An adversarial example library for constructing attacks, building defenses, and benchmarking both
Data Analysis with Bootstrapped ESTimation