Stars
A simple, fast streaming JSON parser built on standards.
A powerful and modular toolkit for record linkage and duplicate detection in Python
jellyjoin Python package for soft joins with embedding vectors
Bake a cake with care, follow steps the recipe gives, that’s an algorithm.
Community-contributed instructions, prompts, and configurations to help you make the most of GitHub Copilot.
An open-source Python library for Reinforcement Learning (RL), designed to model, optimize, and control dynamic systems.
Python library for Applied Computational Supply Chain & Logistics. Unlock Neural Nets, Bayesian EOQ, Optimization, Time Series, and more for smarter decisions.
Bringing semantic search to Django. Integrates seemlessly with Django ORM.
A declarative, 🐻❄️-native data frame validation library.
Manipulation and analysis of geometric objects on the sphere.
Resources and notebooks to accompany the Duplicate Detection using GenAI paper
This repository contains the code for the O'Reilly book Reinforcement Learning for Finance.
High performance Python GLMs with all the features!
Highly Performant, Modular, Memory Safe and Production-ready Inference, Ingestion and Indexing built in Rust 🦀
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
Grep Python Abstract Syntax Trees (AST) using XPath
Distributed query engine providing simple and reliable data processing for any modality and scale
Aggregating Large-Scale Databases for PubMed Author Name Disambiguation
A library that enables you to easily parse and transform ORCID metadata between XML, JSON and Java objects
A Python concurrency scheduling library, compatible with asyncio and trio.