Stars
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
A Collection of Helper Functions for the great-tables Package.
Python interface to Ledger Investing's analytics infrastructure
Apache Spark - A unified analytics engine for large-scale data processing
DuckDB is an analytical in-process SQL database management system
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
A module for the representation and manipulation of insurance loss triangles.
Data validation toolkit for assessing and monitoring data quality.
Documentation that simply works
A book on DevOps for Data Scientists with CRC Press.
Python tool for converting files and office documents to Markdown.
A curated list of Polars talks, tools, examples & articles. Contributions welcome !
WebAssembly powered code blocks and exercises for both the R and Python languages in Quarto documents
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Lightweight and extensible compatibility layer between dataframe libraries!
A light-weight, flexible, and expressive statistical data testing library
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing