Highlights
- Pro
Lists (13)
Sort Name ascending (A-Z)
Starred repositories
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)
Hands-on examples and exercises from the book "Databricks Certified Data Engineer Associate Study Guide" published by O'Reilly Media.
end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered business intelligence in seconds.
The official Python SDK for Model Context Protocol servers and clients
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Model Context Protocol Servers
DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)
Examples of Databricks Asset Bundles
An extremely fast Python package and project manager, written in Rust.
A very small, very simple, yet very secure encryption tool.
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
A pure Rust Excel/OpenDocument SpreadSheets file reader: rust on metal sheets
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Various Jupyter notebooks about Common Crawl data
Free and Open Source, Distributed, RESTful Search Engine
Prompt, run, edit, and deploy full-stack web applications. -- bolt.new -- Help Center: https://support.bolt.new/ -- Community Support: https://discord.com/invite/stackblitz
This repository helps teach people how to correctly define and create cumulative tables!