Stars
Get your documents ready for gen AI
An orchestration platform for the development, production, and observation of data assets.
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Lightweight and fast library to convert PDF to markdown format.
The absolute trainer to light up AI agents.
Robust Speech Recognition via Large-Scale Weak Supervision
Python tool for converting files and office documents to Markdown.
OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving
SQL databases in Python, designed for simplicity, compatibility, and robustness.
(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.
[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.
verl: Volcano Engine Reinforcement Learning for LLMs
FastAPI Template with Docker, Postgres
Build Real-Time Knowledge Graphs for AI Agents
A simple, easy-to-hack GraphRAG implementation
pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidb.ai
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
A collection of examples that show how to use CrewAI framework to automate workflows.
Task-Aware Agent-driven Prompt Optimization Framework
OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constr…
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
An open-source RAG-based tool for chatting with your documents.
Code for explaining and evaluating late chunking (chunked pooling)