Highlights
- Pro
Starred repositories
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Elegant reading of real-time and hottest news
Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
Visualizer for neural network, deep learning and machine learning models
Tongyi Deep Research, the Leading Open-source Deep Research Agent
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
SafeLine is a self-hosted WAF(Web Application Firewall) / reverse proxy to protect your web apps from attacks and exploits.
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Python tool for converting files and office documents to Markdown.
Supercharge Your LLM Application Evaluations 🚀
🍃 JavaScript library for mobile-friendly interactive maps 🇺🇦
A library for efficient similarity search and clustering of dense vectors.
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL
A computer algebra system written in pure Python
A library for building fast, reliable and evolvable network services.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
Google Drive Public File Downloader when Curl/Wget Fails
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!