Stars
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Awesome Knowledge Distillation
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/
Eigent: The World's First Multi-agent Workforce to Unlock Your Exceptional Productivity.
A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.
A List of Recommender Systems and Resources
Python tool for converting files and office documents to Markdown.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
This is a multi agent tutorial based on the CAMEL framework, aimed at understanding how to build an Agent Society from the ground up!
SGLang is a fast serving framework for large language models and vision language models.
An extremely fast Python package and project manager, written in Rust.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
From expressive code to powerful GUIs in no time: a fast, feature-rich, cross-platform toolkit for C++ & Python.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Production-ready platform for agentic workflow development.
Vision infrastructure to turn complex documents into RAG/LLM-ready data
「PyTorch」A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.