Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
slime is an LLM post-training framework for RL Scaling.
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
MTVCraft: An Open Veo3-style Audio-Video Generation Demo
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Ongoing research training transformer models at scale
TextPy: Collaborative Agent Workflow through Programming and Prompting
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Survey on LLM Agents (Published on CoLing 2025)
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
The official Python SDK for Model Context Protocol servers and clients
🚀 The fast, Pythonic way to build MCP servers and clients
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives