Stars
Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.
[updating] Chinese Medical Dataset 致力于详细整理所有现有中文医学数据集,包括详细的数据汇总、数据示例、下载链接等。
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode & Gemini CLI.
A third-party launcher for Genshin Impact that supports game injection, automatic check-in, and some useful small features.
A feature-rich command-line audio/video downloader
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
DSPy: The framework for programming—not prompting—language models
DeerSheep0314 / Re4-Learning-to-Re-contrast-Re-attend-Re-construct-for-Multi-interest-Recommendation
🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/
A Unified Framework for High-Performance and Extensible LLM Steering
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
A framework for few-shot evaluation of language models.
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Qihoo360 / 360-LLaMA-Factory
Forked from hiyouga/LlamaFactoryadds Sequence Parallelism into LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Code, Data and Model for Paper "Learning from Peers in Reasoning Models"