yechaoying

Ye Chaoying yechaoying

1 follower · 3 following

Achievements

Highlights

Starred repositories

hanshen95 / SEAL

An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.

Python 22 4 Updated Feb 20, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,773 2,879 Updated Nov 27, 2025

bboylyg / BackdoorLLM

[NeurIPS 2025] BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models

Python 247 26 Updated Oct 24, 2025

cloudwego / eino

The ultimate LLM/AI application development framework in Golang.

Go 8,366 639 Updated Nov 27, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 799 58 Updated Jul 31, 2025

Sumandora / remove-refusals-with-transformers

Implements harmful/harmless refusal removal using pure HF Transformers

Python 1,314 209 Updated Nov 27, 2025

Steven-Luo / MasteringRAG

企业级RAG系统从入门到精通

Jupyter Notebook 587 86 Updated Jun 25, 2025

yxlHuster / news-duplicated

文本去重算法，研究自推荐系统中新闻的去重，采用了雅虎的Near-duplicates and shingling算法，服务端用c实现，客户端用java实现，利用thrift框架进行通信，为了提高扩展性，去重可以在服务端实现，服务器也提供了计算的接口，方便客户端自己扩展

Java 24 22 Updated Feb 25, 2014

tuber0613 / hot_news_daily_push

这是一个自动收集各大平台热点新闻（更关注 AI热点）、RSS订阅源以及特定Twitter Feed，进行处理、去重、总结，并通过多种渠道推送热点摘要的工具。该项目完全由Cursor和Trae接力编写

Python 93 18 Updated Jul 29, 2025

Unispac / shallow-vs-deep-alignment

Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Python 165 13 Updated Apr 23, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,474 820 Updated Nov 9, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,775 2,068 Updated May 19, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 50,217 8,396 Updated Nov 12, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,280 4,775 Updated Jun 2, 2025

vibrantlabsai / ragas

Supercharge Your LLM Application Evaluations 🚀

Python 11,557 1,155 Updated Nov 27, 2025

truera / trulens

Evaluation and Tracking for LLM Experiments and AI Agents

Python 2,941 232 Updated Nov 25, 2025

deepseek-ai / DeepSeek-V3

Python 100,414 16,368 Updated Aug 28, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 63,198 7,642 Updated Nov 27, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,543 1,787 Updated Oct 13, 2025

zai-org / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,937 597 Updated Jul 4, 2025

JailbreakBench / jailbreakbench

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]

Python 469 52 Updated Apr 4, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,841 4,650 Updated Nov 26, 2025

git-disl / awesome_LLM-harmful-fine-tuning-papers

A survey on harmful fine-tuning attack for large language model

222 6 Updated Nov 20, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 73,760 2,265 Updated Nov 27, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,627 1,396 Updated Oct 1, 2025