ydup

🔥

Yadong Zhang ydup

🔥

搜索广告

18 followers · 8 following

Meituan
Beijing

Achievements

Lists (1)

Sort

✨ Inspiration

Starred repositories

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 743 41 Updated Aug 13, 2025

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 4,144 761 Updated Nov 22, 2022

ByteDance-Seed / Seed-Coder

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

605 45 Updated Jun 6, 2025

opendilab / PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Python 2,425 204 Updated Mar 13, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 12,760 1,218 Updated Oct 28, 2025

zjunlp / LLMAgentPapers

Must-read Papers on LLM Agents.

2,760 162 Updated Oct 24, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,195 11,552 Updated Nov 6, 2025

RUC-NLPIR / Search-o1

🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]

Python 1,082 95 Updated Aug 21, 2025

OpenLMLab / MOSS-RLHF

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,402 104 Updated Mar 3, 2024

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 1,265 73 Updated Jun 8, 2025

brendanhogan / DeepSeekRL-Extended

Exploring Applications of GRPO

Python 248 34 Updated Aug 25, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,618 2,401 Updated Sep 8, 2025

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

34,375 3,848 Updated Sep 25, 2025

tangxyw / RecSysPapers

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 1,978 256 Updated Oct 9, 2025

Joyce94 / LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 437 22 Updated Oct 11, 2023

huangsg1 / Internet-advertising-mechanism-and-strategy

计算广告机制策略相关材料整理(A collection of research and application papers about Strategy in Internet advertising.)

177 22 Updated Feb 18, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,213 102 Updated May 8, 2024

Doragd / Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）

HTML 3,974 446 Updated Nov 8, 2025

LongxingTan / open-retrievals

All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers

Python 71 13 Updated Aug 10, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,285 1,764 Updated Oct 13, 2025

xx025 / carrot

Free ChatGPT Site List 这儿为你准备了众多免费好用的ChatGPT镜像站点

17,059 1,449 Updated Oct 27, 2025

ACL2023-Retrieval-LM / ACL2023-Retrieval-LM.github.io

https://acl2023-retrieval-lm.github.io/

JavaScript 158 15 Updated Oct 18, 2023

Stability-AI / StableCascade

Official Code for Stable Cascade

Jupyter Notebook 6,582 526 Updated Jul 25, 2024

BaranziniLab / KG_RAG

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

Jupyter Notebook 906 108 Updated Nov 9, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,348 470 Updated Aug 7, 2024

LinkSoul-AI / Chinese-LLaVA

支持中英文双语视觉-文本对话的开源可商用多模态模型。

Python 376 32 Updated Sep 23, 2023

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,919 2,659 Updated Aug 12, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

16,646 1,073 Updated Nov 6, 2025

wangyuxinwhy / uniem

unified embedding model

Python 871 71 Updated Sep 1, 2023

openai / consistency_models

Official repo for consistency models.

Python 6,433 434 Updated Mar 22, 2024

Yadong Zhang ydup

Lists (1)

✨ Inspiration

Starred repositories

triadic-motif-fields