stay hungry, stay healthy.
-
Tsinghua University
- Shenzhen, China
-
21:37
(UTC -12:00) - louieworth.github.io
- @louieworth
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedNov 1, 2025 -
-
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
Python Apache License 2.0 UpdatedOct 12, 2025 -
Field-Experiment-AI-Agent Public
Forked from Six-Persimmon/Field-Experiment-AI-AgentAI Agent framework for conducting consumer behavior experiments based on CrewAI.
Python MIT License UpdatedSep 14, 2025 -
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedMay 16, 2024 -
awesome-rlhf Public
An index of algorithms for reinforcement learning from human feedback (rlhf))
-
jaxrl Public template
Forked from ikostrikov/jaxrlJAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Jupyter Notebook MIT License UpdatedDec 30, 2023