🎯
Focusing
Hi there! I'm a 2nd year CS PhD student at Princeton University.
Previously at Amazon AGI, PKU/CMU Alumni
- Princeton, NJ
- https://sijial430.github.io/
- @letti_liu
Pinned Loading
-
HALOs
HALOs PublicForked from ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Python
-
ContextualAI/HALOs
ContextualAI/HALOs PublicA library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
-
trl-fork
trl-fork PublicForked from Muennighoff/trl-fork
Train transformer language models with reinforcement learning.
Python
-
verl-fork
verl-fork PublicForked from volcengine/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.