Skip to content

wilfrid51/llm-container

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Complete docs

Install envs

first install pytorch2.0 https://pytorch.org/get-started/locally/ then install others pip install -r requirements.txt

deploy necessary settings

run train SFT model

bash run.sh

run train Reward model

bash run-reward.sh

run train RLHF model

bash run-rlhf.sh

Prepare data

SFT data

refer sft-data-construction

reward data and RLHF data

refer rlhf-ppo

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published