mirror of
https://github.com/hiyouga/LlamaFactory.git
synced 2026-03-17 22:53:08 +00:00
101 B
101 B
Usage:
pretrain.shsft.sh->reward.sh->ppo.shsft.sh->dpo.sh->predict.sh