This website requires JavaScript.
Explore
Help
Register
Sign In
ros
/
LLaMA-Factory
Watch
1
Star
0
Fork
0
You've already forked LLaMA-Factory
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
180a05a44625995dcf92e0596351843039140b3b
LLaMA-Factory
/
src
/
llmtuner
/
tuner
/
ppo
History
hiyouga
5549f35939
fix ppo trainer
#551
...
Former-commit-id: 050a5447c191b8c50a0826a0f03bae499bff8b48
2023-08-20 14:07:11 +08:00
..
__init__.py
modity code structure
2023-07-15 16:54:28 +08:00
trainer.py
fix ppo trainer
#551
2023-08-20 14:07:11 +08:00
utils.py
fix ppo trainer
#551
2023-08-20 14:07:11 +08:00
workflow.py
support ppo score norm (trl 0.5.1.dev required)
2023-08-18 12:02:42 +08:00