This website requires JavaScript.
Explore
Help
Register
Sign In
ros
/
LlamaFactory
Watch
1
Star
0
Fork
0
You've already forked LlamaFactory
mirror of
https://github.com/hiyouga/LlamaFactory.git
synced
2026-02-03 21:03:10 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
692b132dbf0124885f8f499c702b6a81de494ca2
LlamaFactory
/
src
/
llmtuner
/
tuner
History
hiyouga
692b132dbf
fix bug in DPO data collator
...
Former-commit-id: 4fc262cdf1347691e253bdfbd96568db5a49c086
2023-09-08 20:45:07 +08:00
..
core
fix
#761
2023-09-08 20:22:18 +08:00
dpo
fix bug in DPO data collator
2023-09-08 20:45:07 +08:00
ppo
fix
#761
2023-09-08 20:22:18 +08:00
pt
update training resuming
2023-08-18 01:41:17 +08:00
rm
change to right-padding, update reward score
#803
2023-09-08 20:04:31 +08:00
sft
change to right-padding, update reward score
#803
2023-09-08 20:04:31 +08:00
__init__.py
modify code structure
2023-08-02 23:17:36 +08:00
tune.py
support rope scaling,
fix
#475
#476
#478
2023-08-12 20:46:27 +08:00