hiyouga
|
ca719a8697
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
e0c7e944fc
|
update trainer
Former-commit-id: 0d39b53a5164e34d22fe0a492eaa0d7ac63102fe
|
2023-08-07 13:34:35 +08:00 |
|
hiyouga
|
e4d0b8ee6e
|
update ppo trainer
Former-commit-id: c27136a83e167465d3f825e40f10c7b9fcfbf97a
|
2023-08-02 18:46:41 +08:00 |
|
hiyouga
|
8e26eb374e
|
fix RM save model
Former-commit-id: 8104cc2425431eb1cddccf3909855296116f922b
|
2023-08-01 11:56:17 +08:00 |
|
hiyouga
|
772ad4ec6b
|
fix inference
Former-commit-id: 55dc2bdd3eaa552c655e584fc3cbbf017c7bc3e7
|
2023-08-01 00:06:48 +08:00 |
|
hiyouga
|
dd3f3e9749
|
support streaming data, fix #284 #274 #268
Former-commit-id: 819cc1353599e5fa45658bc56dd0dbe4b258b197
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
a1468139a5
|
fix save function
Former-commit-id: 1d6beb0c8490a7531ffdf7a2819410597b200d12
|
2023-07-21 14:09:07 +08:00 |
|
hiyouga
|
0f7cdac207
|
update web UI, support rm predict #210
Former-commit-id: 92cc6b655dc91b94d5bf9d8618c3b57d5cf94333
|
2023-07-21 13:27:27 +08:00 |
|
hiyouga
|
a31a609377
|
fix callback
Former-commit-id: 065680cd2a410d7ceab10a4a76588df43e286117
|
2023-07-15 17:18:16 +08:00 |
|
hiyouga
|
6261fb362a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|