hiyouga
|
f2e139f5cd
|
fix #1452
Former-commit-id: 4d16214467715df458e24d03bb7d303d62b8bdcd
|
2023-11-09 16:41:32 +08:00 |
|
hiyouga
|
f5ba2190fb
|
fix ppo train and dpo eval
Former-commit-id: ced863031836632cb5920e22ae6991f251372118
|
2023-11-07 22:48:51 +08:00 |
|
hiyouga
|
14a38b5069
|
fix #1422
Former-commit-id: 25d7bbd0a5142f001bd2ff498df07b24137050a9
|
2023-11-07 19:42:01 +08:00 |
|
hiyouga
|
f23e5b602a
|
fix reward model loading
Former-commit-id: 9709ca501180a1afce32e9043aedb359762b437d
|
2023-11-07 17:20:51 +08:00 |
|
hiyouga
|
857696ed9c
|
fix args
Former-commit-id: 44d0fa2ac6a6423c7ddaf91eb8998c1b9248c04e
|
2023-11-07 16:36:06 +08:00 |
|
hiyouga
|
2eb65d21ac
|
upgrade peft, fix #1088 #1411
Former-commit-id: aa7d104f8e050d12cb8f585bc8a52c850995500f
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
4e40f5b62b
|
fix #1383
Former-commit-id: 9b8a782aa80f27c3e2a2e2621f9be17cae1a27e8
|
2023-11-06 11:42:23 +08:00 |
|
hiyouga
|
217fde0918
|
fix bug in data loader, support dpo eval
Former-commit-id: f4f3dcff990468a2fa864b7176adcebbcf16dac9
|
2023-11-03 00:34:26 +08:00 |
|
hiyouga
|
67a46e553f
|
fix #1287
Former-commit-id: d885aca472c6448bbf9a9e8d16bead92038825e3
|
2023-10-26 17:49:41 +08:00 |
|
hiyouga
|
e387a50475
|
fix shift short attention
Former-commit-id: 9a49cce8e6f6b222f74a07bdab40efee6a77b0f1
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
42e0b30476
|
update flashattn, fix ppo save model
Former-commit-id: 0b08bc3dac246d4aa3f89afb7172529dcad9c39f
|
2023-09-11 17:25:36 +08:00 |
|
hiyouga
|
a09a7b650d
|
remove PeftTrainer
Former-commit-id: cc0cff3e991f194732d278e627648e528118a719
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
692b132dbf
|
fix bug in DPO data collator
Former-commit-id: 4fc262cdf1347691e253bdfbd96568db5a49c086
|
2023-09-08 20:45:07 +08:00 |
|
hiyouga
|
86d835878c
|
fix #809
Former-commit-id: 2783ca75365d7c373cefba039788a48f0b8f35fc
|
2023-09-07 19:04:32 +08:00 |
|
hiyouga
|
e5b72c6a77
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: 0feec9a830b917b36686b61938a66e842eccf930
|
2023-09-01 19:00:45 +08:00 |
|
hiyouga
|
180a05a446
|
fix import error
Former-commit-id: b3207a974a45038591b8cbbcf20d1ca1142d6679
|
2023-08-23 20:45:03 +08:00 |
|
hiyouga
|
eb9ac9ee1f
|
fix #649
Former-commit-id: e6120a937ddb4f3c0b9bcb2466742f5cf4f77f8c
|
2023-08-23 20:21:15 +08:00 |
|
hiyouga
|
d6be98cda6
|
fix #617
Former-commit-id: a7bdaf1c92c7d798caf8438dc42a8972632ec584
|
2023-08-21 18:16:11 +08:00 |
|
hiyouga
|
c2644f939a
|
update training resuming
Former-commit-id: 2ec75c31f609e65116ac3b621eeb7d8ccbf69135
|
2023-08-18 01:41:17 +08:00 |
|
hiyouga
|
bceaba551d
|
fix ChatGLM lm_head #494
Former-commit-id: bf0048abdaeb2b9592d38ac991704ad014370b47
|
2023-08-14 14:14:48 +08:00 |
|
hiyouga
|
4933ab5956
|
fix #480
Former-commit-id: ec15ca8fffacba2c34e1849c5ce90ca9989d66a2
|
2023-08-14 00:23:56 +08:00 |
|
hiyouga
|
ca719a8697
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|