hiyouga
|
538c79fd8f
|
fix #3694
Former-commit-id: 3d1b818cb6a77b7603724fbeb756b468aa74e7ea
|
2024-05-16 00:35:28 +08:00 |
|
hiyouga
|
dd0b85580e
|
fix badam configs
Former-commit-id: 8a4e6a4c65a9a42e6501b0d3ce81d6220c287454
|
2024-05-02 02:47:04 +08:00 |
|
hiyouga
|
233e167f68
|
fix optimizers
Former-commit-id: f811eee2fa12a89a55a9c5d3a05a1521b4347727
|
2024-04-21 20:40:54 +08:00 |
|
hoshi-hiyouga
|
ff4f587dd9
|
Update finetuning_args.py
Former-commit-id: 3a23d900aea74078f0bc8cf73fac860a4ce3df67
|
2024-04-16 17:26:30 +08:00 |
|
Jonery
|
d4d471450f
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
d764cd8736
|
support ORPO
Former-commit-id: f44a4c27e2461cdaa1b16865f597a31033c0e6d9
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
9408366a36
|
fix #2982
Former-commit-id: e5e6a0c50c7a1c0052ed6b459450b9735ff2c9a1
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
a916688723
|
fix bug
Former-commit-id: f513e1415cc3fe87f600318fba855d1286b6d007
|
2024-03-26 17:30:12 +08:00 |
|
hiyouga
|
3336422760
|
fix #2961
Former-commit-id: 616917bb3be7f71073b56ad8c7bc4e164b08b9b5
|
2024-03-26 17:26:14 +08:00 |
|
hiyouga
|
46f99ff277
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
3c91e86268
|
[FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: c35b3c3b1e27171f8a703f88ede1dc8a84c80a56
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
7ff8a064f3
|
support layerwise galore
Former-commit-id: d43a4da0947897d0be3f62fad3107754d4c89f2b
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
5d7d8bd55c
|
update hardware requirements
Former-commit-id: 604b3d10fc1448f702943114b66b97bded21e080
|
2024-03-09 03:58:18 +08:00 |
|
hiyouga
|
48d4364586
|
fix chat engine, update webui
Former-commit-id: 8b32dddd7d883bae07735796a517927c79d1c33b
|
2024-03-08 03:01:53 +08:00 |
|
hiyouga
|
3879d79b89
|
update galore args
Former-commit-id: c7479a7976f773feb36aab4fdb0500be53d83b6a
|
2024-03-08 01:17:32 +08:00 |
|
hiyouga
|
e416cecf62
|
fix galore
Former-commit-id: 62a3ceeef8f60caef43ccc7f971a0c9184e21296
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
1e6fb6c8aa
|
support galore
Former-commit-id: b67a4a46a88d83bb2a3459b3317b66cda15e0171
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
056d2d956a
|
support vllm
Former-commit-id: 889f6e910e654d8ec3922c2185042d737ffbf1c3
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
73d9dfc7ab
|
fix version checking
Former-commit-id: 5780da8d640609cca388f55983d0251e5547209a
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
b392e6cfb9
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: 6614cc1f08aa944db083e27e451bbdd733f7dd97
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
a274900188
|
fix #2532
Former-commit-id: 23a8e64f1c47cd473c627effbe271233c136369c
|
2024-02-21 21:55:14 +08:00 |
|
hiyouga
|
bc16c9a54a
|
support lora for llama pro
Former-commit-id: f74c78ba95f0545aae89e603e466f494705ad024
|
2024-02-21 02:17:22 +08:00 |
|
hiyouga
|
5ccf8fcd6b
|
update webui
Former-commit-id: 9e0f7c362d40b78d57e77d52eaa96e678cebadcd
|
2024-02-19 16:49:58 +08:00 |
|
hiyouga
|
596b6828cb
|
support llama pro #2338 , add rslora
Former-commit-id: 40d659b7f30dd5a004703c176ec1f22dc864e505
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
5f83860aa1
|
add option to disable version check
Former-commit-id: fd769cb2de696aee3c5e882237e16eace6a9d675
|
2024-02-10 22:31:23 +08:00 |
|
hiyouga
|
34bc0c22b1
|
lint
Former-commit-id: 6b1f89b6494e9b6b087fe90600617a3024e014e5
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
a2ae5bd867
|
add hint for freeze #2412
Former-commit-id: 9600c93633629605573d908019563fa3870ad6f8
|
2024-02-03 23:38:56 +08:00 |
|
hiyouga
|
66e0e651b9
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
384f0e7678
|
add bf16 lora option
Former-commit-id: 58e7d7ff0cf9bf30e53b3eb12576f38d31976413
|
2024-01-19 16:29:03 +08:00 |
|
hiyouga
|
d1ec884e75
|
fix #2195
Former-commit-id: 801f7279693a0c785480ea67d663d99f4ca653da
|
2024-01-16 23:53:50 +08:00 |
|
hiyouga
|
921f593632
|
update loader
Former-commit-id: 080d8eab858217ca58bffe719d5ffde7579c5bda
|
2023-12-24 19:10:23 +08:00 |
|
hiyouga
|
6faf9c35a9
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
67f7034a21
|
fix param type
Former-commit-id: 11b99f344416ade1cdac52e11ba7f36fcf689221
|
2023-12-21 17:33:01 +08:00 |
|
hiyouga
|
d81ad2d4bc
|
support dpo-ftx
Former-commit-id: 86dfa04f9821556019fa777106787f73eb70b452
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
296711d502
|
support quantization in export model
Former-commit-id: f32500ae6edccab7d14df4c92467e15986866def
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
f902b0d420
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
2542b62d77
|
remove loftq
Former-commit-id: e175c0a1c631296117abda2403a4b87bbdd35a66
|
2023-12-13 01:53:46 +08:00 |
|
hiyouga
|
e39bbdd287
|
support loftq
Former-commit-id: e7ac2eb7f7daae17525a278ffbe2f82c0fbd8093
|
2023-12-12 22:47:06 +08:00 |
|
hiyouga
|
9e2cc21d04
|
update readme
Former-commit-id: 42e042a4206aeb5177ddde56386e9655b0c06460
|
2023-12-12 11:44:30 +08:00 |
|
hiyouga
|
29545d0e5e
|
implement rm server #1543
Former-commit-id: 2e5bb6888c86079493456c2ddd525f8c52b9963e
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
670ee3934f
|
fix #1659
Former-commit-id: e4123129aae59f4123d53c1f5320e3d5e09ae26d
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
569860d7ac
|
support export size setting
Former-commit-id: 1a4de54586c21cdbbc89f8a716ca5a54c87a6120
|
2023-11-26 18:34:09 +08:00 |
|
hiyouga
|
28258aecd2
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
Yuchen Han
|
bcd31cf245
|
Update finetuning_args.py
Former-commit-id: 30e3430553f1f7e09cd57ef2c9843b549746c618
|
2023-11-17 00:15:51 -08:00 |
|
hiyouga
|
de3a84ac59
|
fix rlhf callback
Former-commit-id: f5485452d660caef56474cb7dc37abbe4f34599e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
e017266b98
|
fix bug in PPO training
Former-commit-id: 2e99f0e53ce6de0acbcab85dd50aef874e8c6336
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
7a3a0144a5
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
b2ac8376e1
|
support multiple modules in freeze training #1514
Former-commit-id: 60abac70dfd778df2ae8b3a2e960ed8b607d7ab6
|
2023-11-15 17:08:18 +08:00 |
|
hiyouga
|
64fc9ba678
|
refactor evaluation, upgrade trl to 074
Former-commit-id: ed09ebe2c1926ffdb0520b3866f7fd03a9aed046
|
2023-11-13 22:20:35 +08:00 |
|
hiyouga
|
68dd1ef121
|
tiny fix
Former-commit-id: 97ba2027bb1ddc01a3c824c40d5a180828810c2c
|
2023-11-09 17:20:49 +08:00 |
|