Commit Graph

  • bff8b02543 update gradio, support multiple resp in api hiyouga 2023-11-01 23:02:16 +08:00
  • 2406200914 fix SFT trainer hiyouga 2023-10-31 21:52:52 +08:00
  • db06fcfc84 fix #1316 hiyouga 2023-10-31 11:32:08 +08:00
  • 93b9f74e9f update projects hiyouga 2023-10-29 22:53:47 +08:00
  • 33ec844f76 add projects hiyouga 2023-10-29 22:07:13 +08:00
  • 0f727b393e update constants hiyouga 2023-10-29 13:30:20 +08:00
  • 7da2aad6ee fix vicuna template hiyouga 2023-10-27 22:15:25 +08:00
  • 6f09f50d02 fix chatglm3 template hiyouga 2023-10-27 21:12:06 +08:00
  • 5919832059 update readme hiyouga 2023-10-27 19:19:03 +08:00
  • f7635c1afc support chatglm3 hiyouga 2023-10-27 19:16:28 +08:00
  • c762168ed0 support dataset cache hiyouga 2023-10-26 21:48:45 +08:00
  • 67a46e553f fix #1287 hiyouga 2023-10-26 17:49:41 +08:00
  • e406f37b54 fix #1285 hiyouga 2023-10-26 16:34:52 +08:00
  • 62fe877124 remove filter in preprocess hiyouga 2023-10-23 23:46:02 +08:00
  • a0e682ba79 update neftune logic hiyouga 2023-10-22 17:42:13 +08:00
  • 49e8a87383 fix webui hiyouga 2023-10-22 17:24:56 +08:00
  • b2764b49ca add new options in webui hiyouga 2023-10-22 17:17:58 +08:00
  • 06b810de8f fix recursion error hiyouga 2023-10-22 16:28:37 +08:00
  • 6da51565f5 reimplement neftune hiyouga 2023-10-22 16:15:08 +08:00
  • 1f69965239 Merge pull request #1252 from anvie/neftune hoshi-hiyouga 2023-10-22 15:59:20 +08:00
  • af2d61178d add NEFTune optimization anvie 2023-10-21 13:24:10 +07:00
  • 6a955ccf4f fix openchat template hiyouga 2023-10-21 01:25:42 +08:00
  • c0658711ca fix tokenizer padding side in evaluate.py hiyouga 2023-10-21 00:30:04 +08:00
  • d602f06882 fix #1232 hiyouga 2023-10-20 23:28:52 +08:00
  • 1cb9a38ac2 fix #1215 hiyouga 2023-10-19 16:19:21 +08:00
  • 47a1f73d0f fix #1218 hiyouga 2023-10-19 16:17:41 +08:00
  • 142dd63b47 fix #1228 hiyouga 2023-10-19 15:54:10 +08:00
  • b1bd8370c2 fix #1217 hiyouga 2023-10-19 15:52:24 +08:00
  • 215660c8da rename webui hiyouga 2023-10-16 15:16:24 +08:00
  • 0cafe67efe fix #1197 hiyouga 2023-10-16 15:13:46 +08:00
  • ea83b3222b Update README_zh.md hoshi-hiyouga 2023-10-16 00:28:27 +08:00
  • 725087a04f Update README.md hoshi-hiyouga 2023-10-16 00:23:37 +08:00
  • d627ab4855 release v0.2.0 v0.2.0 hiyouga 2023-10-15 20:49:43 +08:00
  • 7d867e8df4 update readme hiyouga 2023-10-15 20:28:14 +08:00
  • 3d34d44497 Update README.md hoshi-hiyouga 2023-10-15 20:23:22 +08:00
  • a6f800b741 fix config, #1191 hiyouga 2023-10-15 18:28:45 +08:00
  • a003d1fa1e disable tqdm in webui mode hiyouga 2023-10-15 16:18:25 +08:00
  • c2e84d4558 refactor export, fix #1190 hiyouga 2023-10-15 16:01:48 +08:00
  • 68330eab2a fix eval resuming in webui hiyouga 2023-10-15 15:45:38 +08:00
  • 7070f3969d tiny fix hiyouga 2023-10-15 05:02:48 +08:00
  • e4727ab155 fix callback hiyouga 2023-10-15 04:59:44 +08:00
  • 280e7d97ad Merge pull request #1186 from hiyouga/dev hoshi-hiyouga 2023-10-15 04:53:14 +08:00
  • 31e3805fb8 implement webui resuming training hiyouga 2023-10-15 04:52:19 +08:00
  • ef248dbe15 fix bugs in webui hiyouga 2023-10-15 03:41:58 +08:00
  • 6a61b4b638 refactor webui hiyouga 2023-10-15 03:06:21 +08:00
  • 4b1473502f fix loading dtype hiyouga 2023-10-14 20:15:24 +08:00
  • bf211d818d fix #1176 #1177 hiyouga 2023-10-14 20:00:17 +08:00
  • 27dd87c890 fix #1184 hiyouga 2023-10-14 19:20:11 +08:00
  • 8659084ab0 fix webui hiyouga 2023-10-13 16:27:59 +08:00
  • e1c9dcea93 update readme hiyouga 2023-10-13 13:53:43 +08:00
  • 171339ab17 update discord link hiyouga 2023-10-12 21:44:28 +08:00
  • 8542ba5c69 rename repository hiyouga 2023-10-12 21:42:29 +08:00
  • 97b74d328b fix ppo args hiyouga 2023-10-11 23:40:50 +08:00
  • 3198a7e5f4 refactor model_dtype, fix PPO trainer hiyouga 2023-10-11 23:16:01 +08:00
  • a2d08ce961 add averaging in evaluation hiyouga 2023-10-10 23:16:31 +08:00
  • bd8ea09479 fix aquila template, repair sft packing mechanism hiyouga 2023-10-10 18:49:55 +08:00
  • 6d0d46c7fb tiny fix hiyouga 2023-10-10 17:41:13 +08:00
  • 820540780a update readme hiyouga 2023-10-09 20:02:50 +08:00
  • f74d600497 fix flash shift short attention hiyouga 2023-10-09 17:54:48 +08:00
  • 94fec9f50e fix webui args hiyouga 2023-10-09 17:13:57 +08:00
  • e387a50475 fix shift short attention hiyouga 2023-10-09 17:07:46 +08:00
  • 5c4248a29c update webui #1086 hiyouga 2023-10-09 14:50:14 +08:00
  • f22886e2b6 fix #1097 hiyouga 2023-10-08 22:29:26 +08:00
  • 33af3cbf37 add llamafy_qwen.py hiyouga 2023-10-08 22:05:36 +08:00
  • 728dfb1be7 fix #1068 #1074 hiyouga 2023-09-28 14:39:16 +08:00
  • e49f7f1afe fix bug in packed sft dataset hiyouga 2023-09-28 01:16:46 +08:00
  • 21a454fa6c tiny fix hiyouga 2023-09-28 01:03:04 +08:00
  • 22c6c27f78 tiny fix hiyouga 2023-09-28 01:02:11 +08:00
  • aecbb43096 fix #1064 hiyouga 2023-09-28 00:53:29 +08:00
  • fa53fd2db2 fix bug in pretraining hiyouga 2023-09-28 00:45:20 +08:00
  • 1c150995ae fix layer norm dtype hiyouga 2023-09-28 00:25:55 +08:00
  • 6c5d8f089e fix #1026 hiyouga 2023-09-27 22:57:09 +08:00
  • dd623325e8 fix #424 hiyouga 2023-09-27 22:49:43 +08:00
  • e8a375c8f2 fix #1032 hiyouga 2023-09-27 22:42:16 +08:00
  • 386d85ae72 refactor finetuning Args hiyouga 2023-09-27 22:28:06 +08:00
  • ebb3901b05 update readme hiyouga 2023-09-27 21:57:47 +08:00
  • 20130b486c support LongLoRA hiyouga 2023-09-27 21:55:50 +08:00
  • 73c48d0463 add CMMLU, update eval script hiyouga 2023-09-23 21:10:17 +08:00
  • f7cecd20e3 update evaluate hiyouga 2023-09-23 11:55:31 +08:00
  • 2bc64a7636 move file hiyouga 2023-09-23 11:52:12 +08:00
  • 9564ddbb48 shuffle few shot examples hiyouga 2023-09-23 00:53:20 +08:00
  • 28062c71b5 fix MMLU hiyouga 2023-09-23 00:42:23 +08:00
  • 35d1921081 add MMLU and C-Eval script hiyouga 2023-09-23 00:34:17 +08:00
  • 4fbdf18c70 fix #1000 hiyouga 2023-09-22 15:00:48 +08:00
  • 5e07ab01f0 update readme hiyouga 2023-09-22 14:34:13 +08:00
  • fac465a21e fix webui hiyouga 2023-09-21 19:55:38 +08:00
  • e145a2ce0c tiny fix hiyouga 2023-09-21 19:52:06 +08:00
  • dc68c313ee fix #944 hiyouga 2023-09-21 19:51:02 +08:00
  • 95c0d9ab24 tiny fix hiyouga 2023-09-21 15:25:29 +08:00
  • 46a718f339 Merge pull request #975 from statelesshz/npu-support hoshi-hiyouga 2023-09-20 14:56:50 +08:00
  • 496ba46960 support export model on Ascend NPU statelesshz 2023-09-20 10:15:59 +08:00
  • 43ae0aca1d fix webui hiyouga 2023-09-19 18:35:21 +08:00
  • b8574c1b82 fix error info hiyouga 2023-09-19 18:30:23 +08:00
  • 32f8b1082b add tests.cal_flops.py hiyouga 2023-09-16 23:40:41 +08:00
  • 6443fef31a update readme hiyouga 2023-09-16 17:33:01 +08:00
  • 14c3795a7d fix #913 hiyouga 2023-09-15 20:58:28 +08:00
  • 3d9e2de573 fix #896 hiyouga 2023-09-14 18:37:34 +08:00
  • 0ca36a0f8d fix #887 hiyouga 2023-09-14 17:56:58 +08:00
  • 3e5555502a Update utils.py mmbwf 2023-09-14 15:38:04 +08:00
  • fbf5b5e0a9 add MathInstruct dataset hiyouga 2023-09-13 22:30:14 +08:00