Commit Graph

  • bcd661afa6 fix value head model resuming hiyouga 2023-11-20 19:01:37 +08:00
  • adf2730d1d fix #1567 hiyouga 2023-11-20 18:46:36 +08:00
  • ba2be6371d better data streaming hiyouga 2023-11-19 23:32:47 +08:00
  • d2ff09a404 fix model card network issue hiyouga 2023-11-19 23:03:19 +08:00
  • 9f364d3880 fix Mistral template hiyouga 2023-11-19 16:29:30 +08:00
  • cfad41b901 fix #1263 hiyouga 2023-11-19 16:05:18 +08:00
  • 6889f044fb fix #1558 hiyouga 2023-11-19 14:15:47 +08:00
  • 3d1ee27ccd fix evaluator and cached_file in 4.31.0 hiyouga 2023-11-18 19:39:23 +08:00
  • 775ce62950 update benchmark hiyouga 2023-11-18 11:30:01 +08:00
  • 821a6f2fa6 update readme hiyouga 2023-11-18 11:15:56 +08:00
  • 5197fb2fad add benchmark hiyouga 2023-11-18 11:09:52 +08:00
  • 92abe91d22 update dataset hiyouga 2023-11-17 23:19:12 +08:00
  • a7bf0b85d7 fix quantization hiyouga 2023-11-17 22:21:29 +08:00
  • 5ce5ea84a9 fix #1550 hiyouga 2023-11-17 17:23:13 +08:00
  • 992be39f90 Update README_zh.md Yuchen Han 2023-11-17 00:18:07 -08:00
  • cab80a3c56 Update README.md Yuchen Han 2023-11-17 00:17:36 -08:00
  • 6af7107938 Update workflow.py Yuchen Han 2023-11-17 00:16:27 -08:00
  • bcd31cf245 Update finetuning_args.py Yuchen Han 2023-11-17 00:15:51 -08:00
  • 85c4ccfef9 fix packages hiyouga 2023-11-17 16:11:48 +08:00
  • dc0f81aabc Merge #1544 from Outsider565/main, fix #1548 hoshi-hiyouga 2023-11-17 16:09:42 +08:00
  • 07f934566a Fix: Change rouge-chinese package name to rouge_chinese Shaowen Wang 2023-11-16 20:12:35 -06:00
  • 77cb18e9e3 fix chatglm template hiyouga 2023-11-16 22:54:15 +08:00
  • fccaecf730 Update bug-report.yml hiyouga 2023-11-16 19:37:35 +08:00
  • 53cdfe8f73 add issue template hiyouga 2023-11-16 19:35:30 +08:00
  • ea03523c6a Update issue templates hoshi-hiyouga 2023-11-16 18:56:30 +08:00
  • caf3cbf8d7 fix web ui demo hiyouga 2023-11-16 18:41:55 +08:00
  • da411066c9 fix web ui demo hiyouga 2023-11-16 17:12:23 +08:00
  • 95d0f77fc2 release v0.3.0 v0.3.0 hiyouga 2023-11-16 16:00:11 +08:00
  • 9b2654277b update readme hiyouga 2023-11-16 15:58:37 +08:00
  • f1b3bdac3f Merge #1525 from hiyouga/dev, fix #224 #336 #931 #936 #1011 hoshi-hiyouga 2023-11-16 15:47:13 +08:00
  • 595fdbd95d fix css hiyouga 2023-11-16 15:45:38 +08:00
  • dab9385297 fix bug in web ui hiyouga 2023-11-16 15:21:24 +08:00
  • df83def566 update ppo and demo in webui hiyouga 2023-11-16 14:55:26 +08:00
  • f9d4e37b3c fix bug in freeze tuning hiyouga 2023-11-16 14:25:11 +08:00
  • e59a3d71e0 tiny fix hiyouga 2023-11-16 03:27:19 +08:00
  • de3a84ac59 fix rlhf callback hiyouga 2023-11-16 03:26:19 +08:00
  • e017266b98 fix bug in PPO training hiyouga 2023-11-16 02:32:54 +08:00
  • f81a8a5e5c fix import bug hiyouga 2023-11-16 02:27:03 +08:00
  • 7a3a0144a5 support full-parameter PPO hiyouga 2023-11-16 02:08:04 +08:00
  • 8263b2d32d add demo mode for web UI hiyouga 2023-11-15 23:51:26 +08:00
  • 833cd490b8 Create CODE_OF_CONDUCT.md hoshi-hiyouga 2023-11-15 20:42:15 +08:00
  • 2162c37e41 update readme and constants hiyouga 2023-11-15 18:04:37 +08:00
  • b2ac8376e1 support multiple modules in freeze training #1514 hiyouga 2023-11-15 17:08:18 +08:00
  • 8079584143 fix imports hiyouga 2023-11-15 16:47:45 +08:00
  • 09a4474e7f disentangle model from tuner and rename modules hiyouga 2023-11-15 16:29:09 +08:00
  • 81530133ff fix #1507 hiyouga 2023-11-15 16:22:32 +08:00
  • cc4b384ac3 Update cal_lr.py hiyouga 2023-11-14 21:14:42 +08:00
  • 3852daf447 Update cal_lr.py hiyouga 2023-11-14 21:13:01 +08:00
  • 5c97111f9d Update cal_lr.py hiyouga 2023-11-14 21:09:30 +08:00
  • 75dd1f0f7e add cal_lr.py hiyouga 2023-11-14 20:58:37 +08:00
  • c9a4551012 fix #1494 hiyouga 2023-11-14 18:07:20 +08:00
  • 87197ba91d fix #1489 hiyouga 2023-11-14 15:27:05 +08:00
  • 7461bf84e5 support eval remote dataset hiyouga 2023-11-14 02:42:30 +08:00
  • fbc0357b2e fix dc link hiyouga 2023-11-13 23:22:56 +08:00
  • ec334f5891 release v0.2.2, fix #1478 #1466 v0.2.2 hiyouga 2023-11-13 23:09:05 +08:00
  • 885efe772e fix #424 hiyouga 2023-11-13 22:42:23 +08:00
  • 64fc9ba678 refactor evaluation, upgrade trl to 074 hiyouga 2023-11-13 22:20:35 +08:00
  • 989eccd286 fix flashattn warning hiyouga 2023-11-10 18:34:54 +08:00
  • f0766a2ab0 add todo hiyouga 2023-11-10 14:38:18 +08:00
  • 178b85ff9a refactor constants hiyouga 2023-11-10 14:16:10 +08:00
  • 68dd1ef121 tiny fix hiyouga 2023-11-09 17:20:49 +08:00
  • b222cffe98 Merge pull request #1454 from yyq/main hoshi-hiyouga 2023-11-09 17:12:18 +08:00
  • b4f1ab93d1 Update finetuning_args.py Yanqing 2023-11-09 17:04:40 +08:00
  • f2e139f5cd fix #1452 hiyouga 2023-11-09 16:41:32 +08:00
  • a9cbca1604 update readme v0.2.1 hiyouga 2023-11-09 16:00:24 +08:00
  • 3a30ce6c16 release v0.2.1 hiyouga 2023-11-09 15:54:16 +08:00
  • 48ec5355f9 add template, modify datasets hiyouga 2023-11-09 15:53:23 +08:00
  • 11859bc322 Merge pull request #1436 from lvzii/main hoshi-hiyouga 2023-11-09 14:30:50 +08:00
  • 28c67a5be8 support parquet format #1446 hiyouga 2023-11-09 14:17:40 +08:00
  • 44fe93e9b0 fix #1438 #1439 hiyouga 2023-11-09 13:45:10 +08:00
  • 09a1681b63 fix tokenizer config changed after pretrain lvzi 2023-11-08 15:50:46 +08:00
  • f5ba2190fb fix ppo train and dpo eval hiyouga 2023-11-07 22:48:51 +08:00
  • 14a38b5069 fix #1422 hiyouga 2023-11-07 19:42:01 +08:00
  • f23e5b602a fix reward model loading hiyouga 2023-11-07 17:20:51 +08:00
  • 857696ed9c fix args hiyouga 2023-11-07 16:36:06 +08:00
  • 2084133058 update info hiyouga 2023-11-07 16:28:21 +08:00
  • f7f0c3070e delete file hiyouga 2023-11-07 16:20:12 +08:00
  • 46235aa514 fix #1418 hiyouga 2023-11-07 16:17:22 +08:00
  • 2eb65d21ac upgrade peft, fix #1088 #1411 hiyouga 2023-11-07 16:13:36 +08:00
  • 37a0d62a82 update requirements hiyouga 2023-11-06 19:01:21 +08:00
  • 21ac46e439 use seed in evaluate.py hiyouga 2023-11-06 18:17:51 +08:00
  • ba3e8ba20c update readme (list in alphabetical order) hiyouga 2023-11-06 17:18:12 +08:00
  • 2c48e798ca update templates hiyouga 2023-11-06 12:25:47 +08:00
  • 4e40f5b62b fix #1383 hiyouga 2023-11-06 11:42:23 +08:00
  • 2a8892b785 fix deepseek template hiyouga 2023-11-05 13:08:46 +08:00
  • ee3b33ff03 support deepseek coder #1378 hiyouga 2023-11-05 12:51:03 +08:00
  • b2c3001f8e fix #1365 hiyouga 2023-11-05 12:21:07 +08:00
  • 6cfe1e1ac2 tiny fix hiyouga 2023-11-03 01:26:06 +08:00
  • 52326870e4 fix #1290 hiyouga 2023-11-03 00:44:53 +08:00
  • 217fde0918 fix bug in data loader, support dpo eval hiyouga 2023-11-03 00:34:26 +08:00
  • 065021d82a update data readme hiyouga 2023-11-03 00:15:23 +08:00
  • 4bb643e685 update data readme (zh) hiyouga 2023-11-02 23:42:49 +08:00
  • b77c745b1a support sharegpt format, add datasets hiyouga 2023-11-02 23:10:04 +08:00
  • 7d13501b94 support pagination in webui preview hiyouga 2023-11-02 21:21:45 +08:00
  • ac74639b32 fix webui hiyouga 2023-11-02 18:03:14 +08:00
  • 12fa56ae68 support warning in webui hiyouga 2023-11-02 17:57:04 +08:00
  • f11b863f4b fix #1349 hiyouga 2023-11-02 17:02:44 +08:00
  • f3e4b72957 fix #1356 hiyouga 2023-11-02 16:51:52 +08:00
  • 8d52fb46ca fix #1325 hiyouga 2023-11-01 23:38:49 +08:00
  • dab8f45033 fix chat hiyouga 2023-11-01 23:07:58 +08:00