Commit Graph

  • be566a15a5 fix unusual output of 8bit models #278 #391 hiyouga 2023-08-12 00:25:29 +08:00
  • d5f1b99ac4 Release v0.1.6 v0.1.6 hiyouga 2023-08-11 23:25:57 +08:00
  • 2144bb0e27 Update README_zh.md hiyouga 2023-08-11 14:06:02 +08:00
  • bc665bacc7 add defaults hiyouga 2023-08-11 13:56:26 +08:00
  • 52bfcf4883 fix stop word in baichuan template hiyouga 2023-08-11 13:51:46 +08:00
  • 06df3d6fb6 fix baichuan template hiyouga 2023-08-11 13:45:47 +08:00
  • ca719a8697 support DPO training (2305.18290) hiyouga 2023-08-11 03:02:53 +08:00
  • 72dfd74005 Merge pull request #451 from jovialchen/main hoshi-hiyouga 2023-08-10 17:25:38 +08:00
  • 69302c4420 fix webui val size hiyouga 2023-08-10 15:20:44 +08:00
  • 42d7019b2e huggingface login for projects must login while running jiongxuc 2023-08-10 14:57:12 +08:00
  • 5f0d0d6b9b fix template hiyouga 2023-08-09 23:14:27 +08:00
  • 76cb63e4f6 fix template hiyouga 2023-08-09 23:10:20 +08:00
  • 467d571206 support val set in streaming mode hiyouga 2023-08-09 23:00:26 +08:00
  • 972bfa700a fix tokenizer hiyouga 2023-08-09 17:52:15 +08:00
  • 458955d0fb add last_checkpoint support niuba 2023-08-09 16:39:27 +08:00
  • 990eeccf45 fix sft trainer hiyouga 2023-08-09 16:35:03 +08:00
  • a3a7465f00 fix rm #420, fix template #426, fix #423 hiyouga 2023-08-09 16:23:31 +08:00
  • 031a819257 fix llama2 template hoshi-hiyouga 2023-08-09 00:58:27 +08:00
  • eb4b4e3c8c fix tokenizer hoshi-hiyouga 2023-08-09 00:54:54 +08:00
  • d2e1fe9b1d update webui hiyouga 2023-08-09 00:26:11 +08:00
  • 6e27a9e39a fix tokenizer #417 hiyouga 2023-08-08 23:59:41 +08:00
  • 805478c911 fix bug hiyouga 2023-08-08 21:28:28 +08:00
  • a281cdeb89 fix bug hiyouga 2023-08-08 17:55:55 +08:00
  • cda698a67f fix chatml template #408 hiyouga 2023-08-08 17:44:39 +08:00
  • 15acd17716 update args spec hiyouga 2023-08-07 15:23:35 +08:00
  • 34a2bddfcd update readme hiyouga 2023-08-07 15:02:02 +08:00
  • 370f817549 Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning hiyouga 2023-08-07 13:59:16 +08:00
  • 041390c37e fix #376 hiyouga 2023-08-07 13:58:59 +08:00
  • d9fe4bf500 Merge pull request #382 from hiyouga/feature-updateReadme hoshi-hiyouga 2023-08-07 13:43:38 +08:00
  • e0c7e944fc update trainer hiyouga 2023-08-07 13:34:35 +08:00
  • 0845fe67db add detailed model configs codemayq 2023-08-07 09:30:23 +08:00
  • fe3b12d900 fix qwen eos token hiyouga 2023-08-06 13:31:17 +08:00
  • a70d56864e fix qwen tokenizer #361 hiyouga 2023-08-05 17:06:05 +08:00
  • fdbb2c5378 fix template for tiktoken hiyouga 2023-08-05 13:42:42 +08:00
  • 3c0aaf42af remove redundant code hiyouga 2023-08-05 00:27:27 +08:00
  • 438e19160a fix template hiyouga 2023-08-05 00:25:00 +08:00
  • f2b2ff6950 fix llama2 template hiyouga 2023-08-05 00:07:54 +08:00
  • 86cef96305 Support safe ChatML template, fix qwen tok #351 #354 hoshi-hiyouga 2023-08-05 00:00:23 +08:00
  • 5f50944baf fix bos and eos token hiyouga 2023-08-04 23:55:57 +08:00
  • 0804fd2353 fix encode hiyouga 2023-08-04 23:27:55 +08:00
  • 86419eb457 support chatml safe encoding hiyouga 2023-08-04 23:14:28 +08:00
  • 76f3ae7bf3 support interleave probs hiyouga 2023-08-04 21:27:35 +08:00
  • aaa85190eb fix webui export model hiyouga 2023-08-04 14:20:27 +08:00
  • e2a4e926b9 fix mtloader hiyouga 2023-08-03 19:29:02 +08:00
  • d6e922dc1c tiny fix hiyouga 2023-08-03 17:42:28 +08:00
  • 27f4317ec6 fix qwen inference hiyouga 2023-08-03 16:31:55 +08:00
  • e434348216 fix qwen inference hiyouga 2023-08-03 16:15:38 +08:00
  • 2e19afedb8 support Qwen-7B, fix InternLM-7B inference hiyouga 2023-08-03 15:53:32 +08:00
  • da08fa7c63 update web demo hiyouga 2023-08-03 13:28:28 +08:00
  • 9c96b97dc7 fix webui hiyouga 2023-08-03 12:43:12 +08:00
  • 28a51b622b modify code structure hiyouga 2023-08-02 23:17:36 +08:00
  • 8bd1da7144 fix PPO trainer hiyouga 2023-08-02 19:10:23 +08:00
  • e4d0b8ee6e update ppo trainer hiyouga 2023-08-02 18:46:41 +08:00
  • 1dfb28b362 fix memory leak of PPO trainer hiyouga 2023-08-02 17:41:34 +08:00
  • ba618947e7 release v0.1.5 v0.1.5 hiyouga 2023-08-02 16:10:31 +08:00
  • f81041b502 Merge pull request #307 from GitYCC/feature/fix-llama2-prompt-template hoshi-hiyouga 2023-08-02 15:51:28 +08:00
  • f2533a2800 [fix] Remove useless code YC Chen 2023-08-02 14:35:35 +08:00
  • bb5b4a7f26 [feature] Fix template of Llama2 to match the offical template YC Chen 2023-08-02 14:05:43 +08:00
  • 20bff87021 fix bug in preprocessing hiyouga 2023-08-02 01:10:28 +08:00
  • 722b954800 update readme hiyouga 2023-08-01 18:48:27 +08:00
  • 19256086c7 fix #296 hiyouga 2023-08-01 18:43:53 +08:00
  • 250fecfcd4 Fix #294 hiyouga 2023-08-01 18:13:03 +08:00
  • cb4d1d5ebb restore from git lfs hiyouga 2023-08-01 16:33:25 +08:00
  • d7d557fb2e Update .gitattributes hiyouga 2023-08-01 16:28:54 +08:00
  • 0b8e19b6a6 fix webui v0.1.4 hiyouga 2023-08-01 12:11:37 +08:00
  • 8e26eb374e fix RM save model hiyouga 2023-08-01 11:56:17 +08:00
  • 9bba01a033 use git lfs hiyouga 2023-08-01 10:14:08 +08:00
  • 661890b8a1 release v0.1.4 hiyouga 2023-08-01 10:08:47 +08:00
  • 772ad4ec6b fix inference hiyouga 2023-08-01 00:06:48 +08:00
  • 6f65f8cb3b fix arg check hiyouga 2023-07-31 23:48:57 +08:00
  • 43e83548b9 update readme hiyouga 2023-07-31 23:42:32 +08:00
  • dd3f3e9749 support streaming data, fix #284 #274 #268 hiyouga 2023-07-31 23:33:00 +08:00
  • 124f61b404 Update data_args.py hiyouga 2023-07-28 17:42:41 +08:00
  • e8748cc6f3 update readme hiyouga 2023-07-28 17:36:00 +08:00
  • fafec8b7a5 fix #268 hiyouga 2023-07-28 17:02:26 +08:00
  • 030daca686 update dataset hiyouga 2023-07-26 17:05:12 +08:00
  • ac587438f8 fix #242 hiyouga 2023-07-25 17:04:02 +08:00
  • c145bbef3c update dataset hiyouga 2023-07-23 20:01:43 +08:00
  • 745c46ee04 Update README_zh.md hiyouga 2023-07-22 14:31:16 +08:00
  • a707f5b502 update readme, fix web ui postprocess hiyouga 2023-07-22 14:29:22 +08:00
  • dc2e801077 Merge pull request #221 from mrhan1993/main hoshi-hiyouga 2023-07-22 13:04:25 +08:00
  • b56d5108b2 Merge branch 'hiyouga:main' into main NULL 2023-07-21 17:00:26 +08:00
  • 8e6b7034fe 根据GLM Efficient Tuning添加中文README,web添加了server_port mrhan1993 2023-07-21 16:57:58 +08:00
  • dad7ca6633 release v0.1.3 v0.1.3 hiyouga 2023-07-21 16:48:34 +08:00
  • a1468139a5 fix save function hiyouga 2023-07-21 14:09:07 +08:00
  • 49c90044ce Update runner.py hiyouga 2023-07-21 13:35:19 +08:00
  • 0f7cdac207 update web UI, support rm predict #210 hiyouga 2023-07-21 13:27:27 +08:00
  • c4e9694c6e release v0.1.2 v0.1.2 hiyouga 2023-07-20 22:33:59 +08:00
  • 2006a96570 fix api hiyouga 2023-07-20 22:14:54 +08:00
  • 5dcd95645f Merge pull request #213 from Ehco1996/patch-1 hoshi-hiyouga 2023-07-20 22:12:07 +08:00
  • 9b3304b054 update UI, fix #212 hiyouga 2023-07-20 22:09:06 +08:00
  • e580d4ef41 feat: support pass args before init web app Ehco 2023-07-20 21:49:26 +08:00
  • 64db4abc68 Update README.md hiyouga 2023-07-20 17:23:16 +08:00
  • 5ba0b80e5c simplify code hiyouga 2023-07-20 15:08:57 +08:00
  • 7a43ff3d89 tiny fix hiyouga 2023-07-19 22:53:46 +08:00
  • 7e1a1d141a fix #199 hiyouga 2023-07-19 22:51:29 +08:00
  • 6d881f161b add datasets hiyouga 2023-07-19 20:59:15 +08:00
  • a02b3e6192 fix #196 hiyouga 2023-07-19 17:35:38 +08:00
  • bcdee9fc19 fix #194 hiyouga 2023-07-19 17:07:33 +08:00
  • 8b688251be support LLaMA-2 hiyouga 2023-07-19 16:42:14 +08:00