Commit Graph

  • 718f3382ad add LLaMA2 template hiyouga 2023-07-19 00:44:49 +08:00
  • dc8283d3d7 fix API hiyouga 2023-07-19 00:01:14 +08:00
  • 35e76879f5 support dev set in web ui v0.1.1 hiyouga 2023-07-18 20:40:49 +08:00
  • 8e4ae0aaac add web demo hiyouga 2023-07-18 17:21:16 +08:00
  • 5ed2a97056 update baichuan template hiyouga 2023-07-18 16:43:51 +08:00
  • 03eba6f041 fix template hiyouga 2023-07-18 16:37:23 +08:00
  • ec166e736a fix #176 hiyouga 2023-07-18 16:36:24 +08:00
  • c85a6b83b3 fix webUI, fix #171 #177 hiyouga 2023-07-18 15:51:48 +08:00
  • a864a7b395 update webUI, fix #179 hiyouga 2023-07-18 15:35:17 +08:00
  • fd8c2d4aac tiny fix hiyouga 2023-07-18 00:52:31 +08:00
  • baf2e4e825 a monkey patch for lora_target v0.1.0 hiyouga 2023-07-18 00:31:40 +08:00
  • eac7f97337 release v0.1.0 hiyouga 2023-07-18 00:18:25 +08:00
  • c08ff734a7 fix #175 hiyouga 2023-07-17 18:07:17 +08:00
  • e9736b2ba0 fix saving custom code hiyouga 2023-07-16 18:04:41 +08:00
  • c61de6f669 add custom baichuan-13B code supports left-padding v0.0.9 hiyouga 2023-07-15 22:37:17 +08:00
  • f8831cb1ea fix callback hiyouga 2023-07-15 22:01:43 +08:00
  • 6a0499ef40 update stream_chat hiyouga 2023-07-15 19:51:02 +08:00
  • a8deee27f8 create chat model hiyouga 2023-07-15 19:26:20 +08:00
  • e9fe48150c Update callbacks.py hiyouga 2023-07-15 17:39:16 +08:00
  • 51368f3453 Update README.md hiyouga 2023-07-15 17:20:39 +08:00
  • a31a609377 fix callback hiyouga 2023-07-15 17:18:16 +08:00
  • 6261fb362a modity code structure hiyouga 2023-07-15 16:54:28 +08:00
  • fa06b168ab fix eval and pred loss hiyouga 2023-07-14 13:11:57 +08:00
  • 961e6a9ba4 fix pretrain hiyouga 2023-07-13 23:41:54 +08:00
  • 316a02696f fix Baichuan-13B hiyouga 2023-07-13 23:08:45 +08:00
  • d57e0a7006 Merge pull request #156 from ZhengJun-AI/main hoshi-hiyouga 2023-07-12 20:11:19 +08:00
  • 994b21b092 Support for WebNovel dataset zxbsmk 2023-07-12 17:29:47 +08:00
  • 6ef45d311a Merge pull request #145 from elicassion/patch-1 hoshi-hiyouga 2023-07-12 13:50:39 +08:00
  • 30b2092294 Fix typo in common.py Jinghuan Shang 2023-07-11 18:03:53 -04:00
  • 8de7a01887 fix sft encode hiyouga 2023-07-11 19:50:33 +08:00
  • cc290a41e6 add baichuan template hiyouga 2023-07-11 18:57:50 +08:00
  • 1aa0997391 support Baichuan-13B hiyouga 2023-07-11 16:16:14 +08:00
  • 61988225a8 Update README.md hiyouga 2023-07-10 23:09:11 +08:00
  • 62e775cb75 Update README.md hiyouga 2023-07-09 14:57:13 +08:00
  • bc436066c8 update api to match langchain hiyouga 2023-07-07 20:35:39 +08:00
  • 9da7840005 Update README.md hiyouga 2023-07-07 12:06:28 +08:00
  • 113cdaf1cb support InternLM hiyouga 2023-07-07 11:02:28 +08:00
  • 601b1747d1 fix rouge score hiyouga 2023-07-06 14:28:34 +08:00
  • e3b779fcb2 update readme hiyouga 2023-07-05 23:03:58 +08:00
  • 982e76978b fix streaming response in API hiyouga 2023-07-05 22:42:31 +08:00
  • d659907f34 fix freeze tuning hiyouga 2023-07-05 21:18:28 +08:00
  • df71d98b37 fix bug in PPO stage hiyouga 2023-07-05 19:14:10 +08:00
  • 4de9ef568a fix compute dtype hiyouga 2023-07-05 15:13:00 +08:00
  • f1de82f08e support falcon model #72 hiyouga 2023-07-05 15:00:06 +08:00
  • 4b093996a7 fix bleu score hiyouga 2023-07-05 00:11:21 +08:00
  • e4e36a2d74 set use_cache before saving model hiyouga 2023-07-04 23:18:20 +08:00
  • 6df5c4ccef fix seq2seq predictions hiyouga 2023-07-04 22:56:51 +08:00
  • ecae079e56 Merge pull request #119 from codemayq/main hoshi-hiyouga 2023-07-03 19:51:46 +08:00
  • 77a2b60bc6 add the pre-built version of bitsandbytes library for windows user codemayq 2023-07-03 13:58:10 +08:00
  • 6a83f4f793 Update auto_gptq.py hiyouga 2023-07-02 20:56:11 +08:00
  • 2537481c34 add autogptq hiyouga 2023-07-02 20:36:37 +08:00
  • d720f67e6c fix typo hiyouga 2023-06-30 10:09:59 +08:00
  • 40ab36456c Update README.md hiyouga 2023-06-29 19:36:22 +08:00
  • 52652257c9 rename evaluate.py hiyouga 2023-06-29 15:40:39 +08:00
  • 0dd38a41b6 Update evaluate.py hiyouga 2023-06-29 15:40:03 +08:00
  • f39c71b02d Update README.md hiyouga 2023-06-29 15:37:19 +08:00
  • 90fa2dd935 add open assistant dataset hiyouga 2023-06-28 23:09:33 +08:00
  • 6290955e84 update loading logic hiyouga 2023-06-28 12:07:16 +08:00
  • 6b6430489a fix loading best model hiyouga 2023-06-28 01:55:12 +08:00
  • 4ae8a20e1d fix RM accuracy hiyouga 2023-06-28 01:40:13 +08:00
  • eca15bf252 add star history hiyouga 2023-06-27 23:56:29 +08:00
  • e19dcc13e3 tiny fix hiyouga 2023-06-27 23:54:24 +08:00
  • 2d22961c7d fix initializing data arguments hiyouga 2023-06-27 22:50:23 +08:00
  • 640f774d30 support save full model, replace BOS token hiyouga 2023-06-27 21:40:11 +08:00
  • 33c2b063c6 fix decoding in seq2seq hiyouga 2023-06-27 19:33:08 +08:00
  • a7e53dcfef Update evaluate.py hiyouga 2023-06-26 23:41:33 +08:00
  • fe7ca5cb63 Create evaluate.py hiyouga 2023-06-26 23:30:18 +08:00
  • 0ff82b1304 Merge pull request #86 from Jingsong-Yan/main hoshi-hiyouga 2023-06-26 20:14:40 +08:00
  • d2de3f9e41 Update README.md with baichuan-7b-rtx3090 Jingsong-Yan 2023-06-26 19:45:41 +08:00
  • d5260ea860 Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning hiyouga 2023-06-26 18:07:09 +08:00
  • a8f580d753 fix generation in seq2seq.py hiyouga 2023-06-26 18:07:06 +08:00
  • 20c1b25ad9 Merge pull request #84 from wu-yy/patch-1 hoshi-hiyouga 2023-06-26 15:39:08 +08:00
  • 88840c4f2b Update requirements.txt 蓝鲸123 2023-06-26 15:36:19 +08:00
  • 3aa1ca66e0 support prefixes, loading multiple local files hiyouga 2023-06-26 15:32:40 +08:00
  • 83346e86af update api hiyouga 2023-06-26 13:39:57 +08:00
  • f9332bc329 update readme hiyouga 2023-06-23 00:17:05 +08:00
  • 7daf6c8b8e update API hiyouga 2023-06-22 20:46:24 +08:00
  • 391bf1c699 match api with OpenAI format hiyouga 2023-06-22 20:27:00 +08:00
  • 84b66010a3 Merge pull request #68 from mMrBun/main hoshi-hiyouga 2023-06-22 15:52:34 +08:00
  • 810d9e36ea Compatible with OpenAI API. Bun 2023-06-21 14:45:04 +08:00
  • de2c418637 add default template hiyouga 2023-06-16 21:12:17 +08:00
  • 7dc1f06a97 add belle multiturn dataset hiyouga 2023-06-16 20:01:16 +08:00
  • ee22b80ad0 fix freeze layers hiyouga 2023-06-16 17:38:21 +08:00
  • de9da40b18 add source prefix hiyouga 2023-06-16 16:32:17 +08:00
  • 3836aadacf support loading lora from hub hiyouga 2023-06-16 00:02:17 +08:00
  • 194c5d2bee support baichuan model hiyouga 2023-06-15 16:02:01 +08:00
  • 496846e819 fix bug in template vanilla hiyouga 2023-06-15 14:36:55 +08:00
  • c42562d7ae add BOS token in pre-training hiyouga 2023-06-15 01:46:17 +08:00
  • aa1bb8a9a2 support multiturn training like FastChat hiyouga 2023-06-14 22:27:39 +08:00
  • 6f655e3916 fix loading valuehead hiyouga 2023-06-13 11:13:06 +08:00
  • 6828f07d54 fix generating args hiyouga 2023-06-13 01:33:56 +08:00
  • 4724ae3492 support RM metrics, add generating Args hiyouga 2023-06-12 15:48:48 +08:00
  • 4c5cad9722 Merge pull request #26 from BUAADreamer/main hoshi-hiyouga 2023-06-11 19:06:29 +08:00
  • 4adbb95b03 add some BUAADreamer 2023-06-11 18:55:53 +08:00
  • 5b93ca6c39 add code for reading from multi files in one directory BUAADreamer 2023-06-10 16:27:30 +08:00
  • ef6c5ae18a add code for reading from multi files in one directory BUAADreamer 2023-06-10 15:53:47 +08:00
  • 03c92c79ff tiny fix hiyouga 2023-06-07 16:42:31 +08:00
  • fc6091e118 tiny fix hiyouga 2023-06-07 16:02:07 +08:00
  • 025670b4f6 tiny fix hiyouga 2023-06-07 12:58:14 +08:00
  • d6b32dd9ea add templates hiyouga 2023-06-07 12:40:44 +08:00