Commit Graph

  • 025670b4f6 tiny fix hiyouga 2023-06-07 12:58:14 +08:00
  • d6b32dd9ea add templates hiyouga 2023-06-07 12:40:44 +08:00
  • f57dae4a1a add belle template hiyouga 2023-06-07 12:30:11 +08:00
  • 5e2ec2d104 tiny fix hiyouga 2023-06-07 12:08:39 +08:00
  • b9feb82e4e add prompt template class hiyouga 2023-06-07 11:55:25 +08:00
  • 3da427a665 fix inference, add prompt template hiyouga 2023-06-07 10:52:35 +08:00
  • 12094c1db5 recover logging hiyouga 2023-06-06 21:36:37 +08:00
  • bf5ad34196 support distributed quantized training hiyouga 2023-06-06 17:39:41 +08:00
  • ac6f50dedf add API demo from #1 hiyouga 2023-06-05 21:32:18 +08:00
  • 8fd9ef924d Merge pull request #11 from hiyouga/api hoshi-hiyouga 2023-06-05 20:58:02 +08:00
  • a409e1f42c fix bug in web demo hiyouga 2023-06-05 17:58:29 +08:00
  • 3f5869111b increase max length in cli demo hiyouga 2023-06-05 16:49:14 +08:00
  • f9c51a8340 implement stream generating hiyouga 2023-06-05 16:43:44 +08:00
  • a817801c0f tiny fix hiyouga 2023-06-05 15:25:22 +08:00
  • 063a83ab4e tiny fix hiyouga 2023-06-04 16:35:50 +08:00
  • eebe71699b tiny fix hiyouga 2023-06-04 12:55:40 +08:00
  • 5f44112cf5 support QLoRA hiyouga 2023-06-04 00:08:56 +08:00
  • 2308d5a179 fix int8 inference hiyouga 2023-06-03 23:22:05 +08:00
  • 7d6542115c reduce repetition penalty hiyouga 2023-06-03 21:57:39 +08:00
  • c68f9ec3a9 fix int8 inference hiyouga 2023-06-03 21:17:47 +08:00
  • fa850ae6e5 add ziya prompt template hiyouga 2023-06-03 19:05:51 +08:00
  • 5eef8d5d98 use low_cpu_mem_usage to speed up loading hiyouga 2023-06-03 18:19:01 +08:00
  • 9b8b6623ac add logits processor hiyouga 2023-06-03 16:34:54 +08:00
  • ec48d06b9e remove unused code hiyouga 2023-06-03 00:10:54 +08:00
  • 217b89cf7e add wechat hiyouga 2023-06-02 21:47:10 +08:00
  • 382afc3822 tiny fix hiyouga 2023-06-02 19:02:25 +08:00
  • 09997a25d3 fix layer norm name in PPO hiyouga 2023-06-02 17:30:01 +08:00
  • 58c8b29913 fix #1 hiyouga 2023-06-02 14:25:00 +08:00
  • e9ab06678f alter rewards data type hiyouga 2023-06-02 14:19:51 +08:00
  • 896dbfec16 fix possibly OOM error hiyouga 2023-06-01 23:54:44 +08:00
  • 1512711ca2 fix bug at inference hiyouga 2023-05-31 18:11:53 +08:00
  • a79df3500b update readme hiyouga 2023-05-31 16:57:43 +08:00
  • 693c049eac support BLOOM models hiyouga 2023-05-31 16:54:06 +08:00
  • 7492e8f208 Merge pull request #1 from mMrBun/main hoshi-hiyouga 2023-05-30 16:34:00 +08:00
  • 181c776b58 remove dummy code hiyouga 2023-05-30 16:28:00 +08:00
  • ef0aceaa50 Support conversation via API. mMrBun 2023-05-30 15:00:28 +08:00
  • a18c6c0560 Support conversation via API. mMrBun 2023-05-30 14:46:22 +08:00
  • b6ed5176e1 update readme hiyouga 2023-05-29 21:54:01 +08:00
  • bda71e579b update readme hiyouga 2023-05-29 21:53:02 +08:00
  • 33fee45217 add pre-training script hiyouga 2023-05-29 21:37:22 +08:00
  • 304be6dc28 fix checkpoint loading hiyouga 2023-05-29 17:43:16 +08:00
  • 35d04a2c05 tiny fix hiyouga 2023-05-29 09:42:29 +08:00
  • 83fc73c580 tiny fix hiyouga 2023-05-28 21:48:33 +08:00
  • 1fc551e1be use fp16 model, add logcallback hiyouga 2023-05-28 21:30:28 +08:00
  • 17024ebc1a Initial commit hiyouga 2023-05-28 18:09:04 +08:00