Commit Graph

  • c0c387e4db release v0.8.0 v0.8.0 hiyouga 2024-06-08 05:20:54 +08:00
  • ae60ea15da add ultrafeedback and fineweb #4085 #4132 hiyouga 2024-06-08 02:42:34 +08:00
  • 72cd1123a8 fix ci hiyouga 2024-06-08 02:00:44 +08:00
  • 1364190a66 fix ci hiyouga 2024-06-08 01:57:36 +08:00
  • 6d17c59090 add ci hiyouga 2024-06-08 01:48:30 +08:00
  • e0f2c0b5dc init unittest hiyouga 2024-06-08 01:35:58 +08:00
  • 073e34855d Delete .readthedocs.yaml hiyouga 2024-06-08 00:58:10 +08:00
  • ff9ba70bb8 reorganize adapter code hiyouga 2024-06-08 00:47:23 +08:00
  • adbebb0e3f fix #4139 hoshi-hiyouga 2024-06-08 00:45:02 +08:00
  • 3f6b3eed98 add resume args in webui hiyouga 2024-06-08 00:22:16 +08:00
  • f45e81e186 fix #4137 hiyouga 2024-06-07 19:16:06 +08:00
  • ba648fd003 tiny fix hiyouga 2024-06-07 05:19:21 +08:00
  • b0e5a76f4c fix ppo trainer save zero3 model hiyouga 2024-06-07 05:14:19 +08:00
  • 8692796c9b fix ppo in trl 0.8.6 hiyouga 2024-06-07 04:48:29 +08:00
  • d0edcde4ea fix #4120 hiyouga 2024-06-07 04:18:05 +08:00
  • 8c4c2e580c update data processors hiyouga 2024-06-07 04:15:40 +08:00
  • 07f33e7641 Merge pull request #4009 from AlongWY/main hoshi-hiyouga 2024-06-07 03:48:46 +08:00
  • 1998c641af Update supervised.py hoshi-hiyouga 2024-06-07 03:42:08 +08:00
  • be1e5f9d62 Update supervised.py hoshi-hiyouga 2024-06-07 03:38:23 +08:00
  • fdeec6db52 Update supervised.py hoshi-hiyouga 2024-06-07 03:38:04 +08:00
  • a4d335b42f add qwen2 models hiyouga 2024-06-07 00:22:57 +08:00
  • fcb134e144 rename files hiyouga 2024-06-07 00:09:06 +08:00
  • a47e24222a add DISABLE_TORCHRUN option hiyouga 2024-06-06 23:44:58 +08:00
  • b96b995620 Merge pull request #4082 from MengqingCao/bugfix hoshi-hiyouga 2024-06-06 23:38:40 +08:00
  • c231706aa5 Update cli.py hoshi-hiyouga 2024-06-06 23:38:09 +08:00
  • 35b5117a59 fix ppo+zero3 #3108 hiyouga 2024-06-06 23:30:07 +08:00
  • 80f716bc10 fix torch gc hiyouga 2024-06-06 20:30:25 +08:00
  • ca95e98ca0 fix ppo dataset bug #4012 hiyouga 2024-06-06 19:03:20 +08:00
  • d5559461c1 update trainers hiyouga 2024-06-06 18:45:49 +08:00
  • f4acd81e2f fix base64 image read #4061 hiyouga 2024-06-06 17:29:19 +08:00
  • 31feb6e26c update readme hiyouga 2024-06-06 16:59:18 +08:00
  • 7d5c0a069c update readme hiyouga 2024-06-06 16:25:42 +08:00
  • 937f49ec3d lora modules: all by default hiyouga 2024-06-06 03:53:28 +08:00
  • abc2a73a33 add codestral 22B hiyouga 2024-06-06 03:42:50 +08:00
  • 5e1bf7572c lint hiyouga 2024-06-06 03:33:44 +08:00
  • 8fdb32d0a3 Merge pull request #4066 from injet-zhou/main hoshi-hiyouga 2024-06-06 03:32:04 +08:00
  • c709d5f7db Merge pull request #4080 from MengqingCao/npu hoshi-hiyouga 2024-06-06 03:15:44 +08:00
  • f5b2749ec2 Update export.py hoshi-hiyouga 2024-06-06 03:14:46 +08:00
  • ee5853c565 Update model_args.py hoshi-hiyouga 2024-06-06 03:14:23 +08:00
  • 6ec6df8a5f Merge pull request #4053 from hzhaoy/feature/add_select_config_file hoshi-hiyouga 2024-06-06 03:06:03 +08:00
  • fc95800840 add vllm_dtype arg #3387 #3717 hiyouga 2024-06-06 02:53:27 +08:00
  • 765715af21 support train from scratch #4033 #4075 hiyouga 2024-06-06 02:43:19 +08:00
  • 639a7f6796 support image input in api #3971 #4061 hiyouga 2024-06-06 02:29:55 +08:00
  • 35379c7c0e update train hparams hiyouga 2024-06-06 01:49:20 +08:00
  • d992f5353f fix setup hiyouga 2024-06-06 01:39:02 +08:00
  • 875eef45f3 add llamafactory-cli env hiyouga 2024-06-06 01:28:14 +08:00
  • 556a4aa972 fix #4090 hiyouga 2024-06-06 00:50:32 +08:00
  • 8dc1969111 modify export_device option MengqingCao 2024-06-05 09:37:36 +00:00
  • b74c229498 fix #4079 hiyouga 2024-06-05 16:56:54 +08:00
  • 3dbca466fd update readme hiyouga 2024-06-05 16:32:32 +08:00
  • ce6f7fdb82 fix #4077 MengqingCao 2024-06-05 08:03:30 +00:00
  • 7528bc1bc0 support glm-4 hiyouga 2024-06-05 15:16:38 +08:00
  • 9dd5f7d642 add npu for model export MengqingCao 2024-06-05 07:06:40 +00:00
  • 99ecb0daaf add throughput entry to log faddddeout 2024-06-04 11:04:29 +00:00
  • 39d8d7995a add: support selecting saved configuration files and loading training parameters hzhaoy 2024-06-04 10:33:43 +08:00
  • 2ac2cde03e tiny fix hiyouga 2024-06-04 00:31:10 +08:00
  • aa6c3766de fix #3873 hiyouga 2024-06-04 00:21:50 +08:00
  • f4f5d7e3ce fix #3992 hiyouga 2024-06-04 00:17:36 +08:00
  • efbf6018d3 fix abort in webui DDP mode hiyouga 2024-06-04 00:10:24 +08:00
  • 1090bb8bf3 Merge pull request #3987 from injet-zhou/main hoshi-hiyouga 2024-06-04 00:04:07 +08:00
  • 26bc79f971 fix #4043 hiyouga 2024-06-03 23:30:37 +08:00
  • 4c1f015eca remove gc warnings in DPO&KTO hiyouga 2024-06-03 22:53:54 +08:00
  • 0655a183d3 Merge pull request #4045 from enji-zhou/feature/add_kto hoshi-hiyouga 2024-06-03 22:09:25 +08:00
  • 7754024e9b Update trainer.py hoshi-hiyouga 2024-06-03 22:08:38 +08:00
  • b4913569a8 fix KTO Trainer Sampler enji.zhou 2024-06-03 21:32:38 +08:00
  • eae9f09ca8 Merge pull request #4006 from Uminosachi/scheduler-kwargs hoshi-hiyouga 2024-06-03 19:27:53 +08:00
  • 8264e5ceaa update placeholder in issue template hiyouga 2024-06-03 19:24:10 +08:00
  • b76f319e45 Merge pull request #4011 from statelesshz/issue-template hoshi-hiyouga 2024-06-03 19:20:43 +08:00
  • 82d744716a fix #4005 #4013 hiyouga 2024-06-03 19:12:29 +08:00
  • 1a3764ab8f Merge pull request #4007 from xu-song/patch-3 hoshi-hiyouga 2024-06-03 18:54:37 +08:00
  • d2ede9d393 fix #4022 hiyouga 2024-06-03 18:38:36 +08:00
  • 5690f513fc bump versions hiyouga 2024-06-03 18:29:38 +08:00
  • 123a845209 fix data loader hint hiyouga 2024-06-03 18:28:27 +08:00
  • b1b7d735b3 remove empty line ylfeng 2024-05-31 21:43:08 +08:00
  • 230c69f7ce fix eos ylfeng 2024-05-31 21:40:41 +08:00
  • bfc43558ef supervised packing with greedy knapsack algorithm ylfeng 2024-05-31 15:33:54 +08:00
  • f2ae2cc04d Update model_args.py Xu Song 2024-05-31 14:35:48 +08:00
  • 6e9c03f958 Update bug-report.yml statelesshz 2024-05-31 13:18:18 +08:00
  • 2696f614a7 Set scheduler_specific_kwargs to get_scheduler Uminosachi 2024-05-31 13:45:39 +09:00
  • 070b944895 update readme hiyouga 2024-05-30 16:40:17 +08:00
  • f5f091d390 fix cann't interrupt training when using multi GPUs in webui faddddeout 2024-05-30 08:39:21 +00:00
  • 14ab14a0e6 fix #3837 hiyouga 2024-05-30 00:52:26 +08:00
  • 4f7c850115 Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num hoshi-hiyouga 2024-05-30 00:25:45 +08:00
  • 391eca66cf Update loader.py hoshi-hiyouga 2024-05-30 00:20:20 +08:00
  • a67199246d Update loader.py hoshi-hiyouga 2024-05-30 00:17:21 +08:00
  • 5f67fdaac9 Update loader.py hoshi-hiyouga 2024-05-30 00:12:12 +08:00
  • 05e6fe4287 Update parser.py hoshi-hiyouga 2024-05-30 00:05:20 +08:00
  • 91cc571e6e Update README_zh.md hoshi-hiyouga 2024-05-30 00:04:47 +08:00
  • 890926e60c Update README.md hoshi-hiyouga 2024-05-30 00:04:26 +08:00
  • 87aa332583 better llamaboard hiyouga 2024-05-29 23:55:38 +08:00
  • f90c4ca672 fix cohere system hiyouga 2024-05-29 20:58:23 +08:00
  • a922e85a5c fix #3965 hiyouga 2024-05-29 20:55:51 +08:00
  • 9a65820592 update readme hiyouga 2024-05-29 18:39:11 +08:00
  • f4e16ae373 Merge pull request #3930 from MengqingCao/npu hoshi-hiyouga 2024-05-29 18:33:38 +08:00
  • e2cfd34da0 update torch-npu version MengqingCao 2024-05-29 10:05:11 +00:00
  • 668dea9706 update cann kernels url MengqingCao 2024-05-29 09:53:31 +00:00
  • 084be442f2 Merge pull request #3958 from hzhaoy/add_telechat_12b_support hoshi-hiyouga 2024-05-29 17:20:53 +08:00
  • 29cb4a1327 add TeleChat-12B/TeleChat-12B-v2 models hzhaoy 2024-05-29 15:00:37 +08:00
  • 81a61134b8 fix hf chat engine hiyouga 2024-05-29 01:20:07 +08:00
  • cb1a49aa02 add ds config to webui hiyouga 2024-05-29 01:13:17 +08:00