Commit Graph

  • 5bb447b118 [misc] update workflows (#6787) hoshi-hiyouga 2025-02-01 04:54:42 +08:00
  • a28261a866 [model] add mistral small models (#6786) hoshi-hiyouga 2025-02-01 04:31:38 +08:00
  • 800de98dc8 [model] add qwen2.5 vl models (#6779) hoshi-hiyouga 2025-01-31 03:00:29 +08:00
  • 222423bcef [breaking] support transformers 4.48 (#6628) hoshi-hiyouga 2025-01-31 01:36:33 +08:00
  • e71737351f [webui] improve webui & reasoning mode (#6778) hoshi-hiyouga 2025-01-31 00:09:21 +08:00
  • 4f298894da [model] add deepseek-R1 & show think process (#6767) qvlehao 2025-01-29 12:16:26 +08:00
  • a8fae3869d fix: avoid redundant normalization in DPO's SFT loss calculation (#6722) yinpu 2025-01-21 13:38:02 +08:00
  • db9b977e4f [webui] support ja (#6698) engchina 2025-01-20 19:46:38 +08:00
  • 87d685b59f [model] support yarn (#6693) hoshi-hiyouga 2025-01-18 13:56:09 +08:00
  • e4046bdd1f [assets] update wechat (#6692) hoshi-hiyouga 2025-01-18 12:35:03 +08:00
  • 5baa3add8c [misc] update mm plugin (#6691) hoshi-hiyouga 2025-01-17 23:04:26 +08:00
  • 332f637592 disable valset by default (#6690) hoshi-hiyouga 2025-01-17 21:09:30 +08:00
  • 31daa6570b [webui] upgrade to gradio 5 (#6688) hoshi-hiyouga 2025-01-17 20:15:42 +08:00
  • 33525a34b6 fix qwen2 moe (#6684) hoshi-hiyouga 2025-01-17 13:46:09 +08:00
  • 3607caa2ad [data] Fix minicpmv/o dpo training (#6657) Zhangchi Feng 2025-01-15 17:30:37 +08:00
  • 0fc2e19279 Update val_size english description (#6653) steveepreston 2025-01-15 11:30:20 +03:30
  • ef994600db update readme (#6648) hoshi-hiyouga 2025-01-15 11:06:19 +08:00
  • 7638f1070e [optim] clean apollo (#6645) hoshi-hiyouga 2025-01-15 01:42:50 +08:00
  • c2120432db [optim] add support to APOLLO (#6617) zhuHQ 2025-01-14 10:24:56 -06:00
  • 66184762e8 update readme of MiniCPM-o (#6642) Zhangchi Feng 2025-01-14 21:22:35 +08:00
  • 41a9e231cb lint (#6641) hoshi-hiyouga 2025-01-14 18:40:07 +08:00
  • 1bb06e06df Support InternLM3 Dense 8B Model (#6640) Haian Huang(深度眸) 2025-01-14 18:07:27 +08:00
  • 381f7120e6 Fix tokenizer max length (#6632) Xiaosu Zhu 2025-01-14 17:35:54 +08:00
  • f7857c83e1 Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631) Zhangchi Feng 2025-01-14 17:34:58 +08:00
  • d0da6f40b0 [model] fix mllama any image (#6637) hoshi-hiyouga 2025-01-14 16:47:58 +08:00
  • 28d145a066 pin vllm version to 0.6.5 (#6629) hoshi-hiyouga 2025-01-14 02:44:02 +08:00
  • ae32c148d1 Support new features of MiniCPM-V (#6626) Zhangchi Feng 2025-01-14 00:26:19 +08:00
  • 2a05941b14 [inference] fix stop token for object detection (#6624) hoshi-hiyouga 2025-01-13 21:34:20 +08:00
  • 11c38b9173 add nf4 qlora support on Ascend NPU (#6601) codingma 2025-01-13 19:43:36 +08:00
  • 73c1c15b62 Fix template name of MiniCPM-V (#6620) Zhangchi Feng 2025-01-13 16:46:48 +08:00
  • 7f58bf984f Merge pull request #6598 from BUAADreamer/minicpmv hoshi-hiyouga 2025-01-13 15:24:02 +08:00
  • ec552372ba remove tests fzc8578 2025-01-13 15:08:35 +08:00
  • 17d32fb5c7 fix tests fzc8578 2025-01-13 15:01:39 +08:00
  • 4b61610b12 fix style fzc8578 2025-01-13 14:19:38 +08:00
  • 07798e4aad fix system prompt and tests fzc8578 2025-01-13 14:18:06 +08:00
  • 6d6acd0213 add some fzc8578 2025-01-11 15:03:20 +08:00
  • a789e0f263 add cpm_o test fzc8578 2025-01-11 11:55:30 +08:00
  • f9ee00b6b6 add cpm_o test fzc8578 2025-01-11 11:49:03 +08:00
  • 31bfdb08cd fix format fzc8578 2025-01-11 01:27:40 +08:00
  • 12c83e00fc add some fzc8578 2025-01-11 01:10:24 +08:00
  • 9dc7b6c7ac adapt to new mllm_param fzc8578 2025-01-11 00:16:34 +08:00
  • 627548bf7f Merge branch 'main' into minicpmv Zhangchi Feng 2025-01-11 00:01:36 +08:00
  • dc65ecdf09 refactor mllm param logic hiyouga 2025-01-10 15:41:54 +00:00
  • e577990eb2 add minicpmv2.6 fzc8578 2025-01-10 23:45:44 +08:00
  • 1f3b729a4b add some fzc8578 2025-01-10 23:29:06 +08:00
  • 0aa7ac210f add some fzc8578 2025-01-10 21:25:32 +08:00
  • 40382f1387 fix some fzc8578 2025-01-10 20:55:52 +08:00
  • 75b3819e43 fix version fzc8578 2025-01-10 20:31:04 +08:00
  • e63c2df0b1 fix some fzc8578 2025-01-10 20:27:06 +08:00
  • 25d4889789 tiny fix fzc8578 2025-01-10 20:15:39 +08:00
  • 8c0a721c4c Merge branch 'main' into minicpmv Zhangchi Feng 2025-01-10 20:12:07 +08:00
  • 9e972bc9ec add some fzc8578 2025-01-10 20:01:22 +08:00
  • 1675712a4c Merge pull request #6588 from hiyouga/hiyouga/upd_issue_temp hoshi-hiyouga 2025-01-10 03:03:48 +08:00
  • e0c9012f7f update issue template hiyouga 2025-01-09 18:56:49 +00:00
  • a25024bd0c Merge pull request #6585 from hiyouga/hiyouga/add_phi4 hoshi-hiyouga 2025-01-10 02:39:17 +08:00
  • 867980196e improve template, add phi4 model hiyouga 2025-01-09 18:27:20 +00:00
  • 4e25d037c8 Merge pull request #6564 from stephen-nju/fix_ray hoshi-hiyouga 2025-01-08 18:14:18 +08:00
  • 6ba6926221 Merge pull request #6565 from hiyouga/hiyouga/improve_log hoshi-hiyouga 2025-01-08 18:08:21 +08:00
  • b6b53b61f7 fix –get ray args when args not a dict zhubin 2025-01-08 17:18:41 +08:00
  • 647c51a772 imporve log hiyouga 2025-01-08 09:56:10 +00:00
  • 3b843ac9d4 Merge pull request #6542 from erictang000/et/ray-integration hoshi-hiyouga 2025-01-08 11:46:03 +08:00
  • 0ef1f981da fix llamaboard with ray hiyouga 2025-01-07 09:59:24 +00:00
  • 944a2aec4d refactor ray integration, support save ckpt hiyouga 2025-01-07 08:54:41 +00:00
  • 4f31ad997c run style check Eric Tang 2025-01-06 23:55:56 +00:00
  • 8683582300 drafting ray integration Kourosh Hakhamaneshi 2024-12-30 16:48:52 -08:00
  • 5ccc607222 Merge pull request #6547 from hiyouga/hiyouga/fix_pixtral_dpo hoshi-hiyouga 2025-01-07 14:38:55 +08:00
  • d8bd46f1bf fix #6546 hiyouga 2025-01-07 06:30:44 +00:00
  • 8c2a712247 add some fzc8578 2025-01-06 19:32:39 +08:00
  • 53e41bf2c7 Merge pull request #6528 from hiyouga/hiyouga/upd_wechat hoshi-hiyouga 2025-01-04 16:01:21 +08:00
  • 0eeae9061c update wechat hiyouga 2025-01-04 07:25:19 +00:00
  • 08729dbefc Merge branch 'hiyouga:main' into minicpmv Zhangchi Feng 2025-01-04 11:20:33 +08:00
  • 2c120aa0df add some fzc8578 2025-01-04 11:11:15 +08:00
  • cca6286b6f Merge pull request #6524 from hiyouga/hiyouga/upd_scripts hoshi-hiyouga 2025-01-03 23:52:26 +08:00
  • 8516054e4d update scripts hiyouga 2025-01-03 10:50:32 +00:00
  • d1a8cd67d2 Merge pull request #6515 from hiyouga/hiyouga/misc hoshi-hiyouga 2025-01-02 20:20:02 +08:00
  • 8a5b4bdfd4 update model name hiyouga 2025-01-02 12:19:21 +00:00
  • 3bceef02ee Merge pull request #6514 from hiyouga/hiyouga/add_project hoshi-hiyouga 2025-01-02 20:16:15 +08:00
  • 166a830938 Merge pull request #6513 from hiyouga/hiyouga/add_gpt2 hoshi-hiyouga 2025-01-02 20:15:55 +08:00
  • 18767fe026 add project hiyouga 2025-01-02 12:15:41 +00:00
  • 18a1a4b9da add gpt2 model hiyouga 2025-01-02 12:07:38 +00:00
  • 6015fe700e Merge pull request #6512 from hiyouga/hiyouga/fix_gen_logic hoshi-hiyouga 2025-01-02 19:36:54 +08:00
  • 369dae8dd3 Merge pull request #6462 from shibingli/main hoshi-hiyouga 2025-01-02 19:34:17 +08:00
  • 2aaf3697d7 fix #6499 hiyouga 2025-01-02 11:17:29 +00:00
  • 5504b5254c Merge pull request #6492 from hiyouga/hiyouga/add_deepseek3 hoshi-hiyouga 2024-12-30 21:50:13 +08:00
  • b2e4f11602 add deepseek3 model hiyouga 2024-12-30 13:38:30 +00:00
  • e3f95abca7 Merge pull request #5507 from piamo/main hoshi-hiyouga 2024-12-30 21:08:25 +08:00
  • 2f44f70c2c Merge pull request #6483 from hiyouga/hiyouga/fix_paligemma_infer hoshi-hiyouga 2024-12-30 16:34:32 +08:00
  • f8f05a883b fix #6482 hiyouga 2024-12-30 05:55:15 +00:00
  • 5f473e2696 Merge pull request #6465 from hiyouga/hiyouga/fix_eval_loss hoshi-hiyouga 2024-12-28 01:02:56 +08:00
  • 88b1874c04 fix #6448 hiyouga 2024-12-27 16:54:39 +00:00
  • 58bc6943dc Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building. shibingli@yeah.net 2024-12-27 18:31:14 +08:00
  • 2dedf7b401 Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.This commit introduces an ARG parameter named HTTP_PROXY in the Dockerfile. This addition allows for the configuration of an HTTP proxy, facilitating image building in environments with network restrictions. shibingli@yeah.net 2024-12-27 18:17:17 +08:00
  • 5769a553d2 Merge pull request #6457 from youkaichao/module-run hoshi-hiyouga 2024-12-26 23:41:37 +08:00
  • 552816e04b Update cli.py youkaichao 2024-12-26 23:22:09 +08:00
  • b5fa1044b8 Merge pull request #6443 from hiyouga/hiyouga/add_qvq hoshi-hiyouga 2024-12-25 15:53:19 +08:00
  • 3c55976a0e add qvq #6439 hiyouga 2024-12-25 07:52:41 +00:00
  • 4611f67fae Merge pull request #6426 from hiyouga/hiyouga/update_readme hoshi-hiyouga 2024-12-23 22:17:19 +08:00
  • a5346041bb update readme hiyouga 2024-12-23 14:08:59 +00:00
  • df42e438c1 Merge pull request #5922 from Tuyohai/main hoshi-hiyouga 2024-12-23 16:46:02 +08:00
  • 7dbfd7dff6 Merge pull request #6418 from hiyouga/hiyouga/add_report hoshi-hiyouga 2024-12-22 05:47:55 +08:00