Yaowei Zheng
|
122cd46084
|
[model] update constants (#10220)
|
2026-02-26 21:13:56 +08:00 |
|
P. Clawmogorov
|
50599c719b
|
[misc] remove safe_serialization arg for transformers v5 compatibility (#10208)
Co-authored-by: P. Clawmogorov <262173731+Alm0stSurely@users.noreply.github.com>
|
2026-02-24 11:14:19 +08:00 |
|
Kingsley
|
a0f3ad0cee
|
[mca] update supported models (#10196)
|
2026-02-20 22:02:49 +08:00 |
|
Junyou Su
|
675ce8cc7f
|
[algo] add ASFT (#10174)
|
2026-02-12 13:12:14 +08:00 |
|
Username_Full
|
92fa3df4c4
|
[trainer] add dpo/kto fsdp fsdp2 support (#10127)
|
2026-02-04 23:27:12 +08:00 |
|
浮梦
|
bf04ca6af8
|
[deps] adapt to transformers v5 (#10147)
Co-authored-by: frozenleaves <frozen@Mac.local>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2026-02-02 12:07:19 +08:00 |
|
xvxuopop
|
762b480131
|
[feature] support using ray.remote to start distributed training. (#10109)
|
2026-01-28 16:05:29 +08:00 |
|
jiaqiw09
|
7ef19eea00
|
[v0] Fix reward model training safetensors saving (#10137)
|
2026-01-27 16:27:14 +08:00 |
|
Yaowei Zheng
|
d22de0d4bf
|
[v1] add renderer ut (#9722)
|
2026-01-07 02:06:07 +08:00 |
|
yanglele
|
e944dc442c
|
[feature] add support for EAFT loss (#9720)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-01-06 23:07:12 +08:00 |
|
Yaowei Zheng
|
8600530002
|
[misc] lint (#9710)
|
2026-01-04 13:47:56 +08:00 |
|
Santosh Bhavani
|
355d5c5e5a
|
[fix] fp8: add Transformer Engine backend support (#9705)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2026-01-01 10:18:02 +08:00 |
|
Username_Full
|
000526908a
|
[core deps] upgrade TRL to be between 0.18 and 0.24 (#9617)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-12-31 20:54:27 +08:00 |
|
Kingsley
|
bb1ba31005
|
[misc] lint mca code (#9692)
|
2025-12-29 11:44:38 +08:00 |
|
Copilot
|
eceec8ab69
|
[deps] goodbye python 3.9 (#9677)
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: hiyouga <16256802+hiyouga@users.noreply.github.com>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-12-27 02:50:44 +08:00 |
|
Yaowei Zheng
|
55590f5ece
|
[misc] fix ci with uv (#9676)
|
2025-12-27 01:39:13 +08:00 |
|
Xunpeng Xiao
|
3c17f2722c
|
[model] Update ernie_vl to adapt new version (#9665)
|
2025-12-26 19:57:49 +08:00 |
|
Yaowei Zheng
|
0894b4f37e
|
[misc] lint (#9636)
|
2025-12-20 16:19:39 +08:00 |
|
mrhaoxx
|
a769fb94b9
|
[feat] support ktransformers for dpo (#9621)
Co-authored-by: poryfly <porykid@gmail.com>
|
2025-12-18 21:26:25 +08:00 |
|
mrhaoxx
|
964569751f
|
[kt] refactor ktransformers integration (#9632)
|
2025-12-18 21:26:04 +08:00 |
|
浮梦
|
18c21bce5a
|
[test] add allreduce test on npu (#9619)
Co-authored-by: frozenleaves <frozen@Mac.local>
|
2025-12-16 21:33:30 +08:00 |
|
tangefly
|
4fd94141a4
|
[model] Add Ministral3 (#9582)
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
|
2025-12-10 15:57:24 +08:00 |
|
Yaowei Zheng
|
5d56817e2b
|
[misc] lint (#9593)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-12-09 18:00:35 +08:00 |
|
Hertz
|
c1f5f8fff6
|
[model] support GLM4.6v (#9586)
|
2025-12-09 11:06:42 +08:00 |
|
tangefly
|
739954910a
|
[deps] Update for Transformers v5 (#9569)
|
2025-12-08 01:13:32 +08:00 |
|
Peilin Li
|
bd30c0003b
|
[train] fix denominator of ga in ksft loss (#9409)
|
2025-11-05 20:53:23 +08:00 |
|
Yaowei Zheng
|
eaf963f67f
|
[model] update kt code (#9406)
|
2025-11-05 15:27:22 +08:00 |
|
Kingsley
|
56f45e826f
|
[train] fix MPO re-weight (#9405)
|
2025-11-04 21:10:41 +08:00 |
|
Peilin Li
|
934b3084ee
|
[train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-11-04 15:54:12 +08:00 |
|
Yaowei Zheng
|
3ae15da9c0
|
[misc] lint code (#9395)
|
2025-11-03 22:08:59 +08:00 |
|
Kingsley
|
13170577b2
|
[feat] support megatron-LM training by mcore_adapter (#9237)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-10-26 16:21:30 +08:00 |
|
Yaowei Zheng
|
d9d67ba62d
|
[misc] fix import error (#9299)
|
2025-10-17 17:46:27 +08:00 |
|
Ben Feuer
|
1c44b60e3e
|
[feat] fp8 training (#8960)
Co-authored-by: Benjamin Feuer <penfever@gmail.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-10-01 14:32:53 +08:00 |
|
Yaowei Zheng
|
52488ac974
|
[deps] upgrade transformers to 4.56.1 (#9128)
|
2025-09-14 02:26:39 +08:00 |
|
Yaowei Zheng
|
2c31279316
|
[assets] update wechat (#8962)
|
2025-08-19 02:55:09 +08:00 |
|
Zeju Qiu
|
003a2acb1a
|
[feature] adding orthogononal finetuning (OFT) to llama factory (#8623)
Co-authored-by: Zeju <zqiu@g003.internal.cluster.is.localnet>
Co-authored-by: Zeju <zqiu@login2.is.localnet>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-08-18 18:22:47 +08:00 |
|
XLXW
|
1ada15981a
|
[feature] add support for dft loss (#8917)
|
2025-08-15 23:29:57 +08:00 |
|
Kingsley
|
936f4fd78e
|
[feature] Support MPO (#8930)
|
2025-08-15 15:09:59 +08:00 |
|
golangboy
|
ef507ae8e0
|
[file] Resolve file lock issue when deleting safetensors on Windows (#8839)
|
2025-08-08 14:59:54 +08:00 |
|
Yaowei Zheng
|
2c26ce6ac4
|
Merge commit from fork
|
2025-06-26 13:55:42 +08:00 |
|
Yaowei Zheng
|
9a2d1dec62
|
[assets] update wechat (#8385)
|
2025-06-16 18:23:22 +08:00 |
|
Aman Gupta
|
8e4ac78607
|
[trainer] Add LD-DPO objective (#8362)
|
2025-06-12 16:10:38 +08:00 |
|
Ze-Yi LIN
|
c4e51d40e0
|
[tracking] swanlab add llamafactory tag (#8258)
|
2025-06-03 18:42:29 +08:00 |
|
hoshi-hiyouga
|
9ae17cd173
|
[deps] update to transformers 4.52 (#8125)
|
2025-05-21 05:16:18 +08:00 |
|
hoshi-hiyouga
|
beae231af6
|
[doc] add no build isolation (#8103)
|
2025-05-19 19:25:13 +08:00 |
|
Ma, Xiaochen
|
a0b4b91577
|
[trainer] fix KeyError at end of pretrain (#8099)
|
2025-05-19 18:01:26 +08:00 |
|
Eric Tang
|
ef03832cd4
|
[ray] add storage filesystem to ray config (#7854)
|
2025-04-27 22:12:40 +08:00 |
|
hoshi-hiyouga
|
fddcd43c88
|
[trainer] support early stop (#7797)
|
2025-04-22 01:59:33 +08:00 |
|
hoshi-hiyouga
|
b07628dea5
|
[example] add bash usage (#7794)
|
2025-04-22 00:25:51 +08:00 |
|
Juanxi Tian
|
12ada72ed4
|
[trainer] Add Muon Optimizer (#7749)
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
|
2025-04-21 23:38:37 +08:00 |
|