LLaMA-Factory/llamafactory at a13b1bb49aa6ce8b178f343c1e06fdbbd8cd7a83 - LLaMA-Factory - Gitea: Git with a cup of tea

ros/LLaMA-Factory

Files

History

Yu Shi Jie a13b1bb49a [model] fix use_cache patching for gemma3 multimodal (#7500 )

2025-04-01 16:06:48 +08:00

..

[misc] upgrade format to py39 (#7256 )

2025-03-12 00:08:41 +08:00

[inference] support sglang backend (#7278 )

2025-03-15 04:37:58 +08:00

[data] specify position_ids in PackedSupervisedDatasetProcessor for neat_packing (#7318 )

2025-04-01 16:03:13 +08:00

[misc] update format (#7277 )

2025-03-13 02:53:08 +08:00

[model] add Qwen2.5-Omni model (#7537 )

2025-03-31 20:39:35 +08:00

[data] shard the dataset to allow multiprocessing when streaming is enabled (#7530 )

2025-04-01 15:36:23 +08:00

[model] fix use_cache patching for gemma3 multimodal (#7500 )

2025-04-01 16:06:48 +08:00

[3rdparty] support swanlab lark notification (#7481 )

2025-03-27 01:52:01 +08:00

[webui] fix launch with proxy (#7332 )

2025-04-01 15:52:56 +08:00

__init__.py

[model] add qwen2vl 32b & upgrade peft (#7469 )

2025-03-25 12:15:58 +08:00

cli.py

[deps] upgrade vllm to 0.8 (#7436 )

2025-03-23 14:32:22 +08:00

launcher.py

[misc] update license year & fix llama pro (#6814 )

2025-02-05 01:53:33 +08:00