[da'ta] fix minicpmv plugin (#6890)

* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme

* support dpo of minicpmv

* update init audio

* update init audio

* [model]fix image process in minicpmo

* fix no mm inputs

Former-commit-id: cdd19ccd8cec460606b4545e886e932c1c5c5fe1
This commit is contained in:
Zhangchi Feng
2025-02-11 13:30:44 +08:00
committed by GitHub
parent e11b40c344
commit 2047eab723
5 changed files with 117 additions and 98 deletions

View File

@@ -645,6 +645,12 @@ class MiniCPMVPlugin(BasePlugin):
chunk_input=True,
sampling_rate=16000,
)
audio_feature_lens = [
torch.tensor(audio_feature_len)
if not isinstance(audio_feature_len, torch.Tensor)
else audio_feature_len
for audio_feature_len in audio_feature_lens
]
mm_inputs.update({"audio_features": audio_features, "audio_feature_lens": audio_feature_lens})
if kwargs.get("ret_phs", False):
mm_inputs.update({"audio_phs": audio_phs})