语音识别阶段出错:[阿里FunASR中文(本地)] 程序内部错误:expected Tensor as element 1 in argument 0, but got str:
Traceback (most recent call last):
File "videotrans\task\job.py", line 113, in run
File "videotrans\task\_speech2text.py", line 146, in recogn
File "videotrans\recognition\__init__.py", line 227, in run
File "videotrans\recognition\_base.py", line 80, in run
File "videotrans\recognition\_funasr.py", line 60, in _exec
File "funasr\auto\auto_model.py", line 306, in generate
return self.inference_with_vad(input, input_len=input_len, **cfg)File "funasr\auto\auto_model.py", line 383, in inference_with_vad
res = self.inference(File "funasr\auto\auto_model.py", line 345, in inference
res = model.inference(**batch, **kwargs)File "C:\AI\win-pyvideotrans-3.90\_internal\funasr\models\fsmn_vad_streaming\model.py", line 690, in inference
audio_sample = torch.cat((cache["prev_samples"], audio_sample_list[0]))TypeError: expected Tensor as element 1 in argument 0, but got str
=
system:Windows-10-10.0.22621-SP0
version:v3.90
frozen:True
language:zh
root_dir:C:/AI/win-pyvideotrans-3.90