语音识别阶段出错:[阿里FunASR中文(本地)] 程序内部错误:expected Tensor as element 1 in argument 0, but got str:
Traceback (most recent call last):
File "videotrans\task\job.py", line 113, in run
File "videotrans\task\_speech2text.py", line 146, in recogn
File "videotrans\recognition\__init__.py", line 231, in run
File "videotrans\recognition\_base.py", line 78, in run
File "videotrans\recognition\_funasr.py", line 57, in _exec
File "funasr\auto\auto_model.py", line 306, in generate
return self.inference_with_vad(input, input_len=input_len, **cfg)File "funasr\auto\auto_model.py", line 383, in inference_with_vad
res = self.inference(File "funasr\auto\auto_model.py", line 345, in inference
res = model.inference(**batch, **kwargs)File "C:\SOFT\pyvideotrans\_internal\funasr\models\fsmn_vad_streaming\model.py", line 690, in inference
audio_sample = torch.cat((cache["prev_samples"], audio_sample_list[0]))TypeError: expected Tensor as element 1 in argument 0, but got str
TaskCfg(cache_folder='C:/SOFT/pyvideotrans/tmp/20604/speech2text', target_dir='c:/soft/pyvideotrans/output/recogn', remove_noise=False, is_separate=False, detect_language='zh-cn', subtitle_language=None, source_language=None, target_language=None, source_language_code=None, target_language_code=None, source_sub=None, target_sub='c:/soft/pyvideotrans/output/recogn/R_MIC_251126-061350.srt', source_wav=None, source_wav_output=None, target_wav=None, target_wav_output=None, subtitles=None, novoice_mp4=None, noextname='R_MIC_251126-061350', shibie_audio='C:/SOFT/pyvideotrans/tmp/20604/speech2text/R_MIC_251126-061350-1764909243.6723275.wav', targetdir_mp4=None, instrument=None, vocal=None, back_audio=None, background_music=None, app_mode='biaozhun', subtitle_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role=None, copysrt_rawvideo=False, clear_cache=False, translate_type=None, tts_type=None, recogn_type=2, model_name='paraformer-zh', split_type=0, voice_autorate=False, video_autorate=False, cuda=True, name='D:/recode/R_MIC_251126-061350.mp3', basename='R_MIC_251126-061350.mp3', ext='mp3', dirname='D:/recode', shound_del_name=None, uuid='f1edccf8d1', remove_silent_mid=False, align_sub_audio=True, enable_diariz=True, nums_diariz=0, rephrase=0)
=
system:Windows-10-10.0.22631-SP0
version:v3.87
frozen:True
language:zh
root_dir:C:/SOFT/pyvideotrans