#3706 TaskCfgSTT(is_cuda=True, uuid='dfa5fa4648', cache_folder='D:/翻译文件/win-pyvideotrans-v3.98-312/tmp/5264/dfa5fa4648', targe

151.242* Posted at: 3 hours ago 👁12

语音识别阶段出错 [阿里FunASR(本地)] Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 898, in funasr_mlt
File "funasr\auto\auto_model.py", line 135, in init

model, kwargs = self.build_model(**kwargs)

File "funasr\auto\auto_model.py", line 281, in build_model

assert model_class is not None, f'{kwargs["model"]} is not registered'

AssertionError: FunASRNano is not registered

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\_speech2text.py", line 152, in recogn
File "videotrans\recognition\__init__.py", line 242, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_funasr.py", line 64, in _exec
File "videotrans\configure\_base.py", line 288, in _new_process
RuntimeError: Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 898, in funasr_mlt
File "funasr\auto\auto_model.py", line 135, in init

model, kwargs = self.build_model(**kwargs)

File "funasr\auto\auto_model.py", line 281, in build_model

assert model_class is not None, f'{kwargs["model"]} is not registered'

AssertionError: FunASRNano is not registered
TaskCfgSTT(is_cuda=True, uuid='dfa5fa4648', cache_folder='D:/翻译文件/win-pyvideotrans-v3.98-312/tmp/5264/dfa5fa4648', target_dir='C:/Users/24202/Desktop/win-pyvideotrans-v3.98-312/output/recogn', source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='C:/Users/24202/Desktop/win-pyvideotrans-v3.98-312/output/recogn/16-1.watermark.srt', target_wav=None, target_wav_output=None, name='F:/FastCampus水分插画系统课/视频/用所选项目新建的文件夹/16-1.watermark.mp4', noextname='16-1.watermark', basename='16-1.watermark.mp4', ext='mp4', dirname='F:/FastCampus水分插画系统课/视频/用所选项目新建的文件夹', shound_del_name=None, detect_language='ko', recogn_type=3, model_name='Fun-ASR-Nano-2512', shibie_audio='D:/翻译文件/win-pyvideotrans-v3.98-312/tmp/5264/dfa5fa4648/16-1.watermark-1773981080.0240843.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=True)


=
system:Windows-10-10.0.26200-SP0
version:v3.98
frozen:True
language:zh
root_dir:D:/翻译文件/win-pyvideotrans-v3.98-312

1 Replies

这是由于阿里FunASR(本地)语音识别渠道的模型名称配置错误导致的。软件尝试加载名为“FunASRNano”的模型,但该名称在FunASR框架中未注册。

解决方案:

  1. 在软件主界面,点击顶部菜单栏的 工具/选项 -> 高级选项
  2. 在弹出的高级选项窗口中,找到 【语音识别参数】 区域。
  3. 修改 faster/whisper模型 这一项。其默认值可能包含 Fun-ASR-Nano-2512 等模型名称。
  4. 将该值修改为FunASR框架中已注册的、正确的模型名称。例如,可以尝试修改为:
    iic/SenseVoiceSmall, iic/SenseVoiceMedium, iic/funasr_paraformer-zh, iic/ct-punc_c, iic/punc_ct-transformer_zh-cn, damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
  5. 修改后,点击 保存 并关闭高级选项窗口。
  6. 重新启动软件,再次尝试语音识别。

如果问题依旧,请检查网络连接或代理设置,确保软件能正常从模型源(如HuggingFace或ModelScope)下载所需的模型文件。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource