#2736 TaskCfg(cache_folder='D:/视频音频识别IE/win-videotrans-v3.67-tiny/tmp/20816/996f7784e4', target_dir='C:/Users/Administrator/Vi

2401:b60* Posted at: 29 days ago 👁47

语音识别阶段出错 [阿里FunASR(本地)] 程序内部错误:未定义的变量 'None'
Traceback (most recent call last):
File "videotrans\task\job.py", line 113, in run
File "videotrans\task\_speech2text.py", line 146, in recogn
File "videotrans\recognition\__init__.py", line 226, in run
File "videotrans\recognition\_base.py", line 80, in run
File "videotrans\recognition\_funasr.py", line 37, in _exec
File "videotrans\recognition\_funasr.py", line 144, in _exec1
File "funasr\auto\auto_model.py", line 125, in init

model, kwargs = self.build_model(**kwargs)

File "funasr\auto\auto_model.py", line 228, in build_model

tokenizer = tokenizer_class(**tokenizer_conf)

File "D:\视频音频识别IE\win-videotrans-v3.67-tiny\_internal\funasr\tokenizer\hf_tokenizer.py", line 11, in HuggingfaceTokenizer

tokenizer = AutoTokenizer.from_pretrained(init_param_path)

UnboundLocalError: local variable 'AutoTokenizer' referenced before assignment

TaskCfg(cache_folder='D:/视频音频识别IE/win-videotrans-v3.67-tiny/tmp/20816/996f7784e4', target_dir='C:/Users/Administrator/Videos/pyvideotrans/recogn', remove_noise=False, is_separate=False, detect_language='zh-cn', subtitle_language=None, source_language=None, target_language=None, source_language_code=None, target_language_code=None, source_sub=None, target_sub='C:/Users/Administrator/Videos/pyvideotrans/recogn/[2026-01-19 15-50-15].srt', source_wav=None, source_wav_output=None, target_wav=None, target_wav_output=None, subtitles=None, novoice_mp4=None, noextname='[2026-01-19 15-50-15]]', shibie_audio='D:/视频音频识别IE/win-videotrans-v3.67-tiny/tmp/20816/996f7784e4/[2026-01-19 15-50-15]-1768841831.7916567.wav', targetdir_mp4=None, instrument=None, vocal=None, back_audio=None, background_music=None, app_mode='biaozhun', subtitle_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role=None, copysrt_rawvideo=False, clear_cache=False, translate_type=None, tts_type=None, recogn_type=2, model_name='Fun-ASR-Nano-2512', split_type=0, voice_autorate=False, video_autorate=False, cuda=True, name='D:/切片/[2026-01-19 15-50-15].mp4', basename='[2026-01-19 15-50-15].mp4', ext='mp4', dirname='D:/切片', shound_del_name=None, uuid='996f7784e4', remove_silent_mid=False, align_sub_audio=True, enable_diariz=True, nums_diariz=1, rephrase=2, only_out_mp4=False)


=
system:Windows-10-10.0.26200-SP0
version:v3.92
frozen:True
language:zh
root_dir:D:/视频音频识别IE/win-videotrans-v3.67-tiny

1 Replies

该错误是阿里FunASR本地语音识别模块在初始化时出现的内部依赖问题。具体原因是 AutoTokenizer 变量未定义,这通常与FunASR的模型文件下载不完整或环境依赖冲突有关。

请按以下步骤解决:

  1. 清理缓存并重新下载模型

    • 关闭软件。
    • 进入软件目录下的 models/funasr 文件夹。
    • 删除 Fun-ASR-Nano-2512 文件夹(或您选择的模型文件夹)。
    • 重新启动软件,再次尝试语音识别。软件会自动重新下载模型。
  2. 如果问题依旧,请切换语音识别渠道

    • 在软件主界面的“语音识别”区域,暂时将渠道从 阿里FunASR(本地) 切换为 faster-whisper(本地),并选择一个合适的模型(如 mediumlarge-v2)。
    • 这是最直接的临时解决方案,可以绕过当前FunASR的环境问题。
  3. 检查文件路径

    • 您的视频文件名包含方括号 [],这有时会引起解析问题。建议将视频文件重命名为简单的英文或数字名称(如 test.mp4),然后再次尝试。

如果以上步骤均无效,请提供 videotrans/logs/ 目录下的最新日志文件以便进一步分析。

请查看更多参考文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource