#4961 语音识别阶段出错[faster-whisper(本地)] No transcription results returned. Please check the original audio/video or model and try a

2409:8a55* Posted at: 1 hour ago

语音识别阶段出错[faster-whisper(本地)] No transcription results returned. Please check the original audio/video or model and try again.
Traceback (most recent call last):

File "videotrans\task\job.py", line 35, in run

File "videotrans\task\job.py", line 100, in process_task

File "videotrans\task\trans_create.py", line 322, in recogn

File "videotrans\recognition\__init__.py", line 190, in run

File "videotrans\recognition\_base.py", line 94, in run

File "videotrans\recognition\_whisper.py", line 36, in _exec

File "videotrans\recognition\_whisper.py", line 109, in _faster

File "videotrans\configure\base.py", line 253, in _new_process

videotrans.configure.excepts.VideoTransError: No transcription results returned. Please check the original audio/video or model and try again.
TaskCfgVTT(uuid='da982d8d06', name='F:/02_Godot/video/02/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080].mp4', dirname='F:/02_Godot/video/02', noextname='Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080]', basename='Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080].mp4', ext='mp4', target_dir='F:/02_Godot/_video_out/video/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080]-mp4', cache_folder='D:/win-pyvideotrans-v4.01-0602/tmp/32384/da982d8d06', is_cuda=True, source_language='英语', source_language_code='en', source_sub='F:/02_Godot/_video_out/video/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080]-mp4/en.srt', source_wav='D:/win-pyvideotrans-v4.01-0602/tmp/32384/da982d8d06/en.wav', source_wav_output='F:/02_Godot/_video_out/video/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080]-mp4/en.m4a', target_language='简体中文', target_language_code='zh-cn', target_sub='F:/02_Godot/_video_out/video/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080]-mp4/zh-cn.srt', target_wav='D:/win-pyvideotrans-v4.01-0602/tmp/32384/da982d8d06/target.wav', target_wav_output='F:/02_Godot/_video_out/video/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080]-mp4/zh-cn.m4a', translate_type=4, tts_type=9, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='Sound_005_90.m4a', voice_autorate=True, video_autorate=False, remove_silent_mid=False, align_sub_audio=True, detect_language='en', recogn_type=0, model_name='large-v3-turbo', shibie_audio=None, remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=1, fix_punc=True, subtitle_language=None, app_mode='biaozhun', subtitles='', targetdir_mp4='F:/02_Godot/_video_out/video/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080]-mp4/Godot Engine, NPC - AIPlayer or Bots Free Walking Navigation System #godot #godotengine-[84ZsmrJEJXU]-[1920x1080].mp4', novoice_mp4='D:/win-pyvideotrans-v4.01-0602/tmp/32384/da982d8d06/novoice.mp4', is_separate=True, embed_bgm=True, instrument='D:/win-pyvideotrans-v4.01-0602/tmp/32384/da982d8d06/instrument.wav', vocal='D:/win-pyvideotrans-v4.01-0602/tmp/32384/da982d8d06/vocal.wav', clear_cache=False, background_music='', subtitle_type=3, only_out_mp4=False, recogn2pass=False, output_srt=2, copysrt_rawvideo=False, loop_backaudio=1, backaudio_volume=0.8)

1 Replies

该错误的核心原因是 faster-whisper 未能从音频中提取到任何有效语音内容。从你提供的日志分析,最可能由以下三个因素之一导致(按可能性排序):

  1. 视频文件路径和名称包含大量特殊符号及中文字符(如 #, -, [ ]),且路径较长。软件底层调用 ffmpeg 时,Windows 命令行对路径长度和特殊字符敏感,容易导致音频提取失败或模型读取异常。
  2. 显存不足large-v3-turbo 模型需要至少 6-8 GB 显存,你的 NVIDIA 显卡可能显存不够,或同时运行其他程序占用了显存,导致模型加载后静默崩溃,无结果返回。
  3. 视频音频轨道确实无有效人声(例如纯背景音乐、环境噪音过大),或原始语言选择错误(已选英语,但视频里可能是其他语言)。

请按顺序尝试以下解决方案:

1. 简化文件路径和名称

  • 将视频文件复制到盘符根目录下(如 D:\test.mp4),使用纯英文、数字和短名称(删除所有 #[ ]、中文、空格等特殊符号)。
  • 同时确保软件解压目录也是短路径(如 D:\pyVideoTrans),不要放在 Program Files 或含中文的路径。

2. 降低模型大小和显存占用

  • 在语音识别下拉框中,将模型从 large-v3-turbo 改为 mediumsmall
  • 若仍报错,进入 菜单 → 工具/选项 → 高级选项 → 语音识别参数,进行以下调整:

    • CUDA数据类型 改为 int8(节省显存)
    • beam_size 改为 1
    • best_of 改为 1
  • 之后重启软件重试。

3. 检查视频音频内容

  • 先用播放器打开视频,确认是否有清晰人类说话声。如果视频仅有背景音乐或环境嘈杂,建议启用主界面的 “降噪” 选项(位于语音识别渠道右侧),或启用 “设置更多参数 → 分离人声背景声”

4. 临时关闭 GPU 加速

  • 如果以上步骤无效,取消勾选主界面的 “CUDA加速”,改用 CPU 运行(速度会慢但可验证是否为 GPU 问题)。如果 CPU 能正常识别,则说明是 CUDA 环境或显存问题。

如果上述方法均未解决,请提供 软件安装目录下 logs/ 文件夹中当天的日志文件(复制底部约 30 行报错内容),以便进一步分析具体异常来源。

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update. Dondate: ko-fi.com/jianchang512

Related resource