#4980 TaskCfgVTT(uuid='724416a535', name='C:/Users/yue/Desktop/1000.mp4', dirname='C:/Users/yue/Desktop', noextname='1000', ba

240e:3a1* Posted at: 1 month ago

No transcription results returned. Please check the original audio/video or model and try again.语音识别阶段出错:faster-whisper(本地), 翻译字幕阶段出错: Google(免费), 配音阶段出错:Edge-TTS(免费)
Traceback (most recent call last):
File "videotrans\task\only_one.py", line 47, in run
File "videotrans\task\trans_create.py", line 322, in recogn
File "videotrans\recognition\__init__.py", line 190, in run
File "videotrans\recognition\_base.py", line 94, in run
File "videotrans\recognition\_whisper.py", line 36, in _exec
File "videotrans\recognition\_whisper.py", line 109, in _faster
File "videotrans\configure\base.py", line 253, in _new_process
videotrans.configure.excepts.VideoTransError: No transcription results returned. Please check the original audio/video or model and try again.
TaskCfgVTT(uuid='724416a535', name='C:/Users/yue/Desktop/1000.mp4', dirname='C:/Users/yue/Desktop', noextname='1000', basename='1000.mp4', ext='mp4', target_dir='C:/Users/yue/Desktop/_video_out/1000-mp4', cache_folder='D:/pyVideoTrans/tmp/52584/724416a535', is_cuda=False, source_language='简体中文', source_language_code='zh-cn', source_sub='C:/Users/yue/Desktop/_video_out/1000-mp4/zh-cn.srt', source_wav='D:/pyVideoTrans/tmp/52584/724416a535/zh-cn.wav', source_wav_output='C:/Users/yue/Desktop/_video_out/1000-mp4/zh-cn.m4a', target_language='英语', target_language_code='en', target_sub='C:/Users/yue/Desktop/_video_out/1000-mp4/en.srt', target_wav='D:/pyVideoTrans/tmp/52584/724416a535/target.wav', target_wav_output='C:/Users/yue/Desktop/_video_out/1000-mp4/en.m4a', translate_type=0, tts_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='No', voice_autorate=True, video_autorate=False, remove_silent_mid=False, align_sub_audio=True, detect_language='zh-cn', recogn_type=0, model_name='large-v3-turbo', shibie_audio=None, remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False, subtitle_language=None, app_mode='biaozhun', subtitles='', targetdir_mp4='C:/Users/yue/Desktop/_video_out/1000-mp4/1000.mp4', novoice_mp4='D:/pyVideoTrans/tmp/52584/724416a535/novoice.mp4', is_separate=False, embed_bgm=True, instrument=None, vocal='D:/pyVideoTrans/tmp/52584/724416a535/vocal.wav', clear_cache=True, background_music='', subtitle_type=1, only_out_mp4=False, recogn2pass=False, output_srt=0, copysrt_rawvideo=False, loop_backaudio=1, backaudio_volume=0.8)

=
system:Windows-10-10.0.26200-SP0
version:v4.01
frozen:True
language:zh
root_dir:D:/pyVideoTrans

Floor #1 DeepSeek reply at 1 month ago

语音识别阶段返回空结果，说明 faster-whisper 未从视频中提取到任何有效语音片段。常见原因及解决方案如下：

检查视频是否包含人声且语言正确
- 确认视频确实有清晰的说话声，而非纯音乐或静音。
- 源语言选择“简体中文”必须与视频实际语言一致，否则模型无法识别。
降低模型大小或调整识别参数
- 当前使用 large-v3-turbo 模型较大，在 CPU 上可能因内存不足或耗时过长导致子进程崩溃返回空。
- 建议先切换到 base 或 small 模型测试能否转出字幕。
- 如果仍需使用 large 模型，请确保系统有足够内存（建议 16GB+）。
调整 VAD 门槛
- 在菜单栏 工具/选项 → 高级选项 → 语音识别参数 中：
  - 将 语音阈值 从默认 0.5 降至 0.3（更灵敏）。
  - 将 最短语音持续毫秒 设为 500 或更低。
  - 将 静音分割持续毫秒 设为 300。
- 重新执行任务（需勾选 清理已生成 以忽略缓存）。
开启降噪
- 在主界面勾选降噪或点击 设置更多参数 勾选 分离人声背景声，可排除背景噪音干扰。
验证模型文件完整性
- 首次使用 large-v3-turbo 需在线下载。进入软件目录下的 models/ 文件夹，检查是否存在 models--Systran--faster-whisper-large-v3-turbo 文件夹及其内部文件。若缺失，请配置网络代理后重启软件重新下载。
优先启用 GPU 加速
- 如果拥有 NVIDIA 显卡且已安装 CUDA 12.8 + cuDNN 9.x，请务必在主界面勾选 CUDA加速，这能大幅提高 large 模型的稳定性与速度。

若以上步骤仍无效，请提供软件安装目录下 logs/ 文件夹内最新的 .log 文件（复制底部约30行内容），以便进一步分析。

#4980 TaskCfgVTT(uuid='724416a535', name='C:/Users/yue/Desktop/1000.mp4', dirname='C:/Users/yue/Desktop', noextname='1000', ba

1 Replies

请查阅相关文档：

Post Your Reply

Related resource