#2282 TaskCfg(cache_folder='E:/AIfanyi/tmp/28316/ca66ce197b', target_dir='E:/video/_video_out/The Most Powerful DJ App You’re

38.75* Posted at: 6 months ago

语音识别阶段出错 [faster-whisper(本地)] The Most Powerful DJ App You’re Not Using Yet.mp4没有识别出字幕，请检查是否包含人类说话声音，以及说话语言是否和你选择的原始语言匹配
Traceback (most recent call last):
File "videotrans\task\job.py", line 113, in run
File "videotrans\task\trans_create.py", line 379, in recogn
RuntimeError: The Most Powerful DJ App You’re Not Using Yet.mp4没有识别出字幕，请检查是否包含人类说话声音，以及说话语言是否和你选择的原始语言匹配

TaskCfg(cache_folder='E:/AIfanyi/tmp/28316/ca66ce197b', target_dir='E:/video/_video_out/The Most Powerful DJ App You’re Not Using Yet-mp4', remove_noise=False, is_separate=False, detect_language='en', subtitle_language=None, source_language='英语', target_language='简体中文', source_language_code='en', target_language_code='zh-cn', source_sub='E:/video/_video_out/The Most Powerful DJ App You’re Not Using Yet-mp4/en.srt', target_sub='E:/video/_video_out/The Most Powerful DJ App You’re Not Using Yet-mp4/zh-cn.srt', source_wav='E:/AIfanyi/tmp/28316/ca66ce197b/en.wav', source_wav_output='E:/video/_video_out/The Most Powerful DJ App You’re Not Using Yet-mp4/en.m4a', target_wav='E:/AIfanyi/tmp/28316/ca66ce197b/target.wav', target_wav_output='E:/video/_video_out/The Most Powerful DJ App You’re Not Using Yet-mp4/zh-cn.m4a', subtitles='', novoice_mp4='E:/AIfanyi/tmp/28316/ca66ce197b/novoice.mp4', noextname='The Most Powerful DJ App You’re Not Using Yet', shibie_audio=None, targetdir_mp4='E:/video/_video_out/The Most Powerful DJ App You’re Not Using Yet-mp4/The Most Powerful DJ App You’re Not Using Yet.mp4', instrument=None, vocal=None, back_audio='', background_music=None, app_mode='tiqu', subtitle_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='No', copysrt_rawvideo=False, clear_cache=True, translate_type=4, tts_type=0, recogn_type=0, model_name='large-v3-turbo', split_type=0, voice_autorate=False, video_autorate=False, cuda=True, name='E:/video/The Most Powerful DJ App You’re Not Using Yet.mp4', basename='The Most Powerful DJ App You’re Not Using Yet.mp4', ext='mp4', dirname='E:/video', shound_del_name=None, uuid='ca66ce197b', remove_silent_mid=False, align_sub_audio=True, enable_diariz=False, nums_diariz=0, rephrase=2, only_out_mp4=False)

=
system:Windows-10-10.0.19045-SP0
version:v3.91
frozen:True
language:zh
root_dir:E:/AIfanyi

Floor #1 DeepSeek reply at 6 months ago

根据错误信息，问题在于视频文件 The Most Powerful DJ App You’re Not Using Yet.mp4 在语音识别阶段没有检测到任何人类说话声音。这通常由以下几种原因导致：

视频本身无声或人声过少：请先确认该视频是否包含清晰的人声对话或旁白。
语言选择错误：您在识别阶段选择的“原始语言”可能与视频实际语言不匹配。虽然日志显示 source_language_code='en'（英语），但视频可能包含其他语言或口音。
VAD（语音活动检测）参数过于严格：默认的语音检测阈值可能过滤掉了部分人声。

请按以下步骤排查：

第一步：检查视频内容

使用播放器打开原视频，确认其中包含清晰可辨的人声。

第二步：调整识别设置

在主界面，确保“原始语言”选择正确。如果不确定，请选择 auto（自动检测）。
如果视频人声清晰但依然识别失败，请尝试调整 VAD 参数以提高灵敏度：
- 点击菜单栏 工具 -> 高级选项。
- 在“Faster/OpenAI/Whisper 语音识别微调”区域，找到“VAD 参数”。
- 将“语音阈值”从默认的 0.45 调低（例如改为 0.3），这会使检测更灵敏。
- 将“静音分割持续毫秒”从默认的 140 调高（例如改为 300），这会使语音片段合并，避免过短片段被忽略。

第三步：尝试其他识别模式

在主界面的“语音识别”行，将“识别模式”从“整体识别”切换为“批量推理”，然后重试。

第四步：检查文件路径

错误日志显示您的视频路径为 E:/video/The Most Powerful DJ App You’re Not Using Yet.mp4。Windows 系统下，文件路径和名称中包含特殊字符（如 ’）可能导致处理异常。
解决方案：请将视频文件重命名，移除所有特殊字符和空格（例如改为 The_Most_Powerful_DJ_App_You_Are_Not_Using_Yet.mp4），然后重新拖入软件处理。

如果以上步骤均无效，请提供视频的更多信息（如时长、人声部分的大致时间点），以便进一步分析。

#2282 TaskCfg(cache_folder='E:/AIfanyi/tmp/28316/ca66ce197b', target_dir='E:/video/_video_out/The Most Powerful DJ App You’re

1 Replies

Post Your Reply

Related resource