#4767 TaskCfgSTT(uuid='4d27f9aac5', name='N:/Gwenmedia.mp4', dirname='N:/', noextname='Gwenmedia', basename='Gwenmedia.mp4', e

124.165* Posted at: 7 hours ago

语音识别阶段出错[faster-whisper(本地)] No transcription results returned. Please check the original audio/video or model and try again.
Traceback (most recent call last):

File "videotrans\task\job.py", line 54, in run

File "videotrans\task\job.py", line 119, in process_task

File "videotrans\task\speech2text.py", line 125, in recogn

File "videotrans\recognition\__init__.py", line 188, in run

File "videotrans\recognition\_base.py", line 93, in run

File "videotrans\recognition\_whisper.py", line 35, in _exec

File "videotrans\recognition\_whisper.py", line 108, in _faster

File "videotrans\configure\base.py", line 240, in _new_process

videotrans.configure.excepts.VideoTransError: No transcription results returned. Please check the original audio/video or model and try again.
TaskCfgSTT(uuid='4d27f9aac5', name='N:/Gwenmedia.mp4', dirname='N:/', noextname='Gwenmedia', basename='Gwenmedia.mp4', ext='mp4', target_dir='O:/win-pyvideotrans-v4.00-525/output/recogn', cache_folder='O:/win-pyvideotrans-v4.00-525/tmp/896/4d27f9aac5', is_cuda=False, source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='O:/win-pyvideotrans-v4.00-525/output/recogn/Gwenmedia.srt', target_wav=None, target_wav_output=None, detect_language='en', recogn_type=0, model_name='small', shibie_audio='O:/win-pyvideotrans-v4.00-525/tmp/896/4d27f9aac5/Gwenmedia-1779962377.3844447.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False)


=
system:Windows-10-10.0.26100-SP0
version:v4.00
frozen:True
language:zh
root_dir:O:/win-pyvideotrans-v4.00-525

1 Replies

错误直接原因是 faster-whisper 未从音频中识别出任何内容,常见于三种情况:

  1. 原音频/视频文件无声或语音被强噪音淹没
  2. 原始语言选择错误(此处自动检测到 en,但若实际不是英语,模型可能无法识别)
  3. 模型过小或音频格式不兼容

请按顺序尝试以下步骤:

1. 验证音频有效性
用其他播放器打开 N:/Gwenmedia.mp4,确认确实有人声且可以正常播放。

2. 明确指定原始语言
主界面“发音语言”不要选 auto,直接选择视频中的实际语言(如中文选“简体中文”)。

3. 更换更大的模型
当前使用 small,改为 mediumlarge-v3(若 CPU/内存不足可先试 base)。

4. 开启降噪
勾选主界面右上角的 降噪 选项,可滤除部分背景干扰。

5. 改用 openai-whisper 模式
在语音识别下拉框中选择 openai-whisper(本地),然后同样选择 medium 模型重试。

6. 检查文件路径
确保视频文件存放路径及文件名只有英文和数字,无中文字符或特殊符号。若 N:/ 是网络盘或外置盘,请先拷贝到本地磁盘(如 D:/)再试。

若上述方法均无效,请提供 logs/ 文件夹下最新的日志文件(复制报错前后约30行),以便进一步分析。

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource

Similar issues already exist