#5103 TaskCfgVTT(uuid='caa652c3ef', name='E:/short/zip/顾少的小宠妻/40.mp4', dirname='E:/short/zip/顾少的小宠妻', noextname='40', basename

240e:3b7* Posted at: 2 hours ago

语音识别阶段出错[阿里百炼 Qwen3-ASR] 处理列表或序列时索引越界:list index out of range
Traceback (most recent call last):

File "videotrans\task\job.py", line 35, in run

File "videotrans\task\job.py", line 100, in process_task

File "videotrans\task\trans_create.py", line 322, in recogn

File "videotrans\recognition\__init__.py", line 190, in run

File "", line 22, in init

File "videotrans\recognition\_qwen3asr.py", line 18, in post_init

File "videotrans\recognition\_base.py", line 214, in cut_audio

IndexError: list index out of range
TaskCfgVTT(uuid='caa652c3ef', name='E:/short/zip/顾少的小宠妻/40.mp4', dirname='E:/short/zip/顾少的小宠妻', noextname='40', basename='40.mp4', ext='mp4', target_dir='E:/short/zip/顾少的小宠妻/subtitles/40-mp4', cache_folder='E:/short/app/win-pyvideotrans-v4.01-0602/tmp/123284/caa652c3ef', is_cuda=True, source_language='简体中文', source_language_code='zh-cn', source_sub='E:/short/zip/顾少的小宠妻/subtitles/40-mp4/zh-cn.srt', source_wav='E:/short/app/win-pyvideotrans-v4.01-0602/tmp/123284/caa652c3ef/zh-cn.wav', source_wav_output='E:/short/zip/顾少的小宠妻/subtitles/40-mp4/zh-cn.m4a', target_language='英语', target_language_code='en', target_sub='E:/short/zip/顾少的小宠妻/subtitles/40-mp4/en.srt', target_wav='E:/short/app/win-pyvideotrans-v4.01-0602/tmp/123284/caa652c3ef/target.wav', target_wav_output='E:/short/zip/顾少的小宠妻/subtitles/40-mp4/en.m4a', translate_type=1, tts_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='No', voice_autorate=False, video_autorate=False, remove_silent_mid=False, align_sub_audio=True, detect_language='zh-cn', recogn_type=7, model_name='large-v3-turbo', shibie_audio=None, remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False, subtitle_language=None, app_mode='tiqu', subtitles='', targetdir_mp4='E:/short/zip/顾少的小宠妻/subtitles/40-mp4/40.mp4', novoice_mp4='E:/short/app/win-pyvideotrans-v4.01-0602/tmp/123284/caa652c3ef/novoice.mp4', is_separate=False, embed_bgm=True, instrument=None, vocal='E:/short/app/win-pyvideotrans-v4.01-0602/tmp/123284/caa652c3ef/vocal.wav', clear_cache=True, background_music='', subtitle_type=0, only_out_mp4=False, recogn2pass=False, output_srt=0, copysrt_rawvideo=False, loop_backaudio=0, backaudio_volume=0.8)


=
system:Windows-10-10.0.19045-SP0
version:v4.01
frozen:True
language:zh
root_dir:E:/short/app/win-pyvideotrans-v4.01-0602

1 Replies

该错误是由于音频切割阶段产生了空列表,cut_audio 方法索引时越界。通常是音频文件问题(无人声、格式异常)或 VAD 参数导致未检测到任何有效语音片段。

解决方案:

  1. 检查音频源:确认原始视频是否包含清晰人声。用播放器打开文件试听。
  2. 检查任务配置:当前 app_mode='tiqu'(提取模式),源语言为简体中文。请确保视频中确实是中文发音。
  3. 临时更换识别渠道:在语音识别下拉框中改用 faster-whisper(本地) + large-v3 模型,排除 Qwen3-ASR 特定问题。
  4. 调整 VAD 参数(如果必要)

    • 点击菜单 → 工具 → 高级选项 → 语音识别参数
    • 语音阈值 从 0.5 降低到 0.3(更灵敏)
    • 静音分割持续毫秒 从 500 降低到 200
    • 取消勾选 Whisper预分割音频?
  5. 单独处理该视频:勾选主界面左上角 清理已生成,重新尝试。

如果仍不行,请提供软件 logs/ 目录下对应日期的日志文件(底部约 30 行),进一步排查音频预处理阶段输出。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update. Dondate: ko-fi.com/jianchang512

Related resource