#5271 TaskCfgVTT(uuid='6ad3492ec2', name='C:/Users/zwsoft/Desktop/3X Milling Improvements_ENG_Finished.mp4', dirname='C:/Users

31.22* Posted at: 2 days ago

Batch size mismatch: audio=8, context=0:Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 559, in qwen3asr_fun
File "torch\utils\_contextlib.py", line 116, in decorate_context

return func(*args, **kwargs)

File "D:\videotrans\_internal\qwen_asr\inference\qwen3_asr.py", line 345, in transcribe

raise ValueError(f"Batch size mismatch: audio={n}, context={len(ctxs)}")

ValueError: Batch size mismatch: audio=8, context=0
[Qwen-ASR(本地内置), DeepSeek, Edge-TTS(免费)]
Traceback (most recent call last):
File "videotrans\task\only_one.py", line 47, in run
File "videotrans\task\trans_create.py", line 317, in recogn
File "videotrans\recognition\__init__.py", line 190, in run
File "videotrans\recognition\_base.py", line 94, in run
File "videotrans\recognition\_qwenasrlocal.py", line 45, in _exec
File "videotrans\configure\base.py", line 268, in _new_process
videotrans.configure.excepts.VideoTransError: Batch size mismatch: audio=8, context=0:Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 559, in qwen3asr_fun
File "torch\utils\_contextlib.py", line 116, in decorate_context

return func(*args, **kwargs)

File "D:\videotrans\_internal\qwen_asr\inference\qwen3_asr.py", line 345, in transcribe

raise ValueError(f"Batch size mismatch: audio={n}, context={len(ctxs)}")

ValueError: Batch size mismatch: audio=8, context=0
TaskCfgVTT(uuid='6ad3492ec2', name='C:/Users/zwsoft/Desktop/3X Milling Improvements_ENG_Finished.mp4', dirname='C:/Users/zwsoft/Desktop', noextname='3X Milling Improvements_ENG_Finished', basename='3X Milling Improvements_ENG_Finished.mp4', ext='mp4', target_dir='3X Milling Improvements_ENG_Finished-mp4', cache_folder='D:/videotrans/tmp/22188/6ad3492ec2', is_cuda=True, source_language='英语', source_language_code='en', source_sub='3X Milling Improvements_ENG_Finished-mp4/en.srt', source_wav='D:/videotrans/tmp/22188/6ad3492ec2/en.wav', source_wav_output='3X Milling Improvements_ENG_Finished-mp4/en.m4a', target_language='简体中文', target_language_code='zh-cn', target_sub='3X Milling Improvements_ENG_Finished-mp4/zh-cn.srt', target_wav='D:/videotrans/tmp/22188/6ad3492ec2/target.wav', target_wav_output='3X Milling Improvements_ENG_Finished-mp4/zh-cn.m4a', translate_type=4, tts_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='No', voice_autorate=True, video_autorate=False, remove_silent_mid=False, align_sub_audio=True, detect_language='en', recogn_type=2, model_name='1.7B', shibie_audio=None, remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=0, subtitle_language=None, app_mode='biaozhun', subtitles='', targetdir_mp4='3X Milling Improvements_ENG_Finished-mp4/3X Milling Improvements_ENG_Finished.mp4', novoice_mp4='D:/videotrans/tmp/22188/6ad3492ec2/novoice.mp4', is_separate=True, embed_bgm=True, instrument='D:/videotrans/tmp/22188/6ad3492ec2/instrument.wav', vocal='D:/videotrans/tmp/22188/6ad3492ec2/vocal.wav', clear_cache=True, background_music='', subtitle_type=1, only_out_mp4=False, recogn2pass=False, output_srt=2, copysrt_rawvideo=False, loop_backaudio=1, backaudio_volume=0.8)


=
system:Windows-10-10.0.22631-SP0
version:v4.02
frozen:True
language:zh
root_dir:D:/videotrans

1 Replies

这是 Qwen-ASR 本地模型的内部约束错误——模型收到的音频片段数量(8段)对应的上下文信息为零。通常因为分离出的人声文件(vocal.wav)为空或极短静音,导致模型无法提取有效上下文。

请按以下顺序排查:

  1. 取消勾选“分离人声背景声”
    主界面 → “设置更多参数” → 取消 分离人声背景声,重新执行。若能通过,则原因为分离所得的人声文件损坏或静音。
  2. 检查人声文件
    打开 D:/videotrans/tmp/22188/6ad3492ec2/vocal.wav,确认文件大小 > 1MB 且播放时有正常语音。若文件极小或无声,请使用专业工具(如 UVR5 )重新分离,或直接输入原始视频而不分离。
  3. 更换语音识别渠道
    停止使用 Qwen-ASR(本地),改为 faster-whisper(本地) + large-v3 模型,此渠道更稳定且对空音频片段有更好的容错。
  4. 更新软件版本
    确认当前为 v4.02,若有更新补丁,请覆盖 sp.exe 并重试。此错误在更早版本中偶发,后续已部分优化。

若上述步骤均无效,请提供 tmp/22188/6ad3492ec2/ 目录下所有文件清单,以及 logs/ 文件夹最新日志全文,以便进一步定位。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update. Dondate: ko-fi.com/jianchang512

Related resource