#3533 TaskCfgVTT(is_cuda=False, uuid='342bdc5d2a', cache_folder='D:/数字媒体/win-pyvideotrans-v3.97-0304/tmp/5424/342bdc5d2a', tar

2409:895a* Posted at: 2 hours ago 👁9

语音识别阶段出错 [faster-whisper(本地)] Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 346, in faster_whisper
File "faster_whisper\transcribe.py", line 689, in init
RuntimeError: File model.bin is incomplete: failed to read a buffer of size 13107200 at position 1038051066

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\trans_create.py", line 353, in recogn
File "videotrans\recognition\__init__.py", line 265, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_overall.py", line 33, in _exec
File "videotrans\recognition\_overall.py", line 105, in _faster
File "videotrans\configure\_base.py", line 288, in _new_process
RuntimeError: Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 346, in faster_whisper
File "faster_whisper\transcribe.py", line 689, in init
RuntimeError: File model.bin is incomplete: failed to read a buffer of size 13107200 at position 1038051066
TaskCfgVTT(is_cuda=False, uuid='342bdc5d2a', cache_folder='D:/数字媒体/win-pyvideotrans-v3.97-0304/tmp/5424/342bdc5d2a', target_dir='D:/数字媒体/_video_out/5. 量化 - 数字音频基础【机翻双字】-mp4', source_language='英语', source_language_code='en', source_sub='D:/数字媒体/_video_out/5. 量化 - 数字音频基础【机翻双字】-mp4/en.srt', source_wav='D:/数字媒体/win-pyvideotrans-v3.97-0304/tmp/5424/342bdc5d2a/en.wav', source_wav_output='D:/数字媒体/_video_out/5. 量化 - 数字音频基础【机翻双字】-mp4/en.m4a', target_language='简体中文', target_language_code='zh-cn', target_sub='D:/数字媒体/_video_out/5. 量化 - 数字音频基础【机翻双字】-mp4/zh-cn.srt', target_wav='D:/数字媒体/win-pyvideotrans-v3.97-0304/tmp/5424/342bdc5d2a/target.wav', target_wav_output='D:/数字媒体/_video_out/5. 量化 - 数字音频基础【机翻双字】-mp4/zh-cn.m4a', name='D:/数字媒体/5. 量化 - 数字音频基础【机翻双字】.mp4', noextname='5. 量化 - 数字音频基础【机翻双字】', basename='5. 量化 - 数字音频基础【机翻双字】.mp4', ext='mp4', dirname='D:/数字媒体', shound_del_name=None, translate_type=0, tts_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='Xiaoxiao(Female/CN)', voice_autorate=True, video_autorate=False, remove_silent_mid=False, align_sub_audio=True, detect_language='en', recogn_type=0, model_name='large-v3', shibie_audio=None, remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False, subtitle_language=None, app_mode='biaozhun', subtitles='', targetdir_mp4='D:/数字媒体/_video_out/5. 量化 - 数字音频基础【机翻双字】-mp4/5. 量化 - 数字音频基础【机翻双字】.mp4', novoice_mp4='D:/数字媒体/win-pyvideotrans-v3.97-0304/tmp/5424/342bdc5d2a/novoice.mp4', is_separate=False, instrument=None, vocal=None, back_audio='', clear_cache=False, background_music=None, subtitle_type=1, only_out_mp4=False, recogn2pass=True, output_srt=0, copysrt_rawvideo=False)


=
system:Windows-10-10.0.26100-SP0
version:v3.97
frozen:True
language:zh
root_dir:D:/数字媒体/win-pyvideotrans-v3.97-0304

1 Replies

这是由于 faster-whisper(本地) 语音识别模型文件 model.bin 下载不完整或已损坏导致的。

解决方案:

  1. 手动下载并替换模型文件

    • 根据你使用的模型 large-v3,在软件根目录下的 models 文件夹内,创建名为 models--Systran--faster-whisper-large-v3 的文件夹。
    • 访问模型下载地址:https://huggingface.co/Systran/faster-whisper-large-v3/tree/main
    • 将该页面所有 .json.bin.txt 文件下载下来,并复制到上一步创建的文件夹内(覆盖已存在的文件)。
  2. 检查路径:确保你的软件安装路径(D:/数字媒体/win-pyvideotrans-v3.97-0304/)不包含中文字符。建议将软件解压到纯英文路径下,如 D:\pyVideoTrans,然后重新尝试。
  3. 清理缓存:在软件主界面,勾选“清理已生成”选项,然后重新处理视频,以确保使用新下载的模型文件。

注意:手动下载模型时,如果访问 HuggingFace 官网困难,可以尝试使用镜像站 https://hf-mirror.com,但需注意网络稳定性。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource