#3324 TaskCfg(cache_folder='D:/Program Files/win-pyvideotrans-v3.97/tmp/18236/0ad1796be1', target_dir='D:/绿色软件/win-pyvideotran

120.211* Posted at: 19 hours ago 👁16

语音识别阶段出错 [faster-whisper(本地)] Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 264, in faster_whisper
File "faster_whisper\transcribe.py", line 586, in _batched_segments_generator
File "faster_whisper\transcribe.py", line 120, in forward
File "faster_whisper\transcribe.py", line 209, in generate_segment_batched
File "faster_whisper\transcribe.py", line 1400, in encode
RuntimeError: mkl_malloc: failed to allocate memory

Traceback (most recent call last):
File "videotrans\task\job.py", line 106, in run
File "videotrans\task\_speech2text.py", line 156, in recogn
File "videotrans\recognition\__init__.py", line 268, in run
File "videotrans\recognition\_base.py", line 141, in run
File "videotrans\recognition\_overall.py", line 63, in _exec
File "videotrans\recognition\_overall.py", line 142, in _faster
File "videotrans\configure\_base.py", line 291, in _new_process
RuntimeError: Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 264, in faster_whisper
File "faster_whisper\transcribe.py", line 586, in _batched_segments_generator
File "faster_whisper\transcribe.py", line 120, in forward
File "faster_whisper\transcribe.py", line 209, in generate_segment_batched
File "faster_whisper\transcribe.py", line 1400, in encode
RuntimeError: mkl_malloc: failed to allocate memory
TaskCfg(cache_folder='D:/Program Files/win-pyvideotrans-v3.97/tmp/18236/0ad1796be1', target_dir='D:/绿色软件/win-pyvideotrans-v3.97/output/recogn', remove_noise=False, is_separate=False, detect_language='en', subtitle_language=None, source_language=None, target_language=None, source_language_code=None, target_language_code=None, source_sub=None, target_sub='D:/绿色软件/win-pyvideotrans-v3.97/output/recogn/01-Introduction 01-Track 1 (1).srt', source_wav=None, source_wav_output=None, target_wav=None, target_wav_output=None, subtitles='', novoice_mp4=None, noextname='01-Introduction 01-Track 1 (1)', shibie_audio='D:/Program Files/win-pyvideotrans-v3.97/tmp/18236/0ad1796be1/01-Introduction 01-Track 1 (1)-1771817670.1137197.wav', targetdir_mp4=None, instrument=None, vocal=None, back_audio=None, background_music=None, app_mode='biaozhun', subtitle_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role=None, copysrt_rawvideo=False, clear_cache=False, translate_type=None, tts_type=None, recogn_type=0, model_name='small.en', voice_autorate=False, video_autorate=False, is_cuda=False, name='D:/Downloads/01-Introduction 01-Track 1 (1).mp3', basename='01-Introduction 01-Track 1 (1).mp3', ext='mp3', dirname='D:/Downloads/q', shound_del_name=None, uuid='0ad1796be1', remove_silent_mid=False, align_sub_audio=True, enable_diariz=False, nums_diariz=0, rephrase=0, only_out_mp4=False, fix_punc=False, recogn2pass=False)


=
system:Windows-10-10.0.19045-SP0
version:v3.97
frozen:True
language:zh
root_dir:D:/Program Files/win-pyvideotrans-v3.97

1 Replies

这是由于系统内存(RAM)不足导致 faster-whisper 模型在加载或处理时分配内存失败。

解决方案:

  1. 关闭不必要的程序:关闭浏览器、其他大型软件,释放系统内存。
  2. 降低模型大小:在语音识别设置中,将模型从 small.en 更换为更小的模型,例如 tiny.enbase.en,以减少内存占用。
  3. 启用 CUDA 加速:如果您的电脑有 NVIDIA 显卡,请安装 CUDA 12.8 和 cuDNN 9.x,然后在软件设置中开启 GPU 加速,将计算负载从内存转移到显存。
  4. 检查软件安装路径:您的软件安装在 D:/Program Files/win-pyvideotrans-v3.97,该路径包含空格。虽然预打包版对此有一定容忍度,但最佳实践是将其移动到无空格和无中文的路径,例如 D:\pyVideoTrans
  5. 增加虚拟内存:在 Windows 系统中适当增加页面文件大小。

参考文档:

Post Your Reply