#4144 TaskCfgSTT(is_cuda=True, uuid='0f4c8160a8', cache_folder='D:/xiazai/win-pyvideotrans-v3.98-0410/tmp/16640/0f4c8160a8', t

240e:38c* Posted at: 12 hours ago 👁16

语音识别阶段出错 [faster-whisper(本地)] 出错了,可能内存或显存不足 Model:large-v3-turbo GPU0
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 285, in _new_process
File "concurrent\futures\_base.py", line 458, in result
File "concurrent\futures\_base.py", line 403, in __get_result
concurrent.futures.process.BrokenProcessPool: A process in the process pool was terminated abruptly while the future was running or pending.

Traceback (most recent call last):
File "videotrans\configure\_base.py", line 285, in _new_process
File "concurrent\futures\_base.py", line 458, in result
File "concurrent\futures\_base.py", line 403, in __get_result
concurrent.futures.process.BrokenProcessPool: A process in the process pool was terminated abruptly while the future was running or pending.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\_speech2text.py", line 159, in recogn
File "videotrans\recognition\__init__.py", line 292, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_overall.py", line 33, in _exec
File "videotrans\recognition\_overall.py", line 105, in _faster
File "videotrans\configure\_base.py", line 303, in _new_process
RuntimeError: 出错了,可能内存或显存不足 Model:large-v3-turbo GPU0
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 285, in _new_process
File "concurrent\futures\_base.py", line 458, in result
File "concurrent\futures\_base.py", line 403, in __get_result
concurrent.futures.process.BrokenProcessPool: A process in the process pool was terminated abruptly while the future was running or pending.
TaskCfgSTT(is_cuda=True, uuid='0f4c8160a8', cache_folder='D:/xiazai/win-pyvideotrans-v3.98-0410/tmp/16640/0f4c8160a8', target_dir='D:/xiazai/win-pyvideotrans-v3.98-0410/output/recogn', source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='D:/xiazai/win-pyvideotrans-v3.98-0410/output/recogn/第一部分.srt', target_wav=None, target_wav_output=None, name='D:/xiazai/第一部分.mp3', noextname='第一部分', basename='第一部分.mp3', ext='mp3', dirname='D:/xiazai', shound_del_name=None, detect_language='hi', recogn_type=0, model_name='large-v3-turbo', shibie_audio='D:/xiazai/win-pyvideotrans-v3.98-0410/tmp/16640/0f4c8160a8/第一部分-1776516915.4165194.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False)


=
system:Windows-10-10.0.26200-SP0
version:v3.98
frozen:True
language:zh
root_dir:D:/xiazai/win-pyvideotrans-v3.98-0410

之前用也没问题,突然怎么会报这个错误

1 Replies

这是由于显存或内存不足导致语音识别进程崩溃。错误信息明确指出“可能内存或显存不足 Model:large-v3-turbo GPU0”。虽然之前使用正常,但处理不同视频时(如更长、更复杂的音频)或系统后台有其他程序占用资源,都可能突然触发此问题。

解决方案:

  1. 降低模型规格:在语音识别渠道的下拉菜单中,将模型从 large-v3-turbo 更换为更小的模型,例如 mediumsmallbaselarge-v3 系列模型最低需要约 10GB 显存。
  2. 调整高级设置:前往 菜单 -> 工具/选项 -> 高级选项,在“语音识别参数”区域进行以下调整,以降低资源消耗:

    • CUDA数据类型:将 float32 改为 float16int8
    • beam_size:将 5 改为 1
    • best_of:将 5 改为 1
    • 上下文:将 true 改为 false
  3. 检查二次识别:如果你在翻译视频时勾选了主界面右上角的“二次识别”,请先取消勾选。同时翻译多个视频时,二次识别可能占用双倍显存。
  4. 释放系统资源:关闭不必要的应用程序,尤其是占用大量显存/内存的软件(如游戏、浏览器、其他AI工具),然后重试。
  5. 检查多显卡:如果你有多个显卡,软件默认使用第一块(GPU0)。v3.99版本已优化为默认使用显存最大的显卡。可以考虑升级到v3.99版本。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource