#3977 TaskCfgSTT(is_cuda=True, uuid='96b04c140b', cache_folder='E:/Fanyizimu/tmp/15364/96b04c140b', target_dir='E:/Fanyizimu/o

124.91* Posted at: 3 months ago

语音识别阶段出错 [faster-whisper(本地)] 出错了，可能内存或显存不足 Model:large-v3-turbo GPU0
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 281, in _new_process
File "videotrans\process\signelobj.py", line 80, in submit_task_gpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore

Traceback (most recent call last):
File "videotrans\configure\_base.py", line 281, in _new_process
File "videotrans\process\signelobj.py", line 80, in submit_task_gpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\_speech2text.py", line 159, in recogn
File "videotrans\recognition\__init__.py", line 292, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_overall.py", line 33, in _exec
File "videotrans\recognition\_overall.py", line 105, in _faster
File "videotrans\configure\_base.py", line 303, in _new_process
RuntimeError: 出错了，可能内存或显存不足 Model:large-v3-turbo GPU0
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 281, in _new_process
File "videotrans\process\signelobj.py", line 80, in submit_task_gpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore
TaskCfgSTT(is_cuda=True, uuid='96b04c140b', cache_folder='E:/Fanyizimu/tmp/15364/96b04c140b', target_dir='E:/Fanyizimu/output/recogn', source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='E:/Fanyizimu/output/recogn/FC2PPV-3143218.srt', target_wav=None, target_wav_output=None, name='E:/123/FC2PPV-3143218.mp4', noextname='FC2PPV-3143218', basename='FC2PPV-3143218.mp4', ext='mp4', dirname='E:/123', shound_del_name=None, detect_language='ja', recogn_type=0, model_name='large-v3-turbo', shibie_audio='E:/Fanyizimu/tmp/15364/96b04c140b/FC2PPV-3143218-1775647440.7489362.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False)

=
system:Windows-10-10.0.26200-SP0
version:v3.98
frozen:True
language:zh
root_dir:E:/Fanyizimu

Floor #1 DeepSeek reply at 3 months ago

这是由于显存或内存不足导致的。错误信息明确指出在使用 faster-whisper(本地) 的 large-v3-turbo 模型时，GPU0 的显存或系统内存不足以完成语音识别任务。

解决方案：

使用更小的模型：将语音识别模型从 large-v3-turbo 更换为 medium、small 或 base。large-v3-turbo 模型需要大量显存/内存。
调整高级设置：在菜单栏 工具/选项 -> 高级选项 中进行如下修改，以降低显存/内存占用：
- CUDA数据类型: 将 float32 改为 float16 或 int8。
- 识别准确度beam_size: 将 5 改为 1。
- 识别准确度best_of: 将 5 改为 1。
- 启用上下文感知: 将 true 改为 false。
检查并关闭二次识别：如果同时翻译多个视频，请确保主界面右上角的二次识别选项未被勾选。该选项会同时运行多个识别任务，可能导致显存需求翻倍。
检查GPU任务数：在 工具/选项 -> 高级选项 -> 通用设置 中，确保 GPU同时任务数[重启生效] 设置为 1（除非您的单卡显存大于20GB）。
清理缓存并重启：如果之前处理过其他任务，可以尝试关闭软件，手动删除软件根目录下的 tmp 文件夹，然后重新启动软件再试。

请查阅相关文档：

总是提示显存不足

#3977 TaskCfgSTT(is_cuda=True, uuid='96b04c140b', cache_folder='E:/Fanyizimu/tmp/15364/96b04c140b', target_dir='E:/Fanyizimu/o

1 Replies

请查阅相关文档：

Post Your Reply

Related resource