#3672 TaskCfgSTT(is_cuda=True, uuid='a4f6bf975a', cache_folder='D:/video/win-pyvideotrans-v3.98-312/tmp/33160/a4f6bf975a', tar

185.241* Posted at: 1 day ago 👁21

语音识别阶段出错 [openai-whisper(本地)] 出错了,可能内存或显存不足
A child process terminated abruptly, the process pool is not usable anymore
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 280, in _new_process
File "videotrans\process\signelobj.py", line 81, in submit_task_gpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\_speech2text.py", line 152, in recogn
File "videotrans\recognition\__init__.py", line 265, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_overall.py", line 31, in _exec
File "videotrans\recognition\_overall.py", line 73, in _openai
File "videotrans\configure\_base.py", line 294, in _new_process
RuntimeError: 出错了,可能内存或显存不足
A child process terminated abruptly, the process pool is not usable anymore
TaskCfgSTT(is_cuda=True, uuid='a4f6bf975a', cache_folder='D:/video/win-pyvideotrans-v3.98-312/tmp/33160/a4f6bf975a', target_dir='D:/video/win-pyvideotrans-v3.98-312/output/recogn', source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='D:/video/win-pyvideotrans-v3.98-312/output/recogn/9. 怎样从股票玩你逆袭成你玩股票?.srt', target_wav=None, target_wav_output=None, name='H:/CRTubeGet Downloaded/奇衡解读股票作手操盘术/9. 怎样从股票玩你逆袭成你玩股票?.mp4', noextname='9. 怎样从股票玩你逆袭成你玩股票?', basename='9. 怎样从股票玩你逆袭成你玩股票?.mp4', ext='mp4', dirname='H:/CRTubeGet Downloaded/奇衡解读股票作手操盘术', shound_del_name=None, detect_language='zh-cn', recogn_type=1, model_name='tiny', shibie_audio='D:/video/win-pyvideotrans-v3.98-312/tmp/33160/a4f6bf975a/9. 怎样从股票玩你逆袭成你玩股票?-1773815388.0926006.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False)


=
system:Windows-10-10.0.19045-SP0
version:v3.98
frozen:True
language:zh
root_dir:D:/video/win-pyvideotrans-v3.98-312

1 Replies

这是由于内存或显存不足导致子进程崩溃。错误信息明确指出“可能内存或显存不足”,且使用了 openai-whisper(本地)tiny 模型,虽然 tiny 模型较小,但在处理较长视频或并发任务时仍可能耗尽资源。

解决方案:

  1. 降低资源消耗

    • 在“语音识别”阶段,尝试将模型从 tiny 更换为更小的 distil 系列模型(如果视频是英语)或 faster-whisper 模式下的 tiny 模型(通常资源占用更低)。
    • 在“菜单-工具-高级选项”中,将“GPU同时任务数”设置为 1,并将“最大进程数”设置为较小的值(如 23),然后重启软件。
    • 如果视频较长,可以尝试将视频分割成较短的片段分别处理。
  2. 检查CUDA环境

    • 确认已正确安装 CUDA 12.8 和 cuDNN 9.x。在 CMD 中输入 nvcc -Vnvidia-smi 验证。
    • 如果未安装 CUDA 或安装不正确,请卸载后重新安装。安装后需重启计算机。
  3. 调整高级参数

    • 在“菜单-工具-高级选项-语音识别参数”中,尝试降低 识别准确度beam_size识别准确度best_of 的值(例如设为 1)。
    • 如果启用了“启用上下文感知”,请关闭此选项。
  4. 检查文件路径

    • 你的输入文件路径和名称较长且包含中文和特殊符号(如 )。这可能导致命令行处理异常。请将文件重命名为简短、无特殊符号的英文或数字名称(例如 9.mp4),并移动到较浅的目录(如 D:/video/input.mp4)再尝试处理。
  5. 使用命令行模式(可选)

    • 如果 GUI 模式持续失败,可以尝试使用命令行模式进行单任务处理,可能更稳定。命令示例:uv run cli.py --task stt --name "D:/video/input.mp4" --source_language_code zh

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource