#4077 TaskCfgSTT(is_cuda=False, uuid='5c87c4068e', cache_folder='D:/win-pyvideotrans-v3.99/tmp/6748/5c87c4068e', target_dir='D

39.188* Posted at: 1 day ago 👁31

语音识别阶段出错 [阿里FunASR(本地)] 出错了,可能内存或显存不足 Model:paraformer-zh
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 278, in _new_process
File "videotrans\process\signelobj.py", line 75, in submit_task_cpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore

Traceback (most recent call last):
File "videotrans\configure\_base.py", line 278, in _new_process
File "videotrans\process\signelobj.py", line 75, in submit_task_cpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\_speech2text.py", line 159, in recogn
File "videotrans\recognition\__init__.py", line 262, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_funasr.py", line 64, in _exec
File "videotrans\configure\_base.py", line 303, in _new_process
RuntimeError: 出错了,可能内存或显存不足 Model:paraformer-zh
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 278, in _new_process
File "videotrans\process\signelobj.py", line 75, in submit_task_cpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore
TaskCfgSTT(is_cuda=False, uuid='5c87c4068e', cache_folder='D:/win-pyvideotrans-v3.99/tmp/6748/5c87c4068e', target_dir='D:/win-pyvideotrans-v3.99/output/recogn', source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='D:/win-pyvideotrans-v3.99/output/recogn/凡人修仙传.srt', target_wav=None, target_wav_output=None, name='C:/Users/Administrator/Downloads/凡人修仙传/凡人修仙传.mp4', noextname='凡人修仙传', basename='凡人修仙传.mp4', ext='mp4', dirname='C:/Users/Administrator/Downloads/凡人修仙传', shound_del_name=None, detect_language='zh-cn', recogn_type=3, model_name='paraformer-zh', shibie_audio='D:/win-pyvideotrans-v3.99/tmp/6748/5c87c4068e/凡人修仙传-1776250447.1452572.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False)


=
system:Windows-10-10.0.22631-SP0
version:v3.99
frozen:True
language:zh
root_dir:D:/win-pyvideotrans-v3.99

1 Replies

这是由于内存或显存不足导致阿里FunASR本地模型(paraformer-zh)在语音识别阶段进程崩溃。

解决方案:

  1. 降低资源消耗:在“菜单-工具-高级选项-语音识别参数”中,尝试将“最长语音持续秒数”调小(例如设为3-5秒),以减少单次处理的数据量。
  2. 检查并关闭其他占用内存/显存的程序,确保有足够资源供软件使用。
  3. 更换模型:如果上述方法无效,建议改用对资源要求更低的语音识别渠道。对于中文视频,可以尝试:

    • 免费本地模型qwen-asr(本地)
    • 在线API:如果网络条件允许,可使用“阿里百炼Qwen3-ASR”或“字节火山极速版”等在线渠道,它们不依赖本地硬件。
  4. 检查CUDA:如果你有NVIDIA显卡并希望使用GPU加速,请确保已正确安装CUDA 12.8和cuDNN,并在主界面勾选“CUDA加速”。如果未安装CUDA,本地模型将完全依赖CPU和内存,处理长视频时容易内存不足。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource