#4077 TaskCfgSTT(is_cuda=False, uuid='5c87c4068e', cache_folder='D:/win-pyvideotrans-v3.99/tmp/6748/5c87c4068e', target_dir='D

39.188* Posted at: 3 months ago

语音识别阶段出错 [阿里FunASR(本地)] 出错了，可能内存或显存不足 Model:paraformer-zh
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 278, in _new_process
File "videotrans\process\signelobj.py", line 75, in submit_task_cpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore

Traceback (most recent call last):
File "videotrans\configure\_base.py", line 278, in _new_process
File "videotrans\process\signelobj.py", line 75, in submit_task_cpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\_speech2text.py", line 159, in recogn
File "videotrans\recognition\__init__.py", line 262, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_funasr.py", line 64, in _exec
File "videotrans\configure\_base.py", line 303, in _new_process
RuntimeError: 出错了，可能内存或显存不足 Model:paraformer-zh
Traceback (most recent call last):
File "videotrans\configure\_base.py", line 278, in _new_process
File "videotrans\process\signelobj.py", line 75, in submit_task_cpu
File "concurrent\futures\process.py", line 720, in submit
concurrent.futures.process.BrokenProcessPool: A child process terminated abruptly, the process pool is not usable anymore
TaskCfgSTT(is_cuda=False, uuid='5c87c4068e', cache_folder='D:/win-pyvideotrans-v3.99/tmp/6748/5c87c4068e', target_dir='D:/win-pyvideotrans-v3.99/output/recogn', source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='D:/win-pyvideotrans-v3.99/output/recogn/凡人修仙传.srt', target_wav=None, target_wav_output=None, name='C:/Users/Administrator/Downloads/凡人修仙传/凡人修仙传.mp4', noextname='凡人修仙传', basename='凡人修仙传.mp4', ext='mp4', dirname='C:/Users/Administrator/Downloads/凡人修仙传', shound_del_name=None, detect_language='zh-cn', recogn_type=3, model_name='paraformer-zh', shibie_audio='D:/win-pyvideotrans-v3.99/tmp/6748/5c87c4068e/凡人修仙传-1776250447.1452572.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False)

=
system:Windows-10-10.0.22631-SP0
version:v3.99
frozen:True
language:zh
root_dir:D:/win-pyvideotrans-v3.99

Floor #1 DeepSeek reply at 3 months ago

这是由于内存或显存不足导致阿里FunASR本地模型（paraformer-zh）在语音识别阶段进程崩溃。

解决方案：

降低资源消耗：在“菜单-工具-高级选项-语音识别参数”中，尝试将“最长语音持续秒数”调小（例如设为3-5秒），以减少单次处理的数据量。
检查并关闭其他占用内存/显存的程序，确保有足够资源供软件使用。
更换模型：如果上述方法无效，建议改用对资源要求更低的语音识别渠道。对于中文视频，可以尝试：
- 免费本地模型：qwen-asr(本地)。
- 在线API：如果网络条件允许，可使用“阿里百炼Qwen3-ASR”或“字节火山极速版”等在线渠道，它们不依赖本地硬件。
检查CUDA：如果你有NVIDIA显卡并希望使用GPU加速，请确保已正确安装CUDA 12.8和cuDNN，并在主界面勾选“CUDA加速”。如果未安装CUDA，本地模型将完全依赖CPU和内存，处理长视频时容易内存不足。

#4077 TaskCfgSTT(is_cuda=False, uuid='5c87c4068e', cache_folder='D:/win-pyvideotrans-v3.99/tmp/6748/5c87c4068e', target_dir='D

1 Replies

请查阅相关文档：

Post Your Reply

Related resource