#3340 TaskCfg(cache_folder='D:/win-pyvideotrans-v3.97/tmp/22152/8e0a8641e1', target_dir='D:/Ken Tamplin Vocal Academy - How To

223.74* Posted at: 2 days ago 👁17

语音识别阶段出错 [faster-whisper(本地)] Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 264, in faster_whisper
File "faster_whisper\transcribe.py", line 586, in _batched_segments_generator
File "faster_whisper\transcribe.py", line 120, in forward
File "faster_whisper\transcribe.py", line 209, in generate_segment_batched
File "faster_whisper\transcribe.py", line 1400, in encode
RuntimeError: parallel_for failed: cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device

Traceback (most recent call last):
File "videotrans\task\job.py", line 106, in run
File "videotrans\task\trans_create.py", line 358, in recogn
File "videotrans\recognition\__init__.py", line 268, in run
File "videotrans\recognition\_base.py", line 141, in run
File "videotrans\recognition\_overall.py", line 63, in _exec
File "videotrans\recognition\_overall.py", line 142, in _faster
File "videotrans\configure\_base.py", line 291, in _new_process
RuntimeError: Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 264, in faster_whisper
File "faster_whisper\transcribe.py", line 586, in _batched_segments_generator
File "faster_whisper\transcribe.py", line 120, in forward
File "faster_whisper\transcribe.py", line 209, in generate_segment_batched
File "faster_whisper\transcribe.py", line 1400, in encode
RuntimeError: parallel_for failed: cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device
TaskCfg(cache_folder='D:/win-pyvideotrans-v3.97/tmp/22152/8e0a8641e1', target_dir='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1/_video_out/01. Volume 1 Introduction-mp4', remove_noise=False, is_separate=False, detect_language='en', subtitle_language=None, source_language='英语', target_language='简体中文', source_language_code='en', target_language_code='zh-cn', source_sub='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1/_video_out/01. Volume 1 Introduction-mp4/en.srt', target_sub='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1/_video_out/01. Volume 1 Introduction-mp4/zh-cn.srt', source_wav='D:/win-pyvideotrans-v3.97/tmp/22152/8e0a8641e1/en.wav', source_wav_output='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1/_video_out/01. Volume 1 Introduction-mp4/en.m4a', target_wav='D:/win-pyvideotrans-v3.97/tmp/22152/8e0a8641e1/target.wav', target_wav_output='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1/_video_out/01. Volume 1 Introduction-mp4/zh-cn.m4a', subtitles='', novoice_mp4='D:/win-pyvideotrans-v3.97/tmp/22152/8e0a8641e1/novoice.mp4', noextname='01. Volume 1 Introduction', shibie_audio=None, targetdir_mp4='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1/_video_out/01. Volume 1 Introduction-mp4/01. Volume 1 Introduction.mp4', instrument=None, vocal=None, back_audio='', background_music=None, app_mode='tiqu', subtitle_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='No', copysrt_rawvideo=False, clear_cache=True, translate_type=0, tts_type=0, recogn_type=0, model_name='large-v3-turbo', voice_autorate=False, video_autorate=False, is_cuda=True, name='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1/01. Volume 1 Introduction.mp4', basename='01. Volume 1 Introduction.mp4', ext='mp4', dirname='D:/Ken Tamplin Vocal Academy - How To Sing Better Than Anyone Else 2.0/Volume 1', shound_del_name=None, uuid='8e0a8641e1', remove_silent_mid=False, align_sub_audio=True, enable_diariz=False, nums_diariz=0, rephrase=0, only_out_mp4=False, fix_punc=False, recogn2pass=True)


=
system:Windows-10-10.0.19045-SP0
version:v3.97
frozen:True
language:zh
root_dir:D:/win-pyvideotrans-v3.97

1 Replies

这是由于显卡的CUDA计算能力与faster-whisper模型编译的CUDA内核不匹配导致的。错误 cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device 表明你的NVIDIA显卡(可能是较新的50系显卡)无法运行当前版本的faster-whisper。

解决方案:

  1. 临时方案(推荐):在软件设置中,将语音识别渠道的“模型”从 large-v3-turbo 切换为 large-v2large-v3。这些模型可能包含更广泛的CUDA内核支持。
  2. 根本解决方案:根据知识库,此问题在50系显卡上出现。请参考官方文档,为faster-whisper安装或编译支持你显卡计算能力的版本。这通常需要更新CUDA工具包或使用特定版本的faster-whisper wheel文件。
  3. 备用方案:如果上述方法无效,可以暂时关闭CUDA加速(在语音识别参数中取消勾选“启用CUDA加速”),但这会导致识别速度大幅下降。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource