#3986 TaskCfgVTT(is_cuda=False, uuid='0da178fd04', cache_folder='D:/pyvideotran/tmp/32348/0da178fd04', target_dir='E:/1/_video

49.85* Posted at: 7 hours ago 👁11

语音识别阶段出错 [faster-whisper(本地)] Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 346, in faster_whisper
File "faster_whisper\transcribe.py", line 689, in init
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\trans_create.py", line 360, in recogn
File "videotrans\recognition\__init__.py", line 292, in run
File "videotrans\recognition\_base.py", line 143, in run
File "videotrans\recognition\_overall.py", line 33, in _exec
File "videotrans\recognition\_overall.py", line 105, in _faster
File "videotrans\configure\_base.py", line 289, in _new_process
RuntimeError: Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 346, in faster_whisper
File "faster_whisper\transcribe.py", line 689, in init
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.
TaskCfgVTT(is_cuda=False, uuid='0da178fd04', cache_folder='D:/pyvideotran/tmp/32348/0da178fd04', target_dir='E:/1/_video_out/SAN-437-mp4', source_language='日语', source_language_code='ja', source_sub='E:/1/_video_out/SAN-437-mp4/ja.srt', source_wav='D:/pyvideotran/tmp/32348/0da178fd04/ja.wav', source_wav_output='E:/1/_video_out/SAN-437-mp4/ja.m4a', target_language='简体中文', target_language_code='zh-cn', target_sub='E:/1/_video_out/SAN-437-mp4/zh-cn.srt', target_wav='D:/pyvideotran/tmp/32348/0da178fd04/target.wav', target_wav_output='E:/1/_video_out/SAN-437-mp4/zh-cn.m4a', name='E:/1/SAN-437.mp4', noextname='SAN-437', basename='SAN-437.mp4', ext='mp4', dirname='E:/1', shound_del_name=None, translate_type=0, tts_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='No', voice_autorate=False, video_autorate=False, remove_silent_mid=False, align_sub_audio=False, detect_language='ja', recogn_type=0, model_name='large-v3-turbo', shibie_audio=None, remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False, subtitle_language=None, app_mode='tiqu', subtitles='', targetdir_mp4='E:/1/_video_out/SAN-437-mp4/SAN-437.mp4', novoice_mp4='D:/pyvideotran/tmp/32348/0da178fd04/novoice.mp4', is_separate=False, embed_bgm=False, instrument=None, vocal=None, back_audio='', clear_cache=False, background_music=None, subtitle_type=0, only_out_mp4=False, recogn2pass=False, output_srt=0, copysrt_rawvideo=False)


=
system:Windows-10-10.0.26200-SP0
version:v3.98
frozen:True
language:zh
root_dir:D:/pyvideotran

1 Replies

这是由于 CUDA 数据类型设置与显卡硬件不兼容导致的。错误信息表明,软件请求使用 float16 数据类型进行计算,但你的目标设备或后端不支持高效的 float16 运算。

解决方案:

  1. 打开软件,点击顶部菜单栏的 工具/选项 -> 高级选项
  2. 语音识别参数 区域,找到 CUDA数据类型 下拉菜单。
  3. 将选项从 float16default 修改为 float32
  4. 点击 保存,然后重新尝试语音识别任务。

如果修改后问题依旧,或者你的显卡不支持 CUDA 加速,请尝试以下步骤:

  1. 在软件主界面上,取消勾选右上角的 CUDA加速 复选框。
  2. 这将强制软件使用 CPU 进行计算,虽然速度会变慢,但可以避免此错误。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource