#5334 TaskCfg(cache_folder='D:/My Softwares/视频翻译配音 Pyvideotran 3.96-24/tmp/15316/b10709ee8a', target_dir='E:/5/_video_out/55-m

120.217* Posted at: 7 hours ago

语音识别阶段出错 [faster-whisper(本地)] Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 179, in faster_whisper
File "faster_whisper\transcribe.py", line 689, in init
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.

Traceback (most recent call last):
File "videotrans\task\job.py", line 106, in run
File "videotrans\task\trans_create.py", line 358, in recogn
File "videotrans\recognition\__init__.py", line 282, in run
File "videotrans\recognition\_base.py", line 141, in run
File "videotrans\recognition\_overall.py", line 63, in _exec
File "videotrans\recognition\_overall.py", line 142, in _faster
File "videotrans\configure\_base.py", line 276, in _new_process
RuntimeError: Traceback (most recent call last):
File "videotrans\process\stt_fun.py", line 179, in faster_whisper
File "faster_whisper\transcribe.py", line 689, in init
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.
TaskCfg(cache_folder='D:/My Softwares/视频翻译配音 Pyvideotran 3.96-24/tmp/15316/b10709ee8a', target_dir='E:/5/_video_out/55-mp4', remove_noise=False, is_separate=False, detect_language='en', subtitle_language=None, source_language='英语', target_language='简体中文', source_language_code='en', target_language_code='zh-cn', source_sub='E:/5/_video_out/55-mp4/en.srt', target_sub='E:/5/_video_out/55-mp4/zh-cn.srt', source_wav='D:/My Softwares/视频翻译配音 Pyvideotran 3.96-24/tmp/15316/b10709ee8a/en.wav', source_wav_output='E:/5/_video_out/55-mp4/en.m4a', target_wav='D:/My Softwares/视频翻译配音 Pyvideotran 3.96-24/tmp/15316/b10709ee8a/target.wav', target_wav_output='E:/5/_video_out/55-mp4/zh-cn.m4a', subtitles='', novoice_mp4='D:/My Softwares/视频翻译配音 Pyvideotran 3.96-24/tmp/15316/b10709ee8a/novoice.mp4', noextname='55', shibie_audio=None, targetdir_mp4='E:/5/_video_out/55-mp4/55.mp4', instrument=None, vocal=None, back_audio='', background_music=None, app_mode='tiqu', subtitle_type=0, volume='+0%', pitch='+0Hz', voice_rate='+0%', voice_role='No', copysrt_rawvideo=False, clear_cache=True, translate_type=1, tts_type=0, recogn_type=0, model_name='large-v3-turbo', voice_autorate=False, video_autorate=False, cuda=False, name='E:/5/55.mp4', basename='55.mp4', ext='mp4', dirname='E:/5', shound_del_name=None, uuid='b10709ee8a', remove_silent_mid=False, align_sub_audio=True, enable_diariz=False, nums_diariz=0, rephrase=0, only_out_mp4=False, fix_punc=False, recogn2pass=True)


=
system:Windows-10-10.0.26200-SP0
version:v3.96
frozen:True
language:zh
root_dir:D:/My Softwares/视频翻译配音 Pyvideotran 3.96-24

1 Replies

核心原因是 CPU 模式下计算数据类型被设为 float16,而 float16 仅支持 NVIDIA GPU 加速,在 CPU 上会报错。

解决方法:

  1. 打开 菜单 → 工具 → 高级选项 → 语音识别参数
  2. 找到 计算数据类型 选项,改为 float32int8(推荐 int8,CPU 速度更快)。
  3. 保存后重启软件,重新执行任务。

附加建议:

  • 若你有 NVIDIA 显卡并安装了 CUDA,请勾选主界面上的 CUDA 加速,此时可正常使用 float16 获得最高速度。
  • 当前软件版本 v3.96 较旧,建议下载最新 v4.03 以解决更多潜在问题。

请查阅相关文档:

Post Your Reply