#4504 TaskCfgSTT(is_cuda=False, uuid='3a494f4df7', cache_folder='C:/Users/user/Documents/WhisperJAV/output/ja.whisperjav/win-p

183.179* Posted at: 2 hours ago 👁11

语音识别阶段出错 [Deepgram.com] 程序内部错误:'>' not supported between instances of 'int' and 'str'
Traceback (most recent call last):
File "videotrans\recognition\_base.py", line 143, in run
File "tenacity\__init__.py", line 338, in wrapped_f
File "tenacity\__init__.py", line 477, in call
File "tenacity\__init__.py", line 378, in iter
File "tenacity\__init__.py", line 421, in exc_check
tenacity.RetryError: RetryError[]

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "videotrans\task\job.py", line 105, in run
File "videotrans\task\_speech2text.py", line 191, in recogn
File "videotrans\recognition\__init__.py", line 265, in run
File "videotrans\recognition\_base.py", line 180, in run
File "tenacity\__init__.py", line 480, in call
File "videotrans\recognition\_deepgram.py", line 98, in _exec
File "deepgram_captions\srt.py", line 10, in srt
File "deepgram_captions\converters.py", line 18, in get_lines
TypeError: '>' not supported between instances of 'int' and 'str'
TaskCfgSTT(is_cuda=False, uuid='3a494f4df7', cache_folder='C:/Users/user/Documents/WhisperJAV/output/ja.whisperjav/win-pyvideotrans-v3.99-0508/tmp/31344/3a494f4df7', target_dir='C:/Users/user/Documents/WhisperJAV/output/ja.whisperjav/win-pyvideotrans-v3.99-0508/output/recogn', source_language=None, source_language_code=None, source_sub=None, source_wav=None, source_wav_output=None, target_language=None, target_language_code=None, target_sub='C:/Users/user/Documents/WhisperJAV/output/ja.whisperjav/win-pyvideotrans-v3.99-0508/output/recogn/FC2-PPV-238629 綾波笹美流出.srt', target_wav=None, target_wav_output=None, name='F:/qb bt3/WAV/pyvideotrans test/FC2-PPV-238629 綾波笹美流出.wav', noextname='FC2-PPV-238629 綾波笹美流出', basename='FC2-PPV-238629 綾波笹美流出.wav', ext='wav', dirname='F:/qb bt3/WAV/pyvideotrans test', shound_del_name=None, detect_language='ja', recogn_type=10, model_name='nova-3', shibie_audio='C:/Users/user/Documents/WhisperJAV/output/ja.whisperjav/win-pyvideotrans-v3.99-0508/tmp/31344/3a494f4df7/FC2-PPV-238629 綾波笹美流出-1778408064.1699138.wav', remove_noise=False, enable_diariz=False, nums_diariz=0, rephrase=0, fix_punc=False)


=
system:Windows-10-10.0.26200-SP0
version:v3.99
frozen:True
language:zh
root_dir:C:/Users/user/Documents/WhisperJAV/output/ja.whisperjav/win-pyvideotrans-v3.99-0508

1 Replies

这是由于 Deepgram 返回的语音识别结果数据类型不匹配(deepgram_captions 库的 bug),导致时间戳比较时出现 intstr 类型错误。

解决方案:

  1. 升级软件版本:请下载最新版 pyVideoTrans(v3.99 之后可能有紧急修复补丁),覆盖安装后重试。
  2. 临时更换语音识别渠道:推荐改用 faster-whisper(本地)openai-whisper(本地),效果稳定且免费,无需依赖在线 API。
  3. 若必须使用 Deepgram:尝试在“高级设置”中调整“最短语音持续毫秒”或“最长语音持续秒数”参数,或等待后续版本修复。

请查阅相关文档:

Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Related resource