tdt这个语音模型识别英文非常好 但是很经常会出现一天字幕几十秒甚至几分钟 今天我用ai改了下代码 发现改后效果更好了 首先还是按原来代码那样识别语音 然后呢对超过12秒的字幕根据标点符号分割 分割后再检查字幕 如果还有超过25秒的 再次调用tdt识别相关片段 最后再检查如果有超过12秒的字幕 再一次用标点符号分割。另外建议切语音 8分钟为一段就行 我16g显存 默认10分钟 偶尔会很卡
#786 tdt 语音识别模型优化
119.125*
Posted at: 3 months ago 👁98
1 Replies
Post Your Reply
Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.
Trending Questions
-
【这样设置,才能取得最佳断句效果-faster-whisper语音识别!】... 💬11 👁1478
-
edge_tts.exceptions.NoAudioReceived: No audio was received. Please verify that your parameters are correct.... 💬9 👁1310
-
ImportError: cannot import name 'Sentinel' from 'typing_extensions' (/home/Timoshenko/miniconda3/lib... 💬1 👁942
-
翻译断句过长,全屏字幕问题... 💬22 👁936
-
【faster-whisper模式】下载模型失败解决办法... 💬1 👁858
-
RuntimeError: Data processing error: CAS service error : Reqwest Error: HTTP status client error (401 Unauthorized), dom... 💬2 👁839
-
语音转字幕问题... 💬8 👁666
-
大佬帮帮忙,总是到结尾出现这样,不知道哪里出了问题... 💬1 👁619
-
Edge-TTS(免费)不能用了吗?... 💬17 👁582
-
助力项目持续维护... 💬0 👁577
-
识别语音设置怎么感觉没用啊... 💬4 👁562
-
No audio was received. Please verify that your parameters are correct.... 💬3 👁540
-
翻译字幕阶段出错:Traceback (most recent call last): File "videotrans\translator\_base.py", line 77, in run File "... 💬1 👁530
-
翻译钢铁侠2,使用批量翻译srt,中间有字幕没有翻译,同时出现把参数显示出来了。... 💬1 👁524
-
声画不同步的问题... 💬2 👁517
-
2025-09-10 11:09:02,333 - VideoTrans - ERROR - FFmpeg 命令执行失败 (force_cpu=True)。 命令: ffmpeg -hide_banner -ignore_unknown -... 💬11 👁508
-
Edge-TTS无法使用的解决办法... 💬5 👁503
-
{'code': 6, 'msg': 'CUDA error: no kernel image is available for execution on the device\nCUDA ... 💬12 👁500
-
Unable to open file 'model.bin' in model 'C:/Users/wujin/Desktop/win-videotrans-v3.78-0821/models\models-... 💬3 👁488
-
Unable to open file 'model.bin' in model 'D:/BaiduNetdiskDownload/win-videotrans-v3.78-0821/models\models... 💬1 👁477