#4851 [aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de

154.40* Posted at: 1 month ago

[aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] invalid band type
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] Reserved bit set.

Last message repeated 1 times

[aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] Reserved bit set.
[aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] Number of scalefactor bands in group (63) exceeds limit (49).
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] Sample rate index in program config element does not match the sample rate index configured by the container.
[aac @ 000002777cd6e280] Reserved bit set.
[aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[aac @ 000002777cd6e280] TYPE_FIL: Input buffer exhausted before END element found
[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de00] Error submitting packet to decoder: Invalid data found when processing input
[out#0/wav @ 000002777ccc8380] video:0KiB audio:1462KiB subtitle:0KiB other streams:0KiB global headers:0KiB muxing overhead: 0.008282%
size= 1462KiB time=00:00:51.66 bitrate= 231.9kbits/s speed= 544x
Conversion failed!

[ERROR] 单视频模式翻译失败submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing input submitting packet to decoder: Invalid data found when processing

Floor #1 DeepSeek reply at 1 month ago

这是由于视频文件的音频轨道损坏或采用了不标准的 AAC 编码参数，导致 ffmpeg 无法正确解码。常见原因包括：从某些网站下载的视频因编码错误导致音频流异常，或文件本身在录制/压制过程中产生损坏。

解决方案：

先转换视频格式：使用第三方工具（如格式工厂、HandBrake 或 ffmpeg 命令行）将原视频重新编码为 H.264 + AAC 的标准 MP4 文件，再用本软件处理转换后的视频。
直接提取音频并修复：在软件中勾选“设置更多参数” → “分离人声背景声”（该步骤会使用模型重新处理音频，可能绕开损坏的解码路径），或先通过“批量语音转字幕”功能单独处理音频（如果音频能导出的的话）。
更改输入源：如果视频来自网络，建议重新下载，或检查原始文件是否完整。

如果以上方法仍无效，请提供日志文件最后30行（位于软件根目录 logs/ 文件夹下），以便进一步分析。

Floor #2 154.40* reply at 1 month ago

为什么我的视频翻译功能在使用过程中代码没有报错，一切正常，转换出来的音频却只有一两句话

Floor #3 DeepSeek reply at 1 month ago

该问题通常由语音识别（ASR）阶段未能正确捕获全部语音所致，原因排查如下：

检查“发音语言”是否选择正确
- 请确保在翻译界面中“发音语言”选项与视频中人物实际所说的语言一致。
- 如果选择 auto（自动检测），可能因模型误判导致只识别出一部分语音，建议手动指定正确语言（如中文选 中文、英文选 English）。
调整 VAD 相关参数（菜单→工具→高级选项→语音识别参数）
- 最短语音持续毫秒：设为 1000（1秒）或更低，防止短语音被过滤。
- 最长语音持续秒数：设为 10 ~ 15，避免字幕被过度分割成过短片段。
- 静音分割持续毫秒：设为 200 ~ 500，静音时间过大会导致语音被合并成极少数片段。
- 语音阈值：保持默认 0.5，若环境嘈杂可适当调低至 0.3 或 0.4。
- 取消勾选“合并过短字幕到邻近”（若已选中）。
更换模型或启用降噪
- 在语音识别中选择更大的模型（如 large-v3），小模型可能漏检语音。
- 勾选主界面的 降噪 选项，先去除背景噪音，提高识别率。
- 若视频为中文，可尝试使用 Qwen-ASR(本地) 或 阿里FunASR(本地)+paraformer-zh，对中文支持更好。
验证视频本身
- 用播放器检查视频是否真的包含长时间的连续语音，或者是否只有开头部分有人说话（例如片头、结尾仅少量对白）。
- 亦可先用左侧“批量语音转字幕”功能单独转录该视频，查看生成的SRT字幕是否完整。若该功能也只输出少量句子，则说明问题出在ASR而非翻译/配音流程。
检查输出目录
- 确保最终视频是完整长度（与原视频时长一致），若不一致则说明在后期剪辑或拼接时出了问题。
- 查看 _video_out 文件夹内是否有完整的SRT字幕文件（原始语言和目标语言），若字幕只有几十行或时长极短，则确认是第一步识别不全。
日志分析
- 如果以上调整后仍无效，请打开软件安装目录下的 logs/ 文件夹，找到当天的日志文件（.log），复制最后30行内容提供给技术支持。日志中会明确显示语音识别到的音频时长、识别出的字幕条数等信息，可精确定位问题。

请按顺序尝试以上步骤，90% 的类似问题可通过调整 VAD 参数或更换识别模型解决。

Floor #4 154.40* reply at 1 month ago

[PYI-16184:DEBUG] PyInstaller Bootloader 6.x

[PYI-16184:DEBUG] LOADER: executable file: D:\pyV\_debug.exe
[PYI-16184:DEBUG] LOADER: trying to load executsk

archive...
[PYI-16184:DEBUG] LOADER: attempting to open archive D:\pyV\_debug.exe
[PYI-16184:DEBUG] LOADER: cookie found at offset 0x46430DD
[PYI-16184:DEBUG] LOADER: archive file: D:\pyV\_debug.exe
[PYI-16184:DEBUG] LOADER: application has onedir semantics...
[PYI-16184:DEBUG] LOADER: _PYI_ARCHIVE_FILE not defined...
[PYI-16184:DEBUG] LOADER: parent process level = -2
[PYI-16184:DEBUG] LOADER: process level = 1
[PYI-16184:DEBUG] LOADER: application's top-level directory: D:\pyV\_internal
[PYI-16184:DEBUG] LOADER: calling SetDllDirectoryW: D:\pyV\_internal
[PYI-16184:DEBUG] LOADER: splash screen is unavailable.
[PYI-16184:DEBUG] DYLIB: attempting to pre-load bundled copy of ucrtbase.dll: D:\pyV\_internal\ucrtbase.dll
[PYI-16184:DEBUG] DYLIB: loading Python DLL: D:\pyV\_internal\python310.dll
[PYI-16184:DEBUG] DYLIB: loaded Python shared library.
[PYI-16184:DEBUG] DYLIB: imported symbols from Python shared library.
[PYI-16184:DEBUG] LOADER: sk

embedded python interpreter...
[PYI-16184:DEBUG] LOADER: using PEP-587 API...
[PYI-16184:DEBUG] LOADER: creating PyConfig structure...
[PYI-16184:DEBUG] LOADER: initializing interpreter configuration...
[PYI-16184:DEBUG] LOADER: setting program name...
[PYI-16184:DEBUG] LOADER: setting python home path...
[PYI-16184:DEBUG] LOADER: setting module search paths...
[PYI-16184:DEBUG] LOADER: setting sys.argv...
[PYI-16184:DEBUG] LOADER: applying run-time options...
[PYI-16184:DEBUG] LOADER: starting embedded python interpreter...
[PYI-16184:DEBUG] LOADER: setting sys._MEIPASS
[PYI-16184:DEBUG] LOADER: importing modules from PKG/CArchive
[PYI-16184:DEBUG] LOADER: extracted struct
[PYI-16184:DEBUG] LOADER: running unmarshalled code object for module struct...
[PYI-16184:DEBUG] LOADER: extracted pyimod01_archive
[PYI-16184:DEBUG] LOADER: running unmarshalled code object for module pyimod01_archive...
[PYI-16184:DEBUG] LOADER: extracted pyimod02_importers
[PYI-16184:DEBUG] LOADER: running unmarshalled code object for module pyimod02_importers...
[PYI-16184:DEBUG] LOADER: extracted pyimod03_ctypes
[PYI-16184:DEBUG] LOADER: running unmarshalled code object for module pyimod03_ctypes...
[PYI-16184:DEBUG] LOADER: extracted pyimod04_pywin32
[PYI-16184:DEBUG] LOADER: running unmarshalled code object for module pyimod04_pywin32...
[PYI-16184:DEBUG] LOADER: looking for PYZ archive TOC entry...
[PYI-16184:DEBUG] LOADER: path to PYZ archive stored into sys._pyinstaller_pyz...
[PYI-16184:DEBUG] LOADER: running pyiboot01_bootstrap.py
[PYI-16184:DEBUG] LOADER: running pyi_rth_inspect.py
[PYI-16184:DEBUG] LOADER: running pyi_rth_pkgutil.py
[PYI-16184:DEBUG] LOADER: running pyi_rth_pkgres.py
[PYI-16184:DEBUG] LOADER: running pyi_sk

.py
[PYI-16184:DEBUG] LOADER: running pyi_sk

_openssl.py
[PYI-16184:DEBUG] LOADER: running pyi_rth__tkinter.py
[PYI-16184:DEBUG] LOADER: running pyi_sk

.py
[PYI-16184:DEBUG] LOADER: running pyi_rth_pyside6.py
[PYI-16184:DEBUG] LOADER: running pyi_sk

.py
[PYI-16184:DEBUG] LOADER: running pyi_sk

.py
[PYI-16184:DEBUG] LOADER: running sp.py

Floor #5 DeepSeek reply at 1 month ago

你提供的日志仅为程序启动时的调试信息，其中不包含视频翻译的实际处理过程，无法据此定位“音频只转出几句话”的原因。请按以下步骤获取正确日志：

打开软件安装目录下的 logs/ 文件夹。
找到当天以日期命名的 .log 文件（如 20250410.log）。
复制该文件的底部30行内容发回。

如果你已经执行过翻译任务，该日志中会明确显示语音识别(ASR)阶段识别出的音频时长、字幕条数等信息，可精确判断是 ASR 漏识别还是后续环节出现问题。

Floor #6 154.40* reply at 1 month ago

还是不够理想，很多字都被忽略掉了

Floor #7 DeepSeek reply at 1 month ago

许多字被忽略（漏识别）表明语音识别（ASR）阶段未能捕获所有语音，最常见的原因是模型大小不足或 VAD（语音活动检测）参数设置不当导致语音片段被误判为静音而跳过。请按以下顺序逐一排查：

更换更大的识别模型
在主界面“语音识别”中选择 faster-whisper(本地) + large-v3 模型。若之前使用 small 或 tiny，准确率极低，会遗漏大量句子。
优化 VAD 参数（菜单 → 工具 → 高级选项 → 语音识别参数）：
- 最短语音持续毫秒：设为 1000（1秒），低于此值的短语音将被丢弃。若原视频中说话人语速较快，可再降至 500。
- 最长语音持续秒数：设为 10 ~ 15，避免长句被强行切断。
- 静音分割持续毫秒：设为 140 ~ 200（默认500可能过大）。静音值越小，分割越细，不易漏掉话与话之间的短片段。
- 语音阈值：保持 0.5，若背景较安静可降至 0.3 以提高灵敏度。
- 取消勾选 合并过短字幕到邻近（若已选中），防止短句被合并掉。
- 选择VAD：尝试从默认的 ten-vad 切换为 silero，两种模型处理逻辑不同，效果可能有改善。
启用降噪或分离人声
- 勾选主界面右上角的 降噪，可去除背景噪声，让 ASR 更聚焦于人声。
- 若勾选后速度过慢，也可尝试在“设置更多参数”中启用 分离人声背景声（会先用模型分离人声再识别，CPU 耗时较长但效果显著）。
更换识别渠道
- 如果视频为中文，强烈建议尝试 Qwen-ASR(本地) 或 阿里FunASR(本地)+paraformer-zh，对中文语音支持远优于 whisper。
- 如果视频为英文，可尝试 openai-whisper(本地) + large-v3 或 openai语音识别API（需有 Key）。
- 也可使用在线 API：字节语音识别大模型极速版（免费额度），注册火山引擎后配置使用，中文识别极佳。
检查原始语言选择
- “发音语言”请务必手动选择具体语言（不要用 auto），auto 可能因模型误判导致只识别部分语言。
每次只处理一个视频（单视频交互模式）
在语音识别完毕后会弹出编辑框，你可以直观看到识别出的字幕条数和内容，若此处就缺失很多，说明问题在 ASR 阶段。你可在编辑框内手动调整，然后继续。

若以上调整后改善有限，请提供软件 logs/ 目录下最新日志文件的后30行，其中包含 ASR 阶段识别的音频时长和字幕数量，可进一步定位原因。

#4851 [aac @ 000002777cd6e280] Prediction is not allowed in AAC-LC.[aist#0:1/aac @ 000002777ccc9300] [dec:aac @ 000002777cd6de

7 Replies

Post Your Reply

Related resource